Wayne Heller
November 25, 2017
You could build a word prediction application customized to your needs?
NextWord is a new word prediction application that allows you to predict your next word based on state of the art prediction algorithms and a flexible system to tune its performance!
Behind and intuitive user interface, NextWord uses twitter, blog and news article data provided by SwiftKey. The configurable build system samples, processes, analyzes, creates, and evaluates the prediction model.
Using advanced natural language processing techniques such as Modified Kneser-Nye smoothing, NextWord delivers accuracy and blazing fast performance.
| File Size (Mb) | NGrams | Next Word Accuracy % | In Top 3 Words Accuracy % | Average Query Time (msec) |
|---|---|---|---|---|
| 354.3 | 35,429,660 | 18.5 | 20.8 | 19.1 |
| 323.7 | 31,296,396 | 18.4 | 20.7 | 19.1 |
| 164.8 | 18,543,372 | 17.1 | 19.1 | 17.3 |
| 126.5 | 13,875,685 | 17.1 | 18.9 | 18.1 |
| 65.0 | 7,290,744 | 15.6 | 17.1 | 19.1 |
| 57.8 | 6,756,321 | 15.6 | 17.0 | 18.6 |
| 38.3 | 3,409,113 | 14.1 | 15.4 | 22.2 |
| 36.0 | 3,157,480 | 15.5 | 17.1 | 20.0 |
| 35.6 | 3,189,369 | 15.5 | 17.0 | 17.2 |
| 28.9 | 2,623,057 | 15.5 | 17.0 | 17.4 |
There is an entire automated workflow of natural language processing to tune a prediction model to fit your needs:
Shiny Application: https://rougeone.shinyapps.io/nextword/
Code Repo: https://github.com/wayneheller/DataScienceSpecializationCapstone
Pitch Deck (github): https://wayneheller.github.io/DataScienceSpecializationCapstone/
Pitch Deck (rpubs): http://rpubs.com/wgheller/nextword