Next Word Predictor Pitch

Wayne Heller

November 25, 2017

Wouldn’t Be Great If…

You could build a word prediction application customized to your needs?

NextWord is a new word prediction application that allows you to predict your next word based on state of the art prediction algorithms and a flexible system to tune its performance!

Using SwiftKey Data & Natural Language Processing

Behind and intuitive user interface, NextWord uses twitter, blog and news article data provided by SwiftKey. The configurable build system samples, processes, analyzes, creates, and evaluates the prediction model.

Using advanced natural language processing techniques such as Modified Kneser-Nye smoothing, NextWord delivers accuracy and blazing fast performance.

Performance Characteristics

File Size (Mb) NGrams Next Word Accuracy % In Top 3 Words Accuracy % Average Query Time (msec)
354.3 35,429,660 18.5 20.8 19.1
323.7 31,296,396 18.4 20.7 19.1
164.8 18,543,372 17.1 19.1 17.3
126.5 13,875,685 17.1 18.9 18.1
65.0 7,290,744 15.6 17.1 19.1
57.8 6,756,321 15.6 17.0 18.6
38.3 3,409,113 14.1 15.4 22.2
36.0 3,157,480 15.5 17.1 20.0
35.6 3,189,369 15.5 17.0 17.2
28.9 2,623,057 15.5 17.0 17.4

More Than Just An Application

There is an entire automated workflow of natural language processing to tune a prediction model to fit your needs:

Make Predicting Your Next Word Fast and Easy!

Shiny Application: https://rougeone.shinyapps.io/nextword/

Code Repo: https://github.com/wayneheller/DataScienceSpecializationCapstone

Pitch Deck (github): https://wayneheller.github.io/DataScienceSpecializationCapstone/

Pitch Deck (rpubs): http://rpubs.com/wgheller/nextword