nlp_next_word

This project produced a predictive text algorithm, and demonstration web interface, as part of the Coursera Data Science Capstone by Johns Hopkins University on Coursera (view certificate).

Background

Due to the complexities, subtleties and ever-changing nature of language, the most successful predictive text algorithms tend to take the approach of training models on a large body of text sources "in the wild" rather than alternatives such as applying grammatical rules (although the combination of both has potential to be even better).

To this end we will be using a large body of text (corpus) provided by SwiftKey as the training source for our predictive text models. Here we report on the nature of the data and search for insight on effective strategies on how to build text predictive algorithms.

Data

The data for this project kindly provided by SwiftKey (large zip archive).

Code

Analyses were peformed using R. Reporting written in Rmarkdown format and rendered in HTML using knitr.

Usage

A brief explanation of this project and how to use the app can be found on Rpubs.

The predictive text web app is hosted on shinyapps.io.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
next_word		next_word
word_prediction-figure		word_prediction-figure
LICENSE		LICENSE
MilestoneReport.Rmd		MilestoneReport.Rmd
README.md		README.md
UI_figure.png		UI_figure.png
predictive_text.R		predictive_text.R
word_prediction-rpubs.html		word_prediction-rpubs.html
word_prediction.Rpres		word_prediction.Rpres
word_prediction.md		word_prediction.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

nlp_next_word

Background

Data

Code

Usage

About

Releases

Packages

Languages

License

msinjin/nlp_next_word

Folders and files

Latest commit

History

Repository files navigation

nlp_next_word

Background

Data

Code

Usage

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages