This contains two example data sets:

Text Data (ptb): Data from the Penn Treebank dataset provided by Mikolov: http://www.fit.vutbr.cz/~imikolov/rnnlm/
Tree Data (trees): The tree data from the Stanford Sentiment Treebank: http://nlp.stanford.edu/sentiment/index.html
Classification Data (classes): The data from the Stanford Sentiment Treebank with tree info removed.
Parallel Data (parallel): Data from the Tanaka corpus, reduced to only have 10,000 training examples: http://www.edrdg.org/wiki/index.php/Tanaka_Corpus
Tagging Data (tags): Data from WikiNER, reduced to only have 10,000 training examples: http://schwa.org/projects/resources/wiki/Wikiner

Provide feedback

Saved searches