Exploiting-BiLSTM-for-Proper-Word-Choice

Code and resources for training Bidirectional LSTM model, to supoort correct and sound writing for ESL learners.

Please cite this work if you find this code and datasets useful: https://arxiv.org/abs/1901.02490

Choosing the Right Word: Using Bidirectional LSTM Tagger for Writing Support Systems by: Victor Makarenkov, Lior Rokach, Bracha Shapira

In case of testing for scientific domain specific model, you should initilize the vocab dictionaries with values from: w2i.txt. This file already contains the dict and word serial numbers, so no need for the whole corpus, upon vocabulary initialization.

You need the model file itself: adam_batch_bilstm_bigmodel2.txt You can download it from here: https://drive.google.com/file/d/0B5iAITPoL9L2ald0VzNxZWtxdGM/view?usp=sharing

Otherwise, for any other domain or corpora you please replace long30k.txt filename in the code with your own training text file. You should also uncomment the reading of the file in code, and comment the vocabulary initialization.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
README.md		README.md
bilstm_word_choice.py		bilstm_word_choice.py
bilstm_word_choice_bert.py		bilstm_word_choice_bert.py
conll_test.zip		conll_test.zip
evaluation_176_3.txt		evaluation_176_3.txt
i2w.txt		i2w.txt
test_set_176_sentences.txt		test_set_176_sentences.txt
w2i.txt		w2i.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Exploiting-BiLSTM-for-Proper-Word-Choice

About

Releases

Packages

Languages

vicmak/Exploiting-BiLSTM-for-Proper-Word-Choice

Folders and files

Latest commit

History

Repository files navigation

Exploiting-BiLSTM-for-Proper-Word-Choice

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages