From 204b788e60817917ef1d5182c3d4fe19c1c7243c Mon Sep 17 00:00:00 2001 From: sonvx Date: Tue, 25 Sep 2018 14:06:37 +0200 Subject: [PATCH] Update README.md --- README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.md b/README.md index 9192750..4857ac7 100644 --- a/README.md +++ b/README.md @@ -11,7 +11,7 @@ Word2Vec models for Vietnamese - word2vec-simple-visualization: It is working well. Please read the readme file inside that folder to know how to test the model. ## Note: - This model is trained using data of Le et al. http://mim.hus.vnu.edu.vn/phuonglh/node/72 - + Data information: 7.1G text with 1,675,819 word types from a corpus of 974,393,244 raw words and 97,440 sentences. Note that all words are tokenized words. + + Data information: 7.1G text with 1,675,819 word types from a corpus of 974,393,244 raw words and 97,440 documents. Note that all words are tokenized words. ### Citation