forked from piskvorky/gensim
-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Evaluation of word2vec models against semantic similarity datasets (p…
…iskvorky#1047) * Update CHANGELOG.txt * Update CHANGELOG.txt * cbow_mean default changed from 0 to 1. * Hyperparameters' default values are aligned with Mikolov's word2vec. * Fix for piskvorky#538: cbow_mean default changed from 0 to 1. * Update changelog * (main) defaults aligned to Mikolov's word2vec. * word2vec (main) now mimics command-line arguments for Mikolov's word2vec. * Fix for piskvorky#538 * Fix for piskvorky#538 (tabs and spaces). * Fix for piskvorky#538 (tests). * For piskvorky#538: slightly relaxed sanity check demands (because now default vector size is 100, not 200). * Fixes as per @gojomo comments. * Test fixes due to negative sampling becoming default behavior. * Commented out tests which work for HS only. * Fix for piskvorky#538. * Yet another fix. * Merging. * Fix for CBOW test. * Changelog mention of piskvorky#538 * Fix for CBOW negative sampling tests. * Factoring out word2vec _main__ into gensim/scripts * Use logger instead of logging. * Made Changelog less verbose about word2vec defaults changed. * Fixes to word2vec_standalone.py as per Radim's comments. * Alpha argument. with different defaults for CBOW ans skipgram. * Release version typo fix * 'fisrt_push' * Finalizing. * Initial shippable release * Evaluation function to measure model correlation with human similarity judgments datasets. * Updating semantic similarity evaluation. * Scipy stats import * Evaluation function to measure model correlation with human similarity judgments datasets. * Remove unneccessary. * Changing the neame of the word pairs evaluation function.
- Loading branch information
Showing
3 changed files
with
90 additions
and
0 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Empty file.