-
Notifications
You must be signed in to change notification settings - Fork 2
mmedek/nlp-hal
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
You will need install unidecode for running the scripts. You can download and install it with command 'pip install unidecode'. Run script: python /src/script.py Content of repo: |----- data - folder contains data for building hal model in Czech language |----- train.txt - news feeds in Czech |----- stopwords.txt - my stopwords which I use in script (stopwords are grouped from several sources) |----- src - python scripts |----- script.py |----- czech_stemmer.py - czech stemmer developed by Luís Gomes which I am using |----- README
About
Implementation of HAL (Hyperspace Analogue to Language) algorithm with linear weighting and usage idf multiplicator in python 3.6. Czech news feeds are used as input data.
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published