Skip to content

cgl/zemberek-nlp

 
 

Repository files navigation

zemberek-nlp

Natural Language Processing library. Some modules are specifically developed for Turkish language.

Core

Core classes such as special Collection classes, Hash functions and helpers.

Morphology

Turkish morphological parsing, disambiguation and generation.

Spelling

Statistical Spell checker.

Tokenization

Turkish Tokenization and sentence boundary detection. So far only rule based algorithms.

Hyphenation

Turkish syllabification and hyphenation.

Language modelling

Language model compression

Language identification

Text based statistical language identification.

About

Turkish Nlp libraries

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Java 100.0%