Stars
6
stars
written in Python
Clear filter
An open-source NLP research library, built on PyTorch.
ERRor ANnotation Toolkit: Automatically extract and classify grammatical errors in parallel original and corrected sentences.
A thread-safe disk based persistent queue in Python
Blazingly fast cleaning swear words (and their leetspeak) in strings
This dataset contains synthetic training data for grammatical error correction. The corpus is generated by corrupting clean sentences from C4 using a tagged corruption model. The approach and the d…
cLang-8 is a dataset for grammatical error correction.