-
University of Cambridge
- United Kingdom
-
web-languages Public
Forked from commoncrawl/web-languagesCrowd-sourced lists of urls to help Common Crawl crawl under-resourced languages. See https://github.com/commoncrawl/web-languages-code/ for the code
UpdatedFeb 21, 2025 -
num2words Public
Forked from savoirfairelinux/num2wordsModules to convert numbers to words. 42 --> forty-two
Python GNU Lesser General Public License v2.1 UpdatedFeb 17, 2025 -
espeak-ng Public
Forked from espeak-ng/espeak-ngeSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
C GNU General Public License v3.0 UpdatedFeb 4, 2025 -
locales Public
Forked from citation-style-language/localesOfficial repository for Citation Style Language (CSL) locale files.
Ruby UpdatedJan 9, 2025 -
spaCy Public
Forked from explosion/spaCy💫 Industrial-strength Natural Language Processing (NLP) in Python
Python MIT License UpdatedNov 25, 2024 -
flores Public
Forked from facebookresearch/floresFacebook Low Resource (FLoRes) MT Benchmark
Python Other UpdatedNov 20, 2023 -
-
mtdata Public
Forked from thammegowda/mtdataA tool that locates, downloads, and extracts machine translation corpora
Python Apache License 2.0 UpdatedMay 23, 2023 -
epitran Public
Forked from dmort27/epitranA tool for transcribing orthographic text as IPA (International Phonetic Alphabet)
Python MIT License UpdatedJan 11, 2023 -
-
docs Public
Forked from UniversalDependencies/docsUniversal Dependencies online documentation
HTML Apache License 2.0 UpdatedOct 31, 2021 -
stanza Public
Forked from stanfordnlp/stanzaOfficial Stanford NLP Python Library for Many Human Languages
Python Other UpdatedOct 5, 2021 -
submitit Public
Forked from facebookincubator/submititPython 3.6+ toolbox for submitting jobs to Slurm
Python MIT License UpdatedSep 15, 2021 -
KILT Public
Forked from facebookresearch/KILTLibrary for Knowledge Intensive Language Tasks
Python MIT License UpdatedJan 6, 2021 -
text Public
Forked from hudeven/textData loaders and abstractions for text and NLP
Python UpdatedDec 14, 2020 -
DPR Public
Forked from facebookresearch/DPRDense Passage Retriever - is a set of tools and models for open domain Q&A task.
Python Other UpdatedNov 20, 2020 -
pytext-1 Public
Forked from facebookresearch/pytextA natural language modeling framework based on PyTorch
Python Other UpdatedOct 14, 2020 -
cldr Public
Forked from unicode-org/cldrThe new home of the Unicode Common Locale Data Repository
Java Other UpdatedSep 3, 2020 -
-
fairseq Public
Forked from facebookresearch/fairseqFacebook AI Research Sequence-to-Sequence Toolkit written in Python.
Python MIT License UpdatedNov 13, 2019 -
-
translate Public
Forked from pytorch/translateTranslate - a PyTorch Language Library
Python BSD 3-Clause "New" or "Revised" License UpdatedSep 17, 2019 -
calamari Public
Forked from Calamari-OCR/calamariOCR Engine based on OCRopy and Kraken
Python GNU General Public License v3.0 UpdatedApr 21, 2019 -
sgns-embeddings Public
NLP word and phrase embeddings, computed via skip-gram with negative sampling
-
relpron Public
Code for the article: Laura Rimell, Jean Maillard, Tamara Polajnar and Stephen Clark. 2016. RELPRON: A Relative Clause Composition Data Set for Compositional Distributional Semantics. Computational…
-
-
bib-parser Public archive
.bib file parser (BibTeX, BibLaTeX)
Rust Apache License 2.0 UpdatedJan 24, 2017 -
-