Skip to content
View jeanm's full-sized avatar
  • University of Cambridge
  • United Kingdom

Block or report jeanm

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
  • Crowd-sourced lists of urls to help Common Crawl crawl under-resourced languages. See https://github.com/commoncrawl/web-languages-code/ for the code

    Updated Feb 21, 2025
  • Modules to convert numbers to words. 42 --> forty-two

    Python GNU Lesser General Public License v2.1 Updated Feb 17, 2025
  • espeak-ng Public

    Forked from espeak-ng/espeak-ng

    eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.

    C GNU General Public License v3.0 Updated Feb 4, 2025
  • Official repository for Citation Style Language (CSL) locale files.

    Ruby Updated Jan 9, 2025
  • spaCy Public

    Forked from explosion/spaCy

    💫 Industrial-strength Natural Language Processing (NLP) in Python

    Python MIT License Updated Nov 25, 2024
  • flores Public

    Forked from facebookresearch/flores

    Facebook Low Resource (FLoRes) MT Benchmark

    Python Other Updated Nov 20, 2023
  • lijspell Public

    Ligurian spellchecking

    2 Updated Oct 11, 2023
  • mtdata Public

    Forked from thammegowda/mtdata

    A tool that locates, downloads, and extracts machine translation corpora

    Python Apache License 2.0 Updated May 23, 2023
  • epitran Public

    Forked from dmort27/epitran

    A tool for transcribing orthographic text as IPA (International Phonetic Alphabet)

    Python MIT License Updated Jan 11, 2023
  • url-nlp Public

    Forked from google-research/url-nlp
    Other Updated Jun 24, 2022
  • docs Public

    Forked from UniversalDependencies/docs

    Universal Dependencies online documentation

    HTML Apache License 2.0 Updated Oct 31, 2021
  • stanza Public

    Forked from stanfordnlp/stanza

    Official Stanford NLP Python Library for Many Human Languages

    Python Other Updated Oct 5, 2021
  • Python 3.6+ toolbox for submitting jobs to Slurm

    Python MIT License Updated Sep 15, 2021
  • KILT Public

    Forked from facebookresearch/KILT

    Library for Knowledge Intensive Language Tasks

    Python MIT License Updated Jan 6, 2021
  • text Public

    Forked from hudeven/text

    Data loaders and abstractions for text and NLP

    Python Updated Dec 14, 2020
  • DPR Public

    Forked from facebookresearch/DPR

    Dense Passage Retriever - is a set of tools and models for open domain Q&A task.

    Python Other Updated Nov 20, 2020
  • A natural language modeling framework based on PyTorch

    Python Other Updated Oct 14, 2020
  • cldr Public

    Forked from unicode-org/cldr

    The new home of the Unicode Common Locale Data Repository

    Java Other Updated Sep 3, 2020
  • Updated Jul 15, 2020
  • Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

    Python MIT License Updated Nov 13, 2019
  • dotfiles Public

    Some dotfiles

    Emacs Lisp 1 Updated Nov 5, 2019
  • translate Public

    Forked from pytorch/translate

    Translate - a PyTorch Language Library

    Python BSD 3-Clause "New" or "Revised" License Updated Sep 17, 2019
  • calamari Public

    Forked from Calamari-OCR/calamari

    OCR Engine based on OCRopy and Kraken

    Python GNU General Public License v3.0 Updated Apr 21, 2019
  • NLP word and phrase embeddings, computed via skip-gram with negative sampling

    Python 1 Updated Mar 7, 2017
  • relpron Public

    Code for the article: Laura Rimell, Jean Maillard, Tamara Polajnar and Stephen Clark. 2016. RELPRON: A Relative Clause Composition Data Set for Compositional Distributional Semantics. Computational…

    Python 1 GNU General Public License v3.0 Updated Feb 7, 2017
  • mkweb Public archive

    Minimal static website generator

    Rust Updated Jan 30, 2017
  • bib-parser Public archive

    .bib file parser (BibTeX, BibLaTeX)

    Rust Apache License 2.0 Updated Jan 24, 2017
  • Rust Updated Jun 23, 2016
  • nlip Public

    NLIP python package

    Python GNU General Public License v3.0 Updated Apr 28, 2016