Skip to content
View amittai's full-sized avatar

Block or report amittai

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

These are lists for a variety of languages containing words that are distinctive to each language.

35 4 Updated Apr 5, 2022

is scot injured?

HTML 2 Updated Aug 15, 2023

End-to-End Speech Processing Toolkit

Python 8,649 2,198 Updated Jan 5, 2025

End-to-end framework to build automatic agents (chatbots) for task-oriented dialogs

Python 18 3 Updated Dec 2, 2020

NAACL website

HTML 4 7 Updated Dec 21, 2024

Reference implementations of MLPerf™ training benchmarks

Python 1,630 563 Updated Oct 17, 2024

Reference implementations of MLPerf™ inference benchmarks

Python 1,266 539 Updated Jan 2, 2025

Quick & dirty hack to read AMD Ryzen rapl counters

C 67 10 Updated Sep 4, 2018

Democratizing NLP!

Jupyter Notebook 105 29 Updated Dec 6, 2023

Python port of Moses tokenizer, truecaser and normalizer

Python 489 59 Updated May 26, 2024

[Discontinued] Auryo - Unofficial Soundcloud Desktop App

TypeScript 638 46 Updated Dec 10, 2022

Fast Neural Machine Translation in C++

C++ 1,271 234 Updated Aug 25, 2023

Cynical data selection

Perl 20 7 Updated Jan 16, 2021
Python 122 41 Updated Mar 15, 2017

C++/CUDA toolkit for training sequence and sequence-to-sequence models across multiple GPUs

C++ 186 65 Updated May 15, 2017

Examples and scripts using Blocks

Python 147 93 Updated Aug 22, 2016

A Multilingual and Multilevel Representation Learning Toolkit for NLP

C++ 117 31 Updated Feb 14, 2018

SALM: Suffix Array and its Applications in Empirical Language Processing by Joy

C++ 11 5 Updated Dec 22, 2017

Easy Bootstrap Resampling and Approximate Randomization for BLEU, METEOR, and TER using Multiple Optimizer Runs. This implements "Better Hypothesis Testing for Statistical Machine Translation: Cont…

Groff 202 39 Updated Feb 25, 2023

GIZA++ is a statistical machine translation toolkit that is used to train IBM Models 1-5 and an HMM word alignment model. This package also contains the source for the mkcls tool which generates th…

C++ 266 82 Updated Mar 31, 2023

A word alignment tool based on famous GIZA++, extended to support multi-threading, resume training and incremental training.

C++ 161 60 Updated May 12, 2021

Simple, fast unsupervised word aligner

C++ 742 160 Updated Jul 19, 2022

Moses, the machine translation system

Roff 1,585 778 Updated Jun 7, 2024

A workflow management system for researchers who heart Unix.

Scala 121 14 Updated Sep 23, 2015
C++ 1 Updated Nov 3, 2014

Decoder, aligner, and model optimizer for statistical machine translation and other structured prediction models based on (mostly) context-free formalisms

C++ 183 77 Updated May 26, 2020