Skip to content
View mod-cpu's full-sized avatar

Highlights

  • Pro

Block or report mod-cpu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Faster Whisper transcription with CTranslate2

Python 14,611 1,232 Updated Jan 1, 2025

Latex code for making neural networks diagrams

TeX 22,875 2,929 Updated Aug 21, 2023

The goal of this library is to generate more helpful exception messages for matrix algebra expressions for numpy, pytorch, jax, tensorflow, keras, fastai.

Jupyter Notebook 803 40 Updated Apr 7, 2022

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 53,681 8,904 Updated Aug 14, 2024

🏡 Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.

Python 1,749 248 Updated Dec 20, 2023

✍️ A carefully curated list of NLP paper summaries

1,479 242 Updated Dec 4, 2021

Leaderboard implementations for datasets produced by the Mosaic Team.

Jupyter Notebook 19 5 Updated Jul 6, 2023

Code and data accompanying Natural Language Processing with PyTorch published by O'Reilly Media https://amzn.to/3JUgR2L

Jupyter Notebook 2,016 810 Updated Mar 12, 2023

PyTorch Tutorial materials for Duke University +Data Science Initiative

Jupyter Notebook 9 20 Updated Jan 8, 2020

PyTorch Tutorial materials for Duke University +Data Science Initiative

Jupyter Notebook 1 1 Updated Jan 1, 2020

Materials for +DataScience In-Person Learning Experiences (IPLEs)

Jupyter Notebook 28 45 Updated Aug 19, 2020

Tools to download and cleanup Common Crawl data

Python 990 147 Updated Apr 25, 2023

Streamlit — A faster way to build and share data apps.

Python 38,037 3,301 Updated Mar 9, 2025

Pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)

Python 1,203 103 Updated Oct 1, 2024

An open clone of the GPT-2 WebText dataset by OpenAI. Still WIP.

Python 388 60 Updated Mar 26, 2024

Ongoing research training transformer models at scale

Python 11,679 2,621 Updated Mar 8, 2025

Just draw a bounding box and you can remove the object you want to remove.

Python 2,681 474 Updated Sep 27, 2019

XLNet: Generalized Autoregressive Pretraining for Language Understanding

Python 6,184 1,176 Updated May 28, 2023

A mix of GAN implementations including progressive growing

Python 1,623 270 Updated Oct 12, 2021

Code and documentation for my Eyeo Festival talk 2019

Python 28 5 Updated Oct 11, 2020

Code for the paper "Language Models are Unsupervised Multitask Learners"

Python 1,147 443 Updated Oct 31, 2022

Dataset of GPT-2 outputs for research in detection, biases, and more

Python 1,965 548 Updated Dec 13, 2023

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 31,105 6,488 Updated Jan 9, 2025

Examples of using sparse attention, as in "Generating Long Sequences with Sparse Transformers"

Python 1,554 192 Updated Aug 12, 2020

NLP research experiments, built on PyTorch within the AllenNLP framework.

Python 91 9 Updated Mar 20, 2024

PyTorch deep learning models for document classification

Python 594 126 Updated Jul 21, 2023

A very simple framework for state-of-the-art Natural Language Processing (NLP)

Python 14,100 2,112 Updated Mar 7, 2025

Code and test data for "On Measuring Bias in Sentence Encoders", to appear at NAACL 2019.

Python 54 20 Updated May 23, 2021
Next