Skip to content
View JaejinCho's full-sized avatar

Block or report JaejinCho

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Python toolkit for speech processing

Python 68 21 Updated Nov 20, 2024

Take Bible Study notes easily in the popular note-taking app Obsidian, with automatic verse and reference suggestions.

TypeScript 243 48 Updated Dec 14, 2024

Interactive e-book for Python to C++ transition

Python 44 48 Updated Sep 13, 2024

A framework to enable multimodal models to operate a computer.

Python 9,043 1,218 Updated Dec 19, 2024

A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour

Jupyter Notebook 36,802 5,326 Updated Jan 5, 2025

Code, Dataset, and Pretrained Models for Audio and Speech Large Language Model "Listen, Think, and Understand".

Python 400 38 Updated Apr 24, 2024

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 137,178 27,455 Updated Jan 5, 2025

Foundational Models for State-of-the-Art Speech and Text Translation

Jupyter Notebook 11,057 1,080 Updated Nov 14, 2024

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Python 21,254 2,198 Updated Nov 11, 2024

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 12,595 2,579 Updated Jan 5, 2025

Awesome-LLM: a curated list of Large Language Model

20,332 1,656 Updated Dec 31, 2024

A playbook for systematically maximizing the performance of deep learning models.

27,701 2,290 Updated Jun 18, 2024

Personal homepage for Desh Raj

HTML 12 29 Updated Nov 12, 2024

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

1,658 228 Updated Oct 16, 2024

Robust Speech Recognition via Large-Scale Weak Supervision

Python 73,807 8,821 Updated Jan 4, 2025

This repository contains demos I made with the Transformers library by HuggingFace.

Jupyter Notebook 9,736 1,480 Updated Oct 21, 2024

Think DSP: Digital Signal Processing in Python, by Allen B. Downey.

Jupyter Notebook 4,012 3,251 Updated Nov 15, 2024
Python 211 54 Updated Sep 25, 2024

Development Toolkit for the VoxCeleb Speaker Recognition Challenge 2021

Perl 18 Updated Jul 21, 2021

A curated list of awesome self-supervised methods

6,196 830 Updated Jul 3, 2024
Python 4 1 Updated May 8, 2020

Contrastive Predictive Coding for Automatic Speaker Verification

Python 483 100 Updated Oct 29, 2019

End-to-End Speech Processing Toolkit

Python 8,649 2,198 Updated Jan 5, 2025