Stars
Take Bible Study notes easily in the popular note-taking app Obsidian, with automatic verse and reference suggestions.
Interactive e-book for Python to C++ transition
A framework to enable multimodal models to operate a computer.
A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour
Code, Dataset, and Pretrained Models for Audio and Speech Large Language Model "Listen, Think, and Understand".
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Foundational Models for State-of-the-Art Speech and Text Translation
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Awesome-LLM: a curated list of Large Language Model
A playbook for systematically maximizing the performance of deep learning models.
Personal homepage for Desh Raj
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
Robust Speech Recognition via Large-Scale Weak Supervision
This repository contains demos I made with the Transformers library by HuggingFace.
Think DSP: Digital Signal Processing in Python, by Allen B. Downey.
JaesungHuh / VoxSRC2021
Forked from a-nagrani/VoxSRC2020Development Toolkit for the VoxCeleb Speaker Recognition Challenge 2021
A curated list of awesome self-supervised methods
Contrastive Predictive Coding for Automatic Speaker Verification