kundan2510

Kundan Kumar kundan2510

Founder Lyrebird-AI (YC S17). Head of AI, Descript. Previously, phd-student at MILA, UdeM

164 followers · 23 following

Montreal
kundan2510.github.io

Achievements

x2 x2

Achievements

x2 x2

Organizations

Stars

EmulationAI / awesome-large-audio-models

Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.

629 36 Updated Aug 3, 2024

kyutai-labs / moshi

Python 7,083 552 Updated Dec 20, 2024

hubertsiuzdak / snac

Multi-Scale Neural Audio Codec (SNAC) compresses audio into discrete codes at a low bitrate

Python 464 26 Updated Nov 19, 2024

lyuchenyang / Macaw-LLM

Macaw-LLM: Multi-Modal Language Modeling with Image, Video, Audio, and Text Integration

Python 1,573 128 Updated Jan 1, 2025

pipecat-ai / pipecat

Open Source framework for voice and multimodal conversational AI

Python 4,079 409 Updated Jan 2, 2025

karpathy / llm.c

LLM training in simple, raw C/CUDA

Cuda 24,917 2,829 Updated Oct 2, 2024

lucidrains / audiolm-pytorch

Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch

Python 2,468 267 Updated Nov 8, 2024

kuprel / min-dalle

min(DALL·E) is a fast, minimal port of DALL·E Mini to PyTorch

Python 3,488 257 Updated Nov 21, 2022

tomwetjens / boardgamefiesta

Project to play board games like Great Western Trail and Dominant Species online. Backend code for Quarkus, AWS Lambda, DynamoDB. Front end code: https://github.com/tomwetjens/boardgamefiesta-app

Java 4 1 Updated Sep 22, 2022

nussl / nussl

A flexible source separation library in Python

Python 624 92 Updated Dec 9, 2024

infiloop2 / personal-stock-ticker

Scripts powering https://infiloop.io/personalstockticker

JavaScript 4 1 Updated Jan 23, 2021

pseeth / torch-stft

An STFT/iSTFT for PyTorch.

Python 353 52 Updated Oct 31, 2023

descriptinc / melgan-neurips

GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis

Python 989 215 Updated Aug 28, 2023

swechhasingh / Handwriting-synthesis

Implementation of "Generating Sequences With Recurrent Neural Networks" https://arxiv.org/abs/1308.0850

Jupyter Notebook 232 31 Updated May 1, 2023

CorentinJ / Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 53,109 8,842 Updated Aug 14, 2024

lowerquality / gentle

gentle forced aligner

Python 1,478 297 Updated Apr 25, 2024

vickianand / kaggle_cats_vs_dogs

Using Convnet to classify images of cats from those of dogs. :)

Python 1 Updated Feb 17, 2019

mravanelli / pytorch-kaldi

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding a…

Python 2,374 445 Updated Mar 14, 2022