Skip to content
View kundan2510's full-sized avatar

Organizations

@lyrebird-ai

Block or report kundan2510

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.

629 36 Updated Aug 3, 2024
Python 7,083 552 Updated Dec 20, 2024

Multi-Scale Neural Audio Codec (SNAC) compresses audio into discrete codes at a low bitrate

Python 464 26 Updated Nov 19, 2024

Macaw-LLM: Multi-Modal Language Modeling with Image, Video, Audio, and Text Integration

Python 1,573 128 Updated Jan 1, 2025

Open Source framework for voice and multimodal conversational AI

Python 4,079 409 Updated Jan 2, 2025

LLM training in simple, raw C/CUDA

Cuda 24,917 2,829 Updated Oct 2, 2024

Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch

Python 2,468 267 Updated Nov 8, 2024

min(DALL·E) is a fast, minimal port of DALL·E Mini to PyTorch

Python 3,488 257 Updated Nov 21, 2022

Project to play board games like Great Western Trail and Dominant Species online. Backend code for Quarkus, AWS Lambda, DynamoDB. Front end code: https://github.com/tomwetjens/boardgamefiesta-app

Java 4 1 Updated Sep 22, 2022

A flexible source separation library in Python

Python 624 92 Updated Dec 9, 2024

Scripts powering https://infiloop.io/personalstockticker

JavaScript 4 1 Updated Jan 23, 2021

An STFT/iSTFT for PyTorch.

Python 353 52 Updated Oct 31, 2023

GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis

Python 989 215 Updated Aug 28, 2023

Implementation of "Generating Sequences With Recurrent Neural Networks" https://arxiv.org/abs/1308.0850

Jupyter Notebook 232 31 Updated May 1, 2023

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 53,109 8,842 Updated Aug 14, 2024

gentle forced aligner

Python 1,478 297 Updated Apr 25, 2024

Using Convnet to classify images of cats from those of dogs. :)

Python 1 Updated Feb 17, 2019

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding a…

Python 2,374 445 Updated Mar 14, 2022

Code for replication of the paper "The relativistic discriminator: a key element missing from standard GAN"

Python 721 104 Updated Mar 12, 2020

Recurrent neural network for audio noise reduction

C 4,231 913 Updated Jan 1, 2025

Send voicified messages on Slack using your vocal avatar!

JavaScript 33 11 Updated Oct 10, 2018

Minimalist Attention-based RNN for NMT (tested on Multi30k)

Python 5 3 Updated May 17, 2018

A domain specific language to express machine learning workloads.

C++ 1,759 211 Updated Apr 28, 2023

PyTorch based Deep Learning Toolbox

Python 204 14 Updated Jul 27, 2018

Basic DQN implementation

Python 221 71 Updated Dec 28, 2017

Decoupled Neural Interfaces using Synthetic Gradients for PyTorch

Python 236 36 Updated Jan 12, 2019

MagPhase Vocoder: Speech analysis/synthesis system for TTS and related applications.

Python 78 31 Updated Oct 14, 2019

A repository of state of the art Deep Learning modules implemented in Tensorflow

Python 5 Updated Aug 18, 2017
Next