rwesterman

Ryan Westerman rwesterman

2 followers · 5 following

Achievements

Stars

ga642381 / speech-trident

Awesome speech/audio LLMs, representation learning, and codec models

793 47 Updated Dec 21, 2024

bytedance / SALMONN

SALMONN: Speech Audio Language Music Open Neural Network

Python 1,095 85 Updated Dec 12, 2024

EmulationAI / awesome-large-audio-models

Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.

624 35 Updated Aug 3, 2024

fixie-ai / ultravox

A fast multimodal LLM for real-time voice

Python 1,647 116 Updated Dec 12, 2024

revdotcom / reverb

Open source inference code for Rev's model

Python 348 24 Updated Dec 19, 2024

TorchJD / torchjd

Library for Jacobian descent with PyTorch. It enables optimization of neural networks with multiple losses (e.g. multi-task learning).

Python 168 1 Updated Dec 22, 2024

huangruizhe / ConEC

10 Updated Jun 17, 2024

Significant-Gravitas / AutoGPT

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Python 169,463 44,634 Updated Dec 23, 2024

corca-ai / EVAL

EVAL(Elastic Versatile Agent with Langchain) will execute all your requests. Just like an eval method!

Python 869 81 Updated May 30, 2023

e2b-dev / E2B

Secure open source cloud runtime for AI apps & AI agents

HTML 7,160 473 Updated Dec 23, 2024

nttcslab-sp / EEND-vector-clustering

This repository contains a set of codes to run (i.e., train, perform inference with, evaluate) a diarization method called EEND-vector-clustering.

Python 73 17 Updated Oct 18, 2022

microsoft / UniSpeech

UniSpeech - Large Scale Self-Supervised Learning for Speech

Python 440 74 Updated Apr 5, 2024

RunLLM / aqueduct

Aqueduct is no longer being maintained. Aqueduct allows you to run LLM and ML workloads on any cloud infrastructure.

Go 521 18 Updated Jun 7, 2023

bitsandbytes-foundation / bitsandbytes

Accessible large language models via k-bit quantization for PyTorch.

Python 6,444 639 Updated Dec 23, 2024

openai / whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Python 73,265 8,745 Updated Dec 1, 2024

AUTOMATIC1111 / stable-diffusion-webui

Stable Diffusion web UI

Python 144,802 27,197 Updated Dec 23, 2024

nicklansley / nick-stable-diffusion

Nick's Docker-based version of Stable Diffusion

Jupyter Notebook 55 5 Updated Dec 26, 2022

iral-lab / gold

Multimodal grounded language dataset

10 Updated Dec 14, 2021

google-research / perceiver-ar

Python 232 20 Updated Dec 23, 2024

invoke-ai / InvokeAI

Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The …

TypeScript 23,979 2,460 Updated Dec 23, 2024

google / flax

Flax is a neural network library for JAX that is designed for flexibility.

Jupyter Notebook 6,210 652 Updated Dec 23, 2024

nryant / dscore

Diarization scoring tools.

Python 227 43 Updated Mar 28, 2023

hitachi-speech / EEND

End-to-End Neural Diarization

Python 382 59 Updated Aug 30, 2021

NVIDIA / NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 12,508 2,575 Updated Dec 24, 2024

grantjenks / python-sortedcontainers

Python Sorted Container Types: Sorted List, Sorted Dict, and Sorted Set

Python 3,594 204 Updated Mar 8, 2024

Xflick / EEND_PyTorch

A PyTorch implementation of End-to-End Neural Diarization

Python 98 16 Updated Jun 19, 2023

wq2012 / SpectralCluster

Python re-implementation of the (constrained) spectral clustering algorithms used in Google's speaker diarization papers.

Python 518 73 Updated Sep 25, 2024

revdotcom / words2num

Convert words to numbers

Python 20 10 Updated Apr 13, 2022

revdotcom / speech-datasets

Various speech datasets made available to the public

Jupyter Notebook 107 13 Updated Dec 13, 2024

facebookresearch / AugLy

A data augmentations library for audio, image, text, and video.

Python 4,979 302 Updated Nov 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ryan Westerman rwesterman

Achievements

Achievements

Block or report rwesterman

Stars

ga642381 / speech-trident

bytedance / SALMONN

EmulationAI / awesome-large-audio-models

fixie-ai / ultravox

revdotcom / reverb

TorchJD / torchjd

huangruizhe / ConEC

Significant-Gravitas / AutoGPT

corca-ai / EVAL

e2b-dev / E2B

nttcslab-sp / EEND-vector-clustering

microsoft / UniSpeech

RunLLM / aqueduct

bitsandbytes-foundation / bitsandbytes

openai / whisper

AUTOMATIC1111 / stable-diffusion-webui

nicklansley / nick-stable-diffusion

iral-lab / gold

google-research / perceiver-ar

invoke-ai / InvokeAI

google / flax

nryant / dscore

hitachi-speech / EEND

NVIDIA / NeMo

grantjenks / python-sortedcontainers

Xflick / EEND_PyTorch

wq2012 / SpectralCluster

revdotcom / words2num

revdotcom / speech-datasets

facebookresearch / AugLy