Skip to content
View david-rx's full-sized avatar

Block or report david-rx

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A 6-million Audio-Caption Paired Dataset Built with a LLMs and ALMs-based Automatic Pipeline

Python 118 2 Updated Dec 13, 2024

Fast and memory-efficient exact attention

Python 16,153 1,528 Updated Mar 7, 2025

This is the repository for the BioCLIP model and the TreeOfLife-10M dataset [CVPR'24 Oral, Best Student Paper].

Python 191 16 Updated Feb 26, 2025

Juptyer notebook tutorials on using the data in the AWS pacific-sound registry for ocean soundscape research, education, and the arts

23 6 Updated Jan 8, 2024

BLSP: Bootstrapping Langauge-Speech Pre-training via Behavior Alignment of Continuation Writing

Python 50 10 Updated Mar 11, 2024
Python 23 1 Updated Nov 14, 2024

Contrastive language-audio pretraining for bioacoustics

Python 17 Updated Oct 17, 2023
Jupyter Notebook 954 153 Updated Mar 3, 2025

AnuraSet: A dataset for classification of tropical anurans from passive acoustic monitoring

Python 25 5 Updated Oct 20, 2024

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 8,273 511 Updated May 3, 2024
Python 24 5 Updated Nov 28, 2024

Text-to-Audio/Music Generation

Python 2,381 187 Updated Sep 29, 2024

Anthropic Claude2 Hackathon:Building MCTS with Claude for optimal action prediction during patient/doctor interactions.

HTML 104 14 Updated Sep 9, 2023

The Biologger Ethogram Benchmark

Python 15 2 Updated Feb 24, 2025

Contrast-agnostic segmentation of MRI scans

Python 418 105 Updated Jul 17, 2024

Let us control diffusion models!

Python 31,653 2,834 Updated Feb 25, 2024

AudioLDM: Generate speech, sound effects, music and beyond, with text.

Python 2,571 230 Updated Dec 9, 2024

Audio generation using diffusion models, in PyTorch.

Python 2,020 169 Updated Jun 12, 2023

A benchmark dataset for data-driven weather forecasting

Jupyter Notebook 738 172 Updated Dec 8, 2023
Jupyter Notebook 89 24 Updated Apr 23, 2024

Reimplementing Karpathy's micrograd for fun

Python 1 Updated Jul 27, 2023

AVES: Animal Vocalization Encoder based on Self-Supervision

Python 97 5 Updated Mar 6, 2025

BEANS: The Benchmark of Animal Sounds

Python 91 8 Updated Oct 16, 2024

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 140,811 28,216 Updated Mar 7, 2025

A modular RL library to fine-tune language models to human preferences

Python 2,287 196 Updated Mar 1, 2024

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

C++ 15,884 3,085 Updated Mar 8, 2025

Grounded Language-Image Pre-training

Python 2,344 200 Updated Jan 24, 2024

Code for ALBEF: a new vision-language pre-training method

Python 1,613 203 Updated Sep 20, 2022

Implementation of RETRO, Deepmind's Retrieval based Attention net, in Pytorch

Python 857 106 Updated Oct 30, 2023

Library for clinical NLP with spaCy.

Jupyter Notebook 551 93 Updated Mar 6, 2025
Next