fatchord

Ollie McCarthy fatchord

ML Engineer at Resemble AI

245 followers · 32 following

@resemble-ai
Barcelona

Achievements

x3 x3

Achievements

x3 x3

Stars

LTH14 / fractalgen

PyTorch implementation of FractalGen https://arxiv.org/abs/2502.17437

Python 689 30 Updated Feb 25, 2025

Wan-Video / Wan2.1

Wan: Open and Advanced Large-Scale Video Generative Models

Python 5,332 447 Updated Feb 28, 2025

koudounasalkis / voc2vec

This repository contains the code for the paper "voc2vec: A Foundation Model for Non-Verbal Vocalization", accepted at ICASSP 2025.

Python 11 Updated Feb 24, 2025

ArcInstitute / evo2

Genome modeling and design across all domains of life

Jupyter Notebook 2,284 204 Updated Feb 24, 2025

MoonshotAI / Moonlight

884 34 Updated Feb 28, 2025

LiuZH-19 / SongGen

Python 169 15 Updated Feb 23, 2025

KyungsuKim42 / tokensynth

The official implementation of TokenSynth (ICASSP 2025)

Python 44 1 Updated Feb 19, 2025

facebookresearch / audiobox-aesthetics

Unified automatic quality assessment for speech, music, and sound.

Python 381 22 Updated Feb 13, 2025

simplescaling / s1

s1: Simple test-time scaling

Python 5,767 656 Updated Feb 23, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 21,802 1,931 Updated Mar 1, 2025

xingchensong / S3Tokenizer

Reverse Engineering of Supervised Semantic Speech Tokenizer (S3Tokenizer) proposed in CosyVoice

Python 257 33 Updated Jan 15, 2025

multimodal-art-projection / YuE

YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open

Python 4,138 440 Updated Mar 1, 2025

Jiayi-Pan / TinyZero

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 10,807 1,386 Updated Feb 1, 2025

hs-oh-prml / DurFlexEVC

Python 69 4 Updated Jan 22, 2025

zhenye234 / X-Codec-2.0

Codec for paper: LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis

Python 187 20 Updated Feb 24, 2025

zhenye234 / LLaSA_training

LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis

Python 373 27 Updated Feb 14, 2025

facebookresearch / coconut

Training Large Language Model to Reason in a Continuous Latent Space

Python 922 78 Updated Jan 24, 2025

brownvc / R3GAN

Code for NeurIPS 2024 paper - The GAN is dead; long live the GAN! A Modern Baseline GAN - by Huang et al.

Python 677 22 Updated Jan 23, 2025

jjunak-yun / FLowHigh_code

Code for the paper "FLowHigh: Towards efficient and high-quality audio super-resolution with single-step flow matching"

Python 54 5 Updated Jan 17, 2025

emo-box / EmoBox

[INTERSPEECH 2024] EmoBox: Multilingual Multi-corpus Speech Emotion Recognition Toolkit and Benchmark

Python 205 9 Updated Jun 17, 2024

martenlienen / torchode

A parallel ODE solver for PyTorch

Python 248 18 Updated Oct 3, 2024

genmoai / mochi

The best OSS video generation models

Python 2,950 310 Updated Jan 8, 2025

parlance-zz / dualdiffusion

Fourier Dual Diffusion

Python 46 1 Updated Feb 28, 2025

NVlabs / Sana

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

Python 3,470 209 Updated Feb 12, 2025

showlab / computer_use_ootb

Out-of-the-box (OOTB) GUI Agent for Windows and macOS

Python 1,332 126 Updated Feb 26, 2025

ga642381 / speech-trident

Awesome speech/audio LLMs, representation learning, and codec models

907 58 Updated Feb 28, 2025

r9y9 / speech-trident

Forked from ga642381/speech-trident

Awesome speech/audio LLMs, representation learning, and codec models

3 Updated Oct 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ollie McCarthy fatchord

Achievements

Achievements

Block or report fatchord

Stars

LTH14 / fractalgen

Wan-Video / Wan2.1

koudounasalkis / voc2vec

ArcInstitute / evo2

MoonshotAI / Moonlight

LiuZH-19 / SongGen

KyungsuKim42 / tokensynth

facebookresearch / audiobox-aesthetics

simplescaling / s1

huggingface / open-r1

xingchensong / S3Tokenizer

multimodal-art-projection / YuE

Jiayi-Pan / TinyZero

hs-oh-prml / DurFlexEVC

zhenye234 / X-Codec-2.0

zhenye234 / LLaSA_training

facebookresearch / coconut

brownvc / R3GAN

jjunak-yun / FLowHigh_code

emo-box / EmoBox

martenlienen / torchode

genmoai / mochi

parlance-zz / dualdiffusion

NVlabs / Sana

showlab / computer_use_ootb

ga642381 / speech-trident

r9y9 / speech-trident

HHousen / speaker-change-detection

facebookresearch / SONAR

voideditor / void