Highlights
- Pro
Lists (11)
Sort Name ascending (A-Z)
Stars
Open-source code for paper "Dataset Distillation"
Home for "How To Scale Your Model", a short blog-style textbook about scaling LLMs on TPUs
A curated list of awesome papers on dataset distillation and related applications.
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
Wan: Open and Advanced Large-Scale Video Generative Models
SlamKit is an open source tool kit for efficient training of SpeechLMs. It was used for "Slamming: Training a Speech Language Model on One GPU in a Day"
Muon optimizer: +>30% sample efficiency with <3% wallclock overhead
A Sana-like text-to-image model trained from scratch.
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open
Everything you need to know to build your own RAG application
Code for the paper "LLark: A Multimodal Instruction-Following Language Model for Music" by Josh Gardner, Simon Durand, Daniel Stoller, and Rachel Bittner.
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 2, and other large language models.
[IJCAI 2024] EAT: Self-Supervised Pre-Training with Efficient Audio Transformer
DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding
JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.
Code for paper https://arxiv.org/abs/2409.02958
A lightweight open-source UI component library that provides free Nativewind UI components for react native mobile apps.
[NeurIPS 2024] The official code of "U-DiTs: Downsample Tokens in U-Shaped Diffusion Transformers"
Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch
A FLAX NNX implementation of GPT2
Everything about the SmolLM2 and SmolVLM family of models
Clean, minimal, accessible reproduction of DeepSeek R1-Zero