Skip to content
View kelechi-c's full-sized avatar
🎨
model bending
🎨
model bending

Highlights

  • Pro

Block or report kelechi-c

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Open-source code for paper "Dataset Distillation"

Python 787 116 Updated May 20, 2022

Home for "How To Scale Your Model", a short blog-style textbook about scaling LLMs on TPUs

HTML 197 29 Updated Mar 5, 2025

A curated list of awesome papers on dataset distillation and related applications.

HTML 1,552 143 Updated Mar 5, 2025

More relighting!

Python 7,641 469 Updated Feb 20, 2025

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.

Python 27,871 5,741 Updated Mar 6, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 7,272 715 Updated Mar 6, 2025

SlamKit is an open source tool kit for efficient training of SpeechLMs. It was used for "Slamming: Training a Speech Language Model on One GPU in a Day"

Python 171 7 Updated Mar 5, 2025

Muon optimizer: +>30% sample efficiency with <3% wallclock overhead

Python 457 24 Updated Mar 1, 2025

A Sana-like text-to-image model trained from scratch.

Jupyter Notebook 3 Updated Mar 5, 2025

Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch

Python 11,231 1,092 Updated May 11, 2024

YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open

Python 4,255 454 Updated Mar 1, 2025

Everything you need to know to build your own RAG application

Jupyter Notebook 2,541 245 Updated Feb 25, 2025

Code for the paper "LLark: A Multimodal Instruction-Following Language Model for Music" by Josh Gardner, Simon Durand, Daniel Stoller, and Rachel Bittner.

Jupyter Notebook 336 26 Updated May 30, 2024

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 2, and other large language models.

Go 131,395 10,787 Updated Mar 6, 2025

multimodal experiments/adaptation

Python 1 Updated Mar 2, 2025

[IJCAI 2024] EAT: Self-Supervised Pre-Training with Efficient Audio Transformer

Python 132 8 Updated Dec 23, 2024

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Python 4,407 1,646 Updated Feb 26, 2025

JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.

Jupyter Notebook 4,566 396 Updated Apr 3, 2024

Code for paper https://arxiv.org/abs/2409.02958

Jupyter Notebook 3 1 Updated Sep 10, 2024

A lightweight open-source UI component library that provides free Nativewind UI components for react native mobile apps.

TypeScript 136 4 Updated Mar 6, 2025

[NeurIPS 2024] The official code of "U-DiTs: Downsample Tokens in U-Shaped Diffusion Transformers"

Python 185 9 Updated Sep 30, 2024

Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch

Python 8,208 778 Updated Oct 7, 2024

my playground for diffusion/image gen

Python 2 Updated Feb 8, 2025

My take on Flow Matching

Jupyter Notebook 38 3 Updated Jan 11, 2025
Jupyter Notebook 176 52 Updated Feb 28, 2025

A FLAX NNX implementation of GPT2

Jupyter Notebook 1 Updated Aug 16, 2024

Simple diffusion in Flax NNX.

Python 1 Updated Oct 20, 2024

A transformer model in flax.nnx

Python 4 Updated Dec 20, 2024

Everything about the SmolLM2 and SmolVLM family of models

Python 1,980 111 Updated Feb 20, 2025

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 10,992 1,402 Updated Feb 1, 2025
Next