Lists (1)
Sort Oldest
Stars
High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Data and code for the Algonauts Project 2025 challenge.
Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis
Implementation of "DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation"
Convert negative film to positive images using Python
A one-click python script for film negative inversion with dust or scratch removal
Two self-contained notebooks to perform "weight transformer" from pretrained Transformer model to neuron-astrocyte network.
This repository includes the official implementation of OpenScholar: Synthesizing Scientific Literature with Retrieval-augmented LMs.
Scaling Properties of Diffusion Models For Perceptual Tasks
The official implement of Mind's eye: image recognition by EEG via multimodal similarity-keeping contrastive learning.
Make-An-Audio-3: Transforming Text/Video into Audio via Flow-based Large Diffusion Transformers
PyTorch Implementation of AudioLCM (ACM-MM'24): a efficient and high-quality text-to-audio generation with latent consistency model.
PyTorch Implementation of Make-An-Audio (ICML'23) with a Text-to-Audio Generative Model
Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.
Image reconstruction from visual evoked potentials using latent diffusion
Official Pytorch Implementation of Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think (ICLR 2025)
High-Resolution Image Synthesis with Latent Diffusion Models
Invert scroll direction for physical scroll wheels while maintaining "Natural" scrolling for trackpads on MacOS
Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The …
✨🔬 A flexible diffraction simulator for exploring and visualizing physical optics.
official implementation of "Interpreting CLIP's Image Representation via Text-Based Decomposition"
Long Range Arena for Benchmarking Efficient Transformers
Lot no. 004 | A Rehousing for Kodak Funsaver Lenses