-
Shanghai Jiao Tong University
Highlights
- Pro
Stars
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
openpilot is an operating system for robotics. Currently, it upgrades the driver assistance system on 275+ supported cars.
Rich is a Python library for rich text and beautiful formatting in the terminal.
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
Generative Models by Stability AI
Open-Sora: Democratizing Efficient Video Production for All
A generative world for general-purpose robotics & embodied AI learning.
Official inference repo for FLUX.1 models
End-to-End Object Detection with Transformers
pix2tex: Using a ViT to convert images of equations into LaTeX code.
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
Infinite Photorealistic Worlds using Procedural Generation
Simple examples to introduce PyTorch
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference
[CVPR 2023 Best Paper Award] Planning-oriented Autonomous Driving
[ECCV 2022] This is the official implementation of BEVFormer, a camera-only framework for autonomous driving perception, e.g., 3D object detection and semantic map segmentation.
Deformable DETR: Deformable Transformers for End-to-End Object Detection.
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
The devkit of the nuScenes dataset.