Lists (1)
Sort Name ascending (A-Z)
Stars
Models and examples built with TensorFlow
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
A high-throughput and memory-efficient inference and serving engine for LLMs
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
OpenMMLab Detection Toolbox and Benchmark
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
Generative Models by Stability AI
Open-Sora: Democratizing Efficient Video Production for All
Code for the paper "Language Models are Unsupervised Multitask Learners"
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Official inference repo for FLUX.1 models
Fast and memory-efficient exact attention
StyleGAN - Official TensorFlow Implementation
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RN…
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
Ongoing research training transformer models at scale
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
OpenMMLab Semantic Segmentation Toolbox and Benchmark.
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"