The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 13,428 1,297 Updated Dec 25, 2024

test-time-training / ttt-lm-jax

Official JAX implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Python 379 32 Updated Aug 11, 2024

cambrian-mllm / cambrian

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,816 122 Updated Oct 30, 2024

XavierXiao / Dreambooth-Stable-Diffusion

Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion

Jupyter Notebook 7,643 795 Updated Dec 8, 2022

LLaVA-VL / LLaVA-NeXT

Python 3,193 281 Updated Oct 16, 2024

dengxl0520 / MemSAM

[CVPR 2024 Oral] MemSAM: Taming Segment Anything Model for Echocardiography Video Segmentation.

Python 138 12 Updated Aug 1, 2024

ytongbai / LVM

Python 1,783 54 Updated Jun 28, 2024

EvolvingLMMs-Lab / lmms-eval

Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.

Python 2,239 178 Updated Dec 31, 2024

haotian-liu / LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 20,966 2,305 Updated Aug 12, 2024

zhaoyue-zephyrus / bsq-vit

[arXiv:2406.07548] Image and Video Tokenization with Binary Spherical Quantization

Python 108 Updated Jun 12, 2024

FoundationVision / LlamaGen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Python 1,432 57 Updated Aug 15, 2024

tianweiy / DMD2

(NeurIPS 2024 Oral 🔥) Improved Distribution Matching Distillation for Fast Image Synthesis

Python 601 31 Updated Sep 27, 2024

syp2ysy / VRP-SAM

[CVPR 2024] Official implementation of "VRP-SAM: SAM with Visual Reference Prompt"

Python 112 13 Updated Sep 27, 2024

baaivision / tokenize-anything

[ECCV 2024] Tokenize Anything via Prompting

Jupyter Notebook 553 23 Updated Dec 11, 2024

rom1504 / img2dataset

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

Python 3,826 347 Updated Aug 7, 2024

meta-llama / llama3

The official Meta Llama 3 GitHub site

Python 27,777 3,179 Updated Aug 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Xingyi Zhou xingyizhou

Achievements

Achievements

Block or report xingyizhou

Stars

kakaobrain / coyo-dataset

FoundationVision / Infinity

facebookresearch / lingua

rese1f / aurora

baaivision / Emu3

huggingface / diffusers

google-deepmind / neptune

THUDM / CogVideo

mlfoundations / MINT-1T

kvfrans / jax-fid-parallel

Nathan-Li123 / SMOTer

NVlabs / VILA

UX-Decoder / FIND

HengLan / SMOT

facebookresearch / sam2