Stars
The code for "TokenPacker: Efficient Visual Projector for Multimodal LLM".
[CVPR2024] The code for "Osprey: Pixel Understanding with Visual Instruction Tuning"
The code for "Label-efficient Segmentation via Affinity Propagation". [NeurIPS2023]
Awesome box-supervised instance segmentation papers.
A toolbox for box-supervised instance segmentation.
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
The code for "Point2Mask: Point-supervised Panoptic Segmentation via Optimal Transport", ICCV2023