A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 12,498 2,573 Updated Dec 22, 2024

shufangxun / LLaVA-MoD

Making LLaVA Tiny via MoE-Knowledge Distillation

Python 72 4 Updated Oct 24, 2024

ZhangAIPI / YOPO_MLLM_Pruning

Pruning the VLLMs

Python 71 3 Updated Dec 9, 2024

datawhalechina / unlock-hf

解锁HuggingFace生态的百般用法

HTML 79 12 Updated Dec 14, 2024

facebookresearch / segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 48,209 5,702 Updated Sep 18, 2024

LeapLabTHU / GSVA

[CVPR2024] GSVA: Generalized Segmentation via Multimodal Large Language Models

Python 106 Updated Sep 12, 2024

lxtGH / OMG-Seg

OMG-LLaVA and OMG-Seg codebase [CVPR-24 and NeurIPS-24]

Python 1,345 50 Updated Dec 11, 2024

zamling / PSALM

[ECCV2024] This is an official implementation for "PSALM: Pixelwise SegmentAtion with Large Multi-Modal Model"

Python 201 10 Updated Nov 19, 2024

wangjunchi / LLMSeg

LLM-Seg: Bridging Image Segmentation and Large Language Model Reasoning

Python 104 9 Updated Apr 16, 2024

dvlab-research / LISA

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

Python 1,942 130 Updated Jul 2, 2024

yuhangzang / ContextDET

Contextual Object Detection with Multimodal Large Language Models

Python 208 5 Updated Oct 14, 2024

343gltysprk / ovow

Python 17 1 Updated Dec 3, 2024

zjysteven / lmms-finetune

A minimal codebase for finetuning large multimodal models, supporting llava-1.5/1.6, llava-interleave, llava-next-video, llava-onevision, llama-3.2-vision, qwen-vl, qwen2-vl, phi3-v etc.

Python 208 24 Updated Dec 16, 2024

IDEA-Research / GroundingDINO

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Python 7,038 712 Updated Aug 12, 2024

OpenGVLab / InternVideo

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding

Python 1,474 91 Updated Dec 11, 2024

megvii-research / Sparsebit

A model compression and acceleration toolbox based on pytorch.

Python 329 40 Updated Jan 12, 2024

luban-agi / Awesome-Domain-LLM

收集和梳理垂直领域的开源模型、数据集及评测基准。

2,308 181 Updated Dec 26, 2023

allenai / open-instruct

Python 2,167 244 Updated Dec 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

donghong1

Block or report donghong1

Stars

hkproj / pytorch-paligemma

hkproj / pytorch-llama

yuanzhoulvpi2017 / zero_nlp

karpathy / build-nanogpt

karpathy / nanoGPT

hulianyuyy / iLLaVA

jingyaogong / minimind-v

meta-llama / llama

lzhxmu / VTW

lm-sys / FastChat

mlabonne / llm-course

casper-hansen / AutoAWQ

NVIDIA / NeMo