Stars
[ACL 2024] GroundingGPT: Language-Enhanced Multi-modal Grounding Model
[AAAI 2024] EarthVQA: Towards Queryable Earth via Relational Reasoning-Based Remote Sensing Visual Question Answering
GeoGround: A Unified Large Vision-Language Model for Remote Sensing Visual Grounding
The code of the paper "NExT-Chat: An LMM for Chat, Detection and Segmentation".
The official repo for [NeurIPS'23] "SAMRS: Scaling-up Remote Sensing Segmentation Dataset with Segment Anything Model"
Official repo of Griffon series including v1(ECCV 2024), v2, and G
Emerging Pixel Grounding in Large Multimodal Models Without Grounding Supervision
[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.
[CVPR2024] GSVA: Generalized Segmentation via Multimodal Large Language Models
[ECCV24] VISA: Reasoning Video Object Segmentation via Large Language Model
[CVPR2024] The code for "Osprey: Pixel Understanding with Visual Instruction Tuning"
Project for "LaSagnA: Language-based Segmentation Assistant for Complex Queries".
LLM-Seg: Bridging Image Segmentation and Large Language Model Reasoning
OMG-LLaVA and OMG-Seg codebase [CVPR-24 and NeurIPS-24]
(ICCV 2023) Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation
Project Page for "LISA: Reasoning Segmentation via Large Language Model"
[CVPR 24] The repository provides code for running inference and training for "Segment and Caption Anything" (SCA) , links for downloading the trained model checkpoints, and example notebooks / gra…
Chat with RS-ChatGPT and get the remote sensing interpretation results and the response!
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
🛰️ Official repository of paper "RemoteCLIP: A Vision Language Foundation Model for Remote Sensing" (IEEE TGRS)
[CVPR 2024 🔥] GeoChat, the first grounded Large Vision Language Model for Remote Sensing
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)