Skip to content
View skx6's full-sized avatar

Block or report skx6

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[ACL 2024] GroundingGPT: Language-Enhanced Multi-modal Grounding Model

Python 309 16 Updated Nov 4, 2024
Jupyter Notebook 760 71 Updated Aug 7, 2024

[AAAI 2024] EarthVQA: Towards Queryable Earth via Relational Reasoning-Based Remote Sensing Visual Question Answering

Python 88 2 Updated Dec 2, 2024

GeoGround: A Unified Large Vision-Language Model for Remote Sensing Visual Grounding

36 1 Updated Nov 20, 2024

The code of the paper "NExT-Chat: An LMM for Chat, Detection and Segmentation".

Python 228 9 Updated Feb 5, 2024

The official repo for [NeurIPS'23] "SAMRS: Scaling-up Remote Sensing Segmentation Dataset with Segment Anything Model"

Python 297 14 Updated Aug 5, 2024

Official repo of Griffon series including v1(ECCV 2024), v2, and G

Python 119 6 Updated Nov 27, 2024

Emerging Pixel Grounding in Large Multimodal Models Without Grounding Supervision

Python 26 Updated Oct 21, 2024

[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.

Python 797 38 Updated Nov 23, 2024
Jupyter Notebook 110 3 Updated Jun 7, 2023

[CVPR2024] GSVA: Generalized Segmentation via Multimodal Large Language Models

Python 107 Updated Sep 12, 2024

[ECCV24] VISA: Reasoning Video Object Segmentation via Large Language Model

Python 142 4 Updated Aug 5, 2024

[CVPR2024] The code for "Osprey: Pixel Understanding with Visual Instruction Tuning"

Python 780 42 Updated Aug 5, 2024

Project for "LaSagnA: Language-based Segmentation Assistant for Complex Queries".

Python 50 2 Updated Apr 29, 2024

LLM-Seg: Bridging Image Segmentation and Large Language Model Reasoning

Python 105 9 Updated Apr 16, 2024

OMG-LLaVA and OMG-Seg codebase [CVPR-24 and NeurIPS-24]

Python 1,346 50 Updated Dec 11, 2024

(ICCV 2023) Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation

Jupyter Notebook 45 4 Updated Jul 18, 2024

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

Python 1,945 130 Updated Jul 2, 2024
25 1 Updated Sep 27, 2024
Python 367 14 Updated Jul 29, 2024

[CVPR 24] The repository provides code for running inference and training for "Segment and Caption Anything" (SCA) , links for downloading the trained model checkpoints, and example notebooks / gra…

Python 208 8 Updated Sep 30, 2024

Chat with RS-ChatGPT and get the remote sensing interpretation results and the response!

Python 218 27 Updated Mar 27, 2024

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 15,438 1,421 Updated Sep 5, 2024

🛰️ Official repository of paper "RemoteCLIP: A Vision Language Foundation Model for Remote Sensing" (IEEE TGRS)

Jupyter Notebook 330 22 Updated Jun 27, 2024

[CVPR 2024 🔥] GeoChat, the first grounded Large Vision Language Model for Remote Sensing

Python 474 38 Updated Nov 28, 2024
Python 38 2 Updated May 2, 2024

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Python 4,351 460 Updated Dec 19, 2024

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Python 36,625 4,508 Updated Dec 23, 2024
Next