Lists (32)
Sort Name ascending (A-Z)
3d & RGBD
algorithm
Anomaly Detection
applications
change detection
classification & cv
competition
cv learning
develop
foundation models
generative
GPT
image captioning
image processing
inspiration
Mamba
medical image
misc & cv
object detection& cv
others
panoptic segmentation
production & light weight
remote sensing
satellite data
segmentation & cv
self-supervised
semi-supervised learning
SOD
tools
transformer & cv
unsupervised
video
Stars
[IEEE GRSS DFC 2025 Track II] BRIGHT: A globally distributed multimodal dataset for all-weather disaster response
🔥 Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos
SegEarth-OV: Towards Training-Free Open-Vocabulary Segmentation for Remote Sensing Images
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
A curated list of papers and resources related to Described Object Detection, Open-Vocabulary/Open-World Object Detection and Referring Expression Comprehension. Updated frequently and pull request…
A vue-based project page template for academic papers. (in development) https://junyaohu.github.io/academic-project-page-template-vue
This repository contains demos I made with the Transformers library by HuggingFace.
This repository contains details of the release of the Prithvi-EO-2.0 foundation model.
Semantic segmentation from multi-source optical data (baseline for the FLAIR#2 challenge)
Remote Sensing Temporal Vision-Language Models: A Comprehensive Survey
🦕 [AAAI'25] Official Code for “Locate Anything on Earth: Advancing Open-Vocabulary Object Detection for Remote Sensing Community"
[ECCV 2024] Official PyTorch implementation of DreamLIP: Language-Image Pre-training with Long Captions
streamline the fine-tuning process for multimodal models: PaliGemma, Florence-2, and Qwen2-VL
Project for "HyperSeg: Towards Universal Visual Segmentation with Large Language Model".
[arXiv, 2024] Show Me What and Where has Changed? Question Answering and Grounding for Remote Sensing Change Detection
Easy-to-use and high-performance NLP and LLM framework based on MindSpore, compatible with models and datasets of 🤗Huggingface.
[NeurIPS 2024] Official PyTorch implementation of LoTLIP: Improving Language-Image Pre-training for Long Text Understanding
official implementation of "Interpreting CLIP's Image Representation via Text-Based Decomposition"
GeoGround: A Unified Large Vision-Language Model for Remote Sensing Visual Grounding
PySAL: Python Spatial Analysis Library Meta-Package
[ICCV 2023] ALIP: Adaptive Language-Image Pre-training with Synthetic Caption