Segment Anything has led to a new breakthrough in the field of Computer Vision (CV), and this repository will continue to track and summarize the research progress of Segment Anything in various fields, including Papers/Projects, etc.
If you find this repository helpful, please consider Stars ⭐ or Sharing ⬆️. Thanks.
- 2923.5.4: Add SEEM paper.
- 2023.4.18: Add two nice job Inpainting Anything and SAM-Track.
- 2023.4.12: Add some presentations.
- 2023.4.12: An initial version of recent papers or projects.
Title | Presentation | Paper page | Project page | Code base | Affiliation | Description |
---|---|---|---|---|---|---|
CLIP | arXiv | Colab | Code | OpenAI | Contrastive Language-Image Pre-Training. | |
OWL-ViT | ECCV2022 | - | Code | A open-vocabulary object detector. | ||
OvSeg | CVPR2023 | Project | Code | META | Segment an image into semantic regions according to text descriptions. | |
Painter | CVPR2023 | - | Code | BAAI | A Generalist Painter for In-Context Visual Learning. | |
Grounding DINO | arXiv | Colab &Huggingface | Code | IDEA | A stronger open-set object detector | |
Segment Anything | arXiv | Project page | Code | Meta | A stronger Large model which can be used to generate masks for all objects in an image. | |
SegGPT | arXiv | Project page | Code | BAAI | Segmenting Everything In Context based on Painter. | |
Segment Everything Everywhere All at Once (SEEM) | arXiv | Project Page | Code | Microsoft | Semantic Segmentation with various prompt types. |
Title | Presentation | Paper page | Project page | Code base | Affiliation | Description |
---|---|---|---|---|---|---|
CLIP_Surgery | arXiv | Demo | Code | HKUST | This work about SAM based on CLIP's explainability to achieve text to mask without manual points. | |
Segment Anything Model (SAM) for Digital Pathology | arXiv | - | - | - | SAM + Tumor segmentation/Tissue segmentation/Cell nuclei segmentation. | |
SAMCOD | - | arXiv | - | Code | - | SAM + Camouflaged object detection (COD) task. |
Segment Anything Is Not Always Perfect | arXiv | - | - | Samsung | This paper analyze and discuss the benefits and limitations of SAM. | |
Inpaint Anything | arXiv | - | Code | USTC & EIT | SAM + Inpainting, which is able to remove the object smoothly. |
Title | Presentation | Project page | Code base | Affiliation | Description |
---|---|---|---|---|---|
Grounded Segment Anything | Colab & Huggingface | Code | - | Combining Grounding DINO and Segment Anything | |
GroundedSAM Anomaly Detection | - | Code | - | Grounding DINO + SAM to segment any anomaly. | |
Semantic Segment Anything | - | Code | Fudan | A dense category annotation engine. | |
Magic Copy | - | Code | - | Magic Copy is a Chrome extension that uses SAM. | |
Segment Anything with Clip | - | Code | - | SAM + CLIP | |
SAM-Clip | - | Code | - | SAM + CLIP. | |
Prompt Segment Anything | - | Code | - | SAM + Zero-shot Instance Segmentation. | |
RefSAM | - | - | Code | - | Evaluating the basic performance of SAM on the Referring Image Segmementation task. |
SAM-RBox | - | Code | - | An implementation of SAM for generating rotated bounding boxes with MMRotate. | |
Open Vocabulary Segment Anything | - | Code | - | An interesting demo by combining OWL-ViT of Google and SAM. | |
SegDrawer | - | Code | - | Simple static web-based mask drawer, supporting semantic drawing with SAM. | |
Annotation Anything Pipeline | YoutubeDemo | Code | - | SAM + Labelme + LabelImg + Auto-labeling. | |
AnyLabel | - | Code | - | GPT + SAM. | |
Roboflow Annotate | App | Blog | Roboflow | SAM-assisted labeling for training computer vision models. | |
SALT | - | Code | - | A tool that adds a basic interface for image labeling and saves the generated masks in COCO format.] | |
SAM U Specify | - | Code | - | Use SAM and CLIP model to segment unique instances you want.] | |
SAM web UI | App | Code | - | This is a new web interface for the SAM. |
Title | Presentation | Project page | Code base | Affiliation | Description |
---|---|---|---|---|---|
MetaSeg | HuggingFace | Code | - | SAM + Video. | |
SAM-Track | Video | YoutubeDemo | Code | Zhejiang University | This project, which is based on SAM and DeAOT, focuses on segmenting and tracking objects in videos. |
Title | Presentation | Project page | Code base | Affiliation | Description |
---|---|---|---|---|---|
SAM in Napari | Video | - | Code | - | Segment anything with Napari integration of SAM. |
SAM Medical Imaging | - | Code | - | SAM for Medical Imaging. |
Title | Presentation | Project page | Code base | Affiliation | Description |
---|---|---|---|---|---|
SegAnythingPro | - | Code | - | SAM + Inpainting/Replacing. |
Title | Presentation | Project page | Code base | Affiliation | Description |
---|---|---|---|---|---|
3D-Box | - | Code | - | SAM is extended to 3D perception by combining it with VoxelNeXt. | |
Anything 3DNovel View | - | Code | - | SAM + Zero 1-to-3. | |
Any 3DFace | - | Code | - | SAM + HRN. |
Title | Presentation | Project page | Code base | Affiliation | Description |
---|---|---|---|---|---|
Edit Anything | - | Code | - | Edit and Generate Anything in an image. | |
Image Edit Anything | - | Code | - | Stable Diffusion + SAM. | |
SAM for Stable Diffusion Webui | - | Code | - | Stable Diffusion + SAM. |
Title | Presentation | Project page | Code base | Affiliation | Description |
---|---|---|---|---|---|
Earth Observation Tools | Colab | Code | - | SAM + Remote Sensing. |
Title | Presentation | Project page | Code base | Affiliation | Description |
---|---|---|---|---|---|
Moving Object Detection | - | Code | - | SAM + Moving Object Detection. |
Title | Presentation | Project page | Code base | Affiliation | Description |
---|---|---|---|---|---|
OCR-SAM | Blog | Code | - | Optical Character Recognition with SAM. |
Some of the presentations in this repository are borrowed from the original author, and we are very thankful for their contribution.
This project is released under the MIT license. Please see the LICENSE file for more information.