Skip to content

Latest commit

 

History

History
13 lines (7 loc) · 568 Bytes

README.md

File metadata and controls

13 lines (7 loc) · 568 Bytes

sam-clip

Use Grounding DINO, Segment Anything, and CLIP to label objects in images.

Below is an image with segmentation masks of all McDonalds logos in an image.

This demo was created by sending the prompt logo to Grounding DINO and SAM, then classifying each prediction using CLIP with two prompts: McDonalds and Burger King.

Screenshot 2023-11-17 at 09 41 44

License

This project is licensed under an MIT license.