cysin

cysin

18 followers · 45 following

Starred repositories

crazystylus / otadump

Extract partitions from Android OTA files.

Rust 168 19 Updated Feb 26, 2025

uncbiag / Awesome-Foundation-Models

A curated list of foundation models for vision and language tasks

951 42 Updated Feb 20, 2025

zilliztech / deep-searcher

Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.

Python 3,571 324 Updated Mar 6, 2025

OpenGVLab / InternImage

[CVPR 2023 Highlight] InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions

Python 2,609 243 Updated Mar 4, 2025

OpenGVLab / Vision-RWKV

[ICLR 2025 Spotlight] Vision-RWKV: Efficient and Scalable Visual Perception with RWKV-Like Architectures

Python 415 17 Updated Feb 18, 2025

facebookresearch / ImageBind

ImageBind One Embedding Space to Bind Them All

Python 8,536 796 Updated Jul 31, 2024

UX-Decoder / DINOv

[CVPR 2024] Official implementation of the paper "Visual In-context Learning"

Python 440 18 Updated Apr 8, 2024

facebookresearch / VLPart

[ICCV2023] VLPart: Going Denser with Open-Vocabulary Part Segmentation

Python 367 17 Updated Sep 19, 2023

cvat-ai / cvat

Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.

Python 13,324 3,150 Updated Mar 6, 2025

IDEA-Research / DINO-X-API

DINO-X: The World's Top-Performing Vision Model for Open-World Object Detection and Understanding

Python 897 38 Updated Jan 21, 2025

UX-Decoder / Segment-Everything-Everywhere-All-At-Once

[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"

Python 4,523 420 Updated Aug 19, 2024

mlzxy / devit

CoRL 2024

Python 383 49 Updated Oct 29, 2024

IDEA-Research / awesome-detection-transformer

Collect some papers about transformer for detection and segmentation. Awesome Detection Transformer for Computer Vision (CV)

1,324 114 Updated Jul 4, 2024

UX-Decoder / Semantic-SAM

[ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"

Python 2,508 127 Updated Jul 19, 2024

autodistill / autodistill-florence-2

Use Florence 2 to auto-label data for use in training fine-tuned object detection models.

Python 62 8 Updated Aug 15, 2024

IDEA-Research / Grounded-SAM-2

Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2

Jupyter Notebook 1,773 173 Updated Dec 21, 2024

yoshitomo-matsubara / torchdistill

A coding-free framework built on PyTorch for reproducible deep learning studies. PyTorch Ecosystem. 🏆25 knowledge distillation methods presented at CVPR, ICLR, ECCV, NeurIPS, ICCV, etc are implemen…

Python 1,459 131 Updated Feb 28, 2025

caodoanh2001 / upar_challenge

Official repository of the first-ranking solution for the UPAR2024 Challenge - Track 1.

Python 22 7 Updated Dec 26, 2023

IDEA-Research / Grounded-Segment-Anything

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 15,861 1,451 Updated Sep 5, 2024

OpenGVLab / InternVideo

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding

Python 1,720 102 Updated Feb 27, 2025

microsoft / Swin-Transformer

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".

Python 14,407 2,098 Updated Jul 24, 2024

QwenLM / Qwen2.5-VL

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 8,419 591 Updated Mar 4, 2025

facebookresearch / pytorchvideo

A deep learning library for video understanding research.

Python 3,399 418 Updated Jan 25, 2025

codedog-ai / codedog

Code review assistant powered by LLM

Python 112 18 Updated Jul 31, 2024

raphaelmansuy / code2prompt

Code2Prompt is a powerful command-line tool that simplifies the process of providing context to Large Language Models (LLMs) by generating a comprehensive Markdown file containing the content of yo…

Python 800 56 Updated Jan 28, 2025

mufeedvh / code2prompt

A CLI tool to convert your codebase into a single LLM prompt with source tree, prompt templating, and token counting.

Rust 4,955 287 Updated Mar 6, 2025

photosynthesis-team / piq

Measures and metrics for image2image tasks. PyTorch.

Python 1,452 123 Updated May 12, 2024

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 22,259 1,996 Updated Mar 6, 2025

deepseek-ai / FlashMLA

FlashMLA: Efficient MLA decoding kernels

C++ 11,169 775 Updated Mar 1, 2025

microsoft / markitdown

Python tool for converting files and office documents to Markdown.

Python 39,590 1,840 Updated Mar 6, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly