Skip to content
View cysin's full-sized avatar

Block or report cysin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Extract partitions from Android OTA files.

Rust 168 19 Updated Feb 26, 2025

A curated list of foundation models for vision and language tasks

951 42 Updated Feb 20, 2025

Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.

Python 3,571 324 Updated Mar 6, 2025

[CVPR 2023 Highlight] InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions

Python 2,609 243 Updated Mar 4, 2025

[ICLR 2025 Spotlight] Vision-RWKV: Efficient and Scalable Visual Perception with RWKV-Like Architectures

Python 415 17 Updated Feb 18, 2025

ImageBind One Embedding Space to Bind Them All

Python 8,536 796 Updated Jul 31, 2024

[CVPR 2024] Official implementation of the paper "Visual In-context Learning"

Python 440 18 Updated Apr 8, 2024

[ICCV2023] VLPart: Going Denser with Open-Vocabulary Part Segmentation

Python 367 17 Updated Sep 19, 2023

Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.

Python 13,324 3,150 Updated Mar 6, 2025

DINO-X: The World's Top-Performing Vision Model for Open-World Object Detection and Understanding

Python 897 38 Updated Jan 21, 2025

[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"

Python 4,523 420 Updated Aug 19, 2024

CoRL 2024

Python 383 49 Updated Oct 29, 2024

Collect some papers about transformer for detection and segmentation. Awesome Detection Transformer for Computer Vision (CV)

1,324 114 Updated Jul 4, 2024

[ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"

Python 2,508 127 Updated Jul 19, 2024

Use Florence 2 to auto-label data for use in training fine-tuned object detection models.

Python 62 8 Updated Aug 15, 2024

Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2

Jupyter Notebook 1,773 173 Updated Dec 21, 2024

A coding-free framework built on PyTorch for reproducible deep learning studies. PyTorch Ecosystem. 🏆25 knowledge distillation methods presented at CVPR, ICLR, ECCV, NeurIPS, ICCV, etc are implemen…

Python 1,459 131 Updated Feb 28, 2025

Official repository of the first-ranking solution for the UPAR2024 Challenge - Track 1.

Python 22 7 Updated Dec 26, 2023

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 15,861 1,451 Updated Sep 5, 2024

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding

Python 1,720 102 Updated Feb 27, 2025

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".

Python 14,407 2,098 Updated Jul 24, 2024

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 8,419 591 Updated Mar 4, 2025

A deep learning library for video understanding research.

Python 3,399 418 Updated Jan 25, 2025

Code review assistant powered by LLM

Python 112 18 Updated Jul 31, 2024

Code2Prompt is a powerful command-line tool that simplifies the process of providing context to Large Language Models (LLMs) by generating a comprehensive Markdown file containing the content of yo…

Python 800 56 Updated Jan 28, 2025

A CLI tool to convert your codebase into a single LLM prompt with source tree, prompt templating, and token counting.

Rust 4,955 287 Updated Mar 6, 2025

Measures and metrics for image2image tasks. PyTorch.

Python 1,452 123 Updated May 12, 2024

Fully open reproduction of DeepSeek-R1

Python 22,259 1,996 Updated Mar 6, 2025

FlashMLA: Efficient MLA decoding kernels

C++ 11,169 775 Updated Mar 1, 2025

Python tool for converting files and office documents to Markdown.

Python 39,590 1,840 Updated Mar 6, 2025
Next