Skip to content
View zhiheLu's full-sized avatar

Block or report zhiheLu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[ICLR 2025] "Noisy Test-Time Adaptation in Vision-Language Models"

Python 9 1 Updated Feb 22, 2025

Official Repo for Paper ‘’HealthGPT : A Medical Large Vision-Language Model for Unifying Comprehension and Generation via Heterogeneous Knowledge Adaptation‘’

Python 237 33 Updated Feb 26, 2025

Advances in Multimodal Adaptation and Generalization: From Traditional Approaches to Foundation Models

43 1 Updated Feb 13, 2025

[ECCV 2024] - Improving Zero-shot Generalization of Learned Prompts via Unsupervised Knowledge Distillation

Python 54 1 Updated Nov 4, 2024
Python 4 Updated Apr 8, 2024

Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series

Python 905 34 Updated Jan 21, 2025

Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2

Jupyter Notebook 1,740 174 Updated Dec 21, 2024

Official repository for VisionZip (CVPR 2025)

Python 241 10 Updated Feb 27, 2025
51 Updated Dec 12, 2024

面向开发者的 LLM 入门教程,吴恩达大模型系列课程中文版

Jupyter Notebook 15,677 1,955 Updated Feb 25, 2025

结构化的Prompts, 用于各种大语言模型

524 52 Updated Jul 20, 2023

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 15,820 1,448 Updated Sep 5, 2024

[ECCV 2024] Official code for "Pseudo-RIS: Distinctive Pseudo-supervision Generation for Referring Image Segmentation"

Python 18 Updated Sep 25, 2024

[ICLR2025] Kolmogorov-Arnold Transformer

Python 696 41 Updated Feb 4, 2025

Official PyTorch and Diffusers Implementation of "LinFusion: 1 GPU, 1 Minute, 16K Image"

Python 291 18 Updated Dec 23, 2024

[CVPR 2023] Official code for "Zero-shot Referring Image Segmentation with Global-Local Context Features"

Jupyter Notebook 115 11 Updated Jul 10, 2024

[CVPR 2023] Parameter is Not All You Need: Starting from Non-Parametric Networks for 3D Point Cloud Analysis

Python 501 48 Updated Apr 9, 2024

SGLang is a fast serving framework for large language models and vision language models.

Python 11,102 1,110 Updated Mar 1, 2025
Python 13 1 Updated Feb 11, 2025

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬

Jupyter Notebook 9,101 1,327 Updated Feb 7, 2025

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 14,307 1,471 Updated Dec 25, 2024

A More Fair and Comprehensive Comparison between KAN and MLP

Jupyter Notebook 160 11 Updated Aug 17, 2024

[ECCV 2024] The official code of paper "Open-Vocabulary SAM".

Python 938 32 Updated Jul 31, 2024

Block Transformer: Global-to-Local Language Modeling for Fast Inference (NeurIPS 2024)

Python 149 7 Updated Dec 18, 2024

Collection of awesome parameter-efficient fine-tuning resources.

512 12 Updated Aug 15, 2024

A collection of parameter-efficient transfer learning papers focusing on computer vision and multimodal domains.

397 25 Updated Sep 26, 2024

A curated list of awesome vision and language resources (still under construction... stay tuned!)

529 41 Updated Nov 4, 2024

Collection of AWESOME vision-language models for vision tasks

2,531 200 Updated Dec 3, 2024
Python 722 59 Updated May 24, 2024
Next