Skip to content
View Bruce-GuoS's full-sized avatar

Block or report Bruce-GuoS

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.

Jupyter Notebook 10,814 1,232 Updated Feb 26, 2025

GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型

Python 5,945 507 Updated Feb 21, 2025

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 8,934 732 Updated Mar 5, 2025

[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…

Jupyter Notebook 6,813 442 Updated Jan 12, 2025

[Embodied-AI-Survey-2024] Paper list and projects for Embodied AI

1,151 80 Updated Mar 5, 2025

A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites

3,428 273 Updated Nov 1, 2024

[ICML 2024] 3D-VLA: A 3D Vision-Language-Action Generative World Model

Python 437 16 Updated Oct 29, 2024

High-resolution models for human tasks.

Python 4,856 288 Updated Nov 18, 2024

a new large-scale person ReID dataset: LSMS

1 Updated Jun 17, 2024

⛹️ Pytorch ReID: A tiny, friendly, strong pytorch implement of person re-id / vehicle re-id baseline. Tutorial 👉https://github.com/layumi/Person_reID_baseline_pytorch/tree/master/tutorial

Python 4,215 1,018 Updated Feb 24, 2025

【CVPR2024】Magic Tokens: Select Diverse Tokens for Multi-modal Object Re-Identification

Python 98 6 Updated Oct 24, 2024

The official code of "PLIP: Language-Image Pre-training for Person Representation Learning"

Python 107 10 Updated Dec 17, 2024
Python 504 34 Updated Jul 29, 2024

[CVPR 2024] Alpha-CLIP: A CLIP Model Focusing on Wherever You Want

Jupyter Notebook 786 51 Updated Jul 30, 2024

[NeurIPS 2023] HAP: Structure-Aware Masked Image Modeling for Human-Centric Perception

Python 40 4 Updated Mar 25, 2024

This repository provides the code and model checkpoints for AIMv1 and AIMv2 research projects.

Python 1,220 59 Updated Nov 22, 2024

A comprehensive list of awesome contrastive self-supervised learning papers.

1,254 127 Updated Sep 10, 2024
Python 52 3 Updated Oct 5, 2022

Collection of awesome parameter-efficient fine-tuning resources.

518 12 Updated Aug 15, 2024

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 17,624 1,769 Updated Mar 5, 2025

Collection of AWESOME vision-language models for vision tasks

2,541 200 Updated Dec 3, 2024

[CVPR 2023] Official repository of paper titled "MaPLe: Multi-modal Prompt Learning".

Python 714 57 Updated Jul 24, 2023

Multimodal Models in Real World

Jupyter Notebook 441 20 Updated Feb 24, 2025

❄️🔥 Visual Prompt Tuning [ECCV 2022] https://arxiv.org/abs/2203.12119

Python 1,085 95 Updated Sep 2, 2023

[ICLR 2024 (Spotlight)] "Frozen Transformers in Language Models are Effective Visual Encoder Layers"

Python 231 8 Updated Jan 17, 2024

Paper collection for cloth variation based person re-identification

123 18 Updated Oct 6, 2024

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 43,095 5,277 Updated Mar 5, 2025

[ECCV2022] PASS: Part-Aware Self-Supervised Pre-Training for Person Re-Identification

Python 62 16 Updated Jul 8, 2022

[Mamba-Survey-2024] Paper list for State-Space-Model/Mamba and it's Applications

681 38 Updated Feb 23, 2025

✨✨Latest Advances on Multimodal Large Language Models

14,116 906 Updated Mar 5, 2025
Next