Stars
Get up and running with Llama 3.3, Mistral, Gemma 2, and other large language models.
Deep Learning-based Image Fusion: A Survey
[NeurIPS 2022] Implementation of "AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition"
A collection of resources on controllable generation with text-to-image diffusion models.
李宏毅2021机器学习深度学习笔记PPT作业
Source code for the paper "Long-Tail Learning with Foundation Model: Heavy Fine-Tuning Hurts" (ICML 2024)
Constructing Balanced Training Samples: A New Perspective on Long-tailed Classification
Convert images of LaTex math equations into LaTex code.
PyTorch implementation of paper "Text-Guided Mixup Towards Long-Tailed Image Categorization" (BMVC 2024)
机器学习、深度学习的学习路径及知识总结
[CVPR'24] MESA: Matching Everything by Segmenting Anything
A generalized framework for subspace tuning methods in parameter efficient fine-tuning.
This is the pytorch implement of the paper "RSMamba: Remote Sensing Image Classification with State Space Model"
[ICLR 2024 & ECCV 2024] The All-Seeing Projects: Towards Panoptic Visual Recognition&Understanding and General Relation Comprehension of the Open World"
[CVPR 2024 Oral] Official code for LTGC: Long-Tail Recognition via Leveraging LLMs-driven Generated Content
[CVPR 2024 Highlight🔥] Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding
Foundation Models for Video Understanding: A Survey
Large World Model -- Modeling Text and Video with Millions Context
Turn any computer or edge device into a command center for your computer vision projects.
The implement for paper : "A Novel Approach to Industrial Defect Generation through Blended Latent Diffusion Model with Online Adaptation"
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
A professional list on Large (Language) Models and Foundation Models (LLM, LM, FM) for Time Series, Spatiotemporal, and Event Data.
Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information
Pytorch 3D U-Net Convolution Neural Network (CNN) designed for medical image segmentation