Stars
Research code for ECCV 2020 paper "UNITER: UNiversal Image-TExt Representation Learning"
[CVPR 2024 Oral] MemSAM: Taming Segment Anything Model for Echocardiography Video Segmentation.
🔥CVPR 2025 Multimodal Large Language Models Paper List
latex template for various conferences, as well as wise-man's overleaf (overleaf is terrible!)
Official pytorch repository for "TR-DETR: Task-Reciprocal Transformer for Joint Moment Retrieval and Highlight Detection" (AAAI 2024 Paper)
[CVPR 2024 Accepted] TaskWeave: Decoupling and Inter-Task Feedback for Joint Moment Retrieval and Highlight Detection
[AAAI 2025]Math-PUMA: Progressive Upward Multimodal Alignment to Enhance Mathematical Reasoning
UMT is a unified and flexible framework which can handle different input modality combinations, and output video moment retrieval and/or highlight detection results.
A curated list of recent diffusion models for video generation, editing, and various other applications.
Video Feature Extraction Code for EMNLP 2020 paper "HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training"
[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
李宏毅2021/2022/2023春季机器学习课程课件及作业
[ICCV 2023] DiffusionRet: Generative Text-Video Retrieval with Diffusion Model
ST-SSL (STSSL): Spatio-Temporal Self-Supervised Learning for Traffic Flow Forecasting/Prediction
MomentDiff: Generative Video Moment Retrieval from Random to Real--NeurIPS 2023
Official pytorch repository for "QD-DETR : Query-Dependent Video Representation for Moment Retrieval and Highlight Detection" (CVPR 2023 Paper)
Understanding Deep Learning - Simon J.D. Prince
TPAMI:Frequency-aware Feature Fusion for Dense Image Prediction
[NeurIPS 2021] Moment-DETR code and QVHighlights dataset
我自己制作的广州大学Latex报告模板,有毕业设计,课程设计,毕业论文,等等🎈
Pytorch implementation of U-Net, R2U-Net, Attention U-Net, and Attention R2U-Net.
3D Medical Image Segmentation Models,集成各种医学图像分割模型的小框架,主要是3D,持续更新...
Pytorch implementation for Semantic Segmentation with multi models (Deeplabv3, Deeplabv3_plus, PSPNet, UNet, UNet_AutoEncoder, UNet_nested, R2AttUNet, AttentionUNet, RecurrentUNet,, SEGNet, CENet, …