-
Sun Yat-Sen University; CASIA
- ShenZhen, China
-
02:31
(UTC +08:00) - https://scholar.google.com/citations?user=PZajgHYAAAAJ&hl=zh-CN
Highlights
- Pro
Stars
A Versatile Video-LLM for Long and Short Video Understanding with Superior Temporal Localization Ability
[ECCV2024] Grounded Multimodal Large Language Model with Localized Visual Tokenization
[ECCV2024] This is an official implementation for "PSALM: Pixelwise SegmentAtion with Large Multi-Modal Model"
UniMD: Towards Unifying Moment retrieval and temporal action Detection
Implementation of "VL-Mamba: Exploring State Space Models for Multimodal Learning"
CatVTON is a simple and efficient virtual try-on diffusion model with 1) Lightweight Network (899.06M parameters totally), 2) Parameter-Efficient Training (49.57M parameters trainable) and 3) Simpl…
[ECCV 2024] Tokenize Anything via Prompting
LAVIS - A One-stop Library for Language-Vision Intelligence
[CVPR2023] Code Release of Aligning Bag of Regions for Open-Vocabulary Object Detection
Official PyTorch implementation of "Multi-modal Queried Object Detection in the Wild" (accepted by NeurIPS 2023)
Codes and Models for VALOR: Vision-Audio-Language Omni-Perception Pretraining Model and Dataset
There can be more than Notion and Miro. AFFiNE(pronounced [ə‘fain]) is a next-gen knowledge base that brings planning, sorting and creating all together. Privacy first, open-source, customizable an…
V2rayU,基于v2ray核心的mac版客户端,用于科学上网,使用swift编写,支持trojan,vmess,shadowsocks,socks5等服务协议,支持订阅, 支持二维码,剪贴板导入,手动配置,二维码分享等
PromptDet: Towards Open-vocabulary Detection using Uncurated Images, ECCV2022
Scenic: A Jax Library for Computer Vision Research and Beyond
PyTorch implementation of "Supervised Contrastive Learning" (and SimCLR incidentally)
2021-2022 International Conferences in Artificial Intelligence, Machine Learning, Computer Vision, Data Mining, Natural Language Processing and Robotics
Official PyTorch implementation of GroupViT: Semantic Segmentation Emerges from Text Supervision, CVPR 2022.
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
Download papers and supplemental materials from open-access paper website, such as AAAI, AAMAS, AISTATS, COLT, CORL, CVPR, ECCV, ICCV, ICLR, ICML, IJCAI, JMLR, NIPS, RSS, WACV.
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".