-
Tsinghua University
- Shen Zhen, Guangdong
-
05:10
(UTC -12:00)
Lists (2)
Sort Name ascending (A-Z)
Stars
[BMVC 2023] Zero-shot Composed Text-Image Retrieval
[ACM MM'2024] Official repository for "Semantic Editing Increment Benefits Zero-Shot Composed Image Retrieval"
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
PyTorch source code for "Stacked Cross Attention for Image-Text Matching" (ECCV 2018)
A bibliography and survey of the papers surrounding o1
The official implementation for BLIP4CIR with bi-directional training | Bi-directional Training for Composed Image Retrieval via Text Prompt Learning (WACV 2024)
Visual Delta Generator with Large Multi-modal Model for Semi-supervised Composed Image Retrieval - CVPR2024
Revisiting Image Captioning Training Paradigm via Direct CLIP-based Optimization (BMVC 2024 Oral ✨)
💫 Models for the spaCy Natural Language Processing (NLP) library
Official implementation of the CVPR 2024 paper ViT-CoMer: Vision Transformer with Convolutional Multi-scale Feature Interaction for Dense Predictions.
EVA Series: Visual Representation Fantasies from BAAI
Code Release for MViTv2 on Image Recognition.
An open source implementation of CLIP.
[ICCV 2023] - Zero-shot Composed Image Retrieval with Textual Inversion
Official Pytorch implementation of LinCIR: Language-only Training of Zero-shot Composed Image Retrieval (CVPR 2024)
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
[ICCV 2023] - Composed Image Retrieval on Common Objects in context (CIRCO) dataset
Official repository of ICCV 2021 - Image Retrieval on Real-life Images with Pre-trained Vision-and-Language Models
【ICLR 2024, Spotlight】Sentence-level Prompts Benefit Composed Image Retrieval
Codes of the Fine-grained Textual Inversion network for Zero-Shot Composed Image Retrieval
[ACM TOMM 2023] - Composed Image Retrieval using Contrastive Learning and Task-oriented CLIP-based Features
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
Description and pointers of laion datasets
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
High-Resolution Image Synthesis with Latent Diffusion Models
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
清华大学计算机系考研攻略 Guidance for postgraduate entrance examination in Department of Computer Science and Technology, Tsinghua University