Skip to content
View Listeningx's full-sized avatar
  • Tsinghua University
  • Shen Zhen, Guangdong
  • 05:10 (UTC -12:00)

Block or report Listeningx

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[BMVC 2023] Zero-shot Composed Text-Image Retrieval

Jupyter Notebook 51 1 Updated Nov 26, 2024

[ACM MM'2024] Official repository for "Semantic Editing Increment Benefits Zero-Shot Composed Image Retrieval"

Python 34 3 Updated Dec 23, 2024

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

Python 3,879 347 Updated Aug 7, 2024

PyTorch source code for "Stacked Cross Attention for Image-Text Matching" (ECCV 2018)

Python 556 114 Updated May 18, 2023

A bibliography and survey of the papers surrounding o1

TeX 1,138 49 Updated Nov 16, 2024

The official implementation for BLIP4CIR with bi-directional training | Bi-directional Training for Composed Image Retrieval via Text Prompt Learning (WACV 2024)

Python 28 3 Updated Feb 7, 2024

Visual Delta Generator with Large Multi-modal Model for Semi-supervised Composed Image Retrieval - CVPR2024

Python 18 1 Updated May 30, 2024

Revisiting Image Captioning Training Paradigm via Direct CLIP-based Optimization (BMVC 2024 Oral ✨)

Python 16 Updated Sep 11, 2024

💫 Models for the spaCy Natural Language Processing (NLP) library

Python 1,692 302 Updated Sep 30, 2024

Official implementation of the CVPR 2024 paper ViT-CoMer: Vision Transformer with Convolutional Multi-scale Feature Interaction for Dense Predictions.

Python 264 18 Updated Jan 21, 2025

EVA Series: Visual Representation Fantasies from BAAI

Python 2,410 176 Updated Aug 1, 2024

Code Release for MViTv2 on Image Recognition.

Python 416 47 Updated Nov 26, 2024

An open source implementation of CLIP.

Python 10,949 1,035 Updated Jan 4, 2025

[ICCV 2023] - Zero-shot Composed Image Retrieval with Textual Inversion

Python 166 9 Updated May 7, 2024

Official Pytorch implementation of LinCIR: Language-only Training of Zero-shot Composed Image Retrieval (CVPR 2024)

Python 124 7 Updated Jul 25, 2024

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 27,283 3,435 Updated Jul 23, 2024

[ICCV 2023] - Composed Image Retrieval on Common Objects in context (CIRCO) dataset

Python 62 2 Updated Aug 14, 2024

Official repository of ICCV 2021 - Image Retrieval on Real-life Images with Pre-trained Vision-and-Language Models

107 3 Updated Dec 11, 2024

【ICLR 2024, Spotlight】Sentence-level Prompts Benefit Composed Image Retrieval

Python 74 5 Updated Apr 16, 2024

清华主题PPT模板

1,204 85 Updated Jan 2, 2025

Codes of the Fine-grained Textual Inversion network for Zero-Shot Composed Image Retrieval

Python 18 Updated Aug 9, 2024

[ACM TOMM 2023] - Composed Image Retrieval using Contrastive Learning and Task-oriented CLIP-based Features

Python 172 17 Updated Sep 5, 2023

Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.

Python 31,194 7,572 Updated Jan 14, 2025

CIFAR-100 dataset by classes folder

10 1 Updated Nov 7, 2024

Description and pointers of laion datasets

HTML 243 10 Updated Nov 5, 2022

Implementation of Denoising Diffusion Probabilistic Model in Pytorch

Python 8,801 1,086 Updated Oct 9, 2024

High-Resolution Image Synthesis with Latent Diffusion Models

Python 40,038 5,138 Updated Oct 10, 2024

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Python 41,029 5,242 Updated Jun 27, 2024

清华大学计算机系考研攻略 Guidance for postgraduate entrance examination in Department of Computer Science and Technology, Tsinghua University

HTML 2,652 525 Updated Oct 8, 2024
Next