Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
bash_scripts		bash_scripts
imgs		imgs
scripts		scripts
.gitignore		.gitignore
README.md		README.md
collect_results.py		collect_results.py
dataset.py		dataset.py
eval.py		eval.py
model.py		model.py
train.py		train.py
train__distr_match.py		train__distr_match.py
train__distr_match__classification.py		train__distr_match__classification.py
utils.py		utils.py

Repository files navigation

Improving Zero-Shot Models with Label Distribution Priors

CLIPPR

Improving Zero-Shot Models with Label Distribution Priors
Joanthan Kahana, Niv Cohen, Yedid Hoshen
Official PyTorch Implementation

Abstract: Labeling large image datasets with attributes such as fa- cial age or object type is tedious and sometimes infeasible. Supervised machine learning methods provide a highly ac- curate solution, but require manual labels which are often unavailable. Zero-shot models (e.g., CLIP) do not require manual labels but are not as accurate as supervised ones, particularly when the attribute is numeric. We propose a new approach, CLIPPR (CLIP with Priors), which adapts zero-shot models for regression and classification on unla- belled datasets. Our method does not use any annotated images. Instead, we assume a prior over the label distri- bution in the dataset. We then train an adapter network on top of CLIP under two competing objectives: i) mini- mal change of predictions from the original CLIP model ii) minimal distance between predicted and prior distribution of labels. Additionally, we present a novel approach for se- lecting prompts for Vision & Language models using a dis- tributional prior. Our method is effective and presents a sig- nificant improvement over the original model. We demon- strate an improvement of 28% in mean absolute error on the UTK age regression task. We also present promising results for classification benchmarks, improving the classification accuracy on the ImageNet dataset by 2.83%, without using any labels.

This repository is the official PyTorch implementation of Improving Zero-Shot Models with Label Distribution Priors

Usage

Requirements

Downloading the Datasets

You need to download the datasets first. download each to a separate directory under the same father directory.

NOTE: Please update the DATA_PATH parameter in dataset.py and scripts/prepare_stanford_cars.py to the father directory of the dataset.

For the ImageNet dataset please perform the pre-processing script found here. (ImageNet dataset Coming soon...)

For the Stanford Cars dataset please perform our pre-processing script: scripts/prepare_stanford_cars.py.

Training

We provide training & evaluation scripts for: CLIPPR, Zero-Shot CLIP (evaluation only) and a Supervised adapter on top of CLIP, for each one of the evaluated datasets.
The scripts can be found in the bash_scripts folder sorted by dataset.

NOTE: Inside the bash_scripts\utk folder you can also find code for our ablation studies.

Trained Checkpoints and More Coming Soon!

Citation

If you find this useful, please cite our paper:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Improving Zero-Shot Models with Label Distribution Priors

CLIPPR

Usage

Requirements

Downloading the Datasets

Training

Trained Checkpoints and More Coming Soon!

Citation

About

Releases

Packages

Contributors 2

Languages

jonkahana/CLIPPR

Folders and files

Latest commit

History

Repository files navigation

Improving Zero-Shot Models with Label Distribution Priors

CLIPPR

Usage

Requirements

Downloading the Datasets

Training

Trained Checkpoints and More Coming Soon!

Citation

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages