DECIDER

Abstract

In this work, we introduce DECIDER (Debiasing Classifiers to Identify Errors Reliably), a novel method for detecting failures in image classification models. DECIDER uses large language models (LLMs) to identify key task-relevant attributes and vision-language models (VLMs) to align visual features to these attributes, creating a "debiased" version of the classifier. Potential failures are detected by measuring the disagreement between the original and debiased models. DECIDER not only identifies likely failures but also provides interpretable explanations through an attribute-ablation strategy. Across various benchmarks, DECIDER outperforms existing methods in failure detection.

Architecture

Folder Setup

├── data/                  # Folder containing datasets
├── logs/                  # Folder where logs and results are saved
├── models/                # Folder to store model files
├── scripts/               # Folder containing bash scripts
├── train_classifier.py    # Script to train the classifier
├── train_failure_evaluator.py    # Script to train the failure evaluator
├── failure_eval.py        # Script to run failure evaluation

Requirements

The project dependencies can be installed using the following command:

pip install -r requirements.txt

Instructions to Run

1. Train the Image Classifier

The train_classifier.py script trains the classifier. You can customize the dataset, model, and hyperparameters using the following command:

python train_classifier.py --dataset_name <DATASET_NAME> --data_path <DATA_PATH> [other optional arguments]

Example Command:

python train_classifier.py \
    --dataset_name "cifar10" \
    --data_path "./data" \
    --image_size 224 \
    --batch_size 64 \
    --num_epochs 50 \
    --classifier_model "resnet18" \
    --use_pretrained

2. Train the Failure Evaluator

After training the classifier, you can train the failure evaluator using the train_failure_evaluator.py script.

python train_failure_evaluator.py --dataset_name <DATASET_NAME> --data_dir <DATA_PATH> --classifier_name <CLASSIFIER_NAME> [other optional arguments]

Example Command:

python train_failure_evaluator.py \
    --dataset_name "cifar100" \
    --data_dir "./data" \
    --classifier_name "resnet50" \
    --num_epochs 100 \
    --learning_rate 1e-3 \
    --scheduler "cosine" \
    --save_dir "./logs"

3. Run Failure Evaluation

After training both the classifier and the failure evaluator, you can evaluate failures using the failure_eval.py script.

python failure_eval.py --dataset_name <DATASET_NAME> --data_dir <DATA_PATH> --method <METHOD> [other optional arguments]

Example Command:

python failure_eval.py \
    --dataset_name "cifar100" \
    --data_dir "./data" \
    --method "PIM" \
    --score "msp" \
    --eval_dataset "cifar100" \
    --filename "cifar100c.log" \
    --cifar100c_corruption "gaussian_blur" \
    --severity 5

Notes

Modify the bash scripts as needed for your specific environment and dataset paths.
The data_path argument should point to the directory where your datasets are stored.
Logs and results will be saved in the logs/ folder unless a different directory is specified.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
core_concepts		core_concepts
models		models
.DS_Store		.DS_Store
README.md		README.md
architecture.png		architecture.png
explanations.py		explanations.py
failure_eval.py		failure_eval.py
requirements.txt		requirements.txt
train_classifier.py		train_classifier.py
train_failure_evaluator.py		train_failure_evaluator.py
utils_proj.py		utils_proj.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DECIDER

Abstract

Architecture

Folder Setup

Requirements

Instructions to Run

1. Train the Image Classifier

Example Command:

2. Train the Failure Evaluator

Example Command:

3. Run Failure Evaluation

Example Command:

Notes

About

Releases

Packages

Languages

kowshikthopalli/Decider_code

Folders and files

Latest commit

History

Repository files navigation

DECIDER

Abstract

Architecture

Folder Setup

Requirements

Instructions to Run

1. Train the Image Classifier

Example Command:

2. Train the Failure Evaluator

Example Command:

3. Run Failure Evaluation

Example Command:

Notes

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages