Combining MixMatch with Transfer Learning

This script combines the Semi-Supervised-Learning method MixMatch with transfer learning to fine-tune a pre-trained Efficient-Net model on a chest x-ray images dataset.

The MixMatch method was proposed by the Google Research team, details here: MixMatch: A Holistic Approach to Semi-Supervised Learning. The official Tensorflow implementation is here and the pytorch implementation I based my project on is from MixMatch-pytorch.

Currently, the script only contains the CIFAR-10 as well as a chest x-ray images dataset.

Requirements

Python 3.6+
PyTorch 1.0
torchvision
tensorboardX
progress
matplotlib
numpy
efficientnet_pytorch

Usage

Download EfficientNet Model

The pip version currently contains a CUDA/CPU bug, so please use github to install the efficient model:

git clone https://github.com/lukemelas/EfficientNet-PyTorch && cd EfficientNet-PyTorch && pip install -e .

Preprocess X-ray dataset

Download the X-ray dataset from the kaggle website. Next, create the dataset and x_ray_images folder:

mkdir -p dataset/x_ray_images

Then extract the zip file into /dataset/x_ray_images and run make_x_ray_dataset.py

Train

Train the EfficientNet model with 250 labeled data of the x-ray dataset, a batch size of 16, and a learning rate of 0.0001. Freeze all layers except the last for the first 5 epochs, then unfreeze all layers for fine-tuning:

python main.py --lr 0.0001 --out x_ray@250 --batch_size 16 --unfreeze 5 --dataset x_ray --n-labeled 250 --model efficient

Train the EfficientNet model in traditional transfer-learning fashion, without MixMatch. Train it on the x-ray dataset with a batch size of 64, and a learning rate of 0.0001. Freeze all layers except the last for the first 5 epochs, then unfreeze all layers for fine-tuning:

main_no_ssl.py --out x_ray@250 --batch_size 64 --lr 0.0001 --unfreeze 5  --dataset x_ray --model efficient

Current Performance

Given computational limitations, I've only been able to run the MixMatch script with the efficientNet-b0 model for around 50 epochs (see To-Do). Adding the SSL method largely improved on the classification performance, increasing from ±30% test set accuracy (transfer learning with all 2800 labels) to ±44.5% test accuracy using only 250 labeled images.

TO-DO

run models for full epoch duration
add requirements.txt

References

@article{berthelot2019mixmatch,
  title={MixMatch: A Holistic Approach to Semi-Supervised Learning},
  author={Berthelot, David and Carlini, Nicholas and Goodfellow, Ian and Papernot, Nicolas and Oliver, Avital and Raffel, Colin},
  journal={arXiv preprint arXiv:1905.02249},
  year={2019}
}
@article{tan2019efficientnet,
  title={EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks},
  author={Tan, Mingxing and Le, Quoc V},
  journal={arXiv preprint arXiv:1905.11946},
  year={2019}
}

Name		Name	Last commit message	Last commit date
Latest commit History 43 Commits
dataset/x_ray_images		dataset/x_ray_images
models		models
results		results
utils		utils
.gitignore		.gitignore
README.md		README.md
main.py		main.py
main_no_ssl.py		main_no_ssl.py
make_x_ray_dataset.py		make_x_ray_dataset.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Combining MixMatch with Transfer Learning

Requirements

Usage

Download EfficientNet Model

Preprocess X-ray dataset

Train

Current Performance

TO-DO

References

About

Releases

Packages

Languages

daniellutscher/MixMatch-TransferLearning

Folders and files

Latest commit

History

Repository files navigation

Combining MixMatch with Transfer Learning

Requirements

Usage

Download EfficientNet Model

Preprocess X-ray dataset

Train

Current Performance

TO-DO

References

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages