We are continuously adding new COVID CT images and we would like to invite the community to contribute COVID CTs as well.
The COVID-CT-Dataset has 349 CT images containing clinical findings of COVID-19. They are in ./Images-processed/CT_COVID.zip
Non-COVID CT scans are in ./Images-processed/CT_NonCOVID.zip
We provide a data split in ./Data-split
The meta information (e.g., patient ID, DOI, image caption) is in COVID-CT-MetaInfo.xlsx
The images are collected from COVID19-related papers from medRxiv, bioRxiv, NEJM, JAMA, Lancet, etc. CTs containing COVID-19 abnormalities are selected by reading the figure captions in the papers. All copyrights of the data belong to the authors and publishers of these papers.
We developed a baseline method for the community to benchmark with. The details are in README for DenseNet_predict.md
- To contribute to our project, please email your data to [email protected] with the corresponding meta information (Patient ID, DOI and Captions).
- We recommend you also extract images from publications or preprints. Make sure the original papers you crawled have different DOIs from those listed in
COVID-CT-MetaInfo.xlsx
. - In
COVID-CT-MetaInfo.xlsx
, images with the form of2020.mm.dd.xxxx
are crawled from bioRxiv or medRxiv. The DOIs for these preprints are10.1101/2020.mm.dd.xxxx
.
Please refer to the preprint for details: COVID-CT-Dataset: A CT Scan Dataset about COVID-19
If you find this dataset and code useful, please cite:
@article{zhao2020COVID-CT-Dataset,
title={COVID-CT-Dataset: a CT scan dataset about COVID-19},
author={Zhao, Jinyu and Zhang, Yichen and He, Xuehai and Xie, Pengtao},
journal={arXiv preprint arXiv:2003.13865},
year={2020}
}