Skip to content

Latest commit

 

History

History

data

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 

Available datasets

The full descriptions of the datasets and the studies of origin can be found in the manuscript. Here we provide the links to access the processed datasets.

Pretraining data

To download data from CellXGene and build for pretraining, go to the folder cellxgene and follow the instructions.

Datasets for cell type annotation

  • Multiple Sclerosis (M.S.) dataset: link

  • Myeloid (Mye.) dataset: link

  • hPancreas dataset: link

Datasets for multi-batch integration

  • PBMC 10K: link

  • Perirhinal Cortex dataset: link

  • COVID-19 dataset: link

Datasets for multi-omics integration

  • BMMC dataset: link

  • 10x Multiome PBMC dataset: link

Datasets for perturbation prediction

  • Adamson dataset: link

  • Norman dataset: link

Datasets for the GRN analysis

  • Immune Human dataset link

Datasets for zero-shot integration

  • Lung-Kim dataset: link

  • COVID-19 dataset: link

  • Multiple Sclerosis (M.S.) dataset: link

Datasets for zero-shot integration

  • COVID-19 dataset(splitted) : link

  • Lung-Kim dataset(splitted): link