Skip to content

Commit

Permalink
Updates data setction
Browse files Browse the repository at this point in the history
  • Loading branch information
asabjorklund committed Feb 9, 2024
1 parent 8cc0531 commit f076e8f
Show file tree
Hide file tree
Showing 2 changed files with 58 additions and 67 deletions.
88 changes: 21 additions & 67 deletions docs/other/data.html
Original file line number Diff line number Diff line change
Expand Up @@ -8,7 +8,7 @@

<meta name="author" content="">

<title>Data sets</title>
<title>Data</title>
<style>
code{white-space: pre-wrap;}
span.smallcaps{font-variant: small-caps;}
Expand All @@ -21,40 +21,6 @@
margin: 0 0.8em 0.2em -1em; /* quarto-specific, see https://github.com/quarto-dev/quarto-cli/issues/4556 */
vertical-align: middle;
}
/* CSS for syntax highlighting */
pre > code.sourceCode { white-space: pre; position: relative; }
pre > code.sourceCode > span { display: inline-block; line-height: 1.25; }
pre > code.sourceCode > span:empty { height: 1.2em; }
.sourceCode { overflow: visible; }
code.sourceCode > span { color: inherit; text-decoration: inherit; }
div.sourceCode { margin: 1em 0; }
pre.sourceCode { margin: 0; }
@media screen {
div.sourceCode { overflow: auto; }
}
@media print {
pre > code.sourceCode { white-space: pre-wrap; }
pre > code.sourceCode > span { text-indent: -5em; padding-left: 5em; }
}
pre.numberSource code
{ counter-reset: source-line 0; }
pre.numberSource code > span
{ position: relative; left: -4em; counter-increment: source-line; }
pre.numberSource code > span > a:first-child::before
{ content: counter(source-line);
position: relative; left: -1em; text-align: right; vertical-align: baseline;
border: none; display: inline-block;
-webkit-touch-callout: none; -webkit-user-select: none;
-khtml-user-select: none; -moz-user-select: none;
-ms-user-select: none; user-select: none;
padding: 0 4px; width: 4em;
}
pre.numberSource { margin-left: 3em; padding-left: 4px; }
div.sourceCode
{ }
@media screen {
pre > code.sourceCode > span > a:first-child::before { text-decoration: underline; }
}
</style>


Expand All @@ -75,8 +41,6 @@
<script src="../site_libs/bootstrap/bootstrap.min.js"></script>
<link href="../site_libs/bootstrap/bootstrap-icons.css" rel="stylesheet">
<link href="../site_libs/bootstrap/bootstrap.min.css" rel="stylesheet" id="quarto-bootstrap" data-mode="light">
<link href="../site_libs/quarto-contrib/fontawesome6-0.1.0/all.css" rel="stylesheet">
<link href="../site_libs/quarto-contrib/fontawesome6-0.1.0/latex-fontsize.css" rel="stylesheet">
<script id="quarto-search-options" type="application/json">{
"location": "navbar",
"copy-button": false,
Expand Down Expand Up @@ -172,7 +136,7 @@
<header id="title-block-header" class="quarto-title-block default page-columns page-full">
<div class="quarto-title-banner page-columns page-full">
<div class="quarto-title column-body">
<h1 class="title">Data sets</h1>
<h1 class="title">Data</h1>
<p class="subtitle lead">Short descriptions of the datasets used in the tutorials.</p>
</div>
</div>
Expand All @@ -194,9 +158,9 @@ <h1 class="title">Data sets</h1>
<h2 id="toc-title">On this page</h2>

<ul>
<li><a href="#covid" id="toc-install-docker" class="nav-link active" data-scroll-target="#install-docker"><span class="header-section-number">1</span> Covid data</a>
<li><a href="#hematopoesis" id="toc-test-installation" class="nav-link" data-scroll-target="#test-installation"><span class="header-section-number">2</span> Hematopoesis data</a></li>
<li><a href="#spatial" id="toc-allocate-resources" class="nav-link" data-scroll-target="#allocate-resources"><span class="header-section-number">3</span> Spatial transcriptomics data</a></li>
<li><a href="#covid-19-data" id="toc-covid-19-data" class="nav-link active" data-scroll-target="#covid-19-data"><span class="header-section-number">1</span> Covid-19 data</a></li>
<li><a href="#hematopoesis-data" id="toc-hematopoesis-data" class="nav-link" data-scroll-target="#hematopoesis-data"><span class="header-section-number">2</span> Hematopoesis data</a></li>
<li><a href="#spatial-transcriptomics-data" id="toc-spatial-transcriptomics-data" class="nav-link" data-scroll-target="#spatial-transcriptomics-data"><span class="header-section-number">3</span> Spatial transcriptomics data</a></li>
</ul>
</nav>
</div>
Expand All @@ -207,31 +171,21 @@ <h2 id="toc-title">On this page</h2>



<section id="covid" class="level2" data-number="1">
<h2 data-number="1" class="anchored" data-anchor-id="covid"><span class="header-section-number">1</span> Covid-19 data</h2>
<p>The data we are using in the first 6 tutorials is 10x data of peripheral blood mononuclear cells (PBMCs) from Covid patients and healthy controls from the paper "Immunophenotyping of COVID-19 and influenza highlights the role of type I interferons in development of severe COVID-19" in <a href="https://www.science.org/doi/10.1126/sciimmunol.abd1554">Science</a>. </p>

<p> A peripheral blood mononuclear cell (PBMC) is any peripheral blood cell having a round nucleus. These cells consist of lymphocytes (T cells, B cells, NK cells), monocytes and dendritic cells, whereas erythrocytes and platelets have no nuclei, and granulocytes (neutrophils, basophils, and eosinophils) have multi-lobed nuclei. </p>


<p> Data was downloaded from GEO GSE149689 entry. For the tutorials we have selected 4 of the severe patients and 4 controls from that dataset. Each donor was then downsampled to 1500 cells per individual just to speed up the processing times in the labs. The script used to select samples and downsampling can be found at our <a href="https://raw.githubusercontent.com/NBISweden/workshop-scRNAseq/master/scripts/data_processing/subsample_covid_data.Rmd">Github</a> </p>

<section id="hematopoesis" class="level2" data-number="2">
<h2 data-number="2" class="anchored" data-anchor-id="hematopoesis"><span class="header-section-number">2</span> Hematopoesis data</h2>
<p>In the trajectory exercise we continue with immune cells but to get the full development of the different lineages we need to have bone marrow data. The dataset we are using is an integrated object with bone marrow data from multiple studies. </p>


<p> The data was integrated with Harmony and saved as a Seurat object. We already have subsetted the dataset (with 6688 cells and 3585 genes). In addition there was some manual filtering done to remove clusters that are disconnected and cells that are hard to cluster, which can be seen in this <a href="https://raw.githubusercontent.com/NBISweden/workshop-scRNAseq/master/scripts/data_processing/slingshot_preprocessing.Rmd">script</a> </p>



<section id="spatial" class="level2" data-number="3">
<h2 data-number="3" class="anchored" data-anchor-id="spatial"><span class="header-section-number">3</span> Spatial transcriptomics data</h2>
<p>For the spatial transcriptomics tutorial we are using public Visium data from the 10x website that has been included in the data resources for the Seurat and Scanpy packages. We are using tow sections of the mouse brain (Sagittal). </p>

<p> The single cell data that we are using for mapping of celltypes onto the spatial data is a mouse cortex dataset from Allen brain institute. </p>


<section id="covid-19-data" class="level2" data-number="1">
<h2 data-number="1" class="anchored" data-anchor-id="covid-19-data"><span class="header-section-number">1</span> Covid-19 data</h2>
<p>The data we are using in the first 6 tutorials is 10x data of peripheral blood mononuclear cells (PBMCs) from Covid patients and healthy controls from the paper “Immunophenotyping of COVID-19 and influenza highlights the role of type I interferons in development of severe COVID-19” in <a href="https://www.science.org/doi/10.1126/sciimmunol.abd1554">Science</a>.</p>
<p>A peripheral blood mononuclear cell (PBMC) is any peripheral blood cell having a round nucleus. These cells consist of lymphocytes (T cells, B cells, NK cells), monocytes and dendritic cells, whereas erythrocytes and platelets have no nuclei, and granulocytes (neutrophils, basophils, and eosinophils) have multi-lobed nuclei.</p>
<p>Data was downloaded from GEO GSE149689 entry. For the tutorials we have selected 4 of the severe patients and 4 controls from that dataset. Each donor was then downsampled to 1500 cells per individual just to speed up the processing times in the labs. The script used to select samples and downsampling can be found at our <a href="https://raw.githubusercontent.com/NBISweden/workshop-scRNAseq/master/scripts/data_processing/subsample_covid_data.Rmd">Github</a></p>
</section>
<section id="hematopoesis-data" class="level2" data-number="2">
<h2 data-number="2" class="anchored" data-anchor-id="hematopoesis-data"><span class="header-section-number">2</span> Hematopoesis data</h2>
<p>In the trajectory exercise we continue with immune cells but to get the full development of the different lineages we need to have bone marrow data. The dataset we are using is an integrated object with bone marrow data from multiple studies.</p>
<p>The data was integrated with Harmony and saved as a Seurat object. We already have subsetted the dataset (with 6688 cells and 3585 genes). In addition there was some manual filtering done to remove clusters that are disconnected and cells that are hard to cluster, which can be seen in this script</p>
</section>
<section id="spatial-transcriptomics-data" class="level2" data-number="3">
<h2 data-number="3" class="anchored" data-anchor-id="spatial-transcriptomics-data"><span class="header-section-number">3</span> Spatial transcriptomics data</h2>
<p>For the spatial transcriptomics tutorial we are using public Visium data from the 10x website that has been included in the data resources for the Seurat and Scanpy packages. We are using tow sections of the mouse brain (Sagittal).</p>
<p>The single cell data that we are using for mapping of celltypes onto the spatial data is a mouse cortex dataset from Allen brain institute.</p>


</section>
Expand Down Expand Up @@ -485,4 +439,4 @@ <h2 data-number="3" class="anchored" data-anchor-id="spatial"><span class="heade


<script src="../site_libs/quarto-html/zenscroll-min.js"></script>
</body></html>
</body></html>
37 changes: 37 additions & 0 deletions other/data.qmd
Original file line number Diff line number Diff line change
@@ -0,0 +1,37 @@
---
title: Data
subtitle: "Short descriptions of the datasets used in the tutorials."
date: ""
author: ""
code-tools: false
format: html
description: ""
execute:
eval: false
engine: knitr
---

## Covid-19 data

The data we are using in the first 6 tutorials is 10x data of peripheral blood mononuclear cells (PBMCs) from Covid patients and healthy controls from the paper "Immunophenotyping of COVID-19 and influenza highlights the role of type I interferons in development of severe COVID-19" in <a href="https://www.science.org/doi/10.1126/sciimmunol.abd1554">Science</a>.

A peripheral blood mononuclear cell (PBMC) is any peripheral blood cell having a round nucleus. These cells consist of lymphocytes (T cells, B cells, NK cells), monocytes and dendritic cells, whereas erythrocytes and platelets have no nuclei, and granulocytes (neutrophils, basophils, and eosinophils) have multi-lobed nuclei.

Data was downloaded from GEO GSE149689 entry. For the tutorials we have selected 4 of the severe patients and 4 controls from that dataset. Each donor was then downsampled to 1500 cells per individual just to speed up the processing times in the labs. The script used to select samples and downsampling can be found at our <a href="https://raw.githubusercontent.com/NBISweden/workshop-scRNAseq/master/scripts/data_processing/subsample_covid_data.Rmd">Github</a>



## Hematopoesis data

In the trajectory exercise we continue with immune cells but to get the full development of the different lineages we need to have bone marrow data. The dataset we are using is an integrated object with bone marrow data from multiple studies.

The data was integrated with Harmony and saved as a Seurat object. We already have subsetted the dataset (with 6688 cells and 3585 genes). In addition there was some manual filtering done to remove clusters that are disconnected and cells that are hard to cluster, which can be seen in this script

## Spatial transcriptomics data

For the spatial transcriptomics tutorial we are using public Visium data from the 10x website that has been included in the data resources for the Seurat and Scanpy packages. We are using tow sections of the mouse brain (Sagittal).

The single cell data that we are using for mapping of celltypes onto the spatial data is a mouse cortex dataset from Allen brain institute.



0 comments on commit f076e8f

Please sign in to comment.