Skip to content

Pipeline for analysing ATAC-seq data from raw fastq files to called genomic peaks

Notifications You must be signed in to change notification settings

focyte/Bash-ATACseq

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ATAC-seq

Pipeline for analysing ATAC-seq data from raw fastq files to called genomic peaks


Table of Contents

  1. Overview
  2. Pipeline Stages
  3. How to Run
  4. Results

Overview

Pipeline Stages

Set up Conda Environment

Load the yml file

conda env create -f atacseq.yml

Quality Control

Run FastQC

fastqc -t 8 *fastq.gz -o ./fastqc_initial

Run MultiQC

cd ./fastqc_initial/
multiqc .
firefox multiqc_report.html

Trim Reads

cd ../trimming
./trim-reads-paired.sh

Run FastQC on Trimmed Reads

fastqc -t 8 *fq.gz -o fastqc_trimming/
cd ./fastqc_trimming/
multiqc .
firefox multiqc_report.html

Mapping

Index Reference Genome

Download index files from https://benlangmead.github.io/aws-indexes/bowtie and add them to the mapping folder.

cd ../../../mapping

Run Bowtie2 Alignment

./bowtie2-mapping-paired.sh

Convert SAM to BAM, Sort, and Index BAM

./sam-to-bam.sh

Peak Calling

Mark Duplicates and Filter

cd ../peak_call
./filter.sh

Use MACS2 to Call Peaks

./peak_call.sh

Open IGV and View Peaks

igv

In the IGV GUI choose file > Load from File

How To Run

Results

Figure describing results generated using this pipeline

About

Pipeline for analysing ATAC-seq data from raw fastq files to called genomic peaks

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages