-
Notifications
You must be signed in to change notification settings - Fork 0
/
ChIPSeq_documentation.Rmd
43 lines (34 loc) · 1.39 KB
/
ChIPSeq_documentation.Rmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
---
title: "ChIPSeq data analysis and visualisation"
output: github_document
---
```{r setup, include=FALSE}
knitr::opts_chunk$set(echo = TRUE)
```
## motif_location_on_sequence
This function helps you to locate motifs on the reference fasta sequences provided by the user.
As a prerequisite user have to provide the list of motifs and reference fasta sequence.
User have to mention if he wants to display percentile location i.e. in case of ChIPSeq peaks when the binding site have variable lengths displaying percent position makes more sense. Or you can just plot the start of the motif on the sequence.
This function takes into account the reverse complement of the motif while finding the motif so no need to worry about the motifs on complementary strands.
Input files
```
1. fastaSequence file
>GeneA
GTGCAACGCGTATAAACCTTTGGGCCTCTCTCACGATGAATAGGGGCAGTAGCATTACACTCTGGCATAACAATAGGGCCCTGGCAATAGGGCTCTGGCAATAGGGCTCTGGCAATAGACTGTGGGAGGGCCATACTG
>GeneB
TAGAAGAAAAAAAGACAAGCAAAACAAAATCACACAACACATAATCAAAAAGCATTACACTCTGGCATAACAATAGGGCCCTGGCAATAGGGCTCTGGCAATAGGGCTCTGGCAATAGACTGTGGGAGGGCCATACTG
2. mymotifs (can be copied from clipboard)
TCAATAGGGGATT
TGAATAGGGGC
ACAATAGGGCC
GCAATAGGGCT
GCAATAGGGCT
```
Run the function as:
```
motif_location_on_sequence(fastaSequence, mymotifs,"outfile",x = "start")
```
## Output
1. outfile_GenesWithMotifs.txt
2. Motif location visualisation
![](motif_location_on_sequence.png)