Skip to content

Commit c06c75d

Browse files
committed
usage of metagenomics pipeline
1 parent c8ef4c2 commit c06c75d

File tree

1 file changed

+76
-2
lines changed

1 file changed

+76
-2
lines changed

README.md

+76-2
Original file line numberDiff line numberDiff line change
@@ -3,13 +3,87 @@ This pipeline is to ease my pressure for Multiple omics analysis. In this versio
33

44
# Get Start
55

6-
若要使用Metagenomics分析流程,请将主程序添加到环境变量:
6+
本项目目前仅包含**Metagenomics宏基因组学标准流程****Lipidomics脂质组学分析流程**
7+
8+
### Metagenomics pipeline
9+
10+
在集群环境使用本流程,请将主程序添加到环境变量:
711

812
```
913
ln -s /ifs1/ST_MD/PMO/script/flow/Omics_pipeline/MetaGenomics/CHAOmics_MetaGenomics_v0.1_Init.pl ~/bin/
1014
```
1115

12-
如果你所在的环境访问不了我的主程序,请`clone`本仓库并将主程序添加到环境变量中。
16+
如果你所在的环境访问不了我的主程序,请在集群工作目录下`clone`本仓库并将主程序添加到环境变量中。
17+
18+
```
19+
cd /your/dir/
20+
clone [email protected]:Fangchao/Omics_pipeline.git
21+
ln -s /your/dir/Omics_pipeline/MetaGenomics/CHAOmics_MetaGenomics_v0.1_Init.pl ~/bin/
22+
```
23+
24+
#### Usage:
25+
26+
```
27+
CHAOmics_MetaGenomics_Init.pl #直接执行本程序可以查看使用说明
28+
usage:
29+
perl /share/bin/CHAOmics_MetaGenomics_v0.1_Init.pl <pe|se> [options]
30+
pattern
31+
pe|se pair end | single end
32+
options:
33+
-p|path :[essential]sample path file (SampleID|fqID|fqPath)
34+
-i|ins :[essential for pair-end seq]insert info file
35+
-s|step :functions,default 1234
36+
1 trim+filter
37+
2 remove host genomic reads
38+
3 soap mapping to microbiotic genomics
39+
4 combine samples' abun into a single profile table
40+
-o|outdir :output directory path. Conatins the results and scripts.
41+
-c|config :provide a configure file including needed database and parameters for each setp, default below:
42+
Qt ||= 20 Qvalue for trim
43+
l ||= 10 bp length for trim
44+
N ||= 1 tolerance number of N for filter
45+
Qf ||= 15 Qvalue for filter. The reads which more than half of the bytes lower than Qf will be discarded.
46+
lf ||= 0 left fq length. The minimum
47+
q ||= "st.q" queue for qsub
48+
P ||= "st_ms" Project id for qsub
49+
pro ||= 8 process number for qsub
50+
vf1 ||= "0.3G" virtual free for qsub in step 1 (trim & filter)
51+
vf2 ||= "8G" virtual free for qsub in step 2 (remove host genes)
52+
vf3 ||= "16G" virtual free for qsub in step 3 (aligned to gene set)
53+
vf4 ||= "10G" virtual free for qsub in step 4 (calculate soap results to abundance)
54+
m ||= 99 job number submitted each for qsub
55+
r ||= 1 repeat time when job failed or interrupted
56+
-h|help :show help info
57+
-v|version :show version and author info.
58+
```
59+
60+
**path file**: 用于记录raw data文件位置和id信息的文件,每行三列分别记录下**样本编号**, **数据编号****fq文件路径**
61+
62+
- `样本编号`:生物学,统计学意义上的样本个体,用于后续分析的基本个体
63+
- `数据编号`:如果同一个样本进行多次测序,则会产生多个数据,此时需要用数据编号来区分(可以使文库号,日期,批次,等等)。`注意`:拥有相同`样本编号`的多个数据会被最终合并计算相对丰度。
64+
- `fastq路径`: 必须是工作环境可以访问到的路径位置
65+
66+
上述配置文件准备完毕后,运行本脚本可以生成工作目录:
67+
68+
```
69+
CHAOmics_MetaGenomics_v0.1_Init.pl se -p sample.path.file -o demo
70+
```
71+
72+
随后进入工作目录,检查脚本无误后可以启动执行脚本:
73+
74+
```
75+
cd demo
76+
sh qsub_all.sh # 模式一,选择其一即可
77+
sh linear.1234.sh # 模式二,选择其一即可,本模式会产生较多进程
78+
```
79+
80+
完成后可以执行`sh REPORT.sh`打印报告表格。
81+
82+
若中途出现错误,可以进入`script`目录对个别脚本进行调试。
83+
84+
--------------
85+
86+
### Lipidomics pipeline
1387

1488
### What do I wanna perform?
1589
As a pipeline, I plan to orgainze the workshop directory like this:

0 commit comments

Comments
 (0)