Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Refactoring #172

Merged
merged 133 commits into from
Feb 28, 2024
Merged
Show file tree
Hide file tree
Changes from 1 commit
Commits
Show all changes
133 commits
Select commit Hold shift + click to select a range
ff0c46a
size reduction
erinyoung Jan 24, 2024
a405ede
moving to nf-core compatible
erinyoung Jan 26, 2024
a1fb5e8
sample to prefix
erinyoung Jan 27, 2024
ab20500
adding fastas
erinyoung Jan 27, 2024
c730624
update kraken2 and mash ref
erinyoung Jan 29, 2024
e670381
added version
erinyoung Jan 29, 2024
7450ad8
initial script for multiqc versions
erinyoung Jan 29, 2024
6b982b7
updated tag
erinyoung Jan 29, 2024
9a352df
updated tag
erinyoung Jan 29, 2024
ae39200
updated tag
erinyoung Jan 29, 2024
5d0f563
added versions
erinyoung Jan 29, 2024
541f8f0
added versions
erinyoung Jan 30, 2024
b6f335c
added versions
erinyoung Jan 30, 2024
468cfc7
read in fastas from path
erinyoung Jan 30, 2024
c2d90a3
change module path
erinyoung Jan 30, 2024
55cf6eb
change module path
erinyoung Jan 30, 2024
0b65ccb
moving to panaroo
erinyoung Jan 30, 2024
c71da15
change module path
erinyoung Jan 30, 2024
f4d29b1
adjusted versions
erinyoung Jan 31, 2024
686ae17
adjusted versions
erinyoung Jan 31, 2024
e889c90
simplified inputs
erinyoung Jan 31, 2024
bf4609c
testing gbk files
erinyoung Jan 31, 2024
2ad22d4
adjusted label
erinyoung Jan 31, 2024
f240700
fixed summary
erinyoung Jan 31, 2024
f42f689
simplified multiqc
erinyoung Jan 31, 2024
37381c8
version to container
erinyoung Jan 31, 2024
a5a80af
adjusted labels
erinyoung Jan 31, 2024
7baf838
uncommented error strategy
erinyoung Jan 31, 2024
3eaa067
added version
erinyoung Jan 31, 2024
934d6ec
changed to gff
erinyoung Jan 31, 2024
4ec7332
added header
erinyoung Jan 31, 2024
ff688c1
moved sorted_matrix
erinyoung Jan 31, 2024
175e84b
adjusted header
erinyoung Jan 31, 2024
787e19f
fixed input
erinyoung Jan 31, 2024
becf61e
fixed inputs
erinyoung Jan 31, 2024
dc64a6b
updated to 2.2.2.7
erinyoung Jan 31, 2024
8597e39
moved back to gff
erinyoung Jan 31, 2024
39002a4
removed meta
erinyoung Jan 31, 2024
9e641b6
adjusted label resource management
erinyoung Jan 31, 2024
cc2bef3
added versions channel
erinyoung Jan 31, 2024
6178f00
added versions channel
erinyoung Jan 31, 2024
014656a
added versions channel
erinyoung Jan 31, 2024
5414448
initial versions added
erinyoung Jan 31, 2024
cbe8398
initial versions added
erinyoung Jan 31, 2024
94c5f59
python script for files
erinyoung Jan 31, 2024
c0831ba
adjusting mash
erinyoung Feb 3, 2024
638c0bd
added core genome
erinyoung Feb 3, 2024
f040bec
changed text
erinyoung Feb 3, 2024
a35f226
remove genome_accessions
erinyoung Feb 3, 2024
bb016dd
made results optional
erinyoung Feb 3, 2024
d853ee3
fixed version parsing
erinyoung Feb 3, 2024
105c081
fixed version parsing
erinyoung Feb 3, 2024
99f6081
fixed version parsing
erinyoung Feb 3, 2024
76a4dbb
fixed version parsing
erinyoung Feb 3, 2024
d6645ea
for drprg
erinyoung Feb 3, 2024
6d5fcb8
fixed prefix
erinyoung Feb 3, 2024
ec17d2d
updated to 1.19
erinyoung Feb 3, 2024
38f5472
added length file
erinyoung Feb 3, 2024
dfad59a
fixed version parsing
erinyoung Feb 3, 2024
3a9d917
added all
erinyoung Feb 3, 2024
67e16d6
added mqc_prep and json_convert
erinyoung Feb 3, 2024
619763e
fixed version parsing
erinyoung Feb 3, 2024
0ea874b
added fastani_len_summary
erinyoung Feb 3, 2024
18c45b4
removed first
erinyoung Feb 3, 2024
06a5dca
removed first
erinyoung Feb 3, 2024
0db7349
added TODO
erinyoung Feb 3, 2024
7a1fc67
removed first
erinyoung Feb 3, 2024
07384d9
added mqc_prep
erinyoung Feb 3, 2024
b006977
removed first
erinyoung Feb 3, 2024
2bc640a
added drprg elgato emmtyper etc
erinyoung Feb 13, 2024
83c1d79
added kaptive
erinyoung Feb 13, 2024
bdda46a
fixed emit
erinyoung Feb 13, 2024
d3841de
added first
erinyoung Feb 13, 2024
c21001f
added first
erinyoung Feb 13, 2024
75fcbd0
added first
erinyoung Feb 13, 2024
84d8477
added first
erinyoung Feb 13, 2024
63b0d96
added first
erinyoung Feb 13, 2024
5469c79
removed comment
erinyoung Feb 13, 2024
48da847
added kaptive, fixed drprg summary
erinyoung Feb 13, 2024
d1c701f
adjusting memory reqs
erinyoung Feb 13, 2024
9408d6d
added config file and adjusted results
erinyoung Feb 13, 2024
f45d9e6
uncommented errorstrategy
erinyoung Feb 13, 2024
250b4cb
added path to files
erinyoung Feb 13, 2024
a8b8f7e
reduced memory
erinyoung Feb 13, 2024
5c01154
added fasta list
erinyoung Feb 13, 2024
7ae81e1
testing adding fastani ref
erinyoung Feb 13, 2024
fdee4f5
testing sample sheet and skip extras
erinyoung Feb 13, 2024
6029fda
fixed skipping extras
erinyoung Feb 13, 2024
5c64e68
fixed typos
erinyoung Feb 14, 2024
95a1900
check contigs instead of summary
erinyoung Feb 14, 2024
e312055
removed retry
erinyoung Feb 14, 2024
38c5cfe
find kleborate
erinyoung Feb 14, 2024
c327600
removed todo
erinyoung Feb 14, 2024
b37e492
redid fastani
erinyoung Feb 14, 2024
88b631d
unused
erinyoung Feb 14, 2024
2b476aa
added sra accessions
erinyoung Feb 14, 2024
eb31418
added versions removed script
erinyoung Feb 14, 2024
fb543e5
now keeping cseq and useq
erinyoung Feb 14, 2024
01bfe4c
download files from ena
erinyoung Feb 14, 2024
b75196f
moved where ch_versions
erinyoung Feb 14, 2024
657d983
removed legsta
erinyoung Feb 14, 2024
eed4372
removed todo
erinyoung Feb 14, 2024
5c1ecb2
added sra accessions
erinyoung Feb 14, 2024
11c7bf8
redid tests, added test0
erinyoung Feb 14, 2024
62e4a9c
added meta to channel
erinyoung Feb 14, 2024
b799941
removed roary
erinyoung Feb 15, 2024
e32c5a7
removed cat final file
erinyoung Feb 15, 2024
2c2392e
fixed tabs
erinyoung Feb 15, 2024
9cec6b2
update to checkout v4
erinyoung Feb 15, 2024
bf4ae0e
increased to process high
erinyoung Feb 15, 2024
f2f3f01
fixed tab
erinyoung Feb 15, 2024
4072652
add if to mashtree
erinyoung Feb 15, 2024
f4133db
added circulocov and blobtools
erinyoung Feb 22, 2024
0aad9b4
added circulocov and blobtools
erinyoung Feb 22, 2024
5dfd679
added blast
erinyoung Feb 22, 2024
430ba20
added version
erinyoung Feb 22, 2024
e7a5a3e
added version
erinyoung Feb 22, 2024
7c3c4e0
added blobtools and quality assessment
erinyoung Feb 22, 2024
5847a41
added first
erinyoung Feb 22, 2024
2613756
removed process not species specific
erinyoung Feb 22, 2024
1c329a8
non-species processes
erinyoung Feb 22, 2024
6ec4f6c
added circulocov
erinyoung Feb 22, 2024
c393c12
fixed typos for NA
erinyoung Feb 22, 2024
6bbdc5d
adding information back in
erinyoung Feb 22, 2024
3f112d4
added space between letters
erinyoung Feb 22, 2024
6a33722
adjusted hidden
erinyoung Feb 22, 2024
5cd9634
removed UPHLICA
erinyoung Feb 22, 2024
60f1752
added groupTuple back
erinyoung Feb 22, 2024
c7e0f8c
uncommented errorStrategy
erinyoung Feb 22, 2024
ea6eaab
adjusted params
erinyoung Feb 22, 2024
3d4c000
added param file for config
erinyoung Feb 22, 2024
175a4fe
added nextflow -version
erinyoung Feb 23, 2024
5f83be9
added tools
erinyoung Feb 23, 2024
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Prev Previous commit
Next Next commit
sample to prefix
  • Loading branch information
erinyoung committed Jan 27, 2024
commit a1fb5e8dfd7442d756a8768cca3bbaf7d393e003
65 changes: 28 additions & 37 deletions modules/local/amrfinderplus.nf
Original file line number Diff line number Diff line change
Expand Up @@ -3,60 +3,51 @@ process amrfinderplus {
label "process_medium"
publishDir params.outdir, mode: 'copy'
container 'staphb/ncbi-amrfinderplus:3.11.26-2023-11-15.1'
maxForks 10
//#UPHLICA errorStrategy { task.attempt < 2 ? 'retry' : 'ignore'}
//#UPHLICA cpus 7
//#UPHLICA memory 26.GB
//#UPHLICA pod annotation: 'scheduler.illumina.com/presetSize', value: 'standard-large'
//#UPHLICA time '10m'
//errorStrategy { task.attempt < 2 ? 'retry' : 'ignore'}'
time '10m'

input:
tuple val(sample), file(contigs), val(genus), val(species)
tuple val(meta), file(contigs), val(genus), val(species)

output:
path "ncbi-AMRFinderplus/${sample}_amrfinder_plus.txt" , emit: collect
path "logs/${task.process}/${sample}.${workflow.sessionId}.log", emit: log
path "versions.yml" , emit: versions
path "ncbi-AMRFinderplus/*_amrfinder_plus.txt", emit: collect
path "logs/${task.process}/*.log", emit: log
path "versions.yml", emit: versions

when:
task.ext.when == null || task.ext.when

shell:
def args = task.ext.args ?: ''
def prefix = task.ext.prefix ?: "${meta.id}"
'''
mkdir -p ncbi-AMRFinderplus logs/!{task.process}
log_file=logs/!{task.process}/!{sample}.!{workflow.sessionId}.log
def args = task.ext.args ?: ''
def prefix = task.ext.prefix ?: "${meta.id}"
"""
mkdir -p ncbi-AMRFinderplus logs/${task.process}
log_file=logs/${task.process}/${prefix}.${workflow.sessionId}.log

# time stamp + capturing tool versions
date > $log_file
amrfinder --version >> $log_file
echo "container : !{task.container}" >> $log_file
echo "Nextflow command : " >> $log_file
cat .command.sh >> $log_file

organism=$(amrfinder -l | tr " " "\\n" | grep -i !{genus} | grep -i !{species} | sed 's/,//g' | head -n 1 )
if [ -z "$organism" ] ; then organism=$(amrfinder -l | tr " " "\\n" | grep -i !{genus} | sed 's/,//g' | head -n 1 ) ; fi
if [ -n "$organism" ]
organism=\$(amrfinder -l | tr " " "\\n" | grep -i ${genus} | grep -i ${species} | sed 's/,//g' | head -n 1 )
if [ -z "\$organism" ] ; then organism=$(amrfinder -l | tr " " "\\n" | grep -i ${genus} | sed 's/,//g' | head -n 1 ) ; fi
if [ -n "\$organism" ]
then
organism_check="--organism $organism"
echo "Top organism result of !{genus} !{species} matched with $organism" >> $log_file
elif [ "!{genus}" == "Shigella" ]
echo "Top organism result of ${genus} ${species} matched with $organism" >> \$log_file
elif [ "${genus}" == "Shigella" ]
then
organism_check="--organism Escherichia"
echo "--organism Escherichia with be used because of top organism result of !{genus}" >> $log_file
echo "--organism Escherichia with be used because of top organism result of ${genus}" >> \$log_file
else
organism_check=''
echo "Top organism result of !{genus} !{species} did not match any of the organisms" >> $log_file
echo "Top organism result of ${genus} ${species} did not match any of the organisms" >> \$log_file
fi

amrfinder !{params.amrfinderplus_options} \
--nucleotide !{contigs} \
--threads !{task.cpus} \
--name !{sample} \
--output ncbi-AMRFinderplus/!{sample}_amrfinder_plus.txt \
$organism_check \
amrfinder ${args} \
--nucleotide ${contigs} \
--threads ${task.cpus} \
--name ${prefix} \
--output ncbi-AMRFinderplus/${prefix}_amrfinder_plus.txt \
\$organism_check \
--plus \
| tee -a $log_file
'''
| tee -a \$log_file

exit 1
"""
}
2 changes: 1 addition & 1 deletion modules/local/bbduk.nf
Original file line number Diff line number Diff line change
Expand Up @@ -3,7 +3,7 @@ process bbduk {
label "process_medium"
publishDir params.outdir, mode: 'copy'
container 'staphb/bbtools:39.01'
//errorStrategy { task.attempt < 2 ? 'retry' : 'ignore'}
errorStrategy { task.attempt < 2 ? 'retry' : 'ignore'}
time '10m'

input:
Expand Down
49 changes: 20 additions & 29 deletions modules/local/blast.nf
Original file line number Diff line number Diff line change
Expand Up @@ -3,44 +3,35 @@ process blastn {
label "process_medium"
publishDir params.outdir, mode: 'copy'
container 'staphb/blast:2.15.0'
maxForks 10
//#UPHLICA errorStrategy { task.attempt < 2 ? 'retry' : 'ignore'}
//#UPHLICA pod annotation: 'scheduler.illumina.com/presetSize' , value: 'himem-medium'
//#UPHLICA cpus 8
//#UPHLICA memory 100.GB
//#UPHLICA time '2h'
//errorStrategy { task.attempt < 2 ? 'retry' : 'ignore'}
time '2h'

input:
tuple val(sample), file(contig), path(blastdb)
tuple val(meta), file(contig), path(blastdb)

output:
tuple val(sample), file("blastn/${sample}.tsv") , emit: blastn
path "logs/${task.process}/${sample}.${workflow.sessionId}.log", emit: log
path "versions.yml" , emit: versions
tuple val(meta), file("blastn/*.tsv"), emit: blastn
path "logs/${task.process}/*.${workflow.sessionId}.log", emit: log
path "versions.yml", emit: versions

when:
task.ext.when == null || task.ext.when

shell:
def args = task.ext.args ?: '-max_target_seqs 10 -max_hsps 1 -evalue 1e-25'
def prefix = task.ext.prefix ?: "${meta.id}"
'''
mkdir -p blastn logs/!{task.process}
log_file=logs/!{task.process}/!{sample}.!{workflow.sessionId}.log
def args = task.ext.args ?: '-max_target_seqs 10 -max_hsps 1 -evalue 1e-25'
def prefix = task.ext.prefix ?: "${meta.id}"
"""
mkdir -p blastn logs/${task.process}
log_file=logs/${task.process}/${prefix}.${workflow.sessionId}.log

# time stamp + capturing tool versions
date > $log_file
echo "container : !{task.container}" >> $log_file
blastn -version >> $log_file
echo "Nextflow command : " >> $log_file
cat .command.sh >> $log_file

blastn -query !{contig} \
-out blastn/!{sample}.tsv \
-num_threads !{task.cpus} \
-db !{blastdb}/!{params.blast_db_type} \
blastn -query ${contig} \
-out blastn/${prefix}.tsv \
-num_threads ${task.cpus} \
-db ${blastdb}/${params.blast_db_type} \
-outfmt '6 qseqid staxids bitscore std' \
!{params.blastn_options} \
| tee -a $log_file
'''
${args} \
| tee -a \$log_file

exit 1
"""
}
96 changes: 46 additions & 50 deletions modules/local/blobtools.nf
Original file line number Diff line number Diff line change
Expand Up @@ -3,20 +3,16 @@ process blobtools_create {
label "process_medium"
publishDir params.outdir, mode: 'copy'
container 'chrishah/blobtools:v1.1.1'
maxForks 10
//#UPHLICA errorStrategy { task.attempt < 2 ? 'retry' : 'ignore'}
//#UPHLICA pod annotation: 'scheduler.illumina.com/presetSize', value: 'standard-xlarge'
//#UPHLICA cpus 14
//#UPHLICA memory 60.GB
//#UPHLICA time '45m'
time '45m'
//errorStrategy { task.attempt < 2 ? 'retry' : 'ignore'}

input:
tuple val(sample), file(contig), file(blastn), file(bam)
tuple val(meta), file(contig), file(blastn), file(bam)

output:
tuple val(sample), file("blobtools/${sample}.blobDB.json") , emit: json
path "blobtools/${sample}.${sample}*.bam.cov" , emit: files
path "logs/${task.process}/${sample}.${workflow.sessionId}.log", emit: log
tuple val(meta), file("blobtools/${prefix}.blobDB.json") , emit: json
path "blobtools/${prefix}.${prefix}*.bam.cov" , emit: files
path "logs/${task.process}/${prefix}.${workflow.sessionId}.log", emit: log
path "versions.yml" , emit: versions

when:
Expand All @@ -25,28 +21,28 @@ process blobtools_create {
shell:
def args = task.ext.args ?: ''
def prefix = task.ext.prefix ?: "${meta.id}"
'''
mkdir -p blobtools logs/!{task.process}
log_file=logs/!{task.process}/!{sample}.!{workflow.sessionId}.log
"""
mkdir -p blobtools logs/${task.process}
log_file=logs/${task.process}/${prefix}.${workflow.sessionId}.log

# time stamp + capturing tool versions
date > $log_file
echo "container : !{task.container}" >> $log_file
echo "container : ${task.container}" >> $log_file
echo "blobtools version $(blobtools -v)" >> $log_file
echo "Nextflow command : " >> $log_file
cat .command.sh >> $log_file

blobtools create !{params.blobtools_create_options} \
-o blobtools/!{sample} \
-i !{contig} \
-b !{bam[0]} \
-t !{blastn} \
blobtools create ${params.blobtools_create_options} \
-o blobtools/${prefix} \
-i ${contig} \
-b ${bam[0]} \
-t ${blastn} \
| tee -a $log_file
'''
"""
}

process blobtools_view {
tag "${sample}"
tag "${prefix}"
publishDir params.outdir, mode: 'copy'
container 'chrishah/blobtools:v1.1.1'
maxForks 10
Expand All @@ -57,34 +53,34 @@ process blobtools_view {
//#UPHLICA time '10m'

input:
tuple val(sample), file(json)
tuple val(meta), file(json)

output:
tuple val(sample), file("blobtools/${sample}.blobDB.table.txt"), emit: file
path "logs/${task.process}/${sample}.${workflow.sessionId}.log", emit: log
tuple val(meta), file("blobtools/${prefix}.blobDB.table.txt"), emit: file
path "logs/${task.process}/${prefix}.${workflow.sessionId}.log", emit: log
path "versions.yml" , emit: versions

shell:
'''
mkdir -p blobtools logs/!{task.process}
log_file=logs/!{task.process}/!{sample}.!{workflow.sessionId}.log
"""
mkdir -p blobtools logs/${task.process}
log_file=logs/${task.process}/${prefix}.${workflow.sessionId}.log

# time stamp + capturing tool versions
date > $log_file
echo "container : !{task.container}" >> $log_file
echo "container : ${task.container}" >> $log_file
echo "blobtools version $(blobtools -v)" >> $log_file
echo "Nextflow command : " >> $log_file
cat .command.sh >> $log_file

blobtools view !{params.blobtools_view_options} \
-i !{json} \
blobtools view ${params.blobtools_view_options} \
-i ${json} \
-o blobtools/ \
| tee -a $log_file
'''
"""
}

process blobtools_plot {
tag "${sample}"
tag "${prefix}"
publishDir params.outdir, mode: 'copy'
container 'chrishah/blobtools:v1.1.1'
maxForks 10
Expand All @@ -95,43 +91,43 @@ process blobtools_plot {
//#UPHLICA time '10m'

input:
tuple val(sample), file(json)
tuple val(meta), file(json)

output:
path "blobtools/${sample}.*" , emit: files
tuple val(sample), file("blobtools/${sample}_blobtools.txt") , emit: results
path "blobtools/${sample}_summary.txt" , emit: collect
path "logs/${task.process}/${sample}.${workflow.sessionId}.log", emit: log
path "blobtools/${prefix}.*" , emit: files
tuple val(meta), file("blobtools/${prefix}_blobtools.txt") , emit: results
path "blobtools/${prefix}_summary.txt" , emit: collect
path "logs/${task.process}/${prefix}.${workflow.sessionId}.log", emit: log
path "versions.yml" , emit: versions

shell:
'''
mkdir -p blobtools logs/!{task.process}
log_file=logs/!{task.process}/!{sample}.!{workflow.sessionId}.log
"""
mkdir -p blobtools logs/${task.process}
log_file=logs/${task.process}/${prefix}.${workflow.sessionId}.log

# time stamp + capturing tool versions
date > $log_file
echo "container : !{task.container}" >> $log_file
echo "container : ${task.container}" >> $log_file
echo "blobtools version $(blobtools -v)" >> $log_file
echo "Nextflow command : " >> $log_file
cat .command.sh >> $log_file

blobtools plot !{params.blobtools_plot_options} \
-i !{json} \
blobtools plot ${params.blobtools_plot_options} \
-i ${json} \
-o blobtools/ \
| tee -a $log_file

grep "^# " blobtools/!{sample}*.stats.txt | \
grep "^# " blobtools/${prefix}*.stats.txt | \
sed 's/# //g' | \
awk '{print "sample\t" $0 }' > blobtools/!{sample}_summary.txt
awk '{print "sample\t" $0 }' > blobtools/${prefix}_summary.txt

grep -v "^#" blobtools/!{sample}*.stats.txt | \
grep -v "^#" blobtools/${prefix}*.stats.txt | \
sed 's/%//g' | \
tr " " "_" | \
awk -v sample=!{sample} '{if ($13 >= 5.0 ) print sample "\\t" $0}' | \
awk -v sample=${prefix} '{if ($13 >= 5.0 ) print sample "\\t" $0}' | \
tr " " "\\t" | \
sort -k 14rn,14 >> blobtools/!{sample}_summary.txt
sort -k 14rn,14 >> blobtools/${prefix}_summary.txt

grep -vw all blobtools/!{sample}_summary.txt > blobtools/!{sample}_blobtools.txt
'''
grep -vw all blobtools/${prefix}_summary.txt > blobtools/${prefix}_blobtools.txt
"""
}
24 changes: 12 additions & 12 deletions modules/local/datasets.nf
Original file line number Diff line number Diff line change
Expand Up @@ -24,19 +24,19 @@ process datasets_summary {
shell:
def args = task.ext.args ?: ''
def prefix = task.ext.prefix ?: "${meta.id}"
'''
mkdir -p datasets logs/!{task.process}
log_file=logs/!{task.process}/!{taxon}.!{workflow.sessionId}.log
"""
mkdir -p datasets logs/${task.process}
log_file=logs/${task.process}/${taxon}.${workflow.sessionId}.log

# time stamp + capturing tool versions
date > $log_file
echo "container : !{task.container}" >> $log_file
echo "container : ${task.container}" >> $log_file
datasets --version 2>> $log_file
echo "Nextflow command : " >> $log_file
cat .command.sh >> $log_file

python3 !{script} !{taxon} !{params.datasets_max_genomes}
'''
python3 ${script} ${taxon} ${params.datasets_max_genomes}
"""
}

// It is faster if datasets can download the entire list at a time, but there is a 20 minute timeout for downloading.
Expand Down Expand Up @@ -66,18 +66,18 @@ process datasets_download {
task.ext.when == null || task.ext.when

shell:
'''
mkdir -p datasets genomes logs/!{task.process}
log_file=logs/!{task.process}/datasets_download.!{workflow.sessionId}.log
"""
mkdir -p datasets genomes logs/${task.process}
log_file=logs/${task.process}/datasets_download.${workflow.sessionId}.log

# time stamp + capturing tool versions
date > $log_file
echo "container : !{task.container}" >> $log_file
echo "container : ${task.container}" >> $log_file
datasets --version 2>> $log_file
echo "Nextflow command : " >> $log_file
cat .command.sh >> $log_file

grep -h -v accession !{ids} | cut -f 1 -d , | sort | uniq > this_run.txt
grep -h -v accession ${ids} | cut -f 1 -d , | sort | uniq > this_run.txt

cat all_runs.txt this_run.txt | sort | uniq > id_list.txt

Expand All @@ -96,5 +96,5 @@ process datasets_download {
done

rm -rf genomes/*:_*
'''
"""
}
Loading