You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I run VirGenA with option "assembling using reference, without msa" on some reads cleaned with alignment on a reference genome (Bowtie2+Samtools). Some tools are ok with my fastq obtnained, like fastqc, fastqscreen, DNAstar, but with VirGenA I have this issue :
java.io.IOException: File {sample}.fastq.gz have incorrect sequence identifier string
Somes parameters :
_> Mode:
Reference Selector: false
Use Major: true
Update: No more issue using option : samtools fastq -N (Always add either '/1' or '/2' to the end of read names, even when put into different files), with now information about who is R1 and R2 inside file.fastq.gz, I guess. :)
You are right! VirGenaA searches for the first ' ' or '/' and then trims the ending to match read names in R1 and R2 files. Bowtie + samtools seem to do the same, trimming the ending. Adding '/1' or '/2' or '1:N:0:4' back fixes this as you already found yourself.
Dear Fedonin,
I run VirGenA with option "assembling using reference, without msa" on some reads cleaned with alignment on a reference genome (Bowtie2+Samtools). Some tools are ok with my fastq obtnained, like fastqc, fastqscreen, DNAstar, but with VirGenA I have this issue :
java.io.IOException: File {sample}.fastq.gz have incorrect sequence identifier string
Somes parameters :
_> Mode:
Reference Selector: false
Use Major: true
My fastq format (head) before and after cleaning :
BEFORE
AFTER
It's the last missing part of the header missing (1:N:0:4) ?
Or maybe something else ?
Thank you very much,
Nicolas
The text was updated successfully, but these errors were encountered: