Skip to content

Commit cc06f4f

Browse files
committed
GSvar: added check that WES/WGS samples were loaded completely.
1 parent 29798e9 commit cc06f4f

6 files changed

+79
-42
lines changed

doc/tools/CnvFilterAnnotations.md

+3-3
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
### CnvFilterAnnotations tool help
2-
CnvFilterAnnotations (2020_03-184-g27235379)
2+
CnvFilterAnnotations (2021_06-89-gbbd16264)
33

44
Filter a copy-number variant list in TSV format based on variant annotations.
55

@@ -27,7 +27,7 @@
2727
mode - Compound-heterozygotes detection mode. [default=n/a] [valid=n/a,CNV-CNV,CNV-SNV/INDEL]
2828
CNV copy-number Filter for CNV copy number.
2929
Parameters:
30-
cn - Copy number [default=n/a] [valid=n/a,0,1,2,3,4+]
30+
cn - Copy number [valid=0,1,2,3,4,5+] [non-empty]
3131
CNV gene constraint Filter based on gene constraint (gnomAD o/e score for LOF variants).
3232
Note that gene constraint is most helpful for early-onset severe diseases.
3333
For details on gnomAD o/e, see https://macarthurlab.org/2018/10/17/gnomad-v2-1/
@@ -82,7 +82,7 @@
8282
--tdx Writes a Tool Definition Xml file. The file name is the application name with the suffix '.tdx'.
8383

8484
### CnvFilterAnnotations changelog
85-
CnvFilterAnnotations 2020_03-184-g27235379
85+
CnvFilterAnnotations 2021_06-89-gbbd16264
8686

8787
2020-05-25 Initial version of the tool. Based on VariantFilterAnnotations.
8888
[back to ngs-bits](https://github.com/imgag/ngs-bits)

doc/tools/MappingQC.md

+4-2
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
### MappingQC tool help
2-
MappingQC (2020_12-85-g5ff87d17)
2+
MappingQC (2021_06-89-gbbd16264)
33

44
Calculates QC metrics based on mapped NGS reads.
55

@@ -28,6 +28,8 @@
2828
Valid: 'hg19,hg38'
2929
-ref <file> Reference genome FASTA file. If unset 'reference_genome' from the 'settings.ini' file is used.
3030
Default value: ''
31+
-cfdna Add additional QC parameters for cfDNA samples. Only supported mit '-roi'.
32+
Default value: 'false'
3133

3234
Special parameters:
3335
--help Shows this help and exits.
@@ -36,7 +38,7 @@
3638
--tdx Writes a Tool Definition Xml file. The file name is the application name with the suffix '.tdx'.
3739

3840
### MappingQC changelog
39-
MappingQC 2020_12-85-g5ff87d17
41+
MappingQC 2021_06-89-gbbd16264
4042

4143
2021-02-09 Added new QC metrics for uniformity of coverage (QC:2000057-QC:2000061).
4244
2020-11-27 Added CRAM support.

doc/tools/NGSDExportAnnotationData.md

+22-20
Original file line numberDiff line numberDiff line change
@@ -1,34 +1,36 @@
11
### NGSDExportAnnotationData tool help
2-
NGSDExportAnnotationData (2021_06-48-gfc326851)
2+
NGSDExportAnnotationData (2021_06-89-gbbd16264)
33

44
Generates a VCF file with all variants and annotations from the NGSD and a BED file containing the gene information of the NGSD.
55

66
Mandatory parameters:
7-
-variants <file> Output variant list as VCF.
7+
-variants <file> Output variant list as VCF.
88

99
Optional parameters:
10-
-genes <file> Optional BED file containing the genes and the gene info (only germline).
11-
Default value: ''
12-
-reference <file> Reference genome FASTA file. If unset 'reference_genome' from the 'settings.ini' file is used.
13-
Default value: ''
14-
-test Uses the test database instead of on the production database.
15-
Default value: 'false'
16-
-max_af <float> Maximum allel frequency of exported variants (default: 0.05).
17-
Default value: '0.05'
18-
-gene_offset <int> Defines the number of bases by which the region of each gene is extended.
19-
Default value: '5000'
20-
-mode <enum> Determines the database which is exported.
21-
Default value: 'germline'
22-
Valid: 'germline,somatic'
10+
-genes <file> Optional BED file containing the genes and the gene info (only germline).
11+
Default value: ''
12+
-reference <file> Reference genome FASTA file. If unset 'reference_genome' from the 'settings.ini' file is used.
13+
Default value: ''
14+
-test Uses the test database instead of on the production database.
15+
Default value: 'false'
16+
-max_af <float> Maximum allel frequency of exported variants (default: 0.05).
17+
Default value: '0.05'
18+
-gene_offset <int> Defines the number of bases by which the region of each gene is extended.
19+
Default value: '5000'
20+
-mode <enum> Determines the database which is exported.
21+
Default value: 'germline'
22+
Valid: 'germline,somatic'
23+
-vicc_config_details Includes details about VICC interpretation. Works only in somatic mode.
24+
Default value: 'false'
2325

2426
Special parameters:
25-
--help Shows this help and exits.
26-
--version Prints version and exits.
27-
--changelog Prints changeloge and exits.
28-
--tdx Writes a Tool Definition Xml file. The file name is the application name with the suffix '.tdx'.
27+
--help Shows this help and exits.
28+
--version Prints version and exits.
29+
--changelog Prints changeloge and exits.
30+
--tdx Writes a Tool Definition Xml file. The file name is the application name with the suffix '.tdx'.
2931

3032
### NGSDExportAnnotationData changelog
31-
NGSDExportAnnotationData 2021_06-48-gfc326851
33+
NGSDExportAnnotationData 2021_06-89-gbbd16264
3234

3335
2021-07-19 Code and parameter refactoring.
3436
2021-07-19 Added support for 'germline_het' and 'germline_hom' columns in 'variant' table.

doc/tools/VcfAnnotateFromBed.md

+24-14
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
### VcfAnnotateFromBed tool help
2-
VcfAnnotateFromBed (2021_03-100-g09f25a25)
2+
VcfAnnotateFromBed (2021_06-89-gbbd16264)
33

44
Annotates the INFO column of a VCF with data from a BED file.
55

@@ -9,26 +9,36 @@
99
-> %0A; -> %0D; -> %20; , -> %2C; ; -> %3B; = -> %3D;
1010

1111
Mandatory parameters:
12-
-bed <file> BED file used as source of annotations (name column).
13-
-name <string> Annotation name in INFO column of output VCF file.
12+
-bed <file> BED file used as source of annotations (name column).
13+
-name <string> Annotation name in INFO column of output VCF file.
1414

1515
Optional parameters:
16-
-in <file> Input VCF file. If unset, reads from STDIN.
17-
Default value: ''
18-
-out <file> Output VCF list. If unset, writes to STDOUT.
19-
Default value: ''
20-
-sep <string> Separator used if there are several matches for one variant.
21-
Default value: ':'
16+
-in <file> Input VCF file. If unset, reads from STDIN.
17+
Default value: ''
18+
-out <file> Output VCF list. If unset, writes to STDOUT.
19+
Default value: ''
20+
-sep <string> Separator used if there are several matches for one variant.
21+
Default value: ':'
22+
-threads <int> The number of threads used to read, process and write files.
23+
Default value: '1'
24+
-block_size <int> Number of lines processed in one chunk.
25+
Default value: '5000'
26+
-prefetch <int> Maximum number of chunks that may be pre-fetched into memory.
27+
Default value: '64'
28+
-debug <int> Enables debug output at the given interval in milliseconds (disabled by default, cannot be combined with writing to STDOUT).
29+
Default value: '-1'
2230

2331
Special parameters:
24-
--help Shows this help and exits.
25-
--version Prints version and exits.
26-
--changelog Prints changeloge and exits.
27-
--tdx Writes a Tool Definition Xml file. The file name is the application name with the suffix '.tdx'.
32+
--help Shows this help and exits.
33+
--version Prints version and exits.
34+
--changelog Prints changeloge and exits.
35+
--tdx Writes a Tool Definition Xml file. The file name is the application name with the suffix '.tdx'.
2836

2937
### VcfAnnotateFromBed changelog
30-
VcfAnnotateFromBed 2021_03-100-g09f25a25
38+
VcfAnnotateFromBed 2021_06-89-gbbd16264
3139

40+
2021-09-18 Prefetch only part of input file (to save memory).
41+
2021-08-24 Added multithread support.
3242
2021-06-15 Added 'sep' parameter.
3343
2019-12-06 Added URL encoding for INFO values.
3444
2017-03-14 Initial implementation.

doc/tools/VcfAnnotateFromVcf.md

+5-2
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
### VcfAnnotateFromVcf tool help
2-
VcfAnnotateFromVcf (2020_03-184-g27235379)
2+
VcfAnnotateFromVcf (2021_06-89-gbbd16264)
33

44
Annotates the INFO column of a VCF with data from another VCF file (or multiple VCF files if config file is provided).
55

@@ -24,6 +24,8 @@
2424
Default value: '1'
2525
-block_size <int> Number of lines processed in one chunk.
2626
Default value: '5000'
27+
-prefetch <int> Maximum number of chunks that may be pre-fetched into memory.
28+
Default value: '64'
2729

2830
Special parameters:
2931
--help Shows this help and exits.
@@ -32,8 +34,9 @@
3234
--tdx Writes a Tool Definition Xml file. The file name is the application name with the suffix '.tdx'.
3335

3436
### VcfAnnotateFromVcf changelog
35-
VcfAnnotateFromVcf 2020_03-184-g27235379
37+
VcfAnnotateFromVcf 2021_06-89-gbbd16264
3638

39+
2021-09-20 Prefetch only part of input file (to save memory).
3740
2020-04-11 Added multithread support by Julian Fratte.
3841
2019-08-19 Added support for multiple annotations files through config file.
3942
2019-08-14 Added VCF.GZ support.

src/GSvar/MainWindow.cpp

+21-1
Original file line numberDiff line numberDiff line change
@@ -2834,10 +2834,30 @@ void MainWindow::checkVariantList(QStringList messages)
28342834
}
28352835
}
28362836

2837+
//check data was loaded completely
2838+
if (germlineReportSupported(true))
2839+
{
2840+
NGSD db;
2841+
int sys_id = db.processingSystemIdFromProcessedSample(germlineReportSample());
2842+
ProcessingSystemData sys_data = db.getProcessingSystemData(sys_id);
2843+
if (sys_data.type=="WES" || sys_data.type=="WGS")
2844+
{
2845+
QSet<Chromosome> chromosomes;
2846+
for(int i=0; i<variants_.count(); ++i)
2847+
{
2848+
chromosomes << variants_[i].chr();
2849+
}
2850+
if (chromosomes.size()<23)
2851+
{
2852+
messages << ("Variants detected on " + QString::number(chromosomes.size()) + " chromosomes only! Expected variants on at least 23 chromosomes for WES/WGS data!");
2853+
}
2854+
}
2855+
}
2856+
28372857
//show messages
28382858
if (!messages.empty())
28392859
{
2840-
QMessageBox::warning(this, "GSvar file outdated", "The GSvar file contains the following error(s):\n -" + messages.join("\n -") + "\n\nTo ensure that GSvar works as expected, re-run the annotation steps for the analysis!");
2860+
QMessageBox::warning(this, "GSvar file problems", "The GSvar file contains the following problems:\n -" + messages.join("\n -"));
28412861
}
28422862
}
28432863

0 commit comments

Comments
 (0)