comparison tools/cgatools17/snpdiff_v17.xml @ 1:3a2e0f376f26 draft

Minor change to tv2vcf.xml to allow for workflow automation
author dgdekoning
date Wed, 21 Oct 2015 10:09:15 -0400
parents
children
comparison
equal deleted inserted replaced
0:751b62d30ae1 1:3a2e0f376f26
1 <tool id="cg_snpdiff" name="SNPDiff" version="1.7.1">
2
3 <description>compares snp calls to a Complete Genomics variant file.</description>
4
5 <requirements>
6 <requirement type="package" version="1">cgatools17</requirement>
7 </requirements>
8
9 <command>
10 cgatools | head -1;
11 cgatools snpdiff
12 --reference ${crr.fields.crr_path}
13 --variants $varfile
14 --genotypes $genotype
15 --output-prefix cg_
16 --reports `echo ${report1} ${report2} ${report3} | sed 's/ */,/g'`
17 </command>
18
19 <inputs>
20 <param name="crr" type="select" label="Reference build">
21 <options from_data_table="cg_anno_files" />
22 </param>
23 <param name="varfile" type="data" format="cg_var" label="Var file"/>
24
25 <param name="genotype" type="data" format="tabular" label="Genotypes file with SNP calls" help="The genotypes file is a tab-delimited file with at least the following columns (additional columns may be given): Chromosome (Required), Offset0Based (Required), GenotypesStrand (Optional), Genotypes (Optional)"/>
26
27 <param name="report1" type="select" label="Create report Output">
28 <option value="">no</option>
29 <option value="Output">yes</option>
30 </param>
31 <param name="report2" type="select" label="Create report Verbose">
32 <option value="">no</option>
33 <option value="Verbose">yes</option>
34 </param>
35 <param name="report3" type="select" label="Create report Stats">
36 <option value="">no</option>
37 <option value="Stats">yes</option>
38 </param>
39 <param name="fname" type="text" value="" label="Prefix for your output file" help="Optional"/>
40 </inputs>
41
42 <outputs>
43 <data format="tabular" name="output1" from_work_dir="cg_Output.tsv" label="$fname ${tool.name} on ${on_string}: Output">
44 <filter>(report1 == 'Output')</filter>
45 </data>
46 <data format="tabular" name="output2" from_work_dir="cg_Verbose.tsv" label="$fname ${tool.name} on ${on_string}: Verbose">
47 <filter>(report2 == 'Verbose')</filter>
48 </data>
49 <data format="tabular" name="output3" from_work_dir="cg_Stats.tsv" label="$fname ${tool.name} on ${on_string}: Stats">
50 <filter>(report3 == 'Stats')</filter>
51 </data>
52 </outputs>
53 <help>
54 **What it does**
55
56 This tool ompares snp calls to a Complete Genomics variant file.
57
58 **cgatools 1.7.1 Documentation**
59
60 Userguide: http://cgatools.sourceforge.net/docs/1.7.1/cgatools-user-guide.pdf
61
62 Release notes: http://cgatools.sourceforge.net/docs/1.7.1/cgatools-release-notes.pdf
63
64 **Command line reference**::
65
66 COMMAND NAME
67 snpdiff - Compares snp calls to a Complete Genomics variant file.
68
69 DESCRIPTION
70 Compares the snp calls in the "genotypes" file to the calls in a Complete
71 Genomics variant file. The genotypes file is a tab-delimited file with at
72 least the following columns (additional columns may be given):
73
74 Chromosome (Required) The name of the chromosome.
75 Offset0Based (Required) The 0-based offset in the chromosome.
76 GenotypesStrand (Optional) The strand of the calls in the Genotypes
77 column (+ or -, defaults to +).
78 Genotypes (Optional) The calls, one per allele. The following
79 calls are recognized:
80 A,C,G,T A called base.
81 N A no-call.
82 - A deleted base.
83 . A non-snp variation.
84
85 The output is a tab-delimited file consisting of the columns of the
86 original genotypes file, plus the following additional columns:
87
88 Reference The reference base at the given position.
89 VariantFile The calls made by the variant file, one per allele.
90 The character codes are the same as is described for
91 the Genotypes column.
92 DiscordantAlleles (Only if Genotypes is present) The number of
93 Genotypes alleles that are discordant with calls in
94 the VariantFile. If the VariantFile is described as
95 haploid at the given position but the Genotypes is
96 diploid, then each genotype allele is compared
97 against the haploid call of the VariantFile.
98 NoCallAlleles (Only if Genotypes is present) The number of
99 Genotypes alleles that were no-called by the
100 VariantFile. If the VariantFile is described as
101 haploid at the given position but the Genotypes is
102 diploid, then a VariantFile no-call is counted twice.
103
104 The verbose output is a tab-delimited file consisting of the columns of the
105 original genotypes file, plus the following additional columns:
106
107 Reference The reference base at the given position.
108 VariantFile The call made by the variant file for one allele (there is
109 a line in this file for each allele). The character codes
110 are the same as is described for the Genotypes column.
111 [CALLS] The rest of the columns are pasted in from the VariantFile,
112 describing the variant file line used to make the call.
113
114 The stats output is a comma-separated file with several tables describing
115 the results of the snp comparison, for each diploid genotype. The tables
116 all describe the comparison result (column headers) versus the genotype
117 classification (row labels) in different ways. The "Locus classification"
118 tables have the most detailed match classifications, while the "Locus
119 concordance" tables roll these match classifications up into "discordance"
120 and "no-call". A locus is considered discordant if it is discordant for
121 either allele. A locus is considered no-call if it is concordant for both
122 alleles but has a no-call on either allele. The "Allele concordance"
123 describes the comparison result on a per-allele basis.
124
125 OPTIONS
126 -h [ --help ]
127 Print this help message.
128
129 --reference arg
130 The input crr file.
131
132 --variants arg
133 The input variant file.
134
135 --genotypes arg
136 The input genotypes file.
137
138 --output-prefix arg
139 The path prefix for all output reports.
140
141 --reports arg (=Output,Verbose,Stats)
142 Comma-separated list of reports to generate. A report is one of:
143 Output The output genotypes file.
144 Verbose The verbose output file.
145 Stats The stats output file.
146
147 SUPPORTED FORMAT_VERSION
148 0.3 or later
149 </help>
150 </tool>