annotate variant_apply_recalibration.xml @ 0:350a4d0d1678 draft default tip

Imported from capsule None
author devteam
date Tue, 01 Apr 2014 10:48:32 -0400
parents
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
0
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
1 <tool id="gatk_variant_apply_recalibration" name="Apply Variant Recalibration" version="0.0.4">
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
2 <description></description>
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
3 <requirements>
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
4 <requirement type="package" version="1.4">gatk</requirement>
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
5 </requirements>
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
6 <macros>
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
7 <import>gatk_macros.xml</import>
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
8 </macros>
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
9 <command interpreter="python">gatk_wrapper.py
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
10 --max_jvm_heap_fraction "1"
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
11 --stdout "${output_log}"
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
12 #for $var_count, $variant in enumerate( $reference_source.variants ):
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
13 -d "--input:input_${var_count},%(file_type)s" "${variant.input_variants}" "${variant.input_variants.ext}" "input_variants_${var_count}"
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
14 #end for
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
15 -p 'java
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
16 -jar "\$JAVA_JAR_PATH/GenomeAnalysisTK.jar"
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
17 -T "ApplyRecalibration"
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
18 ##--num_threads 4 ##hard coded, for now
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
19 -et "NO_ET" ##ET no phone home
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
20 #if $reference_source.reference_source_selector != "history":
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
21 -R "${reference_source.ref_file.fields.path}"
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
22 #end if
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
23 --recal_file "${reference_source.input_recal}"
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
24 --tranches_file "${reference_source.input_tranches}"
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
25 --out "${output_variants}"
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
26 '
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
27
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
28 #include source=$standard_gatk_options#
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
29
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
30 ##start analysis specific options
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
31 -p '
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
32 --mode "${mode}"
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
33
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
34 #for $ignore_filter in $ignore_filters:
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
35 #set $ignore_filter_name = str( $ignore_filter.ignore_filter_type.ignore_filter_type_selector )
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
36 #if $ignore_filter_name == "custom":
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
37 #set $ignore_filter_name = str( $ignore_filter.ignore_filter_type.filter_name )
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
38 #end if
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
39 --ignore_filter "${ignore_filter_name}"
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
40 #end for
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
41 --ts_filter_level "${ts_filter_level}"
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
42 '
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
43 </command>
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
44 <inputs>
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
45 <conditional name="reference_source">
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
46 <expand macro="reference_source_selector_param" />
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
47 <when value="cached">
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
48 <repeat name="variants" title="Variant" min="1" help="-input,--input &amp;lt;input&amp;gt;">
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
49 <param name="input_variants" type="data" format="vcf" label="Variant file to annotate"/>
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
50 </repeat>
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
51 <param name="input_recal" type="data" format="gatk_recal" label="Variant Recalibration file" help="-recalFile,--recal_file &amp;lt;recal_file&amp;gt;" />
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
52 <param name="input_tranches" type="data" format="gatk_tranche" label="Variant Tranches file" help="-tranchesFile,--tranches_file &amp;lt;tranches_file&amp;gt;" />
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
53 <param name="ref_file" type="select" label="Using reference genome" help="-R,--reference_sequence &amp;lt;reference_sequence&amp;gt;">
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
54 <options from_data_table="gatk_picard_indexes">
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
55 <!-- <filter type="data_meta" key="dbkey" ref="variants[0].input_variants" column="dbkey"/> -->
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
56 </options>
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
57 <validator type="no_options" message="A built-in reference genome is not available for the build associated with the selected input file"/>
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
58 </param>
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
59 </when>
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
60 <when value="history"> <!-- FIX ME!!!! -->
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
61 <repeat name="variants" title="Variant" min="1" help="-input,--input &amp;lt;input&amp;gt;">
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
62 <param name="input_variants" type="data" format="vcf" label="Variant file to annotate" />
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
63 </repeat>
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
64 <param name="input_recal" type="data" format="gatk_recal" label="Variant Recalibration file" help="-recalFile,--recal_file &amp;lt;recal_file&amp;gt;" />
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
65 <param name="input_tranches" type="data" format="gatk_tranche" label="Variant Tranches file" help="-tranchesFile,--tranches_file &amp;lt;tranches_file&amp;gt;" />
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
66 <param name="ref_file" type="data" format="fasta" label="Using reference file" help="-R,--reference_sequence &amp;lt;reference_sequence&amp;gt;" />
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
67 </when>
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
68 </conditional>
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
69
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
70 <expand macro="gatk_param_type_conditional" />
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
71
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
72 <param name="mode" type="select" label="Recalibration mode" help="-mode,--mode &amp;lt;mode&amp;gt;">
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
73 <option value="SNP" selected="True">SNP</option>
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
74 <option value="INDEL">INDEL</option>
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
75 <option value="BOTH">BOTH</option>
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
76 </param>
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
77 <repeat name="ignore_filters" title="Ignore Filter" help="-ignoreFilter,--ignore_filter &amp;lt;ignore_filter&amp;gt;">
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
78 <conditional name="ignore_filter_type">
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
79 <param name="ignore_filter_type_selector" type="select" label="Filter Type">
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
80 <option value="HARD_TO_VALIDATE">HARD_TO_VALIDATE</option>
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
81 <option value="LowQual" >LowQual</option>
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
82 <option value="custom" selected="True">Other</option>
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
83 </param>
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
84 <when value="custom">
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
85 <param name="filter_name" type="text" value="" label="Filter name"/>
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
86 </when>
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
87 <when value="HARD_TO_VALIDATE" />
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
88 <when value="LowQual" />
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
89 </conditional>
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
90 </repeat>
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
91 <param name="ts_filter_level" type="float" label="truth sensitivity level at which to start filtering, used here to indicate filtered variants in plots" value="99.0" help="-ts_filter_level,--ts_filter_level &amp;lt;ts_filter_level&amp;gt;"/>
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
92 </inputs>
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
93 <outputs>
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
94 <data format="vcf" name="output_variants" label="${tool.name} on ${on_string} (Variants File)" />
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
95 <data format="txt" name="output_log" label="${tool.name} on ${on_string} (log)" />
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
96 </outputs>
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
97 <tests>
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
98 <!-- ADD TESTS -->
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
99 </tests>
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
100 <help>
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
101 **What it does**
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
102
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
103 Applies cuts to the input vcf file (by adding filter lines) to achieve the desired novel FDR levels which were specified during VariantRecalibration
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
104
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
105 For more information on using the ApplyRecalibration module, see this `tool specific page &lt;http://www.broadinstitute.org/gsa/wiki/index.php/Variant_quality_score_recalibration&gt;`_.
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
106
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
107 To learn about best practices for variant detection using GATK, see this `overview &lt;http://www.broadinstitute.org/gsa/wiki/index.php/Best_Practice_Variant_Detection_with_the_GATK_v3&gt;`_.
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
108
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
109 If you encounter errors, please view the `GATK FAQ &lt;http://www.broadinstitute.org/gsa/wiki/index.php/Frequently_Asked_Questions&gt;`_.
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
110
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
111 ------
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
112
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
113 **Inputs**
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
114
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
115 GenomeAnalysisTK: ApplyRecalibration accepts a variant input file, a recalibration file and a tranches file.
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
116
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
117
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
118 **Outputs**
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
119
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
120 The output is in VCF format.
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
121
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
122
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
123 Go `here &lt;http://www.broadinstitute.org/gsa/wiki/index.php/Input_files_for_the_GATK&gt;`_ for details on GATK file formats.
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
124
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
125 -------
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
126
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
127 **Settings**::
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
128
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
129
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
130 recal_file The output recal file used by ApplyRecalibration
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
131 tranches_file The input tranches file describing where to cut the data
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
132 out The output filtered, recalibrated VCF file
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
133 ts_filter_level The truth sensitivity level at which to start filtering
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
134 ignore_filter If specified the optimizer will use variants even if the specified filter name is marked in the input VCF file
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
135 mode Recalibration mode to employ: 1.) SNP for recalibrating only SNPs (emitting indels untouched in the output VCF); 2.) INDEL for indels; and 3.) BOTH for recalibrating both SNPs and indels simultaneously. (SNP|INDEL|BOTH)
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
136
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
137 @CITATION_SECTION@
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
138 </help>
350a4d0d1678 Imported from capsule None
devteam
parents:
diff changeset
139 </tool>