comparison rgPicardInsertSize.xml @ 4:f4d018471628 draft default tip

Uploaded
author jpruab
date Tue, 13 Aug 2013 12:09:14 -0400
parents
children
comparison
equal deleted inserted replaced
3:08b477977410 4:f4d018471628
1 <tool name="Insertion size metrics" id="PicardInsertSize" version="1.56.0">
2 <description>for PAIRED data</description>
3 <requirements><requirement type="package" version="1.56.0">picard</requirement></requirements>
4 <command interpreter="python">
5 picard_wrapper.py -i "${input_file}" -n "${out_prefix}" --tmpdir "${__new_file_path__}" --deviations "${deviations}"
6 --histwidth "${histWidth}" --minpct "${minPct}" --malevel "${malevel}"
7 -j "\$JAVA_JAR_PATH/CollectInsertSizeMetrics.jar" -d "${html_file.files_path}" -t "${html_file}"
8 </command>
9 <inputs>
10 <param format="bam,sam" name="input_file" type="data" label="SAM/BAM dataset to generate statistics for"
11 help="If empty, upload or import a SAM/BAM dataset."/>
12 <param name="out_prefix" value="Insertion size metrics" type="text"
13 label="Title for the output file" help="Use this remind you what the job was for" size="120" />
14 <param name="deviations" value="10.0" type="float"
15 label="Deviations" size="5"
16 help="See Picard documentation: Generate mean, sd and plots by trimming the data down to MEDIAN + DEVIATIONS*MEDIAN_ABSOLUTE_DEVIATION" />
17 <param name="histWidth" value="0" type="integer"
18 label="Histogram width" size="5"
19 help="Explicitly sets the histogram width option - leave 0 to ignore" />
20 <param name="minPct" value="0.05" type="float"
21 label="Minimum percentage" size="5"
22 help="Discard any data categories (out of FR, TANDEM, RF) that have fewer than this percentage of overall reads" />
23 <param name="malevel" value="0" type="select" multiple="true" label="Metric Accumulation Level"
24 help="Level(s) at which metrics will be accumulated">
25 <option value="ALL_READS" selected="true">All reads (default)</option>
26 <option value="SAMPLE" default="true">Sample</option>
27 <option value="LIBRARY" default="true">Library</option>
28 <option value="READ_GROUP" default="true">Read group</option>
29 </param>
30 </inputs>
31 <outputs>
32 <data format="html" name="html_file" label="InsertSize_${out_prefix}.html"/>
33 </outputs>
34 <tests>
35 <test>
36 <param name="input_file" value="picard_input_tiny.sam" />
37 <param name="out_prefix" value="Insertion size metrics" />
38 <param name="deviations" value="10.0" />
39 <param name="histWidth" value="0" />
40 <param name="minPct" value="0.01" />
41 <param name="malevel" value="ALL_READS" />
42 <output name="html_file" file="picard_output_insertsize_tinysam.html" ftype="html" compare="contains" lines_diff="40" />
43 </test>
44 </tests>
45 <help>
46
47
48 .. class:: infomark
49
50 **Purpose**
51
52 Reads a SAM or BAM file and describes the distribution
53 of insert size (excluding duplicates) with metrics and a histogram plot.
54
55 **Picard documentation**
56
57 This is a Galaxy wrapper for CollectInsertSizeMetrics, a part of the external package Picard-tools_.
58
59 .. _Picard-tools: http://www.google.com/search?q=picard+samtools
60
61 .. class:: warningmark
62
63 **Useful for paired data only**
64
65 This tool works for paired data only and can be expected to fail for single end data.
66
67 -----
68
69 .. class:: infomark
70
71 **Inputs, outputs, and parameters**
72
73 Picard documentation says (reformatted for Galaxy):
74
75 .. csv-table::
76 :header-rows: 1
77
78 Option,Description
79 "INPUT=File","SAM or BAM file Required."
80 "OUTPUT=File","File to write insert size metrics to Required."
81 "HISTOGRAM_FILE=File","File to write insert size histogram chart to Required."
82 "TAIL_LIMIT=Integer","When calculating mean and stdev stop when the bins in the tail of the distribution contain fewer than mode/TAIL_LIMIT items. This also limits how much data goes into each data category of the histogram."
83 "HISTOGRAM_WIDTH=Integer","Explicitly sets the histogram width, overriding the TAIL_LIMIT option. Also, when calculating mean and stdev, only bins LE HISTOGRAM_WIDTH will be included. "
84 "MINIMUM_PCT=Float","When generating the histogram, discard any data categories (out of FR, TANDEM, RF) that have fewer than this percentage of overall reads. (Range: 0 to 1) Default value: 0.01."
85 "STOP_AFTER=Integer","Stop after processing N reads, mainly for debugging. Default value: 0."
86 "CREATE_MD5_FILE=Boolean","Whether to create an MD5 digest for any BAM files created. Default value: false."
87
88 .. class:: warningmark
89
90 **Warning on SAM/BAM quality**
91
92 Many SAM/BAM files produced externally and uploaded to Galaxy do not fully conform to SAM/BAM specifications. Galaxy deals with this by using the **LENIENT**
93 flag when it runs Picard, which allows reads to be discarded if they're empty or don't map. This appears
94 to be the only way to deal with SAM/BAM that cannot be parsed.
95
96 </help>
97 </tool>