annotate SMART/galaxy/WrappGetLetterDistribution.xml @ 68:85e80c21b1f7 draft

Uploaded
author m-zytnicki
date Mon, 16 Nov 2015 12:00:32 -0500
parents 2c0c0a89fad7
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
38
2c0c0a89fad7 Uploaded
m-zytnicki
parents:
diff changeset
1 <tool id="getLetterDistribution1" name="get letter distribution">
2c0c0a89fad7 Uploaded
m-zytnicki
parents:
diff changeset
2 <description>Calculate distribution for each nucleotide per position for all short reads</description>
2c0c0a89fad7 Uploaded
m-zytnicki
parents:
diff changeset
3 <requirements>
2c0c0a89fad7 Uploaded
m-zytnicki
parents:
diff changeset
4 <requirement type="set_environment">PYTHONPATH</requirement>
2c0c0a89fad7 Uploaded
m-zytnicki
parents:
diff changeset
5 </requirements>
2c0c0a89fad7 Uploaded
m-zytnicki
parents:
diff changeset
6 <command interpreter="python">
2c0c0a89fad7 Uploaded
m-zytnicki
parents:
diff changeset
7 WrappGetLetterDistribution.py -i $inputFileName
2c0c0a89fad7 Uploaded
m-zytnicki
parents:
diff changeset
8 #if $formatType.FormatInputFileName == 'fasta':
2c0c0a89fad7 Uploaded
m-zytnicki
parents:
diff changeset
9 -f fasta
2c0c0a89fad7 Uploaded
m-zytnicki
parents:
diff changeset
10 #else :
2c0c0a89fad7 Uploaded
m-zytnicki
parents:
diff changeset
11 -f fastq
2c0c0a89fad7 Uploaded
m-zytnicki
parents:
diff changeset
12 #end if
2c0c0a89fad7 Uploaded
m-zytnicki
parents:
diff changeset
13 -c $ouputFileNameCSV -a $ouputFileNamePNG1 -b $ouputFileNamePNG2
2c0c0a89fad7 Uploaded
m-zytnicki
parents:
diff changeset
14 </command>
2c0c0a89fad7 Uploaded
m-zytnicki
parents:
diff changeset
15 <inputs>
2c0c0a89fad7 Uploaded
m-zytnicki
parents:
diff changeset
16 <conditional name="formatType">
2c0c0a89fad7 Uploaded
m-zytnicki
parents:
diff changeset
17 <param name="FormatInputFileName" type="select" label="Input File Format">
2c0c0a89fad7 Uploaded
m-zytnicki
parents:
diff changeset
18 <option value="fasta">fasta</option>
2c0c0a89fad7 Uploaded
m-zytnicki
parents:
diff changeset
19 <option value="fastq" selected="true">fastq</option>
2c0c0a89fad7 Uploaded
m-zytnicki
parents:
diff changeset
20 </param>
2c0c0a89fad7 Uploaded
m-zytnicki
parents:
diff changeset
21 <when value="fasta">
2c0c0a89fad7 Uploaded
m-zytnicki
parents:
diff changeset
22 <param name="inputFileName" format="fasta" type="data" label="Fasta Input File"/>
2c0c0a89fad7 Uploaded
m-zytnicki
parents:
diff changeset
23 </when>
2c0c0a89fad7 Uploaded
m-zytnicki
parents:
diff changeset
24 <when value="fastq">
2c0c0a89fad7 Uploaded
m-zytnicki
parents:
diff changeset
25 <param name="inputFileName" format="fastq" type="data" label="Fastq Input File"/>
2c0c0a89fad7 Uploaded
m-zytnicki
parents:
diff changeset
26 </when>
2c0c0a89fad7 Uploaded
m-zytnicki
parents:
diff changeset
27 </conditional>
2c0c0a89fad7 Uploaded
m-zytnicki
parents:
diff changeset
28 </inputs>
2c0c0a89fad7 Uploaded
m-zytnicki
parents:
diff changeset
29
2c0c0a89fad7 Uploaded
m-zytnicki
parents:
diff changeset
30 <outputs>
2c0c0a89fad7 Uploaded
m-zytnicki
parents:
diff changeset
31 <data name="ouputFileNameCSV" format="tabular" label="[get letter distribution] CSV file"/>
2c0c0a89fad7 Uploaded
m-zytnicki
parents:
diff changeset
32 <data name="ouputFileNamePNG1" format="png" label="[get letter distribution] PNG file 1"/>
2c0c0a89fad7 Uploaded
m-zytnicki
parents:
diff changeset
33 <data name="ouputFileNamePNG2" format="png" label="[get letter distribution] PNG file 2"/>
2c0c0a89fad7 Uploaded
m-zytnicki
parents:
diff changeset
34 </outputs>
2c0c0a89fad7 Uploaded
m-zytnicki
parents:
diff changeset
35 <tests>
2c0c0a89fad7 Uploaded
m-zytnicki
parents:
diff changeset
36 <test>
2c0c0a89fad7 Uploaded
m-zytnicki
parents:
diff changeset
37 <param name="FormatInputFileName" value="fastq" />
2c0c0a89fad7 Uploaded
m-zytnicki
parents:
diff changeset
38 <param name="inputFileName" value="short_fastq.fastq" />
2c0c0a89fad7 Uploaded
m-zytnicki
parents:
diff changeset
39 <output name="outputFileNameCSV" file="exp_getletterdistribution_short_fastq.csv" />
2c0c0a89fad7 Uploaded
m-zytnicki
parents:
diff changeset
40 </test>
2c0c0a89fad7 Uploaded
m-zytnicki
parents:
diff changeset
41 </tests>
2c0c0a89fad7 Uploaded
m-zytnicki
parents:
diff changeset
42
2c0c0a89fad7 Uploaded
m-zytnicki
parents:
diff changeset
43 <help>
2c0c0a89fad7 Uploaded
m-zytnicki
parents:
diff changeset
44 The script gets the nucleotide distribution of the input sequence list. It outputs two files. The first file shows the nucleotide distribution of the data. More precisely, a point (*x*, *y*) on the curve **A** shows that *y* sequences have *x* % of **A**.
2c0c0a89fad7 Uploaded
m-zytnicki
parents:
diff changeset
45
2c0c0a89fad7 Uploaded
m-zytnicki
parents:
diff changeset
46 The second plot shows the average nucleotide distribution for each position of the read. You can use it to detect a bias in the first nucleotides, for instance. A point *x*, *y* on the curve **A** shows that at the position *x*, there are *y*% of **A**. A point (*x*, *y*) on the curve **#** tells you that *y* % of the sequences contain not less than *x* nucleotides. By definition, this latter line is a decreasing function. It usually explains why the tail of the other curves are sometimes erratic: there are few sequences.
2c0c0a89fad7 Uploaded
m-zytnicki
parents:
diff changeset
47 </help>
2c0c0a89fad7 Uploaded
m-zytnicki
parents:
diff changeset
48 </tool>