Mercurial > repos > yufei-luo > s_mart
diff SMART/galaxy/WrappGetLetterDistribution.xml @ 15:440ceca58672
Uploaded
author | m-zytnicki |
---|---|
date | Mon, 22 Apr 2013 11:08:07 -0400 |
parents | 769e306b7933 |
children | 94ab73e8a190 |
line wrap: on
line diff
--- a/SMART/galaxy/WrappGetLetterDistribution.xml Fri Apr 19 10:13:11 2013 -0400 +++ b/SMART/galaxy/WrappGetLetterDistribution.xml Mon Apr 22 11:08:07 2013 -0400 @@ -1,5 +1,5 @@ <tool id="getLetterDistribution1" name="Get Letter Distribution"> - <description>Calculate distribution for each nucleotide per position for all short reads (S-MART)</description> + <description>Calculate distribution for each nucleotide per position for all short reads</description> <command interpreter="python"> WrappGetLetterDistribution.py -i $inputFileName #if $formatType.FormatInputFileName == 'fasta': @@ -28,6 +28,18 @@ <data name="ouputFileNameCSV" format="tabular" label="[getLetterDistribution] CSV File"/> <data name="ouputFileNamePNG1" format="png" label="[getLetterDistribution] PNG File 1"/> <data name="ouputFileNamePNG2" format="png" label="[getLetterDistribution] PNG File 2"/> - </outputs> + </outputs> + <tests> + <test> + <param name="FormatInputFileName" value="fastq" /> + <param name="inputFileName" value="short_fastq.fastq" /> + <output name="outputFileNameCSV" file="exp_getletterdistribution_short_fastq.csv" /> + </test> + </tests> + + <help> +The script gets the nucleotide distribution of the input sequence list. It outputs two files. The first file shows the nucleotide distribution of the data. More precisely, a point (*x*, *y*) on the curve **A** shows that *y* sequences have *x*% of **A**. + +The second plot shows the average nucleotide distribution for each position of the read. You can use it to detect a bias in the first nucleotides, for instance. A point *x*, *y* on the curve **A** shows that at the position *x*, there are *y*% of **A**. A point (*x*, *y*) on the curve **#** tells you that *y*% of the sequences contain not less than *x* nucleotides. By definition, this latter line is a decreasing function. It usually explains why the tail of the other curves are sometimes erratic: there are few sequences. + </help> </tool> -