annotate fasta_clipping_histogram.xml @ 1:f666895cbebd draft

planemo upload for repository https://github.com/galaxyproject/tools-devteam/tree/master/tool_collections/fastx_toolkit/fasta_clipping_histogram commit a1517c9d22029095120643bbe2c8fa53754dd2b7
author devteam
date Wed, 11 Nov 2015 12:36:37 -0500
parents f2ab5b44870d
children 9db07fd39f85
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
0
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
1 <tool id="cshl_fasta_clipping_histogram" name="Length Distribution" version="1.0.0">
1
f666895cbebd planemo upload for repository https://github.com/galaxyproject/tools-devteam/tree/master/tool_collections/fastx_toolkit/fasta_clipping_histogram commit a1517c9d22029095120643bbe2c8fa53754dd2b7
devteam
parents: 0
diff changeset
2 <description>chart</description>
0
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
3 <requirements>
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
4 <requirement type="package" version="0.0.13">fastx_toolkit</requirement>
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
5 </requirements>
1
f666895cbebd planemo upload for repository https://github.com/galaxyproject/tools-devteam/tree/master/tool_collections/fastx_toolkit/fasta_clipping_histogram commit a1517c9d22029095120643bbe2c8fa53754dd2b7
devteam
parents: 0
diff changeset
6 <command>fasta_clipping_histogram.pl $input $outfile</command>
f666895cbebd planemo upload for repository https://github.com/galaxyproject/tools-devteam/tree/master/tool_collections/fastx_toolkit/fasta_clipping_histogram commit a1517c9d22029095120643bbe2c8fa53754dd2b7
devteam
parents: 0
diff changeset
7
f666895cbebd planemo upload for repository https://github.com/galaxyproject/tools-devteam/tree/master/tool_collections/fastx_toolkit/fasta_clipping_histogram commit a1517c9d22029095120643bbe2c8fa53754dd2b7
devteam
parents: 0
diff changeset
8 <inputs>
f666895cbebd planemo upload for repository https://github.com/galaxyproject/tools-devteam/tree/master/tool_collections/fastx_toolkit/fasta_clipping_histogram commit a1517c9d22029095120643bbe2c8fa53754dd2b7
devteam
parents: 0
diff changeset
9 <param format="fasta" name="input" type="data" label="Library to analyze" />
f666895cbebd planemo upload for repository https://github.com/galaxyproject/tools-devteam/tree/master/tool_collections/fastx_toolkit/fasta_clipping_histogram commit a1517c9d22029095120643bbe2c8fa53754dd2b7
devteam
parents: 0
diff changeset
10 </inputs>
0
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
11
1
f666895cbebd planemo upload for repository https://github.com/galaxyproject/tools-devteam/tree/master/tool_collections/fastx_toolkit/fasta_clipping_histogram commit a1517c9d22029095120643bbe2c8fa53754dd2b7
devteam
parents: 0
diff changeset
12 <outputs>
f666895cbebd planemo upload for repository https://github.com/galaxyproject/tools-devteam/tree/master/tool_collections/fastx_toolkit/fasta_clipping_histogram commit a1517c9d22029095120643bbe2c8fa53754dd2b7
devteam
parents: 0
diff changeset
13 <data format="png" name="outfile" metadata_source="input" />
f666895cbebd planemo upload for repository https://github.com/galaxyproject/tools-devteam/tree/master/tool_collections/fastx_toolkit/fasta_clipping_histogram commit a1517c9d22029095120643bbe2c8fa53754dd2b7
devteam
parents: 0
diff changeset
14 </outputs>
f666895cbebd planemo upload for repository https://github.com/galaxyproject/tools-devteam/tree/master/tool_collections/fastx_toolkit/fasta_clipping_histogram commit a1517c9d22029095120643bbe2c8fa53754dd2b7
devteam
parents: 0
diff changeset
15 <tests>
f666895cbebd planemo upload for repository https://github.com/galaxyproject/tools-devteam/tree/master/tool_collections/fastx_toolkit/fasta_clipping_histogram commit a1517c9d22029095120643bbe2c8fa53754dd2b7
devteam
parents: 0
diff changeset
16 </tests>
f666895cbebd planemo upload for repository https://github.com/galaxyproject/tools-devteam/tree/master/tool_collections/fastx_toolkit/fasta_clipping_histogram commit a1517c9d22029095120643bbe2c8fa53754dd2b7
devteam
parents: 0
diff changeset
17 <help>
0
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
18 **What it does**
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
19
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
20 This tool creates a histogram image of sequence lengths distribution in a given fasta dataset file.
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
21
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
22 **TIP:** Use this tool after clipping your library (with **FASTX Clipper tool**), to visualize the clipping results.
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
23
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
24 -----
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
25
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
26 **Output Examples**
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
27
1
f666895cbebd planemo upload for repository https://github.com/galaxyproject/tools-devteam/tree/master/tool_collections/fastx_toolkit/fasta_clipping_histogram commit a1517c9d22029095120643bbe2c8fa53754dd2b7
devteam
parents: 0
diff changeset
28 In the following library, most sequences are 24-mers to 27-mers.
0
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
29 This could indicate an abundance of endo-siRNAs (depending of course of what you've tried to sequence in the first place).
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
30
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
31 .. image:: ${static_path}/fastx_icons/fasta_clipping_histogram_1.png
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
32
1
f666895cbebd planemo upload for repository https://github.com/galaxyproject/tools-devteam/tree/master/tool_collections/fastx_toolkit/fasta_clipping_histogram commit a1517c9d22029095120643bbe2c8fa53754dd2b7
devteam
parents: 0
diff changeset
33 In the following library, most sequences are 19,22 or 23-mers.
0
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
34 This could indicate an abundance of miRNAs (depending of course of what you've tried to sequence in the first place).
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
35
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
36 .. image:: ${static_path}/fastx_icons/fasta_clipping_histogram_2.png
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
37
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
38 -----
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
39
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
40 **Input Formats**
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
41
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
42 This tool accepts short-reads FASTA files. The reads don't have to be short, but they do have to be on a single line, like so::
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
43
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
44 >sequence1
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
45 AGTAGTAGGTGATGTAGAGAGAGAGAGAGTAG
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
46 >sequence2
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
47 GTGTGTGTGGGAAGTTGACACAGTA
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
48 >sequence3
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
49 CCTTGAGATTAACGCTAATCAAGTAAAC
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
50
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
51 If the sequences span over multiple lines::
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
52
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
53 >sequence1
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
54 CAGCATCTACATAATATGATCGCTATTAAACTTAAATCTCCTTGACGGAG
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
55 TCTTCGGTCATAACACAAACCCAGACCTACGTATATGACAAAGCTAATAG
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
56 aactggtctttacctTTAAGTTG
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
57
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
58 Use the **FASTA Width Formatter** tool to re-format the FASTA into a single-lined sequences::
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
59
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
60 >sequence1
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
61 CAGCATCTACATAATATGATCGCTATTAAACTTAAATCTCCTTGACGGAGTCTTCGGTCATAACACAAACCCAGACCTACGTATATGACAAAGCTAATAGaactggtctttacctTTAAGTTG
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
62
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
63 -----
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
64
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
65 **Multiplicity counts (a.k.a reads-count)**
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
66
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
67 If the sequence identifier (the text after the '>') contains a dash and a number, it is treated as a multiplicity count value (i.e. how many times that individual sequence repeated in the original FASTA file, before collapsing).
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
68
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
69 Example 1 - The following FASTA file *does not* have multiplicity counts::
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
70
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
71 >seq1
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
72 GGATCC
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
73 >seq2
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
74 GGTCATGGGTTTAAA
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
75 >seq3
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
76 GGGATATATCCCCACACACACACAC
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
77
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
78 Each sequence is counts as one, to produce the following chart:
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
79
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
80 .. image:: ${static_path}/fastx_icons/fasta_clipping_histogram_3.png
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
81
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
82 Example 2 - The following FASTA file have multiplicity counts::
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
83
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
84 >seq1-2
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
85 GGATCC
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
86 >seq2-10
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
87 GGTCATGGGTTTAAA
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
88 >seq3-3
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
89 GGGATATATCCCCACACACACACAC
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
90
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
91 The first sequence counts as 2, the second as 10, the third as 3, to produce the following chart:
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
92
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
93 .. image:: ${static_path}/fastx_icons/fasta_clipping_histogram_4.png
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
94
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
95 Use the **FASTA Collapser** tool to create FASTA files with multiplicity counts.
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
96
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
97 ------
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
98
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
99 This tool is based on `FASTX-toolkit`__ by Assaf Gordon.
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
100
f2ab5b44870d Uploaded tool tarball.
devteam
parents:
diff changeset
101 .. __: http://hannonlab.cshl.edu/fastx_toolkit/
1
f666895cbebd planemo upload for repository https://github.com/galaxyproject/tools-devteam/tree/master/tool_collections/fastx_toolkit/fasta_clipping_histogram commit a1517c9d22029095120643bbe2c8fa53754dd2b7
devteam
parents: 0
diff changeset
102 </help>
f666895cbebd planemo upload for repository https://github.com/galaxyproject/tools-devteam/tree/master/tool_collections/fastx_toolkit/fasta_clipping_histogram commit a1517c9d22029095120643bbe2c8fa53754dd2b7
devteam
parents: 0
diff changeset
103 </tool>