Mercurial > repos > peterjc > blastxml_to_top_descr
annotate tools/ncbi_blast_plus/blastxml_to_top_descr.xml @ 6:8a0771c5e236 draft
Uploaded v0.0.5, reports number of queries with hits in stdout (peek text).
author | peterjc |
---|---|
date | Wed, 17 Apr 2013 10:53:58 -0400 |
parents | 662fea0fe6b2 |
children | a7add03a44fd |
rev | line source |
---|---|
6
8a0771c5e236
Uploaded v0.0.5, reports number of queries with hits in stdout (peek text).
peterjc
parents:
5
diff
changeset
|
1 <tool id="blastxml_to_top_descr" name="BLAST top hit descriptions" version="0.0.5"> |
0 | 2 <description>Make a table from BLAST XML</description> |
6
8a0771c5e236
Uploaded v0.0.5, reports number of queries with hits in stdout (peek text).
peterjc
parents:
5
diff
changeset
|
3 <version_command interpreter="python">blastxml_to_top_descr.py --version</version_command> |
0 | 4 <command interpreter="python"> |
5
662fea0fe6b2
Uploaded v0.0.4, quotes filenames in case the contain spaces
peterjc
parents:
0
diff
changeset
|
5 blastxml_to_top_descr.py "${blastxml_file}" "${tabular_file}" ${topN} |
0 | 6 </command> |
7 <inputs> | |
8 <param name="blastxml_file" type="data" format="blastxml" label="BLAST results as XML"/> | |
9 <param name="topN" type="integer" min="1" max="100" optional="false" label="Number of descriptions" value="3"/> | |
10 </inputs> | |
11 <outputs> | |
12 <data name="tabular_file" format="tabular" label="Top $topN descriptions from $blastxml_file.name" /> | |
13 </outputs> | |
14 <requirements> | |
15 </requirements> | |
16 <tests> | |
17 <test> | |
18 <param name="blastxml_file" value="blastp_four_human_vs_rhodopsin.xml" ftype="blastxml" /> | |
19 <param name="topN" value="3" /> | |
20 <output name="tabular_file" file="blastp_four_human_vs_rhodopsin_top3.tabular" ftype="tabular" /> | |
21 </test> | |
22 </tests> | |
23 <help> | |
24 | |
25 **What it does** | |
26 | |
27 NCBI BLAST+ (and the older NCBI 'legacy' BLAST) can output in a range of | |
28 formats including text, tabular and a more detailed XML format. You can | |
29 do a lot of things with tabular files in Galaxy (sorting, filtering, joins, | |
30 etc) however currently the BLAST tabular output omits the hit descriptions | |
31 found in the other output formats. | |
32 | |
33 This tool turns a BLAST XML file into a simple tabular file containing | |
34 one row per query sequence, containing the query identifier and then | |
35 the three (by default) top hit descriptions. If a query doesn't have | |
36 that many hits, then these entries are left blank. | |
37 | |
38 **Example Usage** | |
39 | |
40 One simple usage would be to take a transcriptome assembly or set of | |
41 gene predictions, run a BLAST search against the NCBI NR database, and | |
42 then use this tool to make a table of the top three BLAST hits. This | |
43 can give you a 'quick and dirty' crude annotation, potentially enough | |
44 to spot some problems (e.g. bacterial contaimination could be very | |
45 obvious). | |
46 | |
47 </help> | |
48 </tool> |