Mercurial > repos > damion > versioned_data
comparison versioned_data.xml @ 1:5c5027485f7d draft
Uploaded correct file
| author | damion |
|---|---|
| date | Sun, 09 Aug 2015 16:07:50 -0400 |
| parents | |
| children |
comparison
equal
deleted
inserted
replaced
| 0:d31a1bd74e63 | 1:5c5027485f7d |
|---|---|
| 1 <tool id="versioned_data" name="Versioned data retrieval" version="0.1.03"> | |
| 2 <description>Retrieve versioned sequence files and/or their blast, bowtie, etc. database indexes</description> | |
| 3 <macros> | |
| 4 <token name="@BINARY@">versioned_data.py</token> | |
| 5 <import>bccdc_macros.xml</import> | |
| 6 </macros> | |
| 7 <expand macro="requirements" /> | |
| 8 <command interpreter="python"> | |
| 9 #assert $__user__, Exception( 'You must be logged in to use this tool.' ) | |
| 10 versioned_data.py | |
| 11 #if $globalRetrievalDate.strip() > '' | |
| 12 -d "$globalRetrievalDate" | |
| 13 #end if | |
| 14 -r | |
| 15 " | |
| 16 #for $v in $versions: | |
| 17 ${v.database}, | |
| 18 #for $r in $v.retrieval: | |
| 19 ${r.retrievalId} | |
| 20 #end for | |
| 21 , | |
| 22 #for $w in $v.workflows: | |
| 23 ${w.workflow} | |
| 24 #end for | |
| 25 | | |
| 26 #end for | |
| 27 " | |
| 28 -o "$log" | |
| 29 -O "$__app__.security.encode_id($log.id)" | |
| 30 --api_info_path "$api_info_path" ##Actually a file path to configfile that holds api key | |
| 31 </command> | |
| 32 <!-- #:$log.hid:$log.id dataset_id --> | |
| 33 <expand macro="stdio" /> | |
| 34 | |
| 35 <inputs> | |
| 36 <!-- Implement as datepicker? http://www.learnfaceit.org/for-developers/adding-parameter-types-to-tool --> | |
| 37 <param name="globalRetrievalDate" type="text" label="Global retrieval date [YYYY-MM-DD]" help="The recall system will use this date to try to select the appropriate versions below. Leave empty to select current versions." size="25" /> | |
| 38 | |
| 39 <param name="api_info" display="radio" type="drill_down" label="For user with Galaxy API Key" dynamic_options="vdb_init_tool_user(__trans__)" /> | |
| 40 | |
| 41 <repeat name="versions" title="Data Source" min="1" max="15"> | |
| 42 | |
| 43 <param name="database" type="select" label="Data" dynamic_options="vdb_get_databases()" multiple="false" /> | |
| 44 | |
| 45 <repeat name="retrieval" title="Retrieval" min="0" max="1"> | |
| 46 <param name="retrievalId" label="Version date/id" type="select" dynamic_options="vdb_get_versions(database, globalRetrievalDate)"/> | |
| 47 </repeat> | |
| 48 | |
| 49 <repeat name="workflows" title="Workflow" min="0" max="5" > | |
| 50 <param name="workflow" type="select" label="Name" dynamic_options="vdb_get_workflows(database)" /> | |
| 51 </repeat> | |
| 52 | |
| 53 </repeat> | |
| 54 | |
| 55 </inputs> | |
| 56 | |
| 57 <configfiles> | |
| 58 <configfile name="api_info_path">${__user__.api_keys[0].key} | |
| 59 $api_info | |
| 60 </configfile> | |
| 61 </configfiles> | |
| 62 | |
| 63 <outputs> | |
| 64 <data name="log" format="txt" label="Versioned Data Retrieval" /> | |
| 65 </outputs> | |
| 66 | |
| 67 <code file="versioned_data_form.py" /> | |
| 68 | |
| 69 <tests> | |
| 70 <test> | |
| 71 <param name="db_type" value="nucl"/> | |
| 72 <!-- ... --> | |
| 73 </test> | |
| 74 </tests> | |
| 75 | |
| 76 <help> | |
| 77 | |
| 78 .. class:: infomark | |
| 79 | |
| 80 | |
| 81 **What it does** | |
| 82 | |
| 83 This tool retrieves links to current or past versions of fasta or other types of | |
| 84 data from a cache kept in the Galaxy data library called "Versioned Data". It then places | |
| 85 them into one's current history so that subsequent tools can work with that data. | |
| 86 | |
| 87 For example, after using this tool to select a version of the NCBI nt database, a blast search can be carried out on it by selecting "BLAST database from your history" from the "Subject database/sequences" field of the NCBI BLAST+ search tool. | |
| 88 | |
| 89 You can select one or more files or databases by version date or id. This list | |
| 90 is supplied from the Shared Data > Data Libraries > Versioned Data folder that has | |
| 91 been set up by an administrator. | |
| 92 | |
| 93 The Workflows section allows you to select one or more pre-defined workflows | |
| 94 to execute on the versioned data. The results are placed in your history for use | |
| 95 by other tools or workflows. | |
| 96 | |
| 97 A caching system exists to cache the versioned data or workflow data that the tool generates. | |
| 98 If you request versioned data or derivative data that isn't cached, it may take time to regenerate. | |
| 99 | |
| 100 The top-level "Global retrieval date [YYYY-MM-DD]" field that the form starts with will be applied to | |
| 101 all selected databases. This can be overriden by a retrieval date or version that | |
| 102 you supply for a particular database. Leave it and any "Retrievals" inputs empty if you just need the latest version of selected databases. | |
| 103 | |
| 104 ------- | |
| 105 | |
| 106 .. class:: warningmark | |
| 107 | |
| 108 **Note** | |
| 109 | |
| 110 Again, some past database versions can take time to regenerate if there is no cached version available, for example NCBI nt is a 50+ gigabyte file that needs to be read through to get a fasta version, and a makeblastdb workflow on top of that can take hours on the first call. Access to cached versions is immediate. | |
| 111 | |
| 112 Setup of versioned data sources and workflow options can only be done by a Galaxy administrator. | |
| 113 | |
| 114 ------- | |
| 115 | |
| 116 **References** | |
| 117 | |
| 118 If you use this Galaxy tool in work leading to a scientific publication please | |
| 119 cite the following paper: | |
| 120 | |
| 121 *Reference coming soon...* | |
| 122 | |
| 123 </help> | |
| 124 </tool> |
