annotate README.md @ 42:b938475235e3 draft

Uploaded
author fubar
date Sun, 16 Aug 2020 08:33:09 -0400
parents
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
42
b938475235e3 Uploaded
fubar
parents:
diff changeset
1 Note as at August 8 2020
b938475235e3 Uploaded
fubar
parents:
diff changeset
2
b938475235e3 Uploaded
fubar
parents:
diff changeset
3
b938475235e3 Uploaded
fubar
parents:
diff changeset
4 *WARNING before you start*
b938475235e3 Uploaded
fubar
parents:
diff changeset
5
b938475235e3 Uploaded
fubar
parents:
diff changeset
6 Install this tool on a private Galaxy ONLY
b938475235e3 Uploaded
fubar
parents:
diff changeset
7 Please NEVER on a public or production instance
b938475235e3 Uploaded
fubar
parents:
diff changeset
8
b938475235e3 Uploaded
fubar
parents:
diff changeset
9 Please cite the resource at
b938475235e3 Uploaded
fubar
parents:
diff changeset
10 http://bioinformatics.oxfordjournals.org/cgi/reprint/bts573?ijkey=lczQh1sWrMwdYWJ&keytype=ref
b938475235e3 Uploaded
fubar
parents:
diff changeset
11 if you use this tool in your published work.
b938475235e3 Uploaded
fubar
parents:
diff changeset
12
b938475235e3 Uploaded
fubar
parents:
diff changeset
13 **Short Story**
b938475235e3 Uploaded
fubar
parents:
diff changeset
14
b938475235e3 Uploaded
fubar
parents:
diff changeset
15 This is an unusual Galaxy tool capable of generating new Galaxy tools.
b938475235e3 Uploaded
fubar
parents:
diff changeset
16 It works by exposing *unrestricted* and therefore extremely dangerous scripting
b938475235e3 Uploaded
fubar
parents:
diff changeset
17 to all designated administrators of the host Galaxy server, allowing them to
b938475235e3 Uploaded
fubar
parents:
diff changeset
18 run scripts in R, python, sh and perl over multiple selected input data sets,
b938475235e3 Uploaded
fubar
parents:
diff changeset
19 writing a single new data set as output.
b938475235e3 Uploaded
fubar
parents:
diff changeset
20
b938475235e3 Uploaded
fubar
parents:
diff changeset
21 *You have a working r/python/perl/bash script or any executable with positional or argparse style parameters*
b938475235e3 Uploaded
fubar
parents:
diff changeset
22
b938475235e3 Uploaded
fubar
parents:
diff changeset
23 It can be turned into an ordinary Galaxy tool in minutes, using a Galaxy tool.
b938475235e3 Uploaded
fubar
parents:
diff changeset
24
b938475235e3 Uploaded
fubar
parents:
diff changeset
25 **Automated generation of new Galaxy tools for installation into any Galaxy**
b938475235e3 Uploaded
fubar
parents:
diff changeset
26
b938475235e3 Uploaded
fubar
parents:
diff changeset
27 A test is generated using small sample test data inputs and parameter settings you supply.
b938475235e3 Uploaded
fubar
parents:
diff changeset
28 Once the test case outputs have been produced, they can be used to build a
b938475235e3 Uploaded
fubar
parents:
diff changeset
29 new Galaxy tool. The supplied script or executable is baked as a requirement
b938475235e3 Uploaded
fubar
parents:
diff changeset
30 into a new, ordinary Galaxy tool, fully workflow compatible out of the box.
b938475235e3 Uploaded
fubar
parents:
diff changeset
31 Generated tools are installed via a tool shed by an administrator
b938475235e3 Uploaded
fubar
parents:
diff changeset
32 and work exactly like all other Galaxy tools for your users.
b938475235e3 Uploaded
fubar
parents:
diff changeset
33
b938475235e3 Uploaded
fubar
parents:
diff changeset
34 **More Detail**
b938475235e3 Uploaded
fubar
parents:
diff changeset
35
b938475235e3 Uploaded
fubar
parents:
diff changeset
36 To use the ToolFactory, you should have prepared a script to paste into a
b938475235e3 Uploaded
fubar
parents:
diff changeset
37 text box, or have a package in mind and a small test input example ready to select from your history
b938475235e3 Uploaded
fubar
parents:
diff changeset
38 to test your new script.
b938475235e3 Uploaded
fubar
parents:
diff changeset
39
b938475235e3 Uploaded
fubar
parents:
diff changeset
40 ```planemo test --no_cleanup --no_dependency_resolution --skip_venv --galaxy_root ~/galaxy ~/rossgit/toolfactory``` works for me
b938475235e3 Uploaded
fubar
parents:
diff changeset
41
b938475235e3 Uploaded
fubar
parents:
diff changeset
42 There is an example in each scripting language on the Tool Factory form. You
b938475235e3 Uploaded
fubar
parents:
diff changeset
43 can just cut and paste these to try it out - remember to select the right
b938475235e3 Uploaded
fubar
parents:
diff changeset
44 interpreter please. You'll also need to create a small test data set using
b938475235e3 Uploaded
fubar
parents:
diff changeset
45 the Galaxy history add new data tool.
b938475235e3 Uploaded
fubar
parents:
diff changeset
46
b938475235e3 Uploaded
fubar
parents:
diff changeset
47 If the script fails somehow, use the "redo" button on the tool output in
b938475235e3 Uploaded
fubar
parents:
diff changeset
48 your history to recreate the form complete with broken script. Fix the bug
b938475235e3 Uploaded
fubar
parents:
diff changeset
49 and execute again. Rinse, wash, repeat.
b938475235e3 Uploaded
fubar
parents:
diff changeset
50
b938475235e3 Uploaded
fubar
parents:
diff changeset
51 Once the script runs sucessfully, a new Galaxy tool that runs your script
b938475235e3 Uploaded
fubar
parents:
diff changeset
52 can be generated. Select the "generate" option and supply some help text and
b938475235e3 Uploaded
fubar
parents:
diff changeset
53 names. The new tool will be generated in the form of a new Galaxy datatype
b938475235e3 Uploaded
fubar
parents:
diff changeset
54 *tgz* - as the name suggests, it's an archive ready to upload to a
b938475235e3 Uploaded
fubar
parents:
diff changeset
55 Galaxy ToolShed as a new tool repository.
b938475235e3 Uploaded
fubar
parents:
diff changeset
56
b938475235e3 Uploaded
fubar
parents:
diff changeset
57
b938475235e3 Uploaded
fubar
parents:
diff changeset
58 Once it's in a ToolShed, it can be installed into any local Galaxy server
b938475235e3 Uploaded
fubar
parents:
diff changeset
59 from the server administrative interface.
b938475235e3 Uploaded
fubar
parents:
diff changeset
60
b938475235e3 Uploaded
fubar
parents:
diff changeset
61 Once the new tool is installed, local users can run it - each time, the script
b938475235e3 Uploaded
fubar
parents:
diff changeset
62 that was supplied when it was built will be executed with the input chosen
b938475235e3 Uploaded
fubar
parents:
diff changeset
63 from the user's history. In other words, the tools you generate with the
b938475235e3 Uploaded
fubar
parents:
diff changeset
64 ToolFactory run just like any other Galaxy tool,but run your script every time.
b938475235e3 Uploaded
fubar
parents:
diff changeset
65
b938475235e3 Uploaded
fubar
parents:
diff changeset
66 Tool factory tools are perfect for workflow components. One input, one output,
b938475235e3 Uploaded
fubar
parents:
diff changeset
67 no variables.
b938475235e3 Uploaded
fubar
parents:
diff changeset
68
b938475235e3 Uploaded
fubar
parents:
diff changeset
69 *To fully and safely exploit the awesome power* of this tool,
b938475235e3 Uploaded
fubar
parents:
diff changeset
70 Galaxy and the ToolShed, you should be a developer installing this
b938475235e3 Uploaded
fubar
parents:
diff changeset
71 tool on a private/personal/scratch local instance where you are an
b938475235e3 Uploaded
fubar
parents:
diff changeset
72 admin_user. Then, if you break it, you get to keep all the pieces see
b938475235e3 Uploaded
fubar
parents:
diff changeset
73 https://bitbucket.org/fubar/galaxytoolfactory/wiki/Home
b938475235e3 Uploaded
fubar
parents:
diff changeset
74
b938475235e3 Uploaded
fubar
parents:
diff changeset
75 **Installation**
b938475235e3 Uploaded
fubar
parents:
diff changeset
76 This is a Galaxy tool. You can install it most conveniently using the
b938475235e3 Uploaded
fubar
parents:
diff changeset
77 administrative "Search and browse tool sheds" link. Find the Galaxy Main
b938475235e3 Uploaded
fubar
parents:
diff changeset
78 toolshed at https://toolshed.g2.bx.psu.edu/ and search for the toolfactory
b938475235e3 Uploaded
fubar
parents:
diff changeset
79 repository. Open it and review the code and select the option to install it.
b938475235e3 Uploaded
fubar
parents:
diff changeset
80
b938475235e3 Uploaded
fubar
parents:
diff changeset
81 If you can't get the tool that way, the xml and py files here need to be
b938475235e3 Uploaded
fubar
parents:
diff changeset
82 copied into a new tools
b938475235e3 Uploaded
fubar
parents:
diff changeset
83 subdirectory such as tools/toolfactory Your tool_conf.xml needs a new entry
b938475235e3 Uploaded
fubar
parents:
diff changeset
84 pointing to the xml
b938475235e3 Uploaded
fubar
parents:
diff changeset
85 file - something like::
b938475235e3 Uploaded
fubar
parents:
diff changeset
86
b938475235e3 Uploaded
fubar
parents:
diff changeset
87 <section name="Tool building tools" id="toolbuilders">
b938475235e3 Uploaded
fubar
parents:
diff changeset
88 <tool file="toolfactory/rgToolFactory.xml"/>
b938475235e3 Uploaded
fubar
parents:
diff changeset
89 </section>
b938475235e3 Uploaded
fubar
parents:
diff changeset
90
b938475235e3 Uploaded
fubar
parents:
diff changeset
91 If not already there,
b938475235e3 Uploaded
fubar
parents:
diff changeset
92 please add:
b938475235e3 Uploaded
fubar
parents:
diff changeset
93 <datatype extension="toolshed.gz" type="galaxy.datatypes.binary:Binary"
b938475235e3 Uploaded
fubar
parents:
diff changeset
94 mimetype="multipart/x-gzip" subclass="True" />
b938475235e3 Uploaded
fubar
parents:
diff changeset
95 to your local data_types_conf.xml.
b938475235e3 Uploaded
fubar
parents:
diff changeset
96
b938475235e3 Uploaded
fubar
parents:
diff changeset
97
b938475235e3 Uploaded
fubar
parents:
diff changeset
98 **Restricted execution**
b938475235e3 Uploaded
fubar
parents:
diff changeset
99
b938475235e3 Uploaded
fubar
parents:
diff changeset
100 The tool factory tool itself will then be usable ONLY by admin users -
b938475235e3 Uploaded
fubar
parents:
diff changeset
101 people with IDs in admin_users in universe_wsgi.ini **Yes, that's right. ONLY
b938475235e3 Uploaded
fubar
parents:
diff changeset
102 admin_users can run this tool** Think about it for a moment. If allowed to
b938475235e3 Uploaded
fubar
parents:
diff changeset
103 run any arbitrary script on your Galaxy server, the only thing that would
b938475235e3 Uploaded
fubar
parents:
diff changeset
104 impede a miscreant bent on destroying all your Galaxy data would probably
b938475235e3 Uploaded
fubar
parents:
diff changeset
105 be lack of appropriate technical skills.
b938475235e3 Uploaded
fubar
parents:
diff changeset
106
b938475235e3 Uploaded
fubar
parents:
diff changeset
107 **What it does**
b938475235e3 Uploaded
fubar
parents:
diff changeset
108
b938475235e3 Uploaded
fubar
parents:
diff changeset
109 This is a tool factory for simple scripts in python, R and
b938475235e3 Uploaded
fubar
parents:
diff changeset
110 perl currently. Functional tests are automatically generated.
b938475235e3 Uploaded
fubar
parents:
diff changeset
111
b938475235e3 Uploaded
fubar
parents:
diff changeset
112 LIMITED to simple scripts that read one input from the history. Optionally can
b938475235e3 Uploaded
fubar
parents:
diff changeset
113 write one new history dataset, and optionally collect any number of outputs
b938475235e3 Uploaded
fubar
parents:
diff changeset
114 into links on an autogenerated HTML index page for the user to navigate -
b938475235e3 Uploaded
fubar
parents:
diff changeset
115 useful if the script writes images and output files - pdf outputs are shown
b938475235e3 Uploaded
fubar
parents:
diff changeset
116 as thumbnails and R's bloated pdf's are shrunk with ghostscript so that and
b938475235e3 Uploaded
fubar
parents:
diff changeset
117 imagemagik need to be available.
b938475235e3 Uploaded
fubar
parents:
diff changeset
118
b938475235e3 Uploaded
fubar
parents:
diff changeset
119 Generated tools can be edited and enhanced like any Galaxy tool, so start
b938475235e3 Uploaded
fubar
parents:
diff changeset
120 small and build up since a generated script gets you a serious leg up to a
b938475235e3 Uploaded
fubar
parents:
diff changeset
121 more complex one.
b938475235e3 Uploaded
fubar
parents:
diff changeset
122
b938475235e3 Uploaded
fubar
parents:
diff changeset
123 **What you do**
b938475235e3 Uploaded
fubar
parents:
diff changeset
124
b938475235e3 Uploaded
fubar
parents:
diff changeset
125 You paste and run your script, you fix the syntax errors and
b938475235e3 Uploaded
fubar
parents:
diff changeset
126 eventually it runs. You can use the redo button and edit the script before
b938475235e3 Uploaded
fubar
parents:
diff changeset
127 trying to rerun it as you debug - it works pretty well.
b938475235e3 Uploaded
fubar
parents:
diff changeset
128
b938475235e3 Uploaded
fubar
parents:
diff changeset
129 Once the script works on some test data, you can generate a toolshed compatible
b938475235e3 Uploaded
fubar
parents:
diff changeset
130 gzip file containing your script ready to run as an ordinary Galaxy tool in
b938475235e3 Uploaded
fubar
parents:
diff changeset
131 a repository on your local toolshed. That means safe and largely automated
b938475235e3 Uploaded
fubar
parents:
diff changeset
132 installation in any production Galaxy configured to use your toolshed.
b938475235e3 Uploaded
fubar
parents:
diff changeset
133
b938475235e3 Uploaded
fubar
parents:
diff changeset
134 **Generated tool Security**
b938475235e3 Uploaded
fubar
parents:
diff changeset
135
b938475235e3 Uploaded
fubar
parents:
diff changeset
136 Once you install a generated tool, it's just
b938475235e3 Uploaded
fubar
parents:
diff changeset
137 another tool - assuming the script is safe. They just run normally and their
b938475235e3 Uploaded
fubar
parents:
diff changeset
138 user cannot do anything unusually insecure but please, practice safe toolshed.
b938475235e3 Uploaded
fubar
parents:
diff changeset
139 Read the code before you install any tool. Especially this one - it is really scary.
b938475235e3 Uploaded
fubar
parents:
diff changeset
140
b938475235e3 Uploaded
fubar
parents:
diff changeset
141 **Send Code**
b938475235e3 Uploaded
fubar
parents:
diff changeset
142
b938475235e3 Uploaded
fubar
parents:
diff changeset
143 Patches and suggestions welcome as bitbucket issues please?
b938475235e3 Uploaded
fubar
parents:
diff changeset
144
b938475235e3 Uploaded
fubar
parents:
diff changeset
145 **Attribution**
b938475235e3 Uploaded
fubar
parents:
diff changeset
146
b938475235e3 Uploaded
fubar
parents:
diff changeset
147 Creating re-usable tools from scripts: The Galaxy Tool Factory
b938475235e3 Uploaded
fubar
parents:
diff changeset
148 Ross Lazarus; Antony Kaspi; Mark Ziemann; The Galaxy Team
b938475235e3 Uploaded
fubar
parents:
diff changeset
149 Bioinformatics 2012; doi: 10.1093/bioinformatics/bts573
b938475235e3 Uploaded
fubar
parents:
diff changeset
150
b938475235e3 Uploaded
fubar
parents:
diff changeset
151 http://bioinformatics.oxfordjournals.org/cgi/reprint/bts573?ijkey=lczQh1sWrMwdYWJ&keytype=ref
b938475235e3 Uploaded
fubar
parents:
diff changeset
152
b938475235e3 Uploaded
fubar
parents:
diff changeset
153 **Licensing**
b938475235e3 Uploaded
fubar
parents:
diff changeset
154
b938475235e3 Uploaded
fubar
parents:
diff changeset
155 Copyright Ross Lazarus 2010
b938475235e3 Uploaded
fubar
parents:
diff changeset
156 ross lazarus at g mail period com
b938475235e3 Uploaded
fubar
parents:
diff changeset
157
b938475235e3 Uploaded
fubar
parents:
diff changeset
158 All rights reserved.
b938475235e3 Uploaded
fubar
parents:
diff changeset
159
b938475235e3 Uploaded
fubar
parents:
diff changeset
160 Licensed under the LGPL
b938475235e3 Uploaded
fubar
parents:
diff changeset
161
b938475235e3 Uploaded
fubar
parents:
diff changeset
162 **Obligatory screenshot**
b938475235e3 Uploaded
fubar
parents:
diff changeset
163
b938475235e3 Uploaded
fubar
parents:
diff changeset
164 http://bitbucket.org/fubar/galaxytoolmaker/src/fda8032fe989/images/dynamicScriptTool.png
b938475235e3 Uploaded
fubar
parents:
diff changeset
165