annotate toolfactory/README.md @ 28:ad564ab3cf7b draft

Uploaded
author fubar
date Fri, 31 Jul 2020 23:00:31 -0400
parents
children 35a912ce0c83
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
28
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
1 *WARNING before you start*
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
2
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
3 Install this tool on a private Galaxy ONLY
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
4 Please NEVER on a public or production instance
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
5
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
6 Updated august 2014 by John Chilton adding citation support
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
7
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
8 Updated august 8 2014 to fix bugs reported by Marius van den Beek
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
9
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
10 Please cite the resource at
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
11 http://bioinformatics.oxfordjournals.org/cgi/reprint/bts573?ijkey=lczQh1sWrMwdYWJ&keytype=ref
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
12 if you use this tool in your published work.
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
13
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
14 **Short Story**
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
15
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
16 This is an unusual Galaxy tool capable of generating new Galaxy tools.
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
17 It works by exposing *unrestricted* and therefore extremely dangerous scripting
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
18 to all designated administrators of the host Galaxy server, allowing them to
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
19 run scripts in R, python, sh and perl over multiple selected input data sets,
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
20 writing a single new data set as output.
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
21
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
22 *You have a working r/python/perl/bash script or any executable with positional or argparse style parameters*
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
23
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
24 It can be turned into an ordinary Galaxy tool in minutes, using a Galaxy tool.
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
25
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
26
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
27 **Automated generation of new Galaxy tools for installation into any Galaxy**
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
28
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
29 A test is generated using small sample test data inputs and parameter settings you supply.
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
30 Once the test case outputs have been produced, they can be used to build a
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
31 new Galaxy tool. The supplied script or executable is baked as a requirement
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
32 into a new, ordinary Galaxy tool, fully workflow compatible out of the box.
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
33 Generated tools are installed via a tool shed by an administrator
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
34 and work exactly like all other Galaxy tools for your users.
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
35
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
36 **More Detail**
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
37
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
38 To use the ToolFactory, you should have prepared a script to paste into a
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
39 text box, or have a package in mind and a small test input example ready to select from your history
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
40 to test your new script.
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
41
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
42 ```planemo test rgToolFactory2.xml --galaxy_root ~/galaxy --test_data ~/galaxy/tools/tool_makers/toolfactory/test-data``` works for me
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
43
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
44 There is an example in each scripting language on the Tool Factory form. You
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
45 can just cut and paste these to try it out - remember to select the right
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
46 interpreter please. You'll also need to create a small test data set using
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
47 the Galaxy history add new data tool.
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
48
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
49 If the script fails somehow, use the "redo" button on the tool output in
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
50 your history to recreate the form complete with broken script. Fix the bug
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
51 and execute again. Rinse, wash, repeat.
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
52
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
53 Once the script runs sucessfully, a new Galaxy tool that runs your script
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
54 can be generated. Select the "generate" option and supply some help text and
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
55 names. The new tool will be generated in the form of a new Galaxy datatype
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
56 *toolshed.gz* - as the name suggests, it's an archive ready to upload to a
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
57 Galaxy ToolShed as a new tool repository.
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
58
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
59 Once it's in a ToolShed, it can be installed into any local Galaxy server
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
60 from the server administrative interface.
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
61
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
62 Once the new tool is installed, local users can run it - each time, the script
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
63 that was supplied when it was built will be executed with the input chosen
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
64 from the user's history. In other words, the tools you generate with the
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
65 ToolFactory run just like any other Galaxy tool,but run your script every time.
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
66
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
67 Tool factory tools are perfect for workflow components. One input, one output,
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
68 no variables.
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
69
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
70 *To fully and safely exploit the awesome power* of this tool,
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
71 Galaxy and the ToolShed, you should be a developer installing this
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
72 tool on a private/personal/scratch local instance where you are an
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
73 admin_user. Then, if you break it, you get to keep all the pieces see
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
74 https://bitbucket.org/fubar/galaxytoolfactory/wiki/Home
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
75
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
76 **Installation**
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
77 This is a Galaxy tool. You can install it most conveniently using the
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
78 administrative "Search and browse tool sheds" link. Find the Galaxy Main
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
79 toolshed at https://toolshed.g2.bx.psu.edu/ and search for the toolfactory
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
80 repository. Open it and review the code and select the option to install it.
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
81
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
82 If you can't get the tool that way, the xml and py files here need to be
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
83 copied into a new tools
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
84 subdirectory such as tools/toolfactory Your tool_conf.xml needs a new entry
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
85 pointing to the xml
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
86 file - something like::
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
87
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
88 <section name="Tool building tools" id="toolbuilders">
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
89 <tool file="toolfactory/rgToolFactory.xml"/>
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
90 </section>
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
91
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
92 If not already there,
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
93 please add:
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
94 <datatype extension="toolshed.gz" type="galaxy.datatypes.binary:Binary"
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
95 mimetype="multipart/x-gzip" subclass="True" />
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
96 to your local data_types_conf.xml.
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
97
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
98
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
99 **Restricted execution**
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
100
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
101 The tool factory tool itself will then be usable ONLY by admin users -
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
102 people with IDs in admin_users in universe_wsgi.ini **Yes, that's right. ONLY
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
103 admin_users can run this tool** Think about it for a moment. If allowed to
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
104 run any arbitrary script on your Galaxy server, the only thing that would
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
105 impede a miscreant bent on destroying all your Galaxy data would probably
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
106 be lack of appropriate technical skills.
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
107
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
108 **What it does**
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
109
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
110 This is a tool factory for simple scripts in python, R and
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
111 perl currently. Functional tests are automatically generated. How cool is that.
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
112
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
113 LIMITED to simple scripts that read one input from the history. Optionally can
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
114 write one new history dataset, and optionally collect any number of outputs
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
115 into links on an autogenerated HTML index page for the user to navigate -
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
116 useful if the script writes images and output files - pdf outputs are shown
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
117 as thumbnails and R's bloated pdf's are shrunk with ghostscript so that and
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
118 imagemagik need to be available.
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
119
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
120 Generated tools can be edited and enhanced like any Galaxy tool, so start
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
121 small and build up since a generated script gets you a serious leg up to a
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
122 more complex one.
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
123
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
124 **What you do**
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
125
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
126 You paste and run your script, you fix the syntax errors and
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
127 eventually it runs. You can use the redo button and edit the script before
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
128 trying to rerun it as you debug - it works pretty well.
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
129
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
130 Once the script works on some test data, you can generate a toolshed compatible
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
131 gzip file containing your script ready to run as an ordinary Galaxy tool in
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
132 a repository on your local toolshed. That means safe and largely automated
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
133 installation in any production Galaxy configured to use your toolshed.
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
134
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
135 **Generated tool Security**
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
136
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
137 Once you install a generated tool, it's just
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
138 another tool - assuming the script is safe. They just run normally and their
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
139 user cannot do anything unusually insecure but please, practice safe toolshed.
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
140 Read the code before you install any tool. Especially this one - it is really scary.
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
141
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
142 **Send Code**
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
143
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
144 Patches and suggestions welcome as bitbucket issues please?
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
145
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
146 **Attribution**
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
147
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
148 Creating re-usable tools from scripts: The Galaxy Tool Factory
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
149 Ross Lazarus; Antony Kaspi; Mark Ziemann; The Galaxy Team
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
150 Bioinformatics 2012; doi: 10.1093/bioinformatics/bts573
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
151
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
152 http://bioinformatics.oxfordjournals.org/cgi/reprint/bts573?ijkey=lczQh1sWrMwdYWJ&keytype=ref
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
153
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
154 **Licensing**
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
155
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
156 Copyright Ross Lazarus 2010
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
157 ross lazarus at g mail period com
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
158
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
159 All rights reserved.
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
160
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
161 Licensed under the LGPL
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
162
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
163 **Obligatory screenshot**
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
164
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
165 http://bitbucket.org/fubar/galaxytoolmaker/src/fda8032fe989/images/dynamicScriptTool.png
ad564ab3cf7b Uploaded
fubar
parents:
diff changeset
166