comparison versioned_data.xml @ 1:5c5027485f7d draft

Uploaded correct file
author damion
date Sun, 09 Aug 2015 16:07:50 -0400
parents
children
comparison
equal deleted inserted replaced
0:d31a1bd74e63 1:5c5027485f7d
1 <tool id="versioned_data" name="Versioned data retrieval" version="0.1.03">
2 <description>Retrieve versioned sequence files and/or their blast, bowtie, etc. database indexes</description>
3 <macros>
4 <token name="@BINARY@">versioned_data.py</token>
5 <import>bccdc_macros.xml</import>
6 </macros>
7 <expand macro="requirements" />
8 <command interpreter="python">
9 #assert $__user__, Exception( 'You must be logged in to use this tool.' )
10 versioned_data.py
11 #if $globalRetrievalDate.strip() > ''
12 -d "$globalRetrievalDate"
13 #end if
14 -r
15 "
16 #for $v in $versions:
17 ${v.database},
18 #for $r in $v.retrieval:
19 ${r.retrievalId}
20 #end for
21 ,
22 #for $w in $v.workflows:
23 ${w.workflow}
24 #end for
25 |
26 #end for
27 "
28 -o "$log"
29 -O "$__app__.security.encode_id($log.id)"
30 --api_info_path "$api_info_path" ##Actually a file path to configfile that holds api key
31 </command>
32 <!-- #:$log.hid:$log.id dataset_id -->
33 <expand macro="stdio" />
34
35 <inputs>
36 <!-- Implement as datepicker? http://www.learnfaceit.org/for-developers/adding-parameter-types-to-tool -->
37 <param name="globalRetrievalDate" type="text" label="Global retrieval date [YYYY-MM-DD]" help="The recall system will use this date to try to select the appropriate versions below. Leave empty to select current versions." size="25" />
38
39 <param name="api_info" display="radio" type="drill_down" label="For user with Galaxy API Key" dynamic_options="vdb_init_tool_user(__trans__)" />
40
41 <repeat name="versions" title="Data Source" min="1" max="15">
42
43 <param name="database" type="select" label="Data" dynamic_options="vdb_get_databases()" multiple="false" />
44
45 <repeat name="retrieval" title="Retrieval" min="0" max="1">
46 <param name="retrievalId" label="Version date/id" type="select" dynamic_options="vdb_get_versions(database, globalRetrievalDate)"/>
47 </repeat>
48
49 <repeat name="workflows" title="Workflow" min="0" max="5" >
50 <param name="workflow" type="select" label="Name" dynamic_options="vdb_get_workflows(database)" />
51 </repeat>
52
53 </repeat>
54
55 </inputs>
56
57 <configfiles>
58 <configfile name="api_info_path">${__user__.api_keys[0].key}
59 $api_info
60 </configfile>
61 </configfiles>
62
63 <outputs>
64 <data name="log" format="txt" label="Versioned Data Retrieval" />
65 </outputs>
66
67 <code file="versioned_data_form.py" />
68
69 <tests>
70 <test>
71 <param name="db_type" value="nucl"/>
72 <!-- ... -->
73 </test>
74 </tests>
75
76 <help>
77
78 .. class:: infomark
79
80
81 **What it does**
82
83 This tool retrieves links to current or past versions of fasta or other types of
84 data from a cache kept in the Galaxy data library called "Versioned Data". It then places
85 them into one's current history so that subsequent tools can work with that data.
86
87 For example, after using this tool to select a version of the NCBI nt database, a blast search can be carried out on it by selecting "BLAST database from your history" from the "Subject database/sequences" field of the NCBI BLAST+ search tool.
88
89 You can select one or more files or databases by version date or id. This list
90 is supplied from the Shared Data > Data Libraries > Versioned Data folder that has
91 been set up by an administrator.
92
93 The Workflows section allows you to select one or more pre-defined workflows
94 to execute on the versioned data. The results are placed in your history for use
95 by other tools or workflows.
96
97 A caching system exists to cache the versioned data or workflow data that the tool generates.
98 If you request versioned data or derivative data that isn't cached, it may take time to regenerate.
99
100 The top-level "Global retrieval date [YYYY-MM-DD]" field that the form starts with will be applied to
101 all selected databases. This can be overriden by a retrieval date or version that
102 you supply for a particular database. Leave it and any "Retrievals" inputs empty if you just need the latest version of selected databases.
103
104 -------
105
106 .. class:: warningmark
107
108 **Note**
109
110 Again, some past database versions can take time to regenerate if there is no cached version available, for example NCBI nt is a 50+ gigabyte file that needs to be read through to get a fasta version, and a makeblastdb workflow on top of that can take hours on the first call. Access to cached versions is immediate.
111
112 Setup of versioned data sources and workflow options can only be done by a Galaxy administrator.
113
114 -------
115
116 **References**
117
118 If you use this Galaxy tool in work leading to a scientific publication please
119 cite the following paper:
120
121 *Reference coming soon...*
122
123 </help>
124 </tool>