comparison affiliation_postprocess.xml @ 0:59bc96331073 draft default tip

planemo upload for repository https://github.com/geraldinepascal/FROGS-wrappers/tree/v3.1.0 commit 08296fc88e3e938c482c631bd515b3b7a0499647
author frogs
date Thu, 28 Feb 2019 10:14:49 -0500
parents
children
comparison
equal deleted inserted replaced
-1:000000000000 0:59bc96331073
1 <?xml version="1.0"?>
2 <!--
3 # Copyright (C) 2015 INRA
4 #
5 # This program is free software: you can redistribute it and/or modify
6 # it under the terms of the GNU General Public License as published by
7 # the Free Software Foundation, either version 3 of the License, or
8 # (at your option) any later version.
9 #
10 # This program is distributed in the hope that it will be useful,
11 # but WITHOUT ANY WARRANTY; without even the implied warranty of
12 # MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the
13 # GNU General Public License for more details.
14 #
15 # You should have received a copy of the GNU General Public License
16 # along with this program. If not, see <http://www.gnu.org/licenses/>.
17 -->
18 <tool id="FROGS_affiliation_postprocess" name="FROGS Affiliation postprocess" version="3.1">
19 <description>Optionnal step to resolve inclusive amplicon ambiguities and to aggregate OTUs based on alignment metrics</description>
20 <requirements>
21 <requirement type="package" version="3.1.0">frogs</requirement>
22 </requirements>
23 <stdio>
24 <exit_code range="1:" />
25 <exit_code range=":-1" />
26 </stdio>
27 <command>
28 affiliation_postprocess.py
29 #if $is_HVL.HVL_amplicon
30 #set $reference_filename = str( $is_HVL.ref_file.fields.path )
31 --reference "${reference_filename}"
32 #end if
33 --identity $identity
34 --coverage $coverage
35 --input-biom $biom_in
36 --input-fasta $fasta_in
37 --output-biom $biom_out
38 --output-fasta $fasta_out
39 --output-compo $compo_out
40 </command>
41 <inputs>
42 <!-- Files -->
43 <param format="biom1" name="biom_in" type="data" label="Abundance file of affiliated OTUs" help="Abundances of affiliated OTUs (format: BIOM)."/>
44 <param format="fasta" name="fasta_in" type="data" label="OTU seed sequences" help="OTU sequences (format: fasta)."/>
45 <conditional name="is_HVL">
46 <param name="HVL_amplicon" type="boolean" label="Is this an hyper variable in length amplicon ?" help="Multi-affiliation tag may be resolved by selecting the shortest amplicon reference. For this you need the reference fasta file of your kind of amplicon."/>
47 <when value="true">
48 <param name="ref_file" type="select" label="Using reference database" help="Select reference from the list">
49 <options from_data_table="HVL_db"></options>
50 <validator type="no_options" message="A built-in database is not available" />
51 <!--column name="name" index="0"/>
52 <column name="value" index="1"/-->
53 </param>
54 </when>
55 <when value="false"/>
56 </conditional>
57
58 <param name="identity" type="float" min="0.0" max="100.0" value="99.0" label="minimum identity for aggregation" help="OTUs will be aggregated if they share the same taxonomy with at least X% identity." />
59 <param name="coverage" type="float" min="0.0" max="100.0" value="99.0" label="minimum coverage for aggregation" help="OTUs will be aggregated if they share the same taxonomy with at least X% alignment coverage." />
60 </inputs>
61 <outputs>
62 <data format="biom1" name="biom_out" label="${tool.name}: affiliation.biom" from_work_dir="affiliation.biom" />
63 <data format="fasta" name="fasta_out" label="${tool.name}: sequences.fasta" from_work_dir="sequences.fasta" />
64 <data format="tabular" name="compo_out" label="${tool.name}: aggregation_composition.txt" from_work_dir="aggregation_composition.txt" />
65 </outputs>
66 <tests>
67 <test>
68 <param name="biom_in" value="references/06-affiliation.biom"/>
69 <param name="fasta_in" value="references/04-filters.fasta"/>
70 <conditional name="is_HVL">
71 <param name="HVL_amplicon" value="true"/>
72 <param name="ref_file" value="Unite_extract_ITS1_test"/>
73 </conditional>
74 <output name="compo_out" file="references/07-affiliation_postprocessed.compo.tsv"/>
75 </test>
76 </tests>
77 <help>
78
79 .. image:: static/images/frogs_images/FROGS_logo.png
80 :height: 144
81 :width: 110
82
83
84 .. class:: infomark page-header h2
85
86 What it does
87
88 Resolves multi-hit ambiguities if exact amplicon length are available and aggregrated OTUs sharing same taxonomy based on alignment metrics thresholds
89
90
91 .. class:: infomark page-header h2
92
93 Inputs/outputs
94
95 .. class:: h3
96
97 Inputs
98
99 **Abundance file**:
100
101 The abundance of each OTU in each sample (format `BIOM &lt;http://biom-format.org/&gt;`_) with taxonomic affiliations metadata.
102
103 **Sequence file**:
104
105 The sequences (format `FASTA &lt;https://en.wikipedia.org/wiki/FASTA_format&gt;`_) of each OTU seed.
106
107 **Reference file** (optionnal):
108
109 The exact amplicon reference sequences (format `FASTA &lt;https://en.wikipedia.org/wiki/FASTA_format&gt;`_).
110
111
112 .. class:: h3
113
114 Outputs
115
116 **Abundance file**:
117
118 The abundance file of OTUs and aggregated OTUs, with their affiliation (format `BIOM &lt;http://biom-format.org/&gt;`_) and with potentially less ambiguities.
119
120 **Sequence file**:
121
122 The sequences (format `FASTA &lt;https://en.wikipedia.org/wiki/FASTA_format&gt;`_) of each aggregated OTU seed.
123
124 **Composition file**:
125
126 The aggregation composition file (format text) describing the composition of each resulting OTU.
127
128 .. class:: infomark page-header h2
129
130 How it works
131
132 If a reference fasta file is provided, for each OTU with multiaffiliation, among the different possible affiliations, we only keep the affiliation of the sequence with the shorter length. The aim is to resolve ambiguities due to potential inclusive sequences such as ITS.
133
134 Second step is the OTUs aggregation that share the same taxonomy inferred on alignment metrics.
135 We start with the most abundant OTU. If an OTU shares at least one affiliation with another OTU with at least I% of identity and C% of alignment coverage, so the OTUs are aggregated together (The different affiliations, which then generate the multi-affiliation tag, are merged, abundance counts are summed). The seed of the most abundant OTU is kept.
136 ----
137
138 **Contact**
139
140 Support: please contact first your galaxy support team.
141
142 Contacts: frogs@inra.fr
143
144 Repository: https://github.com/geraldinepascal/FROGS
145 website: http://frogs.toulouse.inra.fr/
146
147 Please cite the **FROGS article**: *Escudie F., et al. Bioinformatics, 2018. FROGS: Find, Rapidly, OTUs with Galaxy Solution.*
148
149 </help>
150 <citations>
151 <citation type="doi">10.1093/bioinformatics/btx791</citation>
152 </citations>
153
154 </tool>