comparison uclust2otutable.xml @ 0:e85e7ba38aff draft

"planemo upload for repository https://github.com/QFAB-Bioinformatics/metaDEGalaxy/tree/master/uc2otutable commit 0db3cb4e9a87400bb2f8402ffc23334e24ad4b4e-dirty"
author qfabrepo
date Mon, 14 Sep 2020 04:52:36 +0000
parents
children 046085ec595c
comparison
equal deleted inserted replaced
-1:000000000000 0:e85e7ba38aff
1 <tool id="uclust2otutable" name="OTUTable" version="1.0.0">
2 <description>Convert UCLUST format from Vsearch to OTU Table</description>
3 <version_command>
4 python ${__tool_directory__}/uclust2otutable.py --version
5 </version_command>
6 <command detect_errors="aggressive">
7 python ${__tool_directory__}/uclust2otutable.py
8 -i '$inputfile'
9 -o '$output'
10 </command>
11 <inputs>
12 <param format="tabular" name="inputfile" type="data" label="UCLUST from Vsearch" />
13 </inputs>
14 <outputs>
15 <data format="tabular" name="output" label="OTU_TABLE_${inputfile.display_name}"/>
16 </outputs>
17 <tests>
18 <test>
19 <param name="inputfile" value="uc_input.txt"/>
20 <output name="output" file="uc_output.txt"/>
21 </test>
22 </tests>
23
24 <help>
25 ** what it does **
26
27 Converts UCLUST format (.uc) output from Vsearch search into raw count table. The description of UCLUST format is based on the information that can be found on UCLUST_ documentation page.
28
29 .. _UCLUST: http://www.drive5.com/uclust/uclust_userguide_1_1_579.html
30
31 --------
32
33 =======
34 Example
35 =======
36
37 Some example records:
38 ---------------------
39
40 ==== ======= ==== ==== ====== === === ========= ========== ======
41 Type Cluster Size %Id Strand Qlo Tlo Alignment Query Target
42 ---- ------- ---- ---- ------ --- --- --------- ---------- ------
43 S 0 292 '*' '*' '*' '*' '*' AH70_12410 '*'
44 H 0 292 99.7 '+' 0 0 292M AH70_12410 '*'
45 S 0 292 '*' '*' '*' '*' '*' AH70_12410 '*'
46 H 0 292 98.2 '+' 0 0 292M AH70_12410 '*'
47 ==== ======= ==== ==== ====== === === ========= ========== ======
48
49 Each record has ten fields, separated by tabs:
50 ----------------------------------------------
51
52 ========= ===========================================
53 Column Description
54 --------- -------------------------------------------
55 Type Record type
56 Cluster Cluster number
57 Size Sequence length or cluster size
58 %Id Identity to the seed(as a percentage), or * if this is a seed.
59 Strand '+' plus strand, '-' minus strand, or '.' amino acids.
60 Qlo 0-based coordinate of alignment start in the query sequence.
61 Tlo 0-based coordinate of alignment start in target (seed) sequence. If minus strand, Tlo is relative to start of reverse-complement target.
62 Alignment Compressed representation of alignment to the seed(see below), or '*' if a seed.
63 Query FASTA label of query sequence
64 Target FASTA label of target(seed / library / database) sequence. or '*' if a seed.
65 ========= ===========================================
66
67 Record Types are:
68 -----------------
69
70 ====== ===========================================
71 Column Description
72 ------ -------------------------------------------
73 L Library seed(generated only if a match if found to this seed).
74 S New seed.
75 H Hit, also known as an accept; i.e. a successful match.
76 D Library cluster.
77 C New cluster.
78 N Not matched (a sequence that didn't match library with --libonly specified).
79 R Reject (generated only if --output_rejects is specified)
80 ====== ===========================================
81
82 The alignment is compressed using run-length encoding, as follows. Each column in the alignment is classified as M,D or I:
83 --------------------------------------------------------------------------------------------------------------------------
84
85 ==== ====== ============== =============
86 Code Name Query sequence Seed sequence
87 ---- ------ -------------- -------------
88 M Match Letter Letter
89 D Delete Gap Letter
90 I Insert Letter Gap
91 ==== ====== ============== =============
92
93 --------
94
95
96 </help>
97
98 </tool>