comparison gff3_rebase.xml @ 0:e54940ea270c draft

planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/master/tools/gff3_rebase commit 350ab347625ed5941873ba0deb3e1cf219d60052
author iuc
date Thu, 20 Apr 2017 08:12:49 -0400
parents
children
comparison
equal deleted inserted replaced
-1:000000000000 0:e54940ea270c
1 <tool id="gff3.rebase" name="Rebase GFF3 features" version="1.2">
2 <description>against parent features</description>
3 <macros>
4 <import>macros.xml</import>
5 </macros>
6 <expand macro="requirements"/>
7 <version_command>python gff3_rebase.py --version</version_command>
8 <command detect_errors="aggressive"><![CDATA[
9 python '$__tool_directory__/gff3_rebase.py'
10 '$parent'
11 '$child'
12
13 $interpro
14 $protein2dna
15 > '$output']]>
16 </command>
17 <inputs>
18 <param label="Parent GFF3 annotations" name="parent" format="gff3" type="data"/>
19 <param label="Child GFF3 annotations to rebase against parent" name="child" format="gff3" type="data"/>
20
21 <param label="Interpro specific modifications" name="interpro" type="boolean" truevalue="--interpro" falsevalue=""/>
22 <param label="Map protein translated results to original DNA data" name="protein2dna" type="boolean" truevalue="--protein2dna" falsevalue=""/>
23 </inputs>
24 <outputs>
25 <data format="gff3" name="output"/>
26 </outputs>
27 <tests>
28 <test>
29 <param name="parent" value="parent.gff"/>
30 <param name="child" value="child.gff"/>
31 <param name="protein2dna" value="True" />
32 <output name="output" file="proteins.gff"/>
33 </test>
34 </tests>
35 <help><![CDATA[
36 **What it does**
37
38 Often the genomic data processing/analysis process requires a workflow like the following:
39
40 - select some features from a genome
41 - export the sequences associated with those regions
42 - analyse those exports with some tool like Blast
43
44 For display, especially in software like JBrowse, it is convenient to know
45 where in the original genome the analysis results would fall. E.g. if a
46 transmembrane domain is detected at bases 10-20 of an analysed protein, where
47 should this be displayed relative to the parent genome?
48
49 This tool helps fill that gap, by rebasing some analysis results against the
50 parent features which were originally analysed.
51
52 **Example Inputs**
53
54 For a "child" set of annotations like::
55
56 #gff-version 3
57 cds42 blastp match_part 1 50 1e-40 . . ID=m00001;Notes=RNAse A Protein
58
59 And a parent set of annotations like::
60
61 #gff-version 3
62 PhageBob maker cds 300 600 . + . ID=cds42
63
64 One could imagine that during the analysis process, the user had exported the parent annotation into some sequence::
65
66 >cds42
67 M......
68
69 and then analysed those results, producing the "child" annotation file. This
70 tool will then localize the results properly against the parent::
71
72 #gff-version 3
73 PhageBob blastp match_part 300 449 1e-40 + . ID=m00001;Notes=RNAse A Protein
74
75 which will allow you to display the results in the correct location in visualizations.
76
77 **Options**
78
79 There are two optional flags which can be passed.
80
81 The Interpro flag selectively ignores features which shouldn't be included in
82 the output (i.e. ``polypeptide``), and a couple of qualifiers that aren't
83 useful (``status``, ``Target``)
84
85 The "Map Protein..." flag says that you translated the sequences during the
86 genomic export process, analysing protein sequences. This indicates to the
87 software that the bases should be multiplied by three to obtain the correct DNA
88 locations.
89 ]]></help>
90 <citations>
91 <citation type="doi">10.1093/bioinformatics/btp163</citation>
92 </citations>
93 </tool>