comparison bp_genbank2gff3.xml @ 0:f79bcd53b9a3 draft

planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/master/tools/bioperl commit 799339e22181d28cb2b145454d353d6025779636
author iuc
date Fri, 09 Oct 2015 09:19:49 -0400
parents
children 792a280ebeea
comparison
equal deleted inserted replaced
-1:000000000000 0:f79bcd53b9a3
1 <tool id="bp_genbank2gff3" name="Genbank to GFF3" version="1.0">
2 <description>converter</description>
3 <macros>
4 <import>macros.xml</import>
5 </macros>
6 <expand macro="stdio" />
7 <command><![CDATA[
8 bp_genbank2gff3.pl
9 $noinfer
10 #if str($sofile.sofile) != "__none__":
11 --sofile
12 #if str($sofile.sofile) == "url":
13 "${sofile.so_url}"
14 #else:
15 live
16 #end if
17 #end if
18 --outdir -
19 --ethresh $ethresh
20 $model
21 --typesource "${typesource}"
22
23 $genbank
24 > $gff3]]></command>
25 <inputs>
26 <param label="Genbank file" name="genbank" type="data" format="gb"/>
27 <param name="noinfer" truevalue="" falsevalue="--noinfer" checked="true" type="boolean" label="Infer exon/mRNA subfeatures"/>
28 <conditional name="sofile" label="Sequence Ontology">
29 <param name="sofile" label="Sequence Ontology File" type="select">
30 <option value="__none__" selected="True">None specified</option>
31 <option value="live">Latest Sequence Ontology</option>
32 <option value="url">User Specified</option>
33 </param>
34 <when value="__none__" />
35 <when value="live" />
36 <when value="url">
37 <param name="so_url" label="Sequence Ontology URL" type="text"/>
38 </when>
39 </conditional>
40 <param name="ethresh" label="Error threshold for unflattener" type="select">
41 <option value="0">Strict</option>
42 <option value="1" selected="True">Medium</option>
43 <option value="2">Loose</option>
44 <option value="3">Ignore Errors</option>
45 </param>
46 <param name="model" label="Gene Model" type="select">
47 <option value="--CDS" selected="True">Default GFF gene model</option>
48 <option value="--noCDS">Alternate gene-RNA-protein-exon model</option>
49 </param>
50 <param name="typesource" label="Sequence Ontology type for landmark feature" help="E.g. chromosome, region, contig" value="contig" type="text" />
51 </inputs>
52 <outputs>
53 <data format="gff3" name="gff3" label="${genbank.name} as GFF3"/>
54 </outputs>
55 <tests>
56 <test>
57 <param name="genbank" value="seq.gb" />
58 <param name="noinfer" value="True" />
59 <output name="gff3" file="seq.gb.0.gff" ftype="gff3" lines_diff="6"/>
60 </test>
61 <test>
62 <param name="genbank" value="seq.gb" />
63 <output name="gff3" file="seq.gb.1.gff" ftype="gff3" lines_diff="4"/>
64 </test>
65 </tests>
66 <help><![CDATA[
67 **What it does**:
68
69 This tool uses Bio::SeqFeature::Tools::Unflattener and
70 Bio::Tools::GFF to convert GenBank flatfiles to GFF3 with gene
71 containment hierarchies mapped for optimal display in gbrowse.
72
73 The input files are assumed to be gzipped GenBank flatfiles for refseq
74 contigs. The files may contain multiple GenBank records.
75
76 **Designed for RefSeq**
77
78 This script is designed for RefSeq genomic sequence entries. It may
79 work for third party annotations but this has not been tested.
80 But see below, Uniprot/Swissprot works, EMBL and possibly EMBL/Ensembl
81 if you don't mind some gene model unflattener errors (dgg).
82
83 **G-R-P-E Gene Model**
84
85 Don Gilbert worked this over with needs to produce GFF3 suited to
86 loading to GMOD Chado databases.
87
88 This writes GFF with an alternate, but useful Gene model,
89 instead of the consensus model for GFF3
90
91 [ gene > mRNA> (exon,CDS,UTR) ]
92
93 This alternate is
94
95 gene > mRNA > polypeptide > exon
96
97 means the only feature with dna bases is the exon. The others
98 specify only location ranges on a genome. Exon of course is a child
99 of mRNA and protein/peptide.
100
101 The protein/polypeptide feature is an important one, having all the
102 annotations of the GenBank CDS feature, protein ID, translation, GO
103 terms, Dbxrefs to other proteins.
104
105 UTRs, introns, CDS-exons are all inferred from the primary exon bases
106 inside/outside appropriate higher feature ranges. Other special gene
107 model features remain the same.
108
109 **Authors**
110
111 Sheldon McKay (mckays@cshl.edu)
112
113 Copyright (c) 2004 Cold Spring Harbor Laboratory.
114
115 **Author of hacks for GFF2Chado loading**
116
117 Don Gilbert (gilbertd@indiana.edu)
118 ]]></help>
119 <citations>
120 <citation type="doi">10.1101/gr.361602</citation>
121 </citations>
122 </tool>