comparison gbsx_digest.xml @ 22:062399f3787f draft

new version of gbsx: v1.2 with barcodeGenerator, demultiplex and digest
author koen
date Tue, 26 Jan 2016 07:38:36 -0500
parents
children 3b325560887b
comparison
equal deleted inserted replaced
21:32ee036c156a 22:062399f3787f
1 <tool id="gbsx_digest" name="GBSX: Digest" version="1.1.5">
2 <description>simulates a digest for the selection of the best enzymes</description>
3 <requirements>
4 <requirement type="package" version="1.1.5">GBSX</requirement>
5 </requirements>
6 <command interpreter="sh">
7 gbsx_digest.sh $genome $read_length $enzyme_combination $adv.min_size $adv.max_size $digest_bed $digest_result
8 </command>
9 <inputs>
10 <param name="genome" format="fasta" type="data" label="The reference genome" />
11 <param name="read_length" size="20" type="integer" value="150" min="1" max="1000" label="The length of the reads to generate" />
12 <param name="enzyme_combination" size="4" type="select" label="The enzyme to use" >
13 <option value="ApeKI">ApeKI</option>
14 <option value="PstI">PstI</option>
15 <option value="EcoT22I">EcoT22I</option>
16 <option value="PasI">PasI</option>
17 <option value="HpaII">HpaII</option>
18 <option value="MspI">MspI</option>
19 <option value="PstI-EcoT22I">PstI-EcoT22I</option>
20 <option value="PstI-MspI">PstI-MspI</option>
21 <option value="PstI-TaqI">PstI-TaqI</option>
22 <option value="SbfI-MspI">SbfI-MspI</option>
23 <option value="AsiSI-MspI">AsiSI-MspI</option>
24 <option value="BssHII-MspI">BssHII-MspI</option>
25 <option value="FseI-MspI">FseI-MspI</option>
26 <option value="SalI-MspI">SalI-MspI</option>
27 <option value="ApoI">ApoI</option>
28 <option value="BamHI">BamHI</option>
29 <option value="MseI">MseI</option>
30 <option value="Sau3AI">Sau3AI</option>
31 <option value="RBSTA">RBSTA</option>
32 <option value="RBSCG">RBSCG</option>
33 <option value="NspI">NspI</option>
34 <option value="AvaII">AvaII</option>
35 </param>
36 <section name="adv" title="Advanced Options" expanded="False">
37 <param name="min_size" size="20" type="integer" value="100" min="1" max="1000" label="The minimum size of the digest fragments to keep" />
38 <param name="max_size" size="20" type="integer" value="1000" min="1" max="10000" label="The minimum size of the digest fragments to keep" />
39 </section>
40 </inputs>
41 <outputs>
42 <data format="bed" name="digest_bed" label="GBSX: digest: fragments" />
43 <data format="txt" name="digest_result" label="GBSX: digest: results" />
44 </outputs>
45
46 <tests>
47 <test>
48 <param name="genome" value="chr21.fa"/>
49 <param name="read_length" value="150"/>
50 <param name="enzyme" value="ApeKI"/>
51 <output name="digest_bed" file="chr21.l150.apeki.bed"/>
52 <output name="digest_result" file="chr21.l150.apeki.txt"/>
53 </test>
54 </tests>
55
56 <help>
57
58 .. class:: infomark
59
60 **Overview**
61
62 Genotyping by Sequencing is an emerging technology for cost effective variant discovery and genotyping. However, current analysis tools do not fulfill all experimental design and analysis needs.
63
64 GBSX is a package of tools to first aid in experimental design, including choice of enzymes and barcode design. Secondly, it provides a first analysis step to demultiplex samples using in-line barcodes, providing fastq files that can easily be plugged into existing variant analysis pipelines.
65
66 ----
67
68 This tool simulates the restriction digest. The output is a file with all possible sequenced reads, and an overview file.
69
70 This will generate 2 files: genome.Enzyme.100nt.digest.bed and genome.Enzyme.100nt.digest_results.
71 The bed file contains all start-stop locations of the sequenced portion (based on a read length of 100bp) of all fragments that are above or equal to 100bp and below or equal to 1000bp in length. The results file give a brief summary of the fragments, including the total number of fragments and the number of fragments between 100-1000bp. For fragments between 100-1000bp, the results file also indicates the number of fragments per chromosome, the distribution of distances between fragments, and the distribution of fragment lengths.
72
73 When using this digest-tool for research, it is possible to determine the SNPs that will be sequenced. This generated bed file can be used with bedtools, dbSNP, and can be uploaded in ucsc as a custom track.
74
75 Only the most used options are included. For more options, please use the command line version of GBSX.
76
77 ----
78
79 **Created by:**
80
81 Koen Herten,
82
83 Genomics Core Leuven,
84
85 Belgium
86
87
88 ----
89
90 **Example:**
91
92 The outcome of this tool will always be different.
93
94 ----
95
96 **Project links:**
97
98 https://github.com/GenomicsCoreLeuven/GBSX
99
100 ----
101
102 **Integrated by:**
103
104 Koen Herten
105
106 Genomics Core
107
108 Center for Human Genetics
109
110 UZ – KU Leuven
111
112 Herestraat 49 PO box 602
113
114 B-3000 Leuven, Belgium
115
116
117
118 If you use this tool in Galaxy, please cite:
119
120
121 </help>
122 <citations>
123 <citation type="doi">10.1186/s12859-015-0514-3</citation>
124
125 <citation type="bibtex">@INPROCEEDINGS{
126 author = {Herten, Koen and Hestand, Matthew S. and Vermeesch, Joris R. and Van Houdt, Jeroen KJ},
127 doi = {10.1186/s12859-015-0514-3},
128 issn = {1471-2105},
129 journal = {BMC Bioinformatics},
130 number = {1},
131 pages = {1–6},
132 title = {GBSX: a toolkit for experimental design and demultiplexing genotyping by sequencing experiments},
133 url = {http://dx.doi.org/10.1186/s12859-015-0514-3},
134 volume = {16},
135 year = {2015}
136 }</citation>
137 </citations>
138
139
140 </tool>