Mercurial > repos > geert-vandeweyer > dc_genotyper
annotate DC_Genotyper.xml @ 13:08c4fb95c3a9 draft
changed dep to samtools .19 to include bcftools
author | geert-vandeweyer |
---|---|
date | Sat, 27 Sep 2014 07:10:10 -0400 |
parents | 845a87ad254a |
children | 36cc147395ad |
rev | line source |
---|---|
11 | 1 <tool id="DC_Genotyper" name="DC Genotyper" version='0.0.1'> |
2 <description></description> | |
3 <requirements> | |
4 <requirement type='package' version='3.0.2'>R_3_0_2</requirement> | |
13
08c4fb95c3a9
changed dep to samtools .19 to include bcftools
geert-vandeweyer
parents:
11
diff
changeset
|
5 <requirement type='package' version='0.1.19'>samtools</requirement> |
11 | 6 <requirement type='package' version='0.2.6'>tabix</requirement> |
7 <requirement type='package' version='latest'>blat_server</requirement> | |
8 <requirement type='package' version='1.92'>perl_module_threads</requirement> | |
9 <requirement type='package' version='1.46'>perl_module_threads_shared</requirement> | |
10 <requirement type='package' version='3.02'>perl_module_Thread_Queue</requirement> | |
11 <requirement type='package' version='2.3.32'>igvtools</requirement> | |
12 </requirements> | |
13 <command interpreter="perl">DC_Genotyper.pl | |
14 -t "$targets" | |
15 -b "$bamfile" | |
16 -R "${ref.fields.path}" | |
17 -p "\${GALAXY_SLOTS:-4}" | |
18 #if $dbsnp.source == "history": | |
19 -s "${dbsnp.ownFile}" | |
20 #else | |
21 -s "${dbsnp.indices.fields.path}" | |
22 #end if | |
23 -m $mincov | |
24 -P $ploidy | |
25 | |
26 -a $output1 | |
27 -v $output2 | |
28 </command> | |
29 | |
30 <inputs> | |
31 <param name="bamfile" type="data" format="bam" label="Sample BAM file" /> | |
32 <param name="targets" type="data" format="bed" label="Enrichment BED file" /> | |
33 <param name="ref" type="select" label="Select a reference genome"> | |
34 <options from_data_table="DC_Genotyper_indexes"> | |
35 <filter type="sort_by" column="2" /> | |
36 <validator type="no_options" message="No indexes are available" /> | |
37 </options> | |
38 </param> | |
39 <conditional name="dbsnp"> | |
40 <param name="source" type="select" label="Will you select a dbSNP file from your history, or use a built in version (which is faster)"> | |
41 <option value="indexed">Use a built-in version</option> | |
42 <option value="history">Use one from the history</option> | |
43 </param> | |
44 <when value="indexed"> | |
45 <param name="indices" type="select" label="Select a dbSNP version"> | |
46 <options from_data_table="dbsnp_indexes"> | |
47 <filter type="sort_by" column="2" /> | |
48 <validator type="no_options" message="No indexes are available" /> | |
49 </options> | |
50 </param> | |
51 </when> | |
52 <when value="history"> | |
53 <param name="ownFile" type="data" format="vcf,bcf" label="Select a dbSNP file from history"/> | |
54 </when> | |
55 </conditional> | |
56 <param name="mincov" value="400" type="integer" label="Minimal Coverage Depth" /> | |
57 <param name="ploidy" type="integer" value='10' label="Expected Sample Ploidy" /> | |
58 </inputs> | |
59 | |
60 <outputs> | |
61 <data format='txt' name="output1" label="${tool.name} on ${on_string}: Allele Fraction Distributions"/> | |
62 <data format='vcf' name='output2' label="${tool.name} on ${on_string}: VCF file" /> | |
63 </outputs> | |
64 <help> | |
65 | |
66 **What it does** | |
67 | |
68 1. get allele counts on all positions in specified targets (bed) using igvtools. Only SNPs !! | |
69 2. remove known dbsnp positions (bcf file) | |
70 3. Get distribution of background noise (pcr/sequencing errors), by modelling allele fractions as normal distributions. | |
71 4. Based on these distributions, check each position for significant change from the reference allele (based on allele fraction) | |
72 5. For abberant positions, check each alternate allele to see if it passes the background signal. | |
73 6. Generate VCF file. | |
74 | |
75 | |
76 **Information** | |
77 | |
78 This tools is created by Geert Vandeweyer. It is a very early version with several limitations. Current limitations are : no support for indels, no plotting of the noise-models, incorrect syntax in for multi-allelic sites in the VCF file. | |
79 | |
80 Any feedback is welcome. | |
81 | |
82 </help> | |
83 </tool> |