annotate EMS_VariantDensityMapping.xml @ 12:5cc9b35b5a52 draft

Uploaded
author gregory-minevich
date Mon, 08 Oct 2012 16:18:36 -0400
parents
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
12
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
1 <tool id="ems_variant_density_mapping" name="CloudMap: EMS Variant Density Mapping">
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
2 <description>Map a mutation by linkage to regions of high mutation density using WGS data</description>
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
3 <command interpreter="python">EMS_VariantDensityMapping.py --snp_vcf "$snp_vcf" --ylim "$ylim" --hist_color "$hist_color" --standardize "$standardize" --ems "$ems" --output "$output" </command>
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
4 <inputs>
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
5 <param name="snp_vcf" type="data" format="vcf" label="VCF of SNPs" help="Takes a VCF file of WGS variants present in a C.elegans mutant strain that has been backcrossed to its (pre-mutagenesis) starting strain"/>
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
6 <param name="ylim" size = "15" type="integer" value="200" label="Y-axis upper limit"/>
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
7 <param name="hist_color" size = "15" type="text" value="darkgray" label="Color for 1Mb bins" help="See below for list of supported colors"/>
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
8 <param name="standardize" type="boolean" truevalue="true" falsevalue="false" checked="true" label="Standardize X-axis" help="Frequency plots from separate chromosomes will have uniform X-axis spacing for comparison"/>
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
9 <param name="ems" type="boolean" truevalue="true" falsevalue="false" checked="true" label="Filter for most common EMS-induced variants (G/C—>A/T)"/>
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
10 </inputs>
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
11 <outputs>
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
12 <data name="output" type="text" format="pdf" />
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
13 </outputs>
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
14 <requirements>
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
15 <requirement type="python-module">sys</requirement>
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
16 <requirement type="python-module">optparse</requirement>
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
17 <requirement type="python-module">csv</requirement>
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
18 <requirement type="python-module">re</requirement>
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
19 <requirement type="python-module">decimal</requirement>
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
20 <requirement type="python-module">rpy</requirement>
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
21 </requirements>
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
22 <tests>
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
23 <param name="snp_vcf" value="" />
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
24 <output name="output" file="" />
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
25 </tests>
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
26 <help>
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
27 **What it does:**
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
28
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
29 This tool is part of the CloudMap pipeline for analysis of mutant genome sequences. For further details, please see `Gregory Minevich, Danny S. Park, Daniel Blankenberg, Richard J. Poole, and Oliver Hobert. CloudMap: A Cloud-based Pipeline for Analysis of Mutant Genome Sequences. (Genetics 2012 In Press)`__
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
30
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
31 .. __: http://biochemistry.hs.columbia.edu/labs/hobert/literature.html
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
32
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
33 CloudMap workflows, shared histories and reference datasets are available at the `CloudMap Galaxy page`__
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
34
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
35 .. __: http://usegalaxy.org/cloudmap
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
36
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
37 Following the approach detailed in Zuryn et al., Genetics 2010, this tool plots histograms of variant density in a mutant C.elegans strain that has been backcrossed to its (pre-mutagenesis) starting strain. Common (i.e. non-phenotype causing) variants present in multiple WGS strains **with the same background** should first be subtracted using the GATK tool *Select Variants*.
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
38
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
39 Sample output where LG III shows linkage to the causal mutation is shown below. In this example, common variants from another strain have been subtracted and remaining variants have been filtered for most common EMS-induced mutations i.e. G/C --> A/T):
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
40
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
41 .. image:: http://biochemistry.hs.columbia.edu/labs/hobert/CloudMap/EMS_Variant_Density_750px.png
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
42
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
43
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
44
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
45
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
46
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
47 The experimental approach is detailed in Figure 1a from Zuryn et al., Genetics 2010:
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
48
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
49 .. image:: http://biochemistry.hs.columbia.edu/labs/hobert/CloudMap/Zuryn_2010_Genetics_Fig1a.pdf
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
50
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
51
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
52 Subtracting common (non-phenotype causing) variants from more whole genome sequenced strains (using GATK Tools *Select Variants*) will result in less noise and a tighter mapping region. Additional backcrosses will also result in a smaller mapping region.
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
53
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
54 ------
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
55
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
56 **Settings:**
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
57
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
58 .. class:: infomark
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
59
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
60 Supported colors for data points and loess regression line:
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
61
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
62 http://www.stat.columbia.edu/~tzheng/files/Rcolor.pdf
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
63
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
64 http://research.stowers-institute.org/efg/R/Color/Chart/ColorChart.pdf
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
65
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
66
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
67
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
68
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
69 .. class:: warningmark
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
70
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
71 This tool requires that the statistical programming environment R has been installed on the system hosting Galaxy (http://www.r-project.org/). If you are accessing this tool on Galaxy via the Cloud, this does not apply to you.
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
72
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
73 ------
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
74
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
75 **Citation:**
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
76
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
77 This tool is part of the CloudMap package from the Hobert Lab. If you use this tool, please cite `Gregory Minevich, Danny S. Park, Daniel Blankenberg, Richard J. Poole, and Oliver Hobert. CloudMap: A Cloud-based Pipeline for Analysis of Mutant Genome Sequences. (Genetics 2012 In Press)`__
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
78
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
79 .. __: http://biochemistry.hs.columbia.edu/labs/hobert/literature.html
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
80
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
81 Correspondence to gm2123@columbia.edu (G.M.) or or38@columbia.edu (O.H.)
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
82
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
83 </help>
5cc9b35b5a52 Uploaded
gregory-minevich
parents:
diff changeset
84 </tool>