annotate CloudMap_InSilico.xml @ 1:5389dc5a0be3 draft default tip

Uploaded
author gregory-minevich
date Thu, 01 Nov 2012 19:26:05 -0400
parents
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
1
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
1 <tool id="in_silico_complementation" name="CloudMap: in silico complementation">
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
2 <description>Perform in silico complementation analysis on multiple tabular snpEff output files</description>
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
3 <command interpreter="python">
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
4 CloudMap_InSilico.py -s "$summary_output_file" -o "$data_output_file"
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
5 -i
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
6 #for $input_files in $input_series:
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
7 "${input_files.input_files}"
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
8 #end for
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
9 -n
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
10 #for $input_files in $input_series:
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
11 "${input_files.sample_names}"
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
12 #end for
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
13
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
14 </command>
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
15 <inputs>
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
16 <repeat min="1" name="input_series" title="Input file">
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
17 <param name="input_files" type="data" format="tabular" label="snpEff annotated variants (tabular file)"/>
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
18 <param name="sample_names" type="text" value="" size = "20" label="Sample name" help="Name should correspond to sample file">
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
19 <validator type="length" min="1" message="You must provide a unique sample name" />
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
20 </param>
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
21 </repeat>
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
22 </inputs>
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
23
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
24 <outputs>
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
25 <data name="summary_output_file" format="tabular"/>
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
26 <data name="data_output_file" format="tabular"/>
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
27 </outputs>
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
28 <tests>
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
29 <test>
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
30 <param name="input_files" value="1.txt"/>
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
31 <param name="sample_names" value="2.txt"/>
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
32 <output name="summary_output_file" file="summary_output_file.txt"/>
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
33 <output name="data_output_file" file="data_output_file.txt"/>
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
34
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
35 </test>
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
36 </tests>
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
37 <help>
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
38
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
39 .. class:: warningmark
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
40
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
41 **What it does**
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
42
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
43 This tool is part of the CloudMap pipeline for analysis of mutant genome sequences. For further details, please see `Gregory Minevich, Danny S. Park, Daniel Blankenberg, Richard J. Poole and Oliver Hobert. CloudMap: A Cloud-based Pipeline for Analysis of Mutant Genome Sequences. (Genetics 2012 In Press)`__
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
44
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
45 .. __: http://biochemistry.hs.columbia.edu/labs/hobert/literature.html
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
46
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
47 CloudMap workflows, shared histories and reference datasets are available at the `CloudMap Galaxy page`__.
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
48
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
49 .. __: http://usegalaxy.org/cloudmap
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
50
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
51 If performed on a large scale, forward genetic screens usually yield multiple alleles of individual loci, which define specific complementation groups. The traditional way to identify such complementation groups is via complementation tests performed by genetic crosses. If screens have revealed dozens of mutants, comprehensive complementation testing can be time-consuming and labor-intensive. Moreover, complementation tests are impossible to perform with dominant alleles and are sometimes subject to misleading results (such as allelic complementation or non-allelic non- complementation). With the decreasing costs of whole genome sequencing, it is now possible to simply sequence many mutants that result from a screen and determine in silico which mutants carry variants in the same locus. To allow such analysis, we developed the CloudMap “in silico Complementation Test” tool to compare tabular lists of annotated variants from the program snpEff (which have been filtered for quality (see Materials and Methods) and had common variants subtracted) for shared gene hits (alleles).
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
52
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
53
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
54 This tool creates two output files:
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
55
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
56 1 A summary file of the number of shared gene hits among the sequenced mutants sorted from most to fewest:
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
57
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
58 .. image:: http://biochemistry.hs.columbia.edu/labs/hobert/CloudMap/Supp.Fig.1_in-silico_compSumm.png
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
59
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
60
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
61
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
62
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
63 2 A corresponding file of the snpEff annotated alleles from each sample also sorted from most to fewest:
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
64
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
65 .. image:: http://biochemistry.hs.columbia.edu/labs/hobert/CloudMap/Supp.Fig.2_in-silico_compOut.png
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
66
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
67
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
68
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
69
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
70
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
71 ------
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
72
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
73 **Citation:**
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
74
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
75 This tool is part of the CloudMap package from the Hobert Lab. If you use this tool, please cite `Gregory Minevich, Danny S Park, Daniel Blankenberg, Richard J. Poole, and Oliver Hobert. CloudMap: A Cloud-based Pipeline for Analysis of Mutant Genome Sequences. (Genetics 2012 In Press)`__
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
76
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
77 .. __: http://biochemistry.hs.columbia.edu/labs/hobert/literature.html
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
78
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
79 Correspondence to gm2123@columbia.edu (G.M.) or or38@columbia.edu (O.H.)
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
80
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
81
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
82 The annotated variant files used as input into this in silico complementation tool are generated by the snpEff program:
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
83
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
84 CINGOLANI, P., A. PLATTS, L. WANG LE, M. COON, T. NGUYEN et al., 2012 A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3. Fly (Austin) 6: 80-92.
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
85 </help>
5389dc5a0be3 Uploaded
gregory-minevich
parents:
diff changeset
86 </tool>