annotate regionFitness.xml @ 4:eddf4584cf85 draft

Uploaded
author antmarge
date Tue, 28 Mar 2017 10:53:58 -0400
parents
children 48c8d3ed441a
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
4
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
1 <tool id="regionFitness" name="Region Fitness" version="0.1.0">
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
2
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
3 <!-- Margaret Antonio 17.01.08 -->
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
4
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
5 <requirements>
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
6 <!--<requirement type="set_environment">LINKYX_PATH</requirement>-->
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
7 <requirement type="package" version="5.18.1">perl</requirement>
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
8 <requirement type="package" version="0.25">perl_list_binarysearch</requirement>
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
9 <requirement type="package" version="2.45">perl_getopt_long</requirement>
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
10 <requirement type="package" version="0.12">perl_data_random</requirement>
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
11 </requirements>
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
12
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
13 <description>assess fitness effect of mutations in a region (sliding window or custom)</description>
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
14
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
15 <command interpreter="perl">
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
16 regionFitness.pl
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
17 -f $fasta
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
18 -r $ref_genome
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
19 -c $cutoff
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
20 -n $run
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
21 -m $max
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
22 #if $region.define == "c":
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
23 -u $region.custom
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
24 #end if
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
25 #if $region.define == "s":
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
26 -size $region.size
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
27 -step $region.step
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
28 #end if
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
29 #if $weight.algorithm == "yes":
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
30 -w
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
31 -wc $weight.ceiling
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
32 #end if
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
33
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
34 -f1 $allTAsites
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
35 -f2 $nullDist
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
36 -f3 $slidingWindows
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
37 -f4 $fitWindowscsv
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
38 -f5 $fitWindowswig
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
39 -f6 $fitWindowstxt
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
40
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
41 $input
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
42 #for $a in $additionalcsv
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
43 ${a.input2}
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
44 #end for
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
45
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
46 </command>
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
47
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
48 <inputs>
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
49 <param name="input" type="data" label="CSV Fitness File(s)"/>
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
50 <repeat name="additionalcsv" title="Additional csv fitness file(s)">
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
51 <param name="input2" type="data" label="Select" />
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
52 </repeat>
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
53 <param format="fasta" name="fasta" type="data" label="Fasta file"/>
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
54 <param name="ref_genome" type="data" label="GenBank reference genome"/>
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
55
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
56 <conditional name="region">
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
57 <param name="define" type="select" label="Define regions: custom or sliding?">
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
58 <option value="s">Sliding Windows</option>
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
59 <option value="c">Custom</option>
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
60 </param>
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
61 <when value="s">
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
62 <param name="size" type="integer" value="500" label="Sliding window size"/>
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
63 <param name="step" type="integer" value="10" label="Sliding window intervals"/>
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
64 </when>
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
65 <when value="c">
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
66 <param name="custom" type="data" label="File with custom regions" />
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
67 </when>
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
68 </conditional>
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
69
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
70 <conditional name="weight">
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
71 <param name="algorithm" type="select" label="Use weighted algorithms?">
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
72 <option value="yes">Yes</option>
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
73 <option value="no">No</option>
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
74 </param>
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
75 <when value = "yes">
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
76 <param name="ceiling" type="integer" value="50" label="Weight ceiling"/>
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
77 </when>
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
78 </conditional>
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
79
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
80 <param name="cutoff" type="integer" value="10" label="Cutoff"/>
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
81 <param name="max" type="integer" value="100" label="Highest # insertions in region"/>
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
82 <param name="run" type="text" value="run1" label="Name of run (will be appended to output files)"/>
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
83
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
84 </inputs>
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
85
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
86 <outputs>
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
87 <data format="txt" name="allTAsites" label="${run}_allTAsites" />
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
88 <data format="txt" name="unmatched" label="${run}_unmatched"/>
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
89 <data format="txt" name="nullDist" label="${run}_nullDist"/>
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
90 <data format="csv" name="slidingWindows" label="${run}_slidingWindows"/>
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
91 <data format="csv" name="fitWindowscsv" label="${run}_fitWindows_csv"/>
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
92 <data format="wig" name="fitWindowswig" label="${run}_fitWindows_wig"/>
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
93 <data format="txt" name="fitWindowstxt" label="${run}_fitWindows_txt"/>
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
94 </outputs>
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
95
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
96 <help>
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
97 **Tool Description**
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
98
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
99 This tool takes a non-gene-centric approach to assessing importance of regions (user defined or sliding windows) to organismal fitness.
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
100
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
101 **Options**
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
102
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
103 *The csv fitness file(s)*: These are the csv (comma separated values) files containing the fitness values that will be used in downstream analyses. Since they should have been produced by the "Calculate Fitness" tool, each line besides the header should represent the following information for an insertion location: position,strand,count_1,count_2,ratio,mt_freq_t1,mt_freq_t2,pop_freq_t1,pop_freq_t2,gene,D,W,nW
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
104
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
105 *Fasta file*: the fasta file for the genome of the organism
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
106
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
107 *GenBank reference genome*: the reference genome of whatever model you're working with, which needs to be in standard genbank format. For more on that format see the genbank website.
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
108
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
109 *Region definition*: Define the regions by a custom file or by sliding windows at a set size and step. If using the custom region option, provide a tab-delimited file with start and end coordinates of each region, one region per line. If choosing the sliding window option, specify the size of the window over which assessments will be made and the step, how much the sliding window increments at each asssessment.
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
110
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
111 *Max*: The maximum number of insertions expected in a window. This is used for creating a null distrubtion upon which the significance of regional essentiality is assessed. An error will be produced if the maximum number of insertions is lower than the actual. Run the Data Overview tool to find the real max number of insertions in a window. This option will be removed later by being data-determined.
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
112
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
113 *Weight ceiling*: This value lets you set a weight ceiling for the weights of fitness values. It's only relevant if you're using weighted algorithms.
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
114
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
115 *Cutoff*: This value lets you ignore the fitness scores of any insertion locations with an average count (the number of counts from t1 and t2 divided by 2) less than it.
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
116
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
117 *Run name*: The name of the run, to be appended to the end of every output file.
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
118
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
119
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
120 </help>
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
121
eddf4584cf85 Uploaded
antmarge
parents:
diff changeset
122 </tool>