annotate aggregate.xml @ 5:74ef799815e5 draft

Uploaded
author kaymccoy
date Thu, 11 Aug 2016 18:30:39 -0400
parents
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
5
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
1 <tool id="aggregate" name="Aggregate">
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
2 <description>fitness calculations by gene</description>
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
3 <requirements>
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
4 <requirement type="package" version="1.64">biopython</requirement>
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
5 </requirements>
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
6 <command interpreter="python">
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
7 aggregate.py
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
8 #if $mark.certain == "yes":
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
9 -m $mark.genes
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
10 #end if
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
11 #if $weighted.algorithms == "yes":
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
12 -w 1
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
13 #end if
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
14 -x $cutoff
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
15 -l $weightceiling
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
16 -b $blanks
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
17 -c $ref
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
18 -o $output
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
19 $input
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
20 #for $a in $additionalcsv
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
21 ${a.input2}
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
22 #end for
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
23 </command>
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
24 <inputs>
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
25 <param name="input" type="data" label="csv fitness file"/>
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
26 <repeat name="additionalcsv" title="Additional csv fitness file(s)">
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
27 <param name="input2" type="data" label="Select" />
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
28 </repeat>
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
29 <param name="ref" type="data" label="GenBank reference genome"/>
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
30 <conditional name="mark">
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
31 <param name="certain" type="select" label="Mark certain genes?">
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
32 <option value="no">No</option>
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
33 <option value="yes">Yes</option>
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
34 </param>
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
35 <when value="no">
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
36 <!-- do nothing -->
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
37 </when>
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
38 <when value="yes">
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
39 <param name="genes" type="data" label="Genes to mark" />
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
40 </when>
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
41 </conditional>
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
42 <conditional name="weighted">
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
43 <param name="algorithms" type="select" label="Use weighted algorithms?">
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
44 <option value="no">No</option>
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
45 <option value="yes">Yes</option>
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
46 </param>
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
47 <when value="-w 1 "/>
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
48 <when value=""/>
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
49 </conditional>
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
50 <param name="weightceiling" type="float" value="50.0" label="Weight ceiling"/>
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
51 <param name="cutoff" type="float" value="10.0" label="Cutoff"/>
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
52 <param name="blanks" type="data" label="txt output from Calc_fit or Consol_fit"/>
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
53 </inputs>
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
54 <outputs>
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
55 <data name="output" format="csv"/>
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
56 </outputs>
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
57 <help>
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
58
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
59 **What it does**
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
60
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
61 This tool calculates the aggregate fitness values of mutations by gene.
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
62
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
63 **The options explained**
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
64
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
65 The csv fitness file(s): These are the csv (comma separated values) files containing the fitness values you want to aggregate by gene. Since they should have been produced by the "Calculate Fitness" tool, each line besides the header should represent the following information for an insertion location: position,strand,count_1,count_2,ratio,mt_freq_t1,mt_freq_t2,pop_freq_t1,pop_freq_t2,gene,D,W,nW
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
66
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
67 GenBank reference genome: the reference genome of whatever model you're working with, which needs to be in standard genbank format. For more on that format see the genbank website.
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
68
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
69 Marking certain genes: If you chose to mark certain genes, those genes will have an "M" under the M column of the resulting aggregate file.
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
70
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
71 Using weighted algorithms: Recommended. If you chose to use weighted algorithms, scores will be weighted by the number of reads their insertion location has, as insertions with more reads tend to be more accurate.
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
72
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
73 Weight ceiling: This value lets you set a weight ceiling for the weights of fitness values. It's only relevant if you're using weighted algorithms.
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
74
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
75 Cutoff: This value lets you ignore the fitness scores of any insertion locations with an average count (the number of counts from t1 and t2 divided by 2) less than it.
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
76
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
77 Blanks: This value lets you exclude a % of blank fitness scores (scores with a fitness of 0) from your calculations. It should be entered as a float (e.g. 0.10 would be 10%). This can be useful when / for !!!
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
78
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
79 The name of your output file: self-explanatory. Remember to have it end in ".csv".
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
80
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
81 **Additional notes**
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
82
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
83 The output file should have each line (besides the header) represent the following information for a particular gene: locus,mean,var,sd,se,gene,Total,Blank,Not Blank,Blank Removed,M
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
84
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
85 </help>
74ef799815e5 Uploaded
kaymccoy
parents:
diff changeset
86 </tool>