comparison calculate_priors_from_ase_count_tables.xml @ 0:e979cb57a5d5 draft default tip

"planemo upload for repository https://github.com/McIntyre-Lab/BayesASE/tree/main/galaxy commit 9b70598ef46a73632d9e0fa0c6ce6776fb5e9d6a"
author malex
date Thu, 14 Jan 2021 21:51:36 +0000
parents
children
comparison
equal deleted inserted replaced
-1:000000000000 0:e979cb57a5d5
1 <tool id="calculate_priors_from_ase_count_tables" name="Calculate Prior Probability Estimates" version="21.1.13">
2 <description>using ASE Count Tables</description>
3 <macros>
4 <import>macros.xml</import>
5 </macros>
6 <expand macro="requirements"/>
7 <command><![CDATA[
8 mkdir outputs;
9 cd outputs;
10 calculate_priors_ase_count_tables.py
11 --output=`pwd`
12 --design=$design
13 --collection_identifiers="${",".join($collection.keys())}"
14 --collection_filenames="${",".join(map(str, $collection))}"
15
16 ]]></command>
17 <inputs>
18 <param name="design" type="data" format="tabular,tsv" label="Priors Design File" help="Select the Priors Design file [created by the Reformat Sample Design File Tool]"/>
19 <param name="collection" type="data_collection" collection_type="list" label="Select dataset collection for Prior Calculations" help="Datasets in this collection can be generated from simulated or DNA reads using the Align and Count' and 'Summarize counts' Workflows."/>
20 </inputs>
21 <outputs>
22 <collection name="split_output" type="list" label="${tool.name} on ${on_string}: Calculate Prior Probabilities">
23 <discover_datasets pattern="(?P&lt;designation&gt;.*)" ext="tsv" directory="outputs"/>
24 </collection>
25 </outputs>
26 <tests>
27 <test>
28 <param name="design" value="summarize_counts_testdata/sample_design_file.tabular" ftype="tsv"/>
29 <param name="collection">
30 <collection type="list">
31 <element name="FEATURE_ID" value="summarize_counts_testdata/filtered_ASE_counts_tables_BASE"/>
32 </collection>
33 </param>
34 <output_collection name="split_output" type="list">
35 <element name="FEATURE_ID">
36 <assert_contents>
37 <has_text_matching expression="Calculated_priors" />
38 </assert_contents>
39 </element>
40 </output_collection>
41 </test>
42 </tests>
43 <help><![CDATA[
44 **Tool Description**
45
46 The Calculate Prior Probability Estimates tool calculates prior probability estimates for each of the comparates.
47 Prior Probability is a part of Bayesian statistical inference, and is useful for ruling out whether alleles aligning better to one parent is due to allelic imbalance,
48 or if it is really due to mapping biases or technical errors in the data.
49
50 The output file contains computations for three variables, prior_[comparate name]_g1 and prior_[comparate name]_g2, which represent the probability that the feature will align to either parental genome, and prior_[comparate name]_both, which are the chances the feature will align to equally well to both genomes.
51
52
53 Calculation::
54
55 prior_{comparate)_both= [# reads aligning equally to both genomes] prior_{comparate)_{G1/G2}= 1-(prior_both)
56 ----------------------------------------- --------------
57 [total uniquely mapped reads] 2
58
59
60
61
62 --------------------------------------------------------------------------------------------------------------------
63
64 **Inputs**
65
66 **Prior Design File [REQUIRED]**
67
68 (1) This tool requires the Priors Design file, which can be created using the *Reformat Sample Design File* tool.
69
70 The Priors Design File must be in the following format and contain the correct header order:
71
72 1. G1- This is the name of the paternal genome
73 2. G2- The name of the maternal genome
74 3. comparate - contains comparate conditions. Exclude replicate number.
75
76
77
78 Example of input design file::
79
80 +-------+------+-----------+
81 | G1 | G2 | comparate |
82 +-------+------+-----------+
83 | W1118 | W55 | W55_M |
84 +-------+------+-----------+
85 | W1118 | W55 | W55_M |
86 +-------+------+-----------+
87
88
89 **Input for Priors Calculations - Summed ASE Counts Tables [Required]**
90
91 The collection of datasets in Summarized and Filtered Format.
92 These datasets can be generated by running simulated reads or DNA reads through the 'Align and Count' and 'Summarize Counts' Workflows.
93
94 Example Input File::
95
96 FEATURE_ID g1 g2 W55_M_flag_analyze W55_M_num_reps W55_M_g1_total_rep1 W55_M_g2_total_rep1 W55_M_both_total_rep1 W55_M_flag_apn_rep1 W55_M_APN_total_reads_rep1 W55_M_APN_both_rep1 W55_M_g1_total_rep2 W55_M_g2_total_rep2 W55_M_both_total_rep2 W55_M_flag_apn_rep2 W55_M_APN_total_reads_rep2 W55_M_APN_both_rep2
97 l(1)G0196 W1118 W55 1 2 691 519 5020 1 29.073464805232 23.4243873865079 1075 812 7481 1 43.7266913990042 34.9168212437762
98 CG8920 W1118 W55 1 2 29 62 647 1 10.3878993081113 9.10716914470779 38 126 920 1 15.2470189901369 12.9534815250994
99 CG10932 W1118 W55 1 2 163 122 1112 1 89.9299663299663 71.5858585858586 237 134 1881 1 144.974410774411 121.086195286195
100
101
102
103 ----------------------------------------------------------------------------------------------------
104
105 **Tool Outputs**
106
107 -One TSV file containing the calculated prior probabilities for reads that align preferentially to G1, G2, and those that align equally to both.
108
109 Example Priors Calculation File::
110
111 +--------------+------------------+-----------------+---------------+
112 | FEATURE_ID | prior_W55_M_both | prior_W55_M_g1 |prior_W55_M_g2 |
113 +==============+==================+=================+===============+
114 | l(1)G0196 | 0.799907266902 | 0.1183611532625 | 0.08173157983 |
115 +--------------+------------------+-----------------+---------------+
116 | CG10932 | 0.853881278538 | 0.0597412480974 | 0.08637747336 |
117 +--------------+------------------+-----------------+---------------+
118 | CG8920 | 0.80895522388 | 0.1233830845771 | 0.06766169154 |
119 +--------------+------------------+-----------------+---------------+
120 | Mapmodulin | 0.8987341772151 | 0.0855389336401 | 0.01572688914 |
121 +--------------+------------------+-----------------+---------------+
122
123 Header descriptions::
124
125 prior_{comparate}_both: prior probability estimate for the total number of reads that mapped equally well to both updated parental genomes
126 prior_{comparate}_g1: prior probability estimate for the total number of reads that mapped to updated parental genome 1
127 prior_{comparate}_g2: prior probability estimate for the total number of reads that mapped to updated parental genome 2
128
129
130 ]]></help>
131 <citations>
132 <citation type="bibtex">@ARTICLE{Miller20BASE,
133 author = {Brecca Miller, Alison M. Morse, Elyse Borgert, Zihao Liu, Kelsey Sinclair, Gavin Gamble, Fei Zou, Jeremy Newman, Luis Leon Novello, Fabio Marroni, Lauren M. McIntyre},
134 title = {Testcrosses are an efficient strategy for identifying cis regulatory variation: Bayesian analysis of allele imbalance among conditions (BASE)},
135 journal = {????},
136 year = {submitted for publication}
137 }</citation>
138 </citations>
139 </tool>