Mercurial > repos > malex > bayesase
comparison calculate_priors_from_ase_count_tables.xml @ 0:e979cb57a5d5 draft default tip
"planemo upload for repository https://github.com/McIntyre-Lab/BayesASE/tree/main/galaxy commit 9b70598ef46a73632d9e0fa0c6ce6776fb5e9d6a"
author | malex |
---|---|
date | Thu, 14 Jan 2021 21:51:36 +0000 |
parents | |
children |
comparison
equal
deleted
inserted
replaced
-1:000000000000 | 0:e979cb57a5d5 |
---|---|
1 <tool id="calculate_priors_from_ase_count_tables" name="Calculate Prior Probability Estimates" version="21.1.13"> | |
2 <description>using ASE Count Tables</description> | |
3 <macros> | |
4 <import>macros.xml</import> | |
5 </macros> | |
6 <expand macro="requirements"/> | |
7 <command><![CDATA[ | |
8 mkdir outputs; | |
9 cd outputs; | |
10 calculate_priors_ase_count_tables.py | |
11 --output=`pwd` | |
12 --design=$design | |
13 --collection_identifiers="${",".join($collection.keys())}" | |
14 --collection_filenames="${",".join(map(str, $collection))}" | |
15 | |
16 ]]></command> | |
17 <inputs> | |
18 <param name="design" type="data" format="tabular,tsv" label="Priors Design File" help="Select the Priors Design file [created by the Reformat Sample Design File Tool]"/> | |
19 <param name="collection" type="data_collection" collection_type="list" label="Select dataset collection for Prior Calculations" help="Datasets in this collection can be generated from simulated or DNA reads using the Align and Count' and 'Summarize counts' Workflows."/> | |
20 </inputs> | |
21 <outputs> | |
22 <collection name="split_output" type="list" label="${tool.name} on ${on_string}: Calculate Prior Probabilities"> | |
23 <discover_datasets pattern="(?P<designation>.*)" ext="tsv" directory="outputs"/> | |
24 </collection> | |
25 </outputs> | |
26 <tests> | |
27 <test> | |
28 <param name="design" value="summarize_counts_testdata/sample_design_file.tabular" ftype="tsv"/> | |
29 <param name="collection"> | |
30 <collection type="list"> | |
31 <element name="FEATURE_ID" value="summarize_counts_testdata/filtered_ASE_counts_tables_BASE"/> | |
32 </collection> | |
33 </param> | |
34 <output_collection name="split_output" type="list"> | |
35 <element name="FEATURE_ID"> | |
36 <assert_contents> | |
37 <has_text_matching expression="Calculated_priors" /> | |
38 </assert_contents> | |
39 </element> | |
40 </output_collection> | |
41 </test> | |
42 </tests> | |
43 <help><![CDATA[ | |
44 **Tool Description** | |
45 | |
46 The Calculate Prior Probability Estimates tool calculates prior probability estimates for each of the comparates. | |
47 Prior Probability is a part of Bayesian statistical inference, and is useful for ruling out whether alleles aligning better to one parent is due to allelic imbalance, | |
48 or if it is really due to mapping biases or technical errors in the data. | |
49 | |
50 The output file contains computations for three variables, prior_[comparate name]_g1 and prior_[comparate name]_g2, which represent the probability that the feature will align to either parental genome, and prior_[comparate name]_both, which are the chances the feature will align to equally well to both genomes. | |
51 | |
52 | |
53 Calculation:: | |
54 | |
55 prior_{comparate)_both= [# reads aligning equally to both genomes] prior_{comparate)_{G1/G2}= 1-(prior_both) | |
56 ----------------------------------------- -------------- | |
57 [total uniquely mapped reads] 2 | |
58 | |
59 | |
60 | |
61 | |
62 -------------------------------------------------------------------------------------------------------------------- | |
63 | |
64 **Inputs** | |
65 | |
66 **Prior Design File [REQUIRED]** | |
67 | |
68 (1) This tool requires the Priors Design file, which can be created using the *Reformat Sample Design File* tool. | |
69 | |
70 The Priors Design File must be in the following format and contain the correct header order: | |
71 | |
72 1. G1- This is the name of the paternal genome | |
73 2. G2- The name of the maternal genome | |
74 3. comparate - contains comparate conditions. Exclude replicate number. | |
75 | |
76 | |
77 | |
78 Example of input design file:: | |
79 | |
80 +-------+------+-----------+ | |
81 | G1 | G2 | comparate | | |
82 +-------+------+-----------+ | |
83 | W1118 | W55 | W55_M | | |
84 +-------+------+-----------+ | |
85 | W1118 | W55 | W55_M | | |
86 +-------+------+-----------+ | |
87 | |
88 | |
89 **Input for Priors Calculations - Summed ASE Counts Tables [Required]** | |
90 | |
91 The collection of datasets in Summarized and Filtered Format. | |
92 These datasets can be generated by running simulated reads or DNA reads through the 'Align and Count' and 'Summarize Counts' Workflows. | |
93 | |
94 Example Input File:: | |
95 | |
96 FEATURE_ID g1 g2 W55_M_flag_analyze W55_M_num_reps W55_M_g1_total_rep1 W55_M_g2_total_rep1 W55_M_both_total_rep1 W55_M_flag_apn_rep1 W55_M_APN_total_reads_rep1 W55_M_APN_both_rep1 W55_M_g1_total_rep2 W55_M_g2_total_rep2 W55_M_both_total_rep2 W55_M_flag_apn_rep2 W55_M_APN_total_reads_rep2 W55_M_APN_both_rep2 | |
97 l(1)G0196 W1118 W55 1 2 691 519 5020 1 29.073464805232 23.4243873865079 1075 812 7481 1 43.7266913990042 34.9168212437762 | |
98 CG8920 W1118 W55 1 2 29 62 647 1 10.3878993081113 9.10716914470779 38 126 920 1 15.2470189901369 12.9534815250994 | |
99 CG10932 W1118 W55 1 2 163 122 1112 1 89.9299663299663 71.5858585858586 237 134 1881 1 144.974410774411 121.086195286195 | |
100 | |
101 | |
102 | |
103 ---------------------------------------------------------------------------------------------------- | |
104 | |
105 **Tool Outputs** | |
106 | |
107 -One TSV file containing the calculated prior probabilities for reads that align preferentially to G1, G2, and those that align equally to both. | |
108 | |
109 Example Priors Calculation File:: | |
110 | |
111 +--------------+------------------+-----------------+---------------+ | |
112 | FEATURE_ID | prior_W55_M_both | prior_W55_M_g1 |prior_W55_M_g2 | | |
113 +==============+==================+=================+===============+ | |
114 | l(1)G0196 | 0.799907266902 | 0.1183611532625 | 0.08173157983 | | |
115 +--------------+------------------+-----------------+---------------+ | |
116 | CG10932 | 0.853881278538 | 0.0597412480974 | 0.08637747336 | | |
117 +--------------+------------------+-----------------+---------------+ | |
118 | CG8920 | 0.80895522388 | 0.1233830845771 | 0.06766169154 | | |
119 +--------------+------------------+-----------------+---------------+ | |
120 | Mapmodulin | 0.8987341772151 | 0.0855389336401 | 0.01572688914 | | |
121 +--------------+------------------+-----------------+---------------+ | |
122 | |
123 Header descriptions:: | |
124 | |
125 prior_{comparate}_both: prior probability estimate for the total number of reads that mapped equally well to both updated parental genomes | |
126 prior_{comparate}_g1: prior probability estimate for the total number of reads that mapped to updated parental genome 1 | |
127 prior_{comparate}_g2: prior probability estimate for the total number of reads that mapped to updated parental genome 2 | |
128 | |
129 | |
130 ]]></help> | |
131 <citations> | |
132 <citation type="bibtex">@ARTICLE{Miller20BASE, | |
133 author = {Brecca Miller, Alison M. Morse, Elyse Borgert, Zihao Liu, Kelsey Sinclair, Gavin Gamble, Fei Zou, Jeremy Newman, Luis Leon Novello, Fabio Marroni, Lauren M. McIntyre}, | |
134 title = {Testcrosses are an efficient strategy for identifying cis regulatory variation: Bayesian analysis of allele imbalance among conditions (BASE)}, | |
135 journal = {????}, | |
136 year = {submitted for publication} | |
137 }</citation> | |
138 </citations> | |
139 </tool> |