annotate execute_dwt_cor_aVb_all.xml @ 0:0f2eda4ea8dc draft

Imported from capsule None
author devteam
date Mon, 27 Jan 2014 09:26:52 -0500
parents
children 8564f6927b87
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
0
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
1 <tool id="compute_p-values_correlation_coefficients_featureA_featureB_occurrences_between_two_datasets_using_discrete_wavelet_transfom" name="Compute P-values and Correlation Coefficients for Occurrences of Two Set of Features" version="1.0.0">
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
2 <description>between two datasets using Discrete Wavelet Transfoms</description>
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
3
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
4 <command interpreter="perl">
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
5 execute_dwt_cor_aVb_all.pl $inputFile1 $inputFile2 $outputFile1 $outputFile2
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
6 </command>
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
7
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
8 <inputs>
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
9 <param format="tabular" name="inputFile1" type="data" label="Select the first input file"/>
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
10 <param format="tabular" name="inputFile2" type="data" label="Select the second input file"/>
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
11 </inputs>
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
12
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
13 <outputs>
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
14 <data format="tabular" name="outputFile1"/>
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
15 <data format="pdf" name="outputFile2"/>
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
16 </outputs>
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
17
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
18 <help>
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
19
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
20 .. class:: infomark
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
21
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
22 **What it does**
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
23
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
24 This program generates plots and computes table matrix of coefficient correlations and p-values at multiple scales for the correlation between the occurrences of features in one dataset and their occurrences in another using multiscale wavelet analysis technique.
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
25
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
26 The program assumes that the user has two sets of DNA sequences, S1 and S1, each of which consists of one or more sequences of equal length. Each sequence in each set is divided into the same number of multiple intervals n such that n = 2^k, where k is a positive integer and k >= 1. Thus, n could be any value of the set {2, 4, 8, 16, 32, 64, 128, ...}. k represents the number of scales.
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
27
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
28 The program has two input files obtained as follows:
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
29
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
30 For a given set of features, say motifs, the user counts the number of occurrences of each feature in each interval of each sequence in S1 and S1, and builds two tabular files representing the count results in each interval of S1 and S1. These are the input files of the program.
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
31
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
32 The program gives two output files:
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
33
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
34 - The first output file is a TABULAR format file representing the coefficient correlations and p-values for each feature at each scale.
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
35 - The second output file is a PDF file consisting of as many figures as the number of features, such that each figure represents the values of the coefficient correlations for that feature at every scale.
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
36
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
37 -----
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
38
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
39 .. class:: warningmark
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
40
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
41 **Note**
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
42
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
43 In order to obtain empirical p-values, a random perumtation test is implemented by the program, which results in the fact that the program gives slightly different results each time it is run on the same input file.
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
44
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
45 -----
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
46
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
47 **Example**
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
48
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
49 Counting the occurrences of 5 features (motifs) in 16 intervals (one line per interval) of the DNA sequences in S1 gives the following tabular file::
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
50
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
51 deletionHoptspot insertionHoptspot dnaPolPauseFrameshift topoisomeraseCleavageSite translinTarget
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
52 82 162 158 79 459
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
53 111 196 154 75 459
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
54 98 178 160 79 475
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
55 113 201 170 113 436
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
56 113 173 147 95 446
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
57 107 150 155 84 436
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
58 106 166 175 96 448
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
59 113 176 135 106 514
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
60 113 170 152 87 450
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
61 95 152 167 93 467
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
62 91 171 169 118 426
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
63 84 139 160 100 459
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
64 92 154 164 104 440
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
65 100 145 154 98 472
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
66 91 161 152 71 461
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
67 117 164 139 97 463
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
68
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
69 And counting the occurrences of 5 features (motifs) in 16 intervals (one line per interval) of the DNA sequences in S2 gives the following tabular file::
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
70
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
71 deletionHoptspot insertionHoptspot dnaPolPauseFrameshift topoisomeraseCleavageSite translinTarget
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
72 269 366 330 238 1129
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
73 239 328 327 283 1188
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
74 254 351 358 297 1151
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
75 262 371 355 256 1107
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
76 254 361 352 234 1192
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
77 265 354 367 240 1182
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
78 255 359 333 235 1217
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
79 271 389 387 272 1241
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
80 240 305 341 249 1159
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
81 272 351 337 257 1169
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
82 275 351 337 233 1158
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
83 305 331 361 253 1172
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
84 277 341 343 253 1113
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
85 266 362 355 267 1162
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
86 235 326 329 241 1230
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
87 254 335 360 251 1172
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
88
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
89
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
90 We notice that the number of scales here is 4 because 16 = 2^4. Running the program on the above input files gives the following output:
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
91
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
92 The first output file::
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
93
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
94 motif1 motif2 1_cor 1_pval 2_cor 2_pval 3_cor 3_pval 4_cor 4_pval
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
95
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
96 deletionHoptspot insertionHoptspot -0.1 0.346 -0.214 0.338 1 0.127 1 0.467
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
97 deletionHoptspot dnaPolPauseFrameshift 0.167 0.267 -0.214 0.334 1 0.122 1 0.511
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
98 deletionHoptspot topoisomeraseCleavageSite 0.167 0.277 0.143 0.412 -0.667 0.243 1 0.521
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
99 deletionHoptspot translinTarget 0 0.505 0.0714 0.441 1 0.124 1 0.518
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
100 insertionHoptspot dnaPolPauseFrameshift -0.202 0.238 0.143 0.379 -1 0.122 1 0.517
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
101 insertionHoptspot topoisomeraseCleavageSite -0.0336 0.457 0.214 0.29 0.667 0.252 1 0.503
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
102 insertionHoptspot translinTarget 0.0672 0.389 0.429 0.186 -1 0.119 1 0.506
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
103 dnaPolPauseFrameshift topoisomeraseCleavageSite -0.353 0.101 0.357 0.228 0 0.612 -1 0.49
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
104 dnaPolPauseFrameshift translinTarget -0.151 0.303 -0.571 0.09 -0.333 0.37 -1 1
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
105 topoisomeraseCleavageSite translinTarget -0.37 0.077 -0.222 0.297 0.667 0.234 -1 0.471
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
106
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
107 The second output file:
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
108
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
109 .. image:: ${static_path}/operation_icons/dwt_cor_aVb_all_1.png
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
110 .. image:: ${static_path}/operation_icons/dwt_cor_aVb_all_2.png
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
111 .. image:: ${static_path}/operation_icons/dwt_cor_aVb_all_3.png
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
112 .. image:: ${static_path}/operation_icons/dwt_cor_aVb_all_4.png
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
113 .. image:: ${static_path}/operation_icons/dwt_cor_aVb_all_5.png
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
114 .. image:: ${static_path}/operation_icons/dwt_cor_aVb_all_6.png
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
115 .. image:: ${static_path}/operation_icons/dwt_cor_aVb_all_7.png
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
116 .. image:: ${static_path}/operation_icons/dwt_cor_aVb_all_8.png
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
117 .. image:: ${static_path}/operation_icons/dwt_cor_aVb_all_9.png
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
118 .. image:: ${static_path}/operation_icons/dwt_cor_aVb_all_10.png
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
119
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
120
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
121 </help>
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
122
0f2eda4ea8dc Imported from capsule None
devteam
parents:
diff changeset
123 </tool>