annotate execute_dwt_cor_aVa_perClass.xml @ 0:6708501767b6 draft

Imported from capsule None
author devteam
date Mon, 27 Jan 2014 09:29:25 -0500
parents
children a0defff5cf89
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
0
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
1 <tool id="compute_p-values_correlation_coefficients_feature_occurrences_between_two_datasets_using_discrete_wavelet_transfom" name="Compute P-values and Correlation Coefficients for Feature Occurrences" version="1.0.0">
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
2 <description>between two datasets using Discrete Wavelet Transfoms</description>
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
3
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
4 <command interpreter="perl">
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
5 execute_dwt_cor_aVa_perClass.pl $inputFile1 $inputFile2 $outputFile1 $outputFile2
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
6 </command>
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
7
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
8 <inputs>
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
9 <param format="tabular" name="inputFile1" type="data" label="Select the first input file"/>
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
10 <param format="tabular" name="inputFile2" type="data" label="Select the second input file"/>
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
11 </inputs>
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
12
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
13 <outputs>
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
14 <data format="tabular" name="outputFile1"/>
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
15 <data format="pdf" name="outputFile2"/>
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
16 </outputs>
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
17
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
18 <help>
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
19
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
20 .. class:: infomark
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
21
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
22 **What it does**
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
23
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
24 This program generates plots and computes table matrix of coefficient correlations and p-values at multiple scales for the correlation between the occurrences of features in one dataset and their occurrences in another using multiscale wavelet analysis technique.
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
25
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
26 The program assumes that the user has two sets of DNA sequences, S1 and S1, each of which consists of one or more sequences of equal length. Each sequence in each set is divided into the same number of multiple intervals n such that n = 2^k, where k is a positive integer and k >= 1. Thus, n could be any value of the set {2, 4, 8, 16, 32, 64, 128, ...}. k represents the number of scales.
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
27
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
28 The program has two input files obtained as follows:
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
29
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
30 For a given set of features, say motifs, the user counts the number of occurrences of each feature in each interval of each sequence in S1 and S1, and builds two tabular files representing the count results in each interval of S1 and S1. These are the input files of the program.
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
31
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
32 The program gives two output files:
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
33
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
34 - The first output file is a TABULAR format file representing the coefficient correlations and p-values for each feature at each scale.
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
35 - The second output file is a PDF file consisting of as many figures as the number of features, such that each figure represents the values of the coefficient correlation for that feature at every scale.
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
36
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
37 -----
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
38
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
39 .. class:: warningmark
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
40
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
41 **Note**
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
42
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
43 In order to obtain empirical p-values, a random perumtation test is implemented by the program, which results in the fact that the program gives slightly different results each time it is run on the same input file.
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
44
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
45 -----
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
46
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
47 **Example**
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
48
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
49 Counting the occurrences of 5 features (motifs) in 16 intervals (one line per interval) of the DNA sequences in S1 gives the following tabular file::
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
50
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
51 deletionHoptspot insertionHoptspot dnaPolPauseFrameshift topoisomeraseCleavageSite translinTarget
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
52 269 366 330 238 1129
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
53 239 328 327 283 1188
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
54 254 351 358 297 1151
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
55 262 371 355 256 1107
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
56 254 361 352 234 1192
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
57 265 354 367 240 1182
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
58 255 359 333 235 1217
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
59 271 389 387 272 1241
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
60 240 305 341 249 1159
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
61 272 351 337 257 1169
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
62 275 351 337 233 1158
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
63 305 331 361 253 1172
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
64 277 341 343 253 1113
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
65 266 362 355 267 1162
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
66 235 326 329 241 1230
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
67 254 335 360 251 1172
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
68
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
69 And counting the occurrences of 5 features (motifs) in 16 intervals (one line per interval) of the DNA sequences in S2 gives the following tabular file::
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
70
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
71 deletionHoptspot insertionHoptspot dnaPolPauseFrameshift topoisomeraseCleavageSite translinTarget
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
72 104 146 142 113 478
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
73 89 146 151 94 495
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
74 100 176 151 88 435
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
75 96 163 128 114 468
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
76 99 138 144 91 513
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
77 112 126 162 106 468
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
78 86 127 145 83 491
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
79 104 145 171 110 496
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
80 91 121 147 104 469
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
81 103 141 145 98 458
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
82 92 134 142 117 468
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
83 97 146 145 107 471
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
84 115 121 136 109 470
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
85 113 135 138 101 491
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
86 111 150 138 102 451
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
87 94 128 151 138 481
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
88
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
89
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
90 We notice that the number of scales here is 4 because 16 = 2^4. Running the program on the above input files gives the following output:
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
91
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
92 The first output file::
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
93
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
94 motif 1_cor 1_pval 2_cor 2_pval 3_cor 3_pval 4_cor 4_pval
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
95
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
96 deletionHoptspot 0.4 0.072 0.143 0.394 -0.667 0.244 1 0.491
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
97 insertionHoptspot 0.343 0.082 -0.0714 0.446 -1 0.12 1 0.502
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
98 dnaPolPauseFrameshift 0.617 0.004 -0.5 0.13 0.667 0.234 1 0.506
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
99 topoisomeraseCleavageSite -0.183 0.242 -0.286 0.256 0.333 0.353 -1 0.489
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
100 translinTarget 0.0167 0.503 -0.0714 0.469 1 0.136 1 0.485
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
101
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
102 The second output file:
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
103
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
104 .. image:: ${static_path}/operation_icons/dwt_cor_aVa_1.png
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
105 .. image:: ${static_path}/operation_icons/dwt_cor_aVa_2.png
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
106 .. image:: ${static_path}/operation_icons/dwt_cor_aVa_3.png
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
107 .. image:: ${static_path}/operation_icons/dwt_cor_aVa_4.png
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
108 .. image:: ${static_path}/operation_icons/dwt_cor_aVa_5.png
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
109
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
110 </help>
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
111
6708501767b6 Imported from capsule None
devteam
parents:
diff changeset
112 </tool>