Mercurial > repos > devteam > dwt_cor_ava_perclass
comparison execute_dwt_cor_aVa_perClass.xml @ 2:b87bbe6bc044 draft default tip
"planemo upload for repository https://github.com/galaxyproject/tools-devteam/tree/master/tools/dwt_cor_ava_perclass commit f929353ffb0623f2218d7dec459c7da62f3b0d24"
author | devteam |
---|---|
date | Mon, 06 Jul 2020 20:28:54 -0400 |
parents | a0defff5cf89 |
children |
comparison
equal
deleted
inserted
replaced
1:a0defff5cf89 | 2:b87bbe6bc044 |
---|---|
1 <tool id="compute_p-values_correlation_coefficients_feature_occurrences_between_two_datasets_using_discrete_wavelet_transfom" name="Compute P-values and Correlation Coefficients for Feature Occurrences" version="1.0.0"> | 1 <tool id="compute_p-values_correlation_coefficients_feature_occurrences_between_two_datasets_using_discrete_wavelet_transfom" name="Compute P-values and Correlation Coefficients for Feature Occurrences" version="1.0.1"> |
2 <description>between two datasets using Discrete Wavelet Transfoms</description> | 2 <description>between two datasets using Discrete Wavelet Transfoms</description> |
3 | 3 <requirements> |
4 <command interpreter="perl"> | 4 <requirement type="package" version="1.7.5">r-waveslim</requirement> |
5 execute_dwt_cor_aVa_perClass.pl $inputFile1 $inputFile2 $outputFile1 $outputFile2 | 5 <requirement type="package" version="4.6.8">r-wavethresh</requirement> |
6 </requirements> | |
7 <command detect_errors="exit_code"> | |
8 Rscript --vanilla '$__tool_directory__/execute_dwt_cor_aVa_perClass.R' | |
9 '$inputFile1' | |
10 '$inputFile2' | |
11 '$outputFile2' | |
12 '$outputFile1' | |
6 </command> | 13 </command> |
7 | |
8 <inputs> | 14 <inputs> |
9 <param format="tabular" name="inputFile1" type="data" label="Select the first input file"/> | 15 <param format="tabular" name="inputFile1" type="data" label="Select the first input file"/> |
10 <param format="tabular" name="inputFile2" type="data" label="Select the second input file"/> | 16 <param format="tabular" name="inputFile2" type="data" label="Select the second input file"/> |
11 </inputs> | 17 </inputs> |
12 | |
13 <outputs> | 18 <outputs> |
14 <data format="tabular" name="outputFile1"/> | 19 <data format="tabular" name="outputFile1" label="${tool.name} on ${on_string}: statistics"/> |
15 <data format="pdf" name="outputFile2"/> | 20 <data format="pdf" name="outputFile2" label="${tool.name} on ${on_string}: pdf"/> |
16 </outputs> | 21 </outputs> |
17 | 22 <tests> |
18 <help> | 23 <test> |
19 | 24 <param ftype="tabular" name="inputFile1" value="in1.tsv"/> |
25 <param ftype="tabular" name="inputFile2" value="in2.tsv"/> | |
26 <output name="outputFile1" ftype="tabular"> | |
27 <assert_contents><has_line_matching expression="^translinTarget.*"/></assert_contents> | |
28 <assert_contents><has_line_matching expression="^deletionHoptspot.*" /></assert_contents> | |
29 </output> | |
30 <output name="outputFile2" ftype="pdf" file="out2.pdf" compare="sim_size"/> | |
31 </test> | |
32 </tests> | |
33 <help> | |
20 .. class:: infomark | 34 .. class:: infomark |
21 | 35 |
22 **What it does** | 36 **What it does** |
23 | 37 |
24 This program generates plots and computes table matrix of coefficient correlations and p-values at multiple scales for the correlation between the occurrences of features in one dataset and their occurrences in another using multiscale wavelet analysis technique. | 38 This program generates plots and computes table matrix of coefficient correlations and p-values at multiple scales for the correlation between the occurrences of features in one dataset and their occurrences in another using multiscale wavelet analysis technique. |
25 | 39 |
26 The program assumes that the user has two sets of DNA sequences, S1 and S1, each of which consists of one or more sequences of equal length. Each sequence in each set is divided into the same number of multiple intervals n such that n = 2^k, where k is a positive integer and k >= 1. Thus, n could be any value of the set {2, 4, 8, 16, 32, 64, 128, ...}. k represents the number of scales. | 40 The program assumes that the user has two sets of DNA sequences, S1 and S1, each of which consists of one or more sequences of equal length. Each sequence in each set is divided into the same number of multiple intervals n such that n = 2^k, where k is a positive integer and k >= 1. Thus, n could be any value of the set {2, 4, 8, 16, 32, 64, 128, ...}. k represents the number of scales. |
27 | 41 |
28 The program has two input files obtained as follows: | 42 The program has two input files obtained as follows: |
29 | 43 |
30 For a given set of features, say motifs, the user counts the number of occurrences of each feature in each interval of each sequence in S1 and S1, and builds two tabular files representing the count results in each interval of S1 and S1. These are the input files of the program. | 44 For a given set of features, say motifs, the user counts the number of occurrences of each feature in each interval of each sequence in S1 and S1, and builds two tabular files representing the count results in each interval of S1 and S1. These are the input files of the program. |
31 | 45 |
32 The program gives two output files: | 46 The program gives two output files: |
33 | 47 |
34 - The first output file is a TABULAR format file representing the coefficient correlations and p-values for each feature at each scale. | 48 - The first output file is a TABULAR format file representing the coefficient correlations and p-values for each feature at each scale. |
35 - The second output file is a PDF file consisting of as many figures as the number of features, such that each figure represents the values of the coefficient correlation for that feature at every scale. | 49 - The second output file is a PDF file consisting of as many figures as the number of features, such that each figure represents the values of the coefficient correlation for that feature at every scale. |
38 | 52 |
39 .. class:: warningmark | 53 .. class:: warningmark |
40 | 54 |
41 **Note** | 55 **Note** |
42 | 56 |
43 In order to obtain empirical p-values, a random perumtation test is implemented by the program, which results in the fact that the program gives slightly different results each time it is run on the same input file. | 57 In order to obtain empirical p-values, a random perumtation test is implemented by the program, which results in the fact that the program gives slightly different results each time it is run on the same input file. |
44 | 58 |
45 ----- | 59 ----- |
46 | 60 |
47 **Example** | 61 **Example** |
48 | 62 |
84 115 121 136 109 470 | 98 115 121 136 109 470 |
85 113 135 138 101 491 | 99 113 135 138 101 491 |
86 111 150 138 102 451 | 100 111 150 138 102 451 |
87 94 128 151 138 481 | 101 94 128 151 138 481 |
88 | 102 |
89 | 103 |
90 We notice that the number of scales here is 4 because 16 = 2^4. Running the program on the above input files gives the following output: | 104 We notice that the number of scales here is 4 because 16 = 2^4. Running the program on the above input files gives the following output: |
91 | 105 |
92 The first output file:: | 106 The first output file:: |
93 | 107 |
94 motif 1_cor 1_pval 2_cor 2_pval 3_cor 3_pval 4_cor 4_pval | 108 motif 1_cor 1_pval 2_cor 2_pval 3_cor 3_pval 4_cor 4_pval |
105 .. image:: dwt_cor_aVa_2.png | 119 .. image:: dwt_cor_aVa_2.png |
106 .. image:: dwt_cor_aVa_3.png | 120 .. image:: dwt_cor_aVa_3.png |
107 .. image:: dwt_cor_aVa_4.png | 121 .. image:: dwt_cor_aVa_4.png |
108 .. image:: dwt_cor_aVa_5.png | 122 .. image:: dwt_cor_aVa_5.png |
109 | 123 |
110 </help> | 124 </help> |
111 | 125 |
112 </tool> | 126 </tool> |