annotate tools/regVariation/compute_motif_frequencies_for_all_motifs.xml @ 0:9071e359b9a3

Uploaded
author xuebing
date Fri, 09 Mar 2012 19:37:19 -0500
parents
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
0
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
1 <tool id="compute_motif_frequencies_for_all_motifs" name="Compute Motif Frequencies For All Motifs" version="1.0.0">
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
2 <description>motif by motif</description>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
3
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
4 <command interpreter="perl">
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
5 compute_motif_frequencies_for_all_motifs.pl $inputFile1 $inputFile2 $inputWindowSize3 $outputFile1
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
6 </command>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
7
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
8 <inputs>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
9 <param format="tabular" name="inputFile1" type="data" label="Select the motifs file"/>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
10 <param format="tabular" name="inputFile2" type="data" label="Select the indel flanking sequences windows file"/>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
11 <param type="integer" name="inputWindowSize3" size="6" value="0" label="What is the number of 10bp windows in which the motif frequencies will be computed?" help="'0' = one window only"/>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
12 </inputs>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
13
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
14 <outputs>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
15 <data format="tabular" name="outputFile1"/>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
16 </outputs>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
17
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
18 <tests>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
19 <test>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
20 <param name="inputFile1" value="motifs2.tabular" />
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
21 <param name="inputFile2" value="flankingSequencesWindows10_2.tabular" />
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
22 <param name="inputWindowSize3" value="0" />
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
23 <output name="outputFile1" file="motifFrequencies_every_indels0.tabular" />
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
24 </test>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
25
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
26 <test>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
27 <param name="inputFile1" value="motifs2.tabular" />
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
28 <param name="inputFile2" value="flankingSequencesWindows10_2.tabular" />
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
29 <param name="inputWindowSize3" value="4" />
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
30 <output name="outputFile1" file="motifFrequencies_every_indels4.tabular" />
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
31 </test>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
32 </tests>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
33
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
34 <help>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
35
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
36 .. class:: infomark
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
37
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
38 **What it does**
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
39
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
40 This program computes the frequencies of each motif at a window size, determined by the user, in both upstream and downstream sequences flanking indels in all chromosomes.
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
41
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
42 - The first input is a TABULAR format file containing the motif names and sequences, one line per motif, such that the file consists of two columns:
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
43
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
44 - The left column represents the motif names
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
45 - The right column represents the motif sequence, as follows::
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
46
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
47 dnaPolPauseFrameshift1 GAG
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
48 dnaPolPauseFrameshift2 ACG
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
49 xSites1 CCG
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
50
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
51 - The second input is a TABULAR format file representing the windows of both upstream and downstream flanking sequences. It consists of multiple left columns representing the windows of the upstream flanking sequences, followed by one column representing the indels, then followed by multiple right columns representing the windows of the downstream flanking sequences, as follows::
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
52
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
53 cgaggtcagg agatcgagac catcctggct aacatggtga aatcccgtct ctactaaaaa indel aaatttatat ttataaacaa ttttaataca cctatgttta ttatacattt
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
54 GCCAGTTTAT GGTCTAACAA GGAGAGAAAC AGGGGGCTGA AGGGGTTTCT TAACCTCCAG indel TTCCGGGCTC TGTCCCTAAC CCCCAGCTAG GTAAGTGGCA AAGCACTTCT
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
55 CAGTGGGACC AAGCACTGAA CCACTTTGGG GAGAATCTCA CACTGGGGCC CTCTGACACC indel tatatatttt tttttttttt tttttttttt tttttttttg agatggtgtc
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
56 AGAGCAGCAG CACCCACTTT TGCAGTGTGT GACGTTGGTG GAGCCATCGA AGTCTGTGCT indel GAGCCCTCCC CAGTGCTCCG AGGAGCTGCT GTTCCCCCTG GAGCTCAGAA
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
57
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
58 - The third input is an integer number representing the number of windows to be considered starting from the indel and leftward for the upstream flanking sequence and, starting from the indel and rightward for the downstream flanking sequence.
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
59
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
60 - The output is a TABULAR format file consisting of three columns:
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
61
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
62 - The left column represents the motif name
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
63 - The middle column represents the motif frequency in the specified windows of the upstream sequence flanking an indel
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
64 - The right column represents the motif frequency in the specified windows of the downstream sequence flanking an indel
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
65
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
66 There is line per indel in the output file, such that the total number of lines in the output file = number of motifs x number of indels.
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
67
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
68 Note: The number of windows entered by the user must be a positive integer >= 1. if negative integer or 0 is entered by the user, the program will consider it as 1.
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
69
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
70 </help>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
71
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
72 </tool>