annotate mytools/random_interval.xml @ 7:f0dc65e7f6c0

Uploaded
author xuebing
date Fri, 09 Mar 2012 19:59:07 -0500
parents
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
7
f0dc65e7f6c0 Uploaded
xuebing
parents:
diff changeset
1 <tool id="randominterval" name="shuffle intervals">
f0dc65e7f6c0 Uploaded
xuebing
parents:
diff changeset
2 <description>weight chromosome by length</description>
f0dc65e7f6c0 Uploaded
xuebing
parents:
diff changeset
3 <command interpreter="python">random_interval.py $input $output $within $genome </command>
f0dc65e7f6c0 Uploaded
xuebing
parents:
diff changeset
4 <inputs>
f0dc65e7f6c0 Uploaded
xuebing
parents:
diff changeset
5 <param name="input" format="interval" type="data" label="reference interval file to mimic"/>
f0dc65e7f6c0 Uploaded
xuebing
parents:
diff changeset
6 <param name="within" label="randomize within chromosome" help="If checked, for each original interval will move it to a random position in the SAME chromosome. The default is to move it to any chromosome (chance proportional to chromosome size)" type="boolean" truevalue="within" falsevalue="across" checked="False"/>
f0dc65e7f6c0 Uploaded
xuebing
parents:
diff changeset
7 <param name="genome" type="select" label="Select genome">
f0dc65e7f6c0 Uploaded
xuebing
parents:
diff changeset
8 <option value="/Users/xuebing/galaxy-dist/tool-data/genome/chrsize/mouse.mm9.genome" selected="true">mm9</option>
f0dc65e7f6c0 Uploaded
xuebing
parents:
diff changeset
9 <option value="/Users/xuebing/galaxy-dist/tool-data/genome/chrsize/mouse.mm8.genome">mm8</option>
f0dc65e7f6c0 Uploaded
xuebing
parents:
diff changeset
10 <option value="/Users/xuebing/galaxy-dist/tool-data/genome/chrsize/human.hg18.genome">hg18</option>
f0dc65e7f6c0 Uploaded
xuebing
parents:
diff changeset
11 <option value="/Users/xuebing/galaxy-dist/tool-data/genome/chrsize/human.hg19.genome">hg19</option>
f0dc65e7f6c0 Uploaded
xuebing
parents:
diff changeset
12 </param>
f0dc65e7f6c0 Uploaded
xuebing
parents:
diff changeset
13 </inputs>
f0dc65e7f6c0 Uploaded
xuebing
parents:
diff changeset
14 <outputs>
f0dc65e7f6c0 Uploaded
xuebing
parents:
diff changeset
15 <data format="interval" name="output" />
f0dc65e7f6c0 Uploaded
xuebing
parents:
diff changeset
16 </outputs>
f0dc65e7f6c0 Uploaded
xuebing
parents:
diff changeset
17 <help>
f0dc65e7f6c0 Uploaded
xuebing
parents:
diff changeset
18
f0dc65e7f6c0 Uploaded
xuebing
parents:
diff changeset
19
f0dc65e7f6c0 Uploaded
xuebing
parents:
diff changeset
20 **What it does**
f0dc65e7f6c0 Uploaded
xuebing
parents:
diff changeset
21
f0dc65e7f6c0 Uploaded
xuebing
parents:
diff changeset
22 This tool will generate a set of intervals randomly distributed in the genome, mimicking the size distribution of the reference set. The same number of intervals are generated.
f0dc65e7f6c0 Uploaded
xuebing
parents:
diff changeset
23
f0dc65e7f6c0 Uploaded
xuebing
parents:
diff changeset
24
f0dc65e7f6c0 Uploaded
xuebing
parents:
diff changeset
25 **How it works**
f0dc65e7f6c0 Uploaded
xuebing
parents:
diff changeset
26
f0dc65e7f6c0 Uploaded
xuebing
parents:
diff changeset
27 For each interval in the reference set, the script picks a random position as the new start in the genome, and then pick the end such that the size of the random interval is the same as the original one. The default setting is to move the interval to any chromosome, with the probability proportional to the size/length of the chromosome. You can have it pick a random position in the same chromosome, such that in the randomized set each chromosome has the same number of intervals as the reference set. The size of the chromosome can be either learned from the reference set (chromosome size = max(interval end)) or read from a chromosome size file. When learning from the reference set, only regions spanned by reference intervals are used to generate random intervals. Regions (may be an entire chromosome) not covered by the reference set will not appear in the output.
f0dc65e7f6c0 Uploaded
xuebing
parents:
diff changeset
28
f0dc65e7f6c0 Uploaded
xuebing
parents:
diff changeset
29 **Chromosome size file**
f0dc65e7f6c0 Uploaded
xuebing
parents:
diff changeset
30
f0dc65e7f6c0 Uploaded
xuebing
parents:
diff changeset
31 Chromosome size files for hg18,hg19,mm8,and mm9 can be found in 'Shared Data'. To use those files, select the correct one and import into to the history, then the file will be listed in the drop-down menu of this tool. You can also make your own chromosme size file: each line specifies the size of a chromosome (tab-delimited):
f0dc65e7f6c0 Uploaded
xuebing
parents:
diff changeset
32
f0dc65e7f6c0 Uploaded
xuebing
parents:
diff changeset
33 chr1 92394392
f0dc65e7f6c0 Uploaded
xuebing
parents:
diff changeset
34
f0dc65e7f6c0 Uploaded
xuebing
parents:
diff changeset
35 chr2 232342342
f0dc65e7f6c0 Uploaded
xuebing
parents:
diff changeset
36
f0dc65e7f6c0 Uploaded
xuebing
parents:
diff changeset
37
f0dc65e7f6c0 Uploaded
xuebing
parents:
diff changeset
38 You can use the following script from UCSC genome browser to download chromosome size files for other genomes:
f0dc65e7f6c0 Uploaded
xuebing
parents:
diff changeset
39
f0dc65e7f6c0 Uploaded
xuebing
parents:
diff changeset
40 http://hgdownload.cse.ucsc.edu/admin/exe/linux.x86_64/fetchChromSizes
f0dc65e7f6c0 Uploaded
xuebing
parents:
diff changeset
41
f0dc65e7f6c0 Uploaded
xuebing
parents:
diff changeset
42
f0dc65e7f6c0 Uploaded
xuebing
parents:
diff changeset
43 </help>
f0dc65e7f6c0 Uploaded
xuebing
parents:
diff changeset
44
f0dc65e7f6c0 Uploaded
xuebing
parents:
diff changeset
45 </tool>