0
|
1 <tool id="gops_coverage_1" name="Coverage">
|
|
2 <description>of a set of intervals on second set of intervals</description>
|
|
3 <command interpreter="python">gops_coverage.py $input1 $input2 $output -1 ${input1.metadata.chromCol},${input1.metadata.startCol},${input1.metadata.endCol},${input1.metadata.strandCol} -2 ${input2.metadata.chromCol},${input2.metadata.startCol},${input2.metadata.endCol},${input2.metadata.strandCol}</command>
|
|
4 <inputs>
|
|
5 <param format="interval" name="input1" type="data" help="First dataset">
|
|
6 <label>What portion of</label>
|
|
7 </param>
|
|
8 <param format="interval" name="input2" type="data" help="Second dataset">
|
|
9 <label>is covered by</label>
|
|
10 </param>
|
|
11 </inputs>
|
|
12 <outputs>
|
|
13 <data format="interval" name="output" metadata_source="input1" />
|
|
14 </outputs>
|
|
15 <code file="operation_filter.py"/>
|
|
16 <tests>
|
|
17 <test>
|
|
18 <param name="input1" value="1.bed" />
|
|
19 <param name="input2" value="2.bed" />
|
|
20 <output name="output" file="gops_coverage_out.interval" />
|
|
21 </test>
|
|
22 <test>
|
|
23 <param name="input1" value="1.bed" />
|
|
24 <param name="input2" value="2_mod.bed" ftype="interval"/>
|
|
25 <output name="output" file="gops_coverage_out_diffCols.interval" />
|
|
26 </test>
|
|
27 <test>
|
|
28 <param name="input1" value="gops_bigint.interval" />
|
|
29 <param name="input2" value="gops_bigint2.interval" />
|
|
30 <output name="output" file="gops_coverage_out2.interval" />
|
|
31 </test>
|
|
32 </tests>
|
|
33 <help>
|
|
34
|
|
35 .. class:: infomark
|
|
36
|
|
37 **TIP:** If your dataset does not appear in the pulldown menu -> it is not in interval format. Use "edit attributes" to set chromosome, start, end, and strand columns.
|
|
38
|
|
39 Find the coverage of intervals in the first dataset on intervals in the second dataset. The coverage is added as two columns, the first being bases covered, and the second being the fraction of bases covered by that interval.
|
|
40
|
|
41 -----
|
|
42
|
|
43 **Screencasts!**
|
|
44
|
|
45 See Galaxy Interval Operation Screencasts_ (right click to open this link in another window).
|
|
46
|
|
47 .. _Screencasts: http://wiki.g2.bx.psu.edu/Learn/Interval%20Operations
|
|
48
|
|
49 -----
|
|
50
|
|
51 **Example**
|
|
52
|
|
53
|
|
54 if **First dataset** are genes ::
|
|
55
|
|
56 chr11 5203271 5204877 NM_000518 0 -
|
|
57 chr11 5210634 5212434 NM_000519 0 -
|
|
58 chr11 5226077 5227663 NM_000559 0 -
|
|
59 chr11 5226079 5232587 BC020719 0 -
|
|
60 chr11 5230996 5232587 NM_000184 0 -
|
|
61
|
|
62 and **Second dataset** are repeats::
|
|
63
|
|
64 chr11 5203895 5203991 L1MA6 500 +
|
|
65 chr11 5204163 5204239 A-rich 219 +
|
|
66 chr11 5211034 5211167 (CATATA)n 245 +
|
|
67 chr11 5211642 5211673 AT_rich 24 +
|
|
68 chr11 5226551 5226606 (CA)n 303 +
|
|
69 chr11 5228782 5228825 (TTTTTG)n 208 +
|
|
70 chr11 5229045 5229121 L1PA11 440 +
|
|
71 chr11 5229133 5229319 MER41A 1106 +
|
|
72 chr11 5229374 5229485 L2 244 -
|
|
73 chr11 5229751 5230083 MLT1A 913 -
|
|
74 chr11 5231469 5231526 (CA)n 330 +
|
|
75
|
|
76 the Result is the coverage density of repeats in the genes::
|
|
77
|
|
78 chr11 5203271 5204877 NM_000518 0 - 172 0.107098
|
|
79 chr11 5210634 5212434 NM_000519 0 - 164 0.091111
|
|
80 chr11 5226077 5227663 NM_000559 0 - 55 0.034678
|
|
81 chr11 5226079 5232587 BC020719 0 - 860 0.132145
|
|
82 chr11 5230996 5232587 NM_000184 0 - 57 0.035827
|
|
83
|
|
84 For example, the following line of output::
|
|
85
|
|
86 chr11 5203271 5204877 NM_000518 0 - 172 0.107098
|
|
87
|
|
88 implies that 172 nucleotides accounting for 10.7% of the this interval (chr11:5203271-5204877) overlap with repetitive elements.
|
|
89
|
|
90 </help>
|
|
91 </tool> |