annotate gencode_partition.xml @ 0:410a2d3c7b6c draft default tip

Uploaded
author test0004
date Thu, 30 Aug 2012 17:03:24 -0400
parents
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
0
410a2d3c7b6c Uploaded
test0004
parents:
diff changeset
1 <tool id="gencode_partition1" name="Gencode Partition">
410a2d3c7b6c Uploaded
test0004
parents:
diff changeset
2 <description>an interval file</description>
410a2d3c7b6c Uploaded
test0004
parents:
diff changeset
3 <command interpreter="python">split_by_partitions.py ${GALAXY_DATA_INDEX_DIR} $input1 $out_file1 ${input1.metadata.chromCol} ${input1.metadata.startCol} ${input1.metadata.endCol} ${input1.metadata.strandCol}</command>
410a2d3c7b6c Uploaded
test0004
parents:
diff changeset
4 <inputs>
410a2d3c7b6c Uploaded
test0004
parents:
diff changeset
5 <param name="input1" type="data" format="interval" label="File to Partition"/>
410a2d3c7b6c Uploaded
test0004
parents:
diff changeset
6 </inputs>
410a2d3c7b6c Uploaded
test0004
parents:
diff changeset
7 <outputs>
410a2d3c7b6c Uploaded
test0004
parents:
diff changeset
8 <data name="out_file1" format="bed"/>
410a2d3c7b6c Uploaded
test0004
parents:
diff changeset
9 </outputs>
410a2d3c7b6c Uploaded
test0004
parents:
diff changeset
10 <tests>
410a2d3c7b6c Uploaded
test0004
parents:
diff changeset
11 <test>
410a2d3c7b6c Uploaded
test0004
parents:
diff changeset
12 <param name="input1" value="encode_1.bed"/>
410a2d3c7b6c Uploaded
test0004
parents:
diff changeset
13 <output name="out_file1" file="gencode_partition_out.bed"/>
410a2d3c7b6c Uploaded
test0004
parents:
diff changeset
14 </test>
410a2d3c7b6c Uploaded
test0004
parents:
diff changeset
15 </tests>
410a2d3c7b6c Uploaded
test0004
parents:
diff changeset
16 <help>
410a2d3c7b6c Uploaded
test0004
parents:
diff changeset
17 For detailed information about partitioning, click here_.
410a2d3c7b6c Uploaded
test0004
parents:
diff changeset
18
410a2d3c7b6c Uploaded
test0004
parents:
diff changeset
19 .. _here: http://genome.imim.es/gencode/wiki/index.php/Collecting_Feature_Sets_from_All_Analysis_Groups
410a2d3c7b6c Uploaded
test0004
parents:
diff changeset
20
410a2d3c7b6c Uploaded
test0004
parents:
diff changeset
21 Datasets are partitioned according to the protocol below:
410a2d3c7b6c Uploaded
test0004
parents:
diff changeset
22
410a2d3c7b6c Uploaded
test0004
parents:
diff changeset
23 A partition scheme has been defined that is similar to what has previously been done with TARs/TRANSFRAGs such that any feature can be classified as falling into one of the following 6 categories:
410a2d3c7b6c Uploaded
test0004
parents:
diff changeset
24 1. **Coding** -- coding exons defined from the GENCODE experimentally verified coding set (coding in any transcript)
410a2d3c7b6c Uploaded
test0004
parents:
diff changeset
25 2. **5UTR** -- 5' UTR exons defined from the GENCODE experimentally verified coding set (5' UTR in some transcript but never coding in any other)
410a2d3c7b6c Uploaded
test0004
parents:
diff changeset
26 3. **3UTR** -- 3' UTR exons defined from the GENCODE experimentally verified coding set (3' UTR in some transcript but never coding in any other)
410a2d3c7b6c Uploaded
test0004
parents:
diff changeset
27 4. **Intronic Proximal** -- intronic and no more than 5kb away from an exon.
410a2d3c7b6c Uploaded
test0004
parents:
diff changeset
28 5. **Intergenic Proximal** -- between genes and no more than 5kb away from an exon.
410a2d3c7b6c Uploaded
test0004
parents:
diff changeset
29 6. **Intronic Distal** -- intronic and greater than 5kb away from an exon.
410a2d3c7b6c Uploaded
test0004
parents:
diff changeset
30 7. **Intergenic Distal** -- between genes and greater than 5kb away from an exon.
410a2d3c7b6c Uploaded
test0004
parents:
diff changeset
31
410a2d3c7b6c Uploaded
test0004
parents:
diff changeset
32 -----
410a2d3c7b6c Uploaded
test0004
parents:
diff changeset
33
410a2d3c7b6c Uploaded
test0004
parents:
diff changeset
34 .. class:: infomark
410a2d3c7b6c Uploaded
test0004
parents:
diff changeset
35
410a2d3c7b6c Uploaded
test0004
parents:
diff changeset
36 **Note:** Features overlapping more than one partition will take the identity of the lower-numbered partition.
410a2d3c7b6c Uploaded
test0004
parents:
diff changeset
37
410a2d3c7b6c Uploaded
test0004
parents:
diff changeset
38 ------
410a2d3c7b6c Uploaded
test0004
parents:
diff changeset
39
410a2d3c7b6c Uploaded
test0004
parents:
diff changeset
40 **Citation**
410a2d3c7b6c Uploaded
test0004
parents:
diff changeset
41
410a2d3c7b6c Uploaded
test0004
parents:
diff changeset
42 If you use this tool, please cite `Blankenberg D, Taylor J, Schenck I, He J, Zhang Y, Ghent M, Veeraraghavan N, Albert I, Miller W, Makova KD, Hardison RC, Nekrutenko A. A framework for collaborative analysis of ENCODE data: making large-scale analyses biologist-friendly. Genome Res. 2007 Jun;17(6):960-4. &lt;http://www.ncbi.nlm.nih.gov/pubmed/17568012&gt;`_
410a2d3c7b6c Uploaded
test0004
parents:
diff changeset
43
410a2d3c7b6c Uploaded
test0004
parents:
diff changeset
44 </help>
410a2d3c7b6c Uploaded
test0004
parents:
diff changeset
45 </tool>