annotate format_repeat_library.xml @ 12:755a4d643184 draft default tip

planemo upload commit a61591d548f42ff417781e7fe7418dc2901ccc23
author petr-novak
date Tue, 26 Sep 2023 07:28:04 +0000
parents 5366d5ea04bc
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
0
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
1 <tool id="annotate_contigs" name="Format Repeat Library" version="0.1.0" python_template_version="3.5">
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
2 <requirements>
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
3 <requirement type="package" version="2.60.0">bioconductor-biostrings</requirement>
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
4 </requirements>
11
5366d5ea04bc planemo upload commit 9d1b19f98d8b7f0a0d1baf2da63a373d155626f8-dirty
petr-novak
parents: 0
diff changeset
5 <required_files>
5366d5ea04bc planemo upload commit 9d1b19f98d8b7f0a0d1baf2da63a373d155626f8-dirty
petr-novak
parents: 0
diff changeset
6 <include type="literal" path="annotate_contigs.R"/>
5366d5ea04bc planemo upload commit 9d1b19f98d8b7f0a0d1baf2da63a373d155626f8-dirty
petr-novak
parents: 0
diff changeset
7 </required_files>
0
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
8 <command detect_errors="exit_code"><![CDATA[
11
5366d5ea04bc planemo upload commit 9d1b19f98d8b7f0a0d1baf2da63a373d155626f8-dirty
petr-novak
parents: 0
diff changeset
9 Rscript $__tool_directory__/annotate_contigs.R '$contigs' '$cluster_table' '$annotated_contigs'
0
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
10 ]]></command>
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
11 <inputs>
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
12 <param type="data" name="contigs" format="fasta" label="Contigs - Library of Repeats from TAREAN/RepeatExplorer pipeline" />
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
13 <param type="data" name="cluster_table" format="txt" label="CLUSTER_TABLE from RepeatExplorer pipeline" help="CLUSTER_TABLE which contain annotation of clusters from RepeatExplorer pipeline"/>
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
14 </inputs>
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
15 <outputs>
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
16 <data name="annotated_contigs" format="fasta" label="Annotated Repeat Library based on ${contigs.hid} and ${cluster_table.hid}" />
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
17 </outputs>
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
18 <help><![CDATA[
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
19 **What this tool does**
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
20
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
21 Contigs from RepeatExplorer archive are annotated based on the classification of repeats from cluster_table.
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
22
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
23 Preformated CLUSTER_TABLE can be extracted from RepeatExplorer archive and modified accordingly. By default, "Final_annotation" column is used to append annotation to contigs in repeat library (based on the cluster id). If "Final_annotation" column is incomplete, "Automatic_annotation" column is used instead.
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
24
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
25 Example of tab delimited CLUSTER_TABLE::
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
26
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
27
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
28 "Number_of_reads_in_clusters" 3886
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
29 "Number_of_clusters" 822
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
30 "Number_of_superclusters" 821
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
31 "Number_of_singlets" 6114
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
32 "Number_of_analyzed_reads" 10000
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
33
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
34 "Cluster" "Supercluster" "Size" "Size_adjusted" "Automatic_annotation" "TAREAN_annotation" "Final_annotation"
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
35 1 1 260 260 "All/repeat/satellite" "Putative satellites (low confidence)" ""
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
36 2 2 157 157 "All/repeat/satellite" "Putative satellites (low confidence)" ""
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
37 3 4 100 100 "All" "Other" ""
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
38 4 5 83 83 "All" "Other" ""
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
39 5 3 77 77 "All" "Other" ""
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
40 6 3 65 65 "All" "Other" ""
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
41 7 6 61 61 "All" "Other" ""
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
42 8 7 58 58 "All" "Other" ""
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
43 9 8 53 53 "All" "Other" ""
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
44 10 9 53 53 "All" "Other" ""
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
45 11 10 51 51 "All" "Other" ""
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
46 12 11 45 45 "All" "Other" ""
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
47 13 12 44 44 "All" "Other" ""
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
48 14 13 44 44 "All" "Other" ""
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
49 15 14 39 39 "All" "Other" ""
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
50 16 15 37 37 "All" "Other" ""
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
51 17 16 30 30 "All/repeat/satellite" "Putative satellites (low confidence)" ""
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
52 18 17 28 28 "All/repeat/satellite" "Putative satellites (low confidence)" ""
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
53 19 18 26 26 "All/repeat/satellite" "Putative satellites (high confidence)" ""
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
54 20 19 23 23 "All/repeat/../CRM" "Other" ""
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
55 21 20 21 21 "All" "Other" ""
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
56 22 21 21 21 "All" "Other" ""
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
57 23 22 21 21 "All" "Other" ""
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
58 24 23 21 21 "All" "Other" ""
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
59 25 24 20 20 "All/repeat/../Ogre" "Other" ""
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
60
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
61
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
62 Only Cluster, Automatic_annotation/Final_annnotation are mandatory"
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
63
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
64 Clusters with higher number than those in CLUSTER_TABLE are removed from Repeat library
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
65
11
5366d5ea04bc planemo upload commit 9d1b19f98d8b7f0a0d1baf2da63a373d155626f8-dirty
petr-novak
parents: 0
diff changeset
66 Contigs are provided in following format::
0
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
67
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
68
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
69 >CL25Contig1
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
70 AGATCAAGATGGCGCCGGAGGACATGGAGAAAACGACGTTTATCACTCCCTGGGGAACATTTTGCTACAAGGTAATGCCT
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
71 TTCGGTCTGAAGAACGCAGGGGCCACTTACCAACGAGCAATGGTAACTT
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
72 >CL1Contig4#All/repeat/satellite
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
73 ACCCGAAGGCCGGCTCAACCCGAAGTTGAGAAGAACATCTGACCTCGCCGTCAGGCATCTGTTAAACAAACAGGCATCGA
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
74 A
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
75 >CL1Contig5
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
76 TGAGAAGAACATCTGACCTCGCCGTCAGGCATCTGTTAAACAAACAGGCATCGAACCCGAAGGCCGGCTCAACCCGAAGT
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
77 TGATAAGAACATCTGACCTCGCCGTCAGGCATCTGTTAAACAAACAGGCATCGAACCCGAAGGCCGGCTCAACACGAAGT
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
78 TGAGAGGAACATCTGACCTCGCCGTCAGGCATCTGTTAAA
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
79
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
80
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
81 Resulting repeat library will have following format::
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
82
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
83 >CL25Contig1#All/repeat/mobile_element/Class_I/LTR/Ty3_gypsy/non-chromovirus/OTA/Tat/Ogre
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
84 AGATCAAGATGGCGCCGGAGGACATGGAGAAAACGACGTTTATCACTCCCTGGGGAACATTTTGCTACAAGGTAATGCCT
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
85 TTCGGTCTGAAGAACGCAGGGGCCACTTACCAACGAGCAATGGTAACTT
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
86 >CL1Contig4#All/repeat/satellite
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
87 ACCCGAAGGCCGGCTCAACCCGAAGTTGAGAAGAACATCTGACCTCGCCGTCAGGCATCTGTTAAACAAACAGGCATCGA
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
88 A
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
89 >CL1Contig5#All/repeat/satellite
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
90 TGAGAAGAACATCTGACCTCGCCGTCAGGCATCTGTTAAACAAACAGGCATCGAACCCGAAGGCCGGCTCAACCCGAAGT
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
91 TGATAAGAACATCTGACCTCGCCGTCAGGCATCTGTTAAACAAACAGGCATCGAACCCGAAGGCCGGCTCAACACGAAGT
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
92 TGAGAGGAACATCTGACCTCGCCGTCAGGCATCTGTTAAA
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
93
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
94
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
95
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
96
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
97 ]]></help>
ea6a3059a6af Uploaded
petr-novak
parents:
diff changeset
98 </tool>