comparison fsd_regions.xml @ 0:b82fdb006304 draft

planemo upload for repository https://github.com/monikaheinzl/galaxyProject/tree/master/tools/fsd_regions commit 6055f8c5c052f528ff85fb5e0d43b4500830637a
author mheinzl
date Thu, 10 May 2018 07:28:39 -0400
parents
children 9ce2b4089c1b
comparison
equal deleted inserted replaced
-1:000000000000 0:b82fdb006304
1 <?xml version="1.0" encoding="UTF-8"?>
2 <tool id="fsd_regions" name="Duplex Sequencing Analysis:" version="0.0.1">
3 <requirements>
4 <requirement type="package" version="2.7">python</requirement>
5 <requirement type="package" version="1.4">matplotlib</requirement>
6 </requirements>
7 <description>Family size distribution (FSD) of aligned tags to reference genome</description>
8 <command>
9 python2 $__tool_directory__/fsd_regions.py --inputFile "$file1" --inputName1 "$file1.name" --ref_genome "$file2" --sep $separator --output_csv $output_csv --output_pdf $output_pdf
10 </command>
11 <inputs>
12 <param name="file1" type="data" format="tabular" label="Dataset 1: input tags of whole dataset" optional="false" help="Input in tabular format with the family size, tags and the direction of the strand ('ab' or 'ba') for each family."/>
13 <param name="file2" type="data" format="txt" label="Dataset 2: input tags aligned to the reference genome" help="Input in txt format with the regions and the tags, which were aligned to the reference genome."/>
14 <param name="separator" type="text" label="Separator of the CSV file." help="can be a single character" value=","/>
15 </inputs>
16 <outputs>
17 <data name="output_pdf" format="pdf" />
18 <data name="output_csv" format="csv"/>
19 </outputs>
20 <help> <![CDATA[
21
22 **What it does**
23
24 This tool will create a distribution of family sizes of all tags, which were aligned to the reference genome. The distribution is separated after the regions of the reference genome.
25
26
27 **Input**
28
29 This tools expects a tabular file with the tags of all families, their sizes and information about forward (ab) and reverse (ba) strands.
30
31 +-----+----------------------------+----+
32 | 1 | AAAAAAAAAAAATGTTGGAATCTT | ba |
33 +-----+----------------------------+----+
34 | 10 | AAAAAAAAAAAGGCGGTCCACCCC | ab |
35 +-----+----------------------------+----+
36 | 28 | AAAAAAAAAAATGGTATGGACCGA | ab |
37 +-----+----------------------------+----+
38
39 In addition, a TXT file with the regions and all tags that were aligned to the reference genome is required. This file can obtained from a different tool.
40
41 +-----------+------------------------------+
42 | 87_636 | AAATCAAAGTATGAATGAAGTTGCCT |
43 +-----------+------------------------------+
44 | 87_636 | AAATTCATAGCATTAATTTCAACGGG |
45 +-----------+------------------------------+
46 | 656_1143 | GGGGCAGCCATATTGGCAATTATCAT |
47 +-----------+------------------------------+
48
49 **Output**
50
51 The output is a PDF file with the plot and a CSV with the data of the plot.
52
53
54 **About Author**
55
56 Author: Monika Heinzl
57
58 Department: Institute of Bioinformatics, Johannes Kepler University Linz, Austria
59
60 Contact: monika.heinzl@edumail.at
61
62 ]]>
63
64 </help>
65 <citations>
66 <citation type="bibtex">
67 @misc{duplex,
68 author = {Heinzl, Monika},
69 year = {2018},
70 title = {Development of algorithms for the analysis of duplex sequencing data}
71 }
72 </citation>
73 </citations>
74 </tool>