annotate tools/picard/rgPicardInsertSize.xml @ 0:9071e359b9a3

Uploaded
author xuebing
date Fri, 09 Mar 2012 19:37:19 -0500
parents
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
0
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
1 <tool name="Insertion size metrics" id="PicardInsertSize" version="0.3.0">
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
2 <description>for PAIRED data</description>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
3 <requirements><requirement type="package">picard</requirement></requirements>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
4 <command interpreter="python">
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
5 picard_wrapper.py -i "$input_file" -n "$out_prefix" --tmpdir "${__new_file_path__}" --taillimit "$tailLimit"
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
6 --histwidth "$histWidth" --minpct "$minPct"
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
7 -j "${GALAXY_DATA_INDEX_DIR}/shared/jars/CollectInsertSizeMetrics.jar" -d "$html_file.files_path" -t "$html_file"
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
8 </command>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
9 <inputs>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
10 <param format="bam,sam" name="input_file" type="data" label="SAM/BAM dataset to generate statistics for"
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
11 help="If empty, upload or import a SAM/BAM dataset."/>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
12 <param name="out_prefix" value="Insertion size metrics" type="text"
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
13 label="Title for the output file" help="Use this remind you what the job was for" size="120" />
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
14 <param name="tailLimit" value="10000" type="integer"
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
15 label="Tail limit" size="5"
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
16 help="When calculating mean and stdev stop when the bins in the tail of the distribution contain fewer than mode/TAIL_LIMIT items" />
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
17 <param name="histWidth" value="0" type="integer"
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
18 label="Histogram width" size="5"
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
19 help="Explicitly sets the histogram width, overriding the TAIL_LIMIT option - leave 0 to ignore" />
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
20 <param name="minPct" value="0.01" type="float"
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
21 label="Minimum percentage" size="5"
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
22 help="Discard any data categories (out of FR, TANDEM, RF) that have fewer than this percentage of overall reads" />
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
23 </inputs>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
24 <outputs>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
25 <data format="html" name="html_file" label="InsertSize_${out_prefix}.html"/>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
26 </outputs>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
27 <tests>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
28 <test>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
29 <param name="input_file" value="picard_input_tiny.sam" />
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
30 <param name="out_prefix" value="Insertion size metrics" />
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
31 <param name="tailLimit" value="10000" />
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
32 <param name="histWidth" value="0" />
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
33 <param name="minPct" value="0.01" />
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
34 <output name="html_file" file="picard_output_insertsize_tinysam.html" ftype="html" compare="contains" lines_diff="40" />
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
35 </test>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
36 </tests>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
37 <help>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
38
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
39
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
40 .. class:: infomark
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
41
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
42 **Purpose**
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
43
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
44 Reads a SAM or BAM file and describes the distribution
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
45 of insert size (excluding duplicates) with metrics and a histogram plot.
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
46
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
47 **Picard documentation**
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
48
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
49 This is a Galaxy wrapper for CollectInsertSizeMetrics, a part of the external package Picard-tools_.
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
50
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
51 .. _Picard-tools: http://www.google.com/search?q=picard+samtools
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
52
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
53 .. class:: warningmark
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
54
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
55 **Useful for paired data only**
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
56
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
57 This tool works for paired data only and can be expected to fail for single end data.
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
58
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
59 -----
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
60
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
61 .. class:: infomark
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
62
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
63 **Inputs, outputs, and parameters**
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
64
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
65 Picard documentation says (reformatted for Galaxy):
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
66
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
67 .. csv-table::
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
68 :header-rows: 1
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
69
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
70 Option,Description
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
71 "INPUT=File","SAM or BAM file Required."
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
72 "OUTPUT=File","File to write insert size metrics to Required."
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
73 "HISTOGRAM_FILE=File","File to write insert size histogram chart to Required."
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
74 "TAIL_LIMIT=Integer","When calculating mean and stdev stop when the bins in the tail of the distribution contain fewer than mode/TAIL_LIMIT items. This also limits how much data goes into each data category of the histogram."
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
75 "HISTOGRAM_WIDTH=Integer","Explicitly sets the histogram width, overriding the TAIL_LIMIT option. Also, when calculating mean and stdev, only bins LE HISTOGRAM_WIDTH will be included. "
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
76 "MINIMUM_PCT=Float","When generating the histogram, discard any data categories (out of FR, TANDEM, RF) that have fewer than this percentage of overall reads. (Range: 0 to 1) Default value: 0.01."
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
77 "STOP_AFTER=Integer","Stop after processing N reads, mainly for debugging. Default value: 0."
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
78 "CREATE_MD5_FILE=Boolean","Whether to create an MD5 digest for any BAM files created. Default value: false."
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
79
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
80 .. class:: warningmark
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
81
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
82 **Warning on SAM/BAM quality**
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
83
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
84 Many SAM/BAM files produced externally and uploaded to Galaxy do not fully conform to SAM/BAM specifications. Galaxy deals with this by using the **LENIENT**
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
85 flag when it runs Picard, which allows reads to be discarded if they're empty or don't map. This appears
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
86 to be the only way to deal with SAM/BAM that cannot be parsed.
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
87
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
88 </help>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
89 </tool>