Purpose
Program to chart the distribution of potential sequencing "single nucleotide mutation" artifacts in a SAM or BAM file.
Dataset collections - processing large numbers of datasets at once
This will be added shortly
Inputs, outputs, and parameters
Either a SAM file or a BAM file must be supplied. Galaxy automatically coordinate-sorts all uploaded BAM files.
From Picard documentation( http://broadinstitute.github.io/picard/):
ASSUME_SORTED=Boolean If true (default), then the sort order in the header file will be ignored.
Default: True
CONTEXT_SIZE=integer The number of context bases to include on each side of the assayed base.
CONTEXT_SIZE_TO_PRINT=String If specified, only print results for these contexts in the detail metrics output.
However, the summary metrics output will still take all contexts into consideration.
DB_SNP=text file VCF format dbSNP file, used to exclude regions around known polymorphisms from analysis.
INCLUDE_DUPLICATES=Boolean Include duplicate reads. If set to true then all reads flagged as duplicates will be included as well.
INCLUDE_UNPAIRED=Boolean Include unpaired reads. If set to true then all paired reads will be included as well -
MINIMUM_INSERT_SIZE and MAXIMUM_INSERT_SIZE will be ignored.
MAXIMUM_INSERT_SIZE=Integer The maximum insert size for a read to be included in analysis. Set to 0 to have no maximum.
Default = 600
MINIMUM_INSERT_SIZE=Integer The minimum insert size for a read to be included in analysis. Default = 60
MINIMUM_MAPPING_QUALITY The minimum mapping quality score for a base to be included in analysis. Default = 30
MINIMUM_QUALITY_SCORE The minimum base quality score for a base to be included in analysis. Default = 20
Additional information
Additional information about Picard tools is available from Picard web site at http://broadinstitute.github.io/picard/ .