Purpose
Randomly down-sample a SAM or BAM file to retain a random subset of the reads. Mate-pairs are either both kept or both discarded. Reads marked as not primary alignments are all discarded. Each read is given a probability P of being retained - results with the exact same input in the same order and with the same value for RANDOM_SEED will produce the same results.
Dataset collections - processing large numbers of datasets at once
This will be added shortly
Inputs, outputs, and parameters
Either a SAM file or a BAM file must be supplied. Galaxy automatically coordinate-sorts all uploaded BAM files.
From Picard documentation( http://broadinstitute.github.io/picard/):
INPUT=File I=File The input SAM or BAM file to downsample. Required. OUTPUT=File O=File The output, downsampled, SAM or BAM file to write. Required. RANDOM_SEED=Long R=Long Random seed to use if reproducibilty is desired. Setting to null will cause multiple invocations to produce different results. PROBABILITY=Double P=Double The probability of keeping any individual read, between 0 and 1.
Additional information
Additional information about Picard tools is available from Picard web site at http://broadinstitute.github.io/picard/ .