sharplabtool: tools/sr_mapping/bwa_color

annotate tools/sr_mapping/bwa_color_wrapper.xml @ 1:cdcb0ce84a1b

Uploaded

author	xuebing
date	Fri, 09 Mar 2012 19:45:15 -0500
parents	9071e359b9a3
children

rev	line source
0 9071e359b9a3 Uploaded xuebing parents: diff changeset	1 <tool id="bwa_color_wrapper" name="Map with BWA for SOLiD" version="1.0.1">
9071e359b9a3 Uploaded xuebing parents: diff changeset	2 <description></description>
9071e359b9a3 Uploaded xuebing parents: diff changeset	3 <parallelism method="basic"></parallelism>
9071e359b9a3 Uploaded xuebing parents: diff changeset	4 <command interpreter="python">
9071e359b9a3 Uploaded xuebing parents: diff changeset	5 bwa_wrapper.py
9071e359b9a3 Uploaded xuebing parents: diff changeset	6 --threads="4"
9071e359b9a3 Uploaded xuebing parents: diff changeset	7 --color-space
9071e359b9a3 Uploaded xuebing parents: diff changeset	8
9071e359b9a3 Uploaded xuebing parents: diff changeset	9 ## reference source
9071e359b9a3 Uploaded xuebing parents: diff changeset	10 --fileSource=$genomeSource.refGenomeSource
9071e359b9a3 Uploaded xuebing parents: diff changeset	11 #if $genomeSource.refGenomeSource == "history":
9071e359b9a3 Uploaded xuebing parents: diff changeset	12 ##build index on the fly
9071e359b9a3 Uploaded xuebing parents: diff changeset	13 --ref="${genomeSource.ownFile}"
9071e359b9a3 Uploaded xuebing parents: diff changeset	14 --dbkey=$dbkey
9071e359b9a3 Uploaded xuebing parents: diff changeset	15 #else:
9071e359b9a3 Uploaded xuebing parents: diff changeset	16 ##use precomputed indexes
9071e359b9a3 Uploaded xuebing parents: diff changeset	17 --ref="${ filter( lambda x: str( x[0] ) == str( $genomeSource.indices ), $__app__.tool_data_tables[ 'bwa_indexes_color' ].get_fields() )[0][-1] }"
9071e359b9a3 Uploaded xuebing parents: diff changeset	18 --do_not_build_index
9071e359b9a3 Uploaded xuebing parents: diff changeset	19 #end if
9071e359b9a3 Uploaded xuebing parents: diff changeset	20
9071e359b9a3 Uploaded xuebing parents: diff changeset	21 ## input file(s)
9071e359b9a3 Uploaded xuebing parents: diff changeset	22 --input1=$paired.input1
9071e359b9a3 Uploaded xuebing parents: diff changeset	23 #if $paired.sPaired == "paired":
9071e359b9a3 Uploaded xuebing parents: diff changeset	24 --input2=$paired.input2
9071e359b9a3 Uploaded xuebing parents: diff changeset	25 #end if
9071e359b9a3 Uploaded xuebing parents: diff changeset	26
9071e359b9a3 Uploaded xuebing parents: diff changeset	27 ## output file
9071e359b9a3 Uploaded xuebing parents: diff changeset	28 --output=$output
9071e359b9a3 Uploaded xuebing parents: diff changeset	29
9071e359b9a3 Uploaded xuebing parents: diff changeset	30 ## run parameters
9071e359b9a3 Uploaded xuebing parents: diff changeset	31 --genAlignType=$paired.sPaired
9071e359b9a3 Uploaded xuebing parents: diff changeset	32 --params=$params.source_select
9071e359b9a3 Uploaded xuebing parents: diff changeset	33 #if $params.source_select != "pre_set":
9071e359b9a3 Uploaded xuebing parents: diff changeset	34 --maxEditDist=$params.maxEditDist
9071e359b9a3 Uploaded xuebing parents: diff changeset	35 --fracMissingAligns=$params.fracMissingAligns
9071e359b9a3 Uploaded xuebing parents: diff changeset	36 --maxGapOpens=$params.maxGapOpens
9071e359b9a3 Uploaded xuebing parents: diff changeset	37 --maxGapExtens=$params.maxGapExtens
9071e359b9a3 Uploaded xuebing parents: diff changeset	38 --disallowLongDel=$params.disallowLongDel
9071e359b9a3 Uploaded xuebing parents: diff changeset	39 --disallowIndel=$params.disallowIndel
9071e359b9a3 Uploaded xuebing parents: diff changeset	40 --seed=$params.seed
9071e359b9a3 Uploaded xuebing parents: diff changeset	41 --maxEditDistSeed=$params.maxEditDistSeed
9071e359b9a3 Uploaded xuebing parents: diff changeset	42 --mismatchPenalty=$params.mismatchPenalty
9071e359b9a3 Uploaded xuebing parents: diff changeset	43 --gapOpenPenalty=$params.gapOpenPenalty
9071e359b9a3 Uploaded xuebing parents: diff changeset	44 --gapExtensPenalty=$params.gapExtensPenalty
9071e359b9a3 Uploaded xuebing parents: diff changeset	45 --suboptAlign=$params.suboptAlign
9071e359b9a3 Uploaded xuebing parents: diff changeset	46 --noIterSearch=$params.noIterSearch
9071e359b9a3 Uploaded xuebing parents: diff changeset	47 --outputTopN=$params.outputTopN
9071e359b9a3 Uploaded xuebing parents: diff changeset	48 --outputTopNDisc=$params.outputTopNDisc
9071e359b9a3 Uploaded xuebing parents: diff changeset	49 --maxInsertSize=$params.maxInsertSize
9071e359b9a3 Uploaded xuebing parents: diff changeset	50 --maxOccurPairing=$params.maxOccurPairing
9071e359b9a3 Uploaded xuebing parents: diff changeset	51 #if $params.readGroup.specReadGroup == "yes"
9071e359b9a3 Uploaded xuebing parents: diff changeset	52 --rgid="$params.readGroup.rgid"
9071e359b9a3 Uploaded xuebing parents: diff changeset	53 --rgcn="$params.readGroup.rgcn"
9071e359b9a3 Uploaded xuebing parents: diff changeset	54 --rgds="$params.readGroup.rgds"
9071e359b9a3 Uploaded xuebing parents: diff changeset	55 --rgdt="$params.readGroup.rgdt"
9071e359b9a3 Uploaded xuebing parents: diff changeset	56 --rgfo="$params.readGroup.rgfo"
9071e359b9a3 Uploaded xuebing parents: diff changeset	57 --rgks="$params.readGroup.rgks"
9071e359b9a3 Uploaded xuebing parents: diff changeset	58 --rglb="$params.readGroup.rglb"
9071e359b9a3 Uploaded xuebing parents: diff changeset	59 --rgpg="$params.readGroup.rgpg"
9071e359b9a3 Uploaded xuebing parents: diff changeset	60 --rgpi="$params.readGroup.rgpi"
9071e359b9a3 Uploaded xuebing parents: diff changeset	61 --rgpl="$params.readGroup.rgpl"
9071e359b9a3 Uploaded xuebing parents: diff changeset	62 --rgpu="$params.readGroup.rgpu"
9071e359b9a3 Uploaded xuebing parents: diff changeset	63 --rgsm="$params.readGroup.rgsm"
9071e359b9a3 Uploaded xuebing parents: diff changeset	64 #end if
9071e359b9a3 Uploaded xuebing parents: diff changeset	65 #end if
9071e359b9a3 Uploaded xuebing parents: diff changeset	66
9071e359b9a3 Uploaded xuebing parents: diff changeset	67 ## suppress output SAM header
9071e359b9a3 Uploaded xuebing parents: diff changeset	68 --suppressHeader=$suppressHeader
9071e359b9a3 Uploaded xuebing parents: diff changeset	69 </command>
9071e359b9a3 Uploaded xuebing parents: diff changeset	70 <requirements>
9071e359b9a3 Uploaded xuebing parents: diff changeset	71 <requirement type="package">bwa</requirement>
9071e359b9a3 Uploaded xuebing parents: diff changeset	72 </requirements>
9071e359b9a3 Uploaded xuebing parents: diff changeset	73 <inputs>
9071e359b9a3 Uploaded xuebing parents: diff changeset	74 <conditional name="genomeSource">
9071e359b9a3 Uploaded xuebing parents: diff changeset	75 <param name="refGenomeSource" type="select" label="Will you select a reference genome from your history or use a built-in index?">
9071e359b9a3 Uploaded xuebing parents: diff changeset	76 <option value="indexed">Use a built-in index</option>
9071e359b9a3 Uploaded xuebing parents: diff changeset	77 <option value="history">Use one from the history</option>
9071e359b9a3 Uploaded xuebing parents: diff changeset	78 </param>
9071e359b9a3 Uploaded xuebing parents: diff changeset	79 <when value="indexed">
9071e359b9a3 Uploaded xuebing parents: diff changeset	80 <param name="indices" type="select" label="Select a reference genome">
9071e359b9a3 Uploaded xuebing parents: diff changeset	81 <options from_data_table="bwa_indexes_color">
9071e359b9a3 Uploaded xuebing parents: diff changeset	82 <filter type="sort_by" column="2" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	83 <validator type="no_options" message="No indexes are available for the selected input dataset" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	84 </options>
9071e359b9a3 Uploaded xuebing parents: diff changeset	85 </param>
9071e359b9a3 Uploaded xuebing parents: diff changeset	86 </when>
9071e359b9a3 Uploaded xuebing parents: diff changeset	87 <when value="history">
9071e359b9a3 Uploaded xuebing parents: diff changeset	88 <param name="ownFile" type="data" format="fasta" metadata_name="dbkey" label="Select a reference from history" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	89 </when>
9071e359b9a3 Uploaded xuebing parents: diff changeset	90 </conditional>
9071e359b9a3 Uploaded xuebing parents: diff changeset	91 <conditional name="paired">
9071e359b9a3 Uploaded xuebing parents: diff changeset	92 <param name="sPaired" type="select" label="Is this library mate-paired?">
9071e359b9a3 Uploaded xuebing parents: diff changeset	93 <option value="single">Single-end</option>
9071e359b9a3 Uploaded xuebing parents: diff changeset	94 <option value="paired">Paired-end</option>
9071e359b9a3 Uploaded xuebing parents: diff changeset	95 </param>
9071e359b9a3 Uploaded xuebing parents: diff changeset	96 <when value="single">
9071e359b9a3 Uploaded xuebing parents: diff changeset	97 <param name="input1" type="data" format="fastqcssanger" label="FASTQ file (Nucleotide-space recoded from color-space)">
9071e359b9a3 Uploaded xuebing parents: diff changeset	98 <help>Convert color-space data to nucleotide-space (see help section below for steps). Must have Sanger-scaled quality values with ASCII offset 33</help>
9071e359b9a3 Uploaded xuebing parents: diff changeset	99 </param>
9071e359b9a3 Uploaded xuebing parents: diff changeset	100 </when>
9071e359b9a3 Uploaded xuebing parents: diff changeset	101 <when value="paired">
9071e359b9a3 Uploaded xuebing parents: diff changeset	102 <param name="input1" type="data" format="fastqcssanger" label="Forward FASTQ file (Nucleotide-space recoded from color-space)" help="Must have Sanger-scaled quality values with ASCII offset 33">
9071e359b9a3 Uploaded xuebing parents: diff changeset	103 <help>Convert color-space data to nucleotide-space (see help section below for steps). Must have Sanger-scaled quality values with ASCII offset 33</help>
9071e359b9a3 Uploaded xuebing parents: diff changeset	104 </param>
9071e359b9a3 Uploaded xuebing parents: diff changeset	105 <param name="input2" type="data" format="fastqcssanger" label="Reverse FASTQ file (Nucleotide-space recoded from color-space)" help="Must have Sanger-scaled quality values with ASCII offset 33">
9071e359b9a3 Uploaded xuebing parents: diff changeset	106 <help>Convert color-space data to nucleotide-space (see help section below for steps). Must have Sanger-scaled quality values with ASCII offset 33</help>
9071e359b9a3 Uploaded xuebing parents: diff changeset	107 </param>
9071e359b9a3 Uploaded xuebing parents: diff changeset	108 </when>
9071e359b9a3 Uploaded xuebing parents: diff changeset	109 </conditional>
9071e359b9a3 Uploaded xuebing parents: diff changeset	110 <conditional name="params">
9071e359b9a3 Uploaded xuebing parents: diff changeset	111 <param name="source_select" type="select" label="BWA settings to use" help="For most mapping needs use Commonly Used settings. If you want full control use Full Parameter List">
9071e359b9a3 Uploaded xuebing parents: diff changeset	112 <option value="pre_set">Commonly Used</option>
9071e359b9a3 Uploaded xuebing parents: diff changeset	113 <option value="full">Full Parameter List</option>
9071e359b9a3 Uploaded xuebing parents: diff changeset	114 </param>
9071e359b9a3 Uploaded xuebing parents: diff changeset	115 <when value="pre_set" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	116 <when value="full">
9071e359b9a3 Uploaded xuebing parents: diff changeset	117 <param name="maxEditDist" type="integer" value="0" label="Maximum edit distance (aln -n)" help="Enter this value OR a fraction of missing alignments, not both" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	118 <param name="fracMissingAligns" type="float" value="0.04" label="Fraction of missing alignments given 2% uniform base error rate (aln -n)" help="Enter this value OR maximum edit distance, not both" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	119 <param name="maxGapOpens" type="integer" value="1" label="Maximum number of gap opens (aln -o)" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	120 <param name="maxGapExtens" type="integer" value="-1" label="Maximum number of gap extensions (aln -e)" help="-1 for k-difference mode (disallowing long gaps)" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	121 <param name="disallowLongDel" type="integer" value="16" label="Disallow long deletion within [value] bp towards the 3'-end (aln -d)" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	122 <param name="disallowIndel" type="integer" value="5" label="Disallow insertion/deletion within [value] bp towards the end (aln -i)" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	123 <param name="seed" type="integer" value="-1" label="Number of first subsequences to take as seed (aln -l)" help="Enter -1 for infinity" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	124 <param name="maxEditDistSeed" type="integer" value="2" label="Maximum edit distance in the seed (aln -k)" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	125 <param name="mismatchPenalty" type="integer" value="3" label="Mismatch penalty (aln -M)" help="BWA will not search for suboptimal hits with a score lower than [value]" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	126 <param name="gapOpenPenalty" type="integer" value="11" label="Gap open penalty (aln -O)" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	127 <param name="gapExtensPenalty" type="integer" value="4" label="Gap extension penalty (aln -E)" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	128 <param name="suboptAlign" type="boolean" truevalue="true" falsevalue="false" checked="no" label="Proceed with suboptimal alignments even if the top hit is a repeat (aln -R)" help="For paired-end reads only. By default, BWA only searches for suboptimal alignments if the top hit is unique. Using this option has no effect on accuracy for single-end reads. It is mainly designed for improving the alignment accuracy of paired-end reads. However, the pairing procedure will be slowed down, especially for very short reads (~32bp)" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	129 <param name="noIterSearch" type="boolean" truevalue="true" falsevalue="false" checked="no" label="Disable iterative search (aln -N)" help="All hits with no more than maxDiff differences will be found. This mode is much slower than the default" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	130 <param name="outputTopN" type="integer" value="3" label="Maximum number of alignments to output in the XA tag for reads paired properly (samse/sampe -n)" help="If a read has more than INT hits, the XA tag will not be written" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	131 <param name="outputTopNDisc" type="integer" value="10" label="Maximum number of alignments to output in the XA tag for disconcordant read pairs (excluding singletons) (sampe -N)" help="For paired-end reads only. If a read has more than INT hits, the XA tag will not be written" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	132 <param name="maxInsertSize" type="integer" value="500" label="Maximum insert size for a read pair to be considered as being mapped properly (sampe -a)" help="For paired-end reads only. Only used when there are not enough good alignments to infer the distribution of insert sizes" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	133 <param name="maxOccurPairing" type="integer" value="100000" label="Maximum occurrences of a read for pairing (sampe -o)" help="For paired-end reads only. A read with more occurrences will be treated as a single-end read. Reducing this parameter helps faster pairing" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	134 <conditional name="readGroup">
9071e359b9a3 Uploaded xuebing parents: diff changeset	135 <param name="specReadGroup" type="select" label="Specify the read group for this file? (samse/sampe -r)">
9071e359b9a3 Uploaded xuebing parents: diff changeset	136 <option value="yes">Yes</option>
9071e359b9a3 Uploaded xuebing parents: diff changeset	137 <option value="no" selected="True">No</option>
9071e359b9a3 Uploaded xuebing parents: diff changeset	138 </param>
9071e359b9a3 Uploaded xuebing parents: diff changeset	139 <when value="yes">
9071e359b9a3 Uploaded xuebing parents: diff changeset	140 <param name="rgid" type="text" size="25" label="Read group identiﬁer (ID). Each @RG line must have a unique ID. The value of ID is used in the RG
9071e359b9a3 Uploaded xuebing parents: diff changeset	141 tags of alignment records. Must be unique among all read groups in header section." help="Required if RG specified. Read group
9071e359b9a3 Uploaded xuebing parents: diff changeset	142 IDs may be modiﬁed when merging SAM ﬁles in order to handle collisions." />
9071e359b9a3 Uploaded xuebing parents: diff changeset	143 <param name="rgcn" type="text" size="25" label="Sequencing center that produced the read (CN)" help="Optional" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	144 <param name="rgds" type="text" size="25" label="Description (DS)" help="Optional" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	145 <param name="rgdt" type="text" size="25" label="Date that run was produced (DT)" help="Optional. ISO8601 format date or date/time, like YYYY-MM-DD" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	146 <param name="rgfo" type="text" size="25" label="Flow order (FO). The array of nucleotide bases that correspond to the nucleotides used for each
9071e359b9a3 Uploaded xuebing parents: diff changeset	147 ﬂow of each read." help="Optional. Multi-base ﬂows are encoded in IUPAC format, and non-nucleotide ﬂows by
9071e359b9a3 Uploaded xuebing parents: diff changeset	148 various other characters. Format : /\*\|[ACMGRSVTWYHKDBN]+/" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	149 <param name="rgks" type="text" size="25" label="The array of nucleotide bases that correspond to the key sequence of each read (KS)" help="Optional" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	150 <param name="rglb" type="text" size="25" label="Library name (LB)" help="Required if RG specified" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	151 <param name="rgpg" type="text" size="25" label="Programs used for processing the read group (PG)" help="Optional" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	152 <param name="rgpi" type="text" size="25" label="Predicted median insert size (PI)" help="Optional" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	153 <param name="rgpl" type="text" size="25" label="Platform/technology used to produce the reads (PL)" help="Required if RG specified. Valid values : CAPILLARY, LS454, ILLUMINA,
9071e359b9a3 Uploaded xuebing parents: diff changeset	154 SOLID, HELICOS, IONTORRENT and PACBIO" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	155 <param name="rgpu" type="text" size="25" label="Platform unit (PU)" help="Optional. Unique identiﬁer (e.g. ﬂowcell-barcode.lane for Illumina or slide for SOLiD)" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	156 <param name="rgsm" type="text" size="25" label="Sample (SM)" help="Required if RG specified. Use pool name where a pool is being sequenced" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	157 </when>
9071e359b9a3 Uploaded xuebing parents: diff changeset	158 <when value="no" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	159 </conditional>
9071e359b9a3 Uploaded xuebing parents: diff changeset	160 </when>
9071e359b9a3 Uploaded xuebing parents: diff changeset	161 </conditional>
9071e359b9a3 Uploaded xuebing parents: diff changeset	162 <param name="suppressHeader" type="boolean" truevalue="true" falsevalue="false" checked="False" label="Suppress the header in the output SAM file" help="BWA produces SAM with several lines of header information" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	163 </inputs>
9071e359b9a3 Uploaded xuebing parents: diff changeset	164 <outputs>
9071e359b9a3 Uploaded xuebing parents: diff changeset	165 <data format="sam" name="output" label="${tool.name} on ${on_string}: mapped reads">
9071e359b9a3 Uploaded xuebing parents: diff changeset	166 <actions>
9071e359b9a3 Uploaded xuebing parents: diff changeset	167 <conditional name="genomeSource.refGenomeSource">
9071e359b9a3 Uploaded xuebing parents: diff changeset	168 <when value="indexed">
9071e359b9a3 Uploaded xuebing parents: diff changeset	169 <action type="metadata" name="dbkey">
9071e359b9a3 Uploaded xuebing parents: diff changeset	170 <option type="from_data_table" name="bwa_indexes_color" column="1">
9071e359b9a3 Uploaded xuebing parents: diff changeset	171 <filter type="param_value" column="0" value="#" compare="startswith" keep="False"/>
9071e359b9a3 Uploaded xuebing parents: diff changeset	172 <filter type="param_value" ref="genomeSource.indices" column="0" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	173 </option>
9071e359b9a3 Uploaded xuebing parents: diff changeset	174 </action>
9071e359b9a3 Uploaded xuebing parents: diff changeset	175 </when>
9071e359b9a3 Uploaded xuebing parents: diff changeset	176 <when value="history">
9071e359b9a3 Uploaded xuebing parents: diff changeset	177 <action type="metadata" name="dbkey">
9071e359b9a3 Uploaded xuebing parents: diff changeset	178 <option type="from_param" name="genomeSource.ownFile" param_attribute="dbkey" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	179 </action>
9071e359b9a3 Uploaded xuebing parents: diff changeset	180 </when>
9071e359b9a3 Uploaded xuebing parents: diff changeset	181 </conditional>
9071e359b9a3 Uploaded xuebing parents: diff changeset	182 </actions>
9071e359b9a3 Uploaded xuebing parents: diff changeset	183 </data>
9071e359b9a3 Uploaded xuebing parents: diff changeset	184 </outputs>
9071e359b9a3 Uploaded xuebing parents: diff changeset	185 <tests>
9071e359b9a3 Uploaded xuebing parents: diff changeset	186 <test>
9071e359b9a3 Uploaded xuebing parents: diff changeset	187 <!--
9071e359b9a3 Uploaded xuebing parents: diff changeset	188 BWA commands:
9071e359b9a3 Uploaded xuebing parents: diff changeset	189 cp test-data/hg19chrX_midpart.fasta hg19chrX_midpart.fasta
9071e359b9a3 Uploaded xuebing parents: diff changeset	190 bwa index -c -a is hg19chrX_midpart.fasta
9071e359b9a3 Uploaded xuebing parents: diff changeset	191 bwa aln -t 4 -c hg19chrX_midpart.fasta test-data/bwa_wrapper_in4.fastqcssanger > bwa_wrapper_out4.sai
9071e359b9a3 Uploaded xuebing parents: diff changeset	192 bwa samse hg19chrX_midpart.fasta bwa_wrapper_out4.sai test-data/bwa_wrapper_in4.fastqcssanger > bwa_wrapper_out4.u.sam
9071e359b9a3 Uploaded xuebing parents: diff changeset	193 hg19chrX_midpart.fasta is the prefix for the reference files (hg19chrX_midpart.fasta.amb, hg19chrX_midpart.fasta.ann, hg19chrX_midpart.fasta.bwt, ...)
9071e359b9a3 Uploaded xuebing parents: diff changeset	194 It's just part of hg19 chrX, from the middle of the chromosome
9071e359b9a3 Uploaded xuebing parents: diff changeset	195 plain old sort doesn't handle underscores like python:
9071e359b9a3 Uploaded xuebing parents: diff changeset	196 python -c "import sys; lines=file(sys.argv[1],'rb').readlines(); lines.sort(); file(sys.argv[2],'wb').write(''.join(lines))" bwa_wrapper_out4.u.sam bwa_wrapper_out4.sam
9071e359b9a3 Uploaded xuebing parents: diff changeset	197 -->
9071e359b9a3 Uploaded xuebing parents: diff changeset	198 <param name="refGenomeSource" value="history" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	199 <param name="ownFile" value="hg19chrX_midpart.fasta" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	200 <param name="sPaired" value="single" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	201 <param name="input1" value="bwa_wrapper_in4.fastqcssanger" ftype="fastqcssanger" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	202 <param name="source_select" value="pre_set" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	203 <param name="suppressHeader" value="false" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	204 <output name="output" file="bwa_wrapper_out4.sam" ftype="sam" sort="True" lines_diff="2" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	205 </test>
9071e359b9a3 Uploaded xuebing parents: diff changeset	206 <test>
9071e359b9a3 Uploaded xuebing parents: diff changeset	207 <!--
9071e359b9a3 Uploaded xuebing parents: diff changeset	208 BWA commands:
9071e359b9a3 Uploaded xuebing parents: diff changeset	209 bwa aln -t 4 -c equCab2chrM_cs.fa test-data/bwa_wrapper_in5.fastqcssanger > bwa_wrapper_out5a.sai
9071e359b9a3 Uploaded xuebing parents: diff changeset	210 bwa aln -t 4 -c equCab2chrM_cs.fa test-data/bwa_wrapper_in6.fastqcssanger > bwa_wrapper_out5b.sai
9071e359b9a3 Uploaded xuebing parents: diff changeset	211 bwa sampe equCab2chrM_cs.fa bwa_wrapper_out5a.sai bwa_wrapper_out5b.sai test-data/bwa_wrapper_in5.fastqcssanger test-data/bwa_wrapper_in6.fastqcssanger > bwa_wrapper_out5.u.sam
9071e359b9a3 Uploaded xuebing parents: diff changeset	212 equCab2chrM_cs.fa is the prefix of the index files (equCab2chrM_cs.fa.amb, equCab2chrM_cs.fa.ann, ...)
9071e359b9a3 Uploaded xuebing parents: diff changeset	213 remove the comment lines (beginning with '@') from the resulting sam file
9071e359b9a3 Uploaded xuebing parents: diff changeset	214 plain old sort doesn't handle underscores like python:
9071e359b9a3 Uploaded xuebing parents: diff changeset	215 python -c "import sys; lines=file(sys.argv[1],'rb').readlines(); lines.sort(); file(sys.argv[2],'wb').write(''.join(lines))" bwa_wrapper_out5.u.sam bwa_wrapper_out5.sam
9071e359b9a3 Uploaded xuebing parents: diff changeset	216 -->
9071e359b9a3 Uploaded xuebing parents: diff changeset	217 <param name="refGenomeSource" value="indexed" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	218 <param name="indices" value="equCab2chrM" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	219 <param name="sPaired" value="paired" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	220 <param name="input1" value="bwa_wrapper_in5.fastqcssanger" ftype="fastqcssanger" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	221 <param name="input2" value="bwa_wrapper_in6.fastqcssanger" ftype="fastqcssanger" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	222 <param name="source_select" value="pre_set" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	223 <param name="suppressHeader" value="true" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	224 <output name="output" file="bwa_wrapper_out5.sam" ftype="sam" sort="True" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	225 </test>
9071e359b9a3 Uploaded xuebing parents: diff changeset	226 <test>
9071e359b9a3 Uploaded xuebing parents: diff changeset	227 <!--
9071e359b9a3 Uploaded xuebing parents: diff changeset	228 BWA commands:
9071e359b9a3 Uploaded xuebing parents: diff changeset	229 bwa aln -n 0.04 -o 1 -e -1 -d 16 -i 5 -k 2 -t 4 -M 3 -O 11 -E 4 -R -N -c hg19chrX_midpart.fasta test-data/bwa_wrapper_in4.fastqcssanger > bwa_wrapper_out6.sai
9071e359b9a3 Uploaded xuebing parents: diff changeset	230 bwa samse -n 3 -r "@RG\tID:474747\tDS:description\tDT:2011-03-14\tLB:lib-child-1-A\tPI:200\tPL:SOLID\tSM:child-1" hg19chrX_midpart.fasta bwa_wrapper_out6.sai test-data/bwa_wrapper_in4.fastqcssanger > bwa_wrapper_out6.u.sam
9071e359b9a3 Uploaded xuebing parents: diff changeset	231 hg19chrX_midpart_cs.fa is the prefix of the index files (hg19chrX_midpart.fa.amb, hg19chrX_midpart.fa.ann, ...)
9071e359b9a3 Uploaded xuebing parents: diff changeset	232 (It's just part of hg19 chrX, from the middle of the chromosome)
9071e359b9a3 Uploaded xuebing parents: diff changeset	233 plain old sort doesn't handle underscores like python:
9071e359b9a3 Uploaded xuebing parents: diff changeset	234 python -c "import sys; lines=file(sys.argv[1],'rb').readlines(); lines.sort(); file(sys.argv[2],'wb').write(''.join(lines))" bwa_wrapper_out6.u.sam bwa_wrapper_out6.sam
9071e359b9a3 Uploaded xuebing parents: diff changeset	235 -->
9071e359b9a3 Uploaded xuebing parents: diff changeset	236 <param name="refGenomeSource" value="indexed" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	237 <param name="indices" value="hg19chrX_midpart" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	238 <param name="sPaired" value="single" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	239 <param name="input1" value="bwa_wrapper_in4.fastqcssanger" ftype="fastqcssanger" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	240 <param name="source_select" value="full" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	241 <param name="maxEditDist" value="0" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	242 <param name="fracMissingAligns" value="0.04" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	243 <param name="maxGapOpens" value="1" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	244 <param name="maxGapExtens" value="-1" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	245 <param name="disallowLongDel" value="16" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	246 <param name="disallowIndel" value="5" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	247 <param name="seed" value="-1" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	248 <param name="maxEditDistSeed" value="2" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	249 <param name="mismatchPenalty" value="3" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	250 <param name="gapOpenPenalty" value="11" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	251 <param name="gapExtensPenalty" value="4" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	252 <param name="suboptAlign" value="true" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	253 <param name="noIterSearch" value="true" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	254 <param name="outputTopN" value="3" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	255 <param name="outputTopNDisc" value="10" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	256 <param name="maxInsertSize" value="500" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	257 <param name="maxOccurPairing" value="100000" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	258 <param name="specReadGroup" value="yes" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	259 <param name="rgid" value="474747" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	260 <param name="rgcn" value="" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	261 <param name="rgds" value="description" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	262 <param name="rgdt" value="2011-03-14" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	263 <param name="rgfo" value="" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	264 <param name="rgks" value="" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	265 <param name="rglb" value="lib-child-1-A" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	266 <param name="rgpg" value="" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	267 <param name="rgpi" value="200" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	268 <param name="rgpl" value="SOLID" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	269 <param name="rgpu" value="" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	270 <param name="rgsm" value="child-1" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	271 <param name="suppressHeader" value="false" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	272 <output name="output" file="bwa_wrapper_out6.sam" ftype="sam" sort="True" lines_diff="2" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	273 </test>
9071e359b9a3 Uploaded xuebing parents: diff changeset	274 <test>
9071e359b9a3 Uploaded xuebing parents: diff changeset	275 <!--
9071e359b9a3 Uploaded xuebing parents: diff changeset	276 BWA commands:
9071e359b9a3 Uploaded xuebing parents: diff changeset	277 cp test-data/chr_m.fasta chr_m.fasta
9071e359b9a3 Uploaded xuebing parents: diff changeset	278 bwa index -c -a is chr_m.fasta
9071e359b9a3 Uploaded xuebing parents: diff changeset	279 bwa aln -n 0.04 -o 1 -e -1 -d 16 -i 5 -k 2 -t 4 -M 3 -O 11 -E 4 -R -N -c chr_m.fasta test-data/bwa_wrapper_in5.fastqcssanger > bwa_wrapper_out7a.sai
9071e359b9a3 Uploaded xuebing parents: diff changeset	280 bwa aln -n 0.04 -o 1 -e -1 -d 16 -i 5 -k 2 -t 4 -M 3 -O 11 -E 4 -R -N -c chr_m.fasta test-data/bwa_wrapper_in6.fastqcssanger > bwa_wrapper_out7b.sai
9071e359b9a3 Uploaded xuebing parents: diff changeset	281 bwa sampe -a 100 -o 2 -n 3 -N 10 chr_m.fasta bwa_wrapper_out7a.sai bwa_wrapper_out7b.sai test-data/bwa_wrapper_in5.fastqcssanger test-data/bwa_wrapper_in6.fastqcssanger > bwa_wrapper_out7.u.sam
9071e359b9a3 Uploaded xuebing parents: diff changeset	282 chr_m.fasta is the prefix of the index files (chr_m.fasta.amb, chr_m.fasta.ann, ...)
9071e359b9a3 Uploaded xuebing parents: diff changeset	283 plain old sort doesn't handle underscores like python:
9071e359b9a3 Uploaded xuebing parents: diff changeset	284 python -c "import sys; lines=file(sys.argv[1],'rb').readlines(); lines.sort(); file(sys.argv[2],'wb').write(''.join(lines))" bwa_wrapper_out7.u.sam bwa_wrapper_out7.sam
9071e359b9a3 Uploaded xuebing parents: diff changeset	285 -->
9071e359b9a3 Uploaded xuebing parents: diff changeset	286 <param name="refGenomeSource" value="history" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	287 <param name="ownFile" value="chr_m.fasta" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	288 <param name="sPaired" value="paired" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	289 <param name="input1" value="bwa_wrapper_in5.fastqcssanger" ftype="fastqcssanger" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	290 <param name="input2" value="bwa_wrapper_in6.fastqcssanger" ftype="fastqcssanger" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	291 <param name="source_select" value="full" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	292 <param name="maxEditDist" value="0" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	293 <param name="fracMissingAligns" value="0.04" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	294 <param name="maxGapOpens" value="1" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	295 <param name="maxGapExtens" value="-1" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	296 <param name="disallowLongDel" value="16" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	297 <param name="disallowIndel" value="5" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	298 <param name="seed" value="-1" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	299 <param name="maxEditDistSeed" value="2" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	300 <param name="mismatchPenalty" value="3" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	301 <param name="gapOpenPenalty" value="11" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	302 <param name="gapExtensPenalty" value="4" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	303 <param name="suboptAlign" value="true" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	304 <param name="noIterSearch" value="true" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	305 <param name="outputTopN" value="3" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	306 <param name="outputTopNDisc" value="10" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	307 <param name="maxInsertSize" value="100" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	308 <param name="maxOccurPairing" value="2" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	309 <param name="specReadGroup" value="no" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	310 <param name="suppressHeader" value="false" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	311 <output name="output" file="bwa_wrapper_out7.sam" ftype="sam" sort="True" lines_diff="2" />
9071e359b9a3 Uploaded xuebing parents: diff changeset	312 </test>
9071e359b9a3 Uploaded xuebing parents: diff changeset	313 </tests>
9071e359b9a3 Uploaded xuebing parents: diff changeset	314 <help>
9071e359b9a3 Uploaded xuebing parents: diff changeset	315
9071e359b9a3 Uploaded xuebing parents: diff changeset	316 What it does
9071e359b9a3 Uploaded xuebing parents: diff changeset	317
9071e359b9a3 Uploaded xuebing parents: diff changeset	318 BWA is a fast light-weighted tool that aligns relatively short sequences (queries) to a sequence database (large), such as the human reference genome. It is developed by Heng Li at the Sanger Insitute. Li H. and Durbin R. (2009) Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics, 25, 1754-60.
9071e359b9a3 Uploaded xuebing parents: diff changeset	319
9071e359b9a3 Uploaded xuebing parents: diff changeset	320 ------
9071e359b9a3 Uploaded xuebing parents: diff changeset	321
9071e359b9a3 Uploaded xuebing parents: diff changeset	322 Know what you are doing
9071e359b9a3 Uploaded xuebing parents: diff changeset	323
9071e359b9a3 Uploaded xuebing parents: diff changeset	324 .. class:: warningmark
9071e359b9a3 Uploaded xuebing parents: diff changeset	325
9071e359b9a3 Uploaded xuebing parents: diff changeset	326 There is no such thing (yet) as an automated gearshift in short read mapping. It is all like stick-shift driving in San Francisco. In other words = running this tool with default parameters will probably not give you meaningful results. A way to deal with this is to understand the parameters by carefully reading the `documentation`__ and experimenting. Fortunately, Galaxy makes experimenting easy.
9071e359b9a3 Uploaded xuebing parents: diff changeset	327
9071e359b9a3 Uploaded xuebing parents: diff changeset	328 .. __: http://bio-bwa.sourceforge.net/
9071e359b9a3 Uploaded xuebing parents: diff changeset	329
9071e359b9a3 Uploaded xuebing parents: diff changeset	330 ------
9071e359b9a3 Uploaded xuebing parents: diff changeset	331
9071e359b9a3 Uploaded xuebing parents: diff changeset	332 Input formats
9071e359b9a3 Uploaded xuebing parents: diff changeset	333
9071e359b9a3 Uploaded xuebing parents: diff changeset	334 BWA accepts files in Sanger FASTQ format. Use the FASTQ Groomer to prepare your files, set to either FASTQ Sanger or FASTQ Color Space Sanger as appropriate.
9071e359b9a3 Uploaded xuebing parents: diff changeset	335
9071e359b9a3 Uploaded xuebing parents: diff changeset	336 If you have Color Space Sanger, it must be converted to nucleotide-space first. To do this, use the Manipulate FASTQ tool under NGS: QC and manipulation, with the following settings:
9071e359b9a3 Uploaded xuebing parents: diff changeset	337 Manipulate reads on Sequence Content, choosing Change Adapter Base, and having the text box empty.
9071e359b9a3 Uploaded xuebing parents: diff changeset	338 Manipulate reads on Sequence Content, doing a String Translate from "01234." to "ACGTN".
9071e359b9a3 Uploaded xuebing parents: diff changeset	339
9071e359b9a3 Uploaded xuebing parents: diff changeset	340
9071e359b9a3 Uploaded xuebing parents: diff changeset	341 ------
9071e359b9a3 Uploaded xuebing parents: diff changeset	342
9071e359b9a3 Uploaded xuebing parents: diff changeset	343 A Note on Built-in Reference Genomes
9071e359b9a3 Uploaded xuebing parents: diff changeset	344
9071e359b9a3 Uploaded xuebing parents: diff changeset	345 Some genomes have multiple variants. If only one "type" of genome is listed, it is the Full version, which means that everything that came in the original genome data download (possibly with mitochondrial and plasmid DNA added if it wasn't already included). The Full version is available for every genome. Some genomes also come in the Canonical variant, which contains only the "canonical" (well-defined) chromosomes or segments, such as chr1-chr22, chrX, chrY, and chrM for human. Other variations include gender. These will come in the canonical form only, so the general Canonical variant is actually Canonical Female and the other is Canonical Male (identical to female excluding chrX).
9071e359b9a3 Uploaded xuebing parents: diff changeset	346
9071e359b9a3 Uploaded xuebing parents: diff changeset	347 ------
9071e359b9a3 Uploaded xuebing parents: diff changeset	348
9071e359b9a3 Uploaded xuebing parents: diff changeset	349 Outputs
9071e359b9a3 Uploaded xuebing parents: diff changeset	350
9071e359b9a3 Uploaded xuebing parents: diff changeset	351 The output is in SAM format, and has the following columns::
9071e359b9a3 Uploaded xuebing parents: diff changeset	352
9071e359b9a3 Uploaded xuebing parents: diff changeset	353 Column Description
9071e359b9a3 Uploaded xuebing parents: diff changeset	354 -------- --------------------------------------------------------
9071e359b9a3 Uploaded xuebing parents: diff changeset	355 1 QNAME Query (pair) NAME
9071e359b9a3 Uploaded xuebing parents: diff changeset	356 2 FLAG bitwise FLAG
9071e359b9a3 Uploaded xuebing parents: diff changeset	357 3 RNAME Reference sequence NAME
9071e359b9a3 Uploaded xuebing parents: diff changeset	358 4 POS 1-based leftmost POSition/coordinate of clipped sequence
9071e359b9a3 Uploaded xuebing parents: diff changeset	359 5 MAPQ MAPping Quality (Phred-scaled)
9071e359b9a3 Uploaded xuebing parents: diff changeset	360 6 CIGAR extended CIGAR string
9071e359b9a3 Uploaded xuebing parents: diff changeset	361 7 MRNM Mate Reference sequence NaMe ('=' if same as RNAME)
9071e359b9a3 Uploaded xuebing parents: diff changeset	362 8 MPOS 1-based Mate POSition
9071e359b9a3 Uploaded xuebing parents: diff changeset	363 9 ISIZE Inferred insert SIZE
9071e359b9a3 Uploaded xuebing parents: diff changeset	364 10 SEQ query SEQuence on the same strand as the reference
9071e359b9a3 Uploaded xuebing parents: diff changeset	365 11 QUAL query QUALity (ASCII-33 gives the Phred base quality)
9071e359b9a3 Uploaded xuebing parents: diff changeset	366 12 OPT variable OPTional fields in the format TAG:VTYPE:VALU
9071e359b9a3 Uploaded xuebing parents: diff changeset	367
9071e359b9a3 Uploaded xuebing parents: diff changeset	368 The flags are as follows::
9071e359b9a3 Uploaded xuebing parents: diff changeset	369
9071e359b9a3 Uploaded xuebing parents: diff changeset	370 Flag Description
9071e359b9a3 Uploaded xuebing parents: diff changeset	371 ------ -------------------------------------
9071e359b9a3 Uploaded xuebing parents: diff changeset	372 0x0001 the read is paired in sequencing
9071e359b9a3 Uploaded xuebing parents: diff changeset	373 0x0002 the read is mapped in a proper pair
9071e359b9a3 Uploaded xuebing parents: diff changeset	374 0x0004 the query sequence itself is unmapped
9071e359b9a3 Uploaded xuebing parents: diff changeset	375 0x0008 the mate is unmapped
9071e359b9a3 Uploaded xuebing parents: diff changeset	376 0x0010 strand of the query (1 for reverse)
9071e359b9a3 Uploaded xuebing parents: diff changeset	377 0x0020 strand of the mate
9071e359b9a3 Uploaded xuebing parents: diff changeset	378 0x0040 the read is the first read in a pair
9071e359b9a3 Uploaded xuebing parents: diff changeset	379 0x0080 the read is the second read in a pair
9071e359b9a3 Uploaded xuebing parents: diff changeset	380 0x0100 the alignment is not primary
9071e359b9a3 Uploaded xuebing parents: diff changeset	381
9071e359b9a3 Uploaded xuebing parents: diff changeset	382 It looks like this (scroll sideways to see the entire example)::
9071e359b9a3 Uploaded xuebing parents: diff changeset	383
9071e359b9a3 Uploaded xuebing parents: diff changeset	384 QNAME FLAG RNAME POS MAPQ CIAGR MRNM MPOS ISIZE SEQ QUAL OPT
9071e359b9a3 Uploaded xuebing parents: diff changeset	385 HWI-EAS91_1_30788AAXX:1:1:1761:343 4 * 0 0 * * 0 0 AAAAAAANNAAAAAAAAAAAAAAAAAAAAAAAAAAACNNANNGAGTNGNNNNNNNGCTTCCCACAGNNCTGG hhhhhhh;;hhhhhhhhhhh^hOhhhhghhhfhhhgh;;h;;hhhh;h;;;;;;;hhhhhhghhhh;;Phhh
9071e359b9a3 Uploaded xuebing parents: diff changeset	386 HWI-EAS91_1_30788AAXX:1:1:1578:331 4 * 0 0 * * 0 0 GTATAGANNAATAAGAAAAAAAAAAATGAAGACTTTCNNANNTCTGNANNNNNNNTCTTTTTTCAGNNGTAG hhhhhhh;;hhhhhhhhhhhhhhhhhhhhhhhhhhhh;;h;;hhhh;h;;;;;;;hhhhhhhhhhh;;hhVh
9071e359b9a3 Uploaded xuebing parents: diff changeset	387
9071e359b9a3 Uploaded xuebing parents: diff changeset	388 -------
9071e359b9a3 Uploaded xuebing parents: diff changeset	389
9071e359b9a3 Uploaded xuebing parents: diff changeset	390 BWA settings
9071e359b9a3 Uploaded xuebing parents: diff changeset	391
9071e359b9a3 Uploaded xuebing parents: diff changeset	392 All of the options have a default value. You can change any of them. All of the options in BWA have been implemented here.
9071e359b9a3 Uploaded xuebing parents: diff changeset	393
9071e359b9a3 Uploaded xuebing parents: diff changeset	394 ------
9071e359b9a3 Uploaded xuebing parents: diff changeset	395
9071e359b9a3 Uploaded xuebing parents: diff changeset	396 BWA parameter list
9071e359b9a3 Uploaded xuebing parents: diff changeset	397
9071e359b9a3 Uploaded xuebing parents: diff changeset	398 This is an exhaustive list of BWA options:
9071e359b9a3 Uploaded xuebing parents: diff changeset	399
9071e359b9a3 Uploaded xuebing parents: diff changeset	400 For aln::
9071e359b9a3 Uploaded xuebing parents: diff changeset	401
9071e359b9a3 Uploaded xuebing parents: diff changeset	402 -n NUM Maximum edit distance if the value is INT, or the fraction of missing
9071e359b9a3 Uploaded xuebing parents: diff changeset	403 alignments given 2% uniform base error rate if FLOAT. In the latter
9071e359b9a3 Uploaded xuebing parents: diff changeset	404 case, the maximum edit distance is automatically chosen for different
9071e359b9a3 Uploaded xuebing parents: diff changeset	405 read lengths. [0.04]
9071e359b9a3 Uploaded xuebing parents: diff changeset	406 -o INT Maximum number of gap opens [1]
9071e359b9a3 Uploaded xuebing parents: diff changeset	407 -e INT Maximum number of gap extensions, -1 for k-difference mode
9071e359b9a3 Uploaded xuebing parents: diff changeset	408 (disallowing long gaps) [-1]
9071e359b9a3 Uploaded xuebing parents: diff changeset	409 -d INT Disallow a long deletion within INT bp towards the 3'-end [16]
9071e359b9a3 Uploaded xuebing parents: diff changeset	410 -i INT Disallow an indel within INT bp towards the ends [5]
9071e359b9a3 Uploaded xuebing parents: diff changeset	411 -l INT Take the first INT subsequence as seed. If INT is larger than the
9071e359b9a3 Uploaded xuebing parents: diff changeset	412 query sequence, seeding will be disabled. For long reads, this option
9071e359b9a3 Uploaded xuebing parents: diff changeset	413 is typically ranged from 25 to 35 for '-k 2'. [inf]
9071e359b9a3 Uploaded xuebing parents: diff changeset	414 -k INT Maximum edit distance in the seed [2]
9071e359b9a3 Uploaded xuebing parents: diff changeset	415 -t INT Number of threads (multi-threading mode) [1]
9071e359b9a3 Uploaded xuebing parents: diff changeset	416 -M INT Mismatch penalty. BWA will not search for suboptimal hits with a score
9071e359b9a3 Uploaded xuebing parents: diff changeset	417 lower than (bestScore-misMsc). [3]
9071e359b9a3 Uploaded xuebing parents: diff changeset	418 -O INT Gap open penalty [11]
9071e359b9a3 Uploaded xuebing parents: diff changeset	419 -E INT Gap extension penalty [4]
9071e359b9a3 Uploaded xuebing parents: diff changeset	420 -c Reverse query but not complement it, which is required for alignment
9071e359b9a3 Uploaded xuebing parents: diff changeset	421 in the color space.
9071e359b9a3 Uploaded xuebing parents: diff changeset	422 -R Proceed with suboptimal alignments even if the top hit is a repeat. By
9071e359b9a3 Uploaded xuebing parents: diff changeset	423 default, BWA only searches for suboptimal alignments if the top hit is
9071e359b9a3 Uploaded xuebing parents: diff changeset	424 unique. Using this option has no effect on accuracy for single-end
9071e359b9a3 Uploaded xuebing parents: diff changeset	425 reads. It is mainly designed for improving the alignment accuracy of
9071e359b9a3 Uploaded xuebing parents: diff changeset	426 paired-end reads. However, the pairing procedure will be slowed down,
9071e359b9a3 Uploaded xuebing parents: diff changeset	427 especially for very short reads (~32bp).
9071e359b9a3 Uploaded xuebing parents: diff changeset	428 -N Disable iterative search. All hits with no more than maxDiff
9071e359b9a3 Uploaded xuebing parents: diff changeset	429 differences will be found. This mode is much slower than the default.
9071e359b9a3 Uploaded xuebing parents: diff changeset	430
9071e359b9a3 Uploaded xuebing parents: diff changeset	431 For samse::
9071e359b9a3 Uploaded xuebing parents: diff changeset	432
9071e359b9a3 Uploaded xuebing parents: diff changeset	433 -n INT Maximum number of alignments to output in the XA tag for reads paired
9071e359b9a3 Uploaded xuebing parents: diff changeset	434 properly. If a read has more than INT hits, the XA tag will not be
9071e359b9a3 Uploaded xuebing parents: diff changeset	435 written. [3]
9071e359b9a3 Uploaded xuebing parents: diff changeset	436 -r STR Specify the read group in a format like '@RG\tID:foo\tSM:bar' [null]
9071e359b9a3 Uploaded xuebing parents: diff changeset	437
9071e359b9a3 Uploaded xuebing parents: diff changeset	438 For sampe::
9071e359b9a3 Uploaded xuebing parents: diff changeset	439
9071e359b9a3 Uploaded xuebing parents: diff changeset	440 -a INT Maximum insert size for a read pair to be considered as being mapped
9071e359b9a3 Uploaded xuebing parents: diff changeset	441 properly. Since version 0.4.5, this option is only used when there
9071e359b9a3 Uploaded xuebing parents: diff changeset	442 are not enough good alignment to infer the distribution of insert
9071e359b9a3 Uploaded xuebing parents: diff changeset	443 sizes. [500]
9071e359b9a3 Uploaded xuebing parents: diff changeset	444 -n INT Maximum number of alignments to output in the XA tag for reads paired
9071e359b9a3 Uploaded xuebing parents: diff changeset	445 properly. If a read has more than INT hits, the XA tag will not be
9071e359b9a3 Uploaded xuebing parents: diff changeset	446 written. [3]
9071e359b9a3 Uploaded xuebing parents: diff changeset	447 -N INT Maximum number of alignments to output in the XA tag for disconcordant
9071e359b9a3 Uploaded xuebing parents: diff changeset	448 read pairs (excluding singletons). If a read has more than INT hits,
9071e359b9a3 Uploaded xuebing parents: diff changeset	449 the XA tag will not be written. [10]
9071e359b9a3 Uploaded xuebing parents: diff changeset	450 -o INT Maximum occurrences of a read for pairing. A read with more
9071e359b9a3 Uploaded xuebing parents: diff changeset	451 occurrences will be treated as a single-end read. Reducing this
9071e359b9a3 Uploaded xuebing parents: diff changeset	452 parameter helps faster pairing. [100000]
9071e359b9a3 Uploaded xuebing parents: diff changeset	453 -r STR Specify the read group in a format like '@RG\tID:foo\tSM:bar' [null]
9071e359b9a3 Uploaded xuebing parents: diff changeset	454
9071e359b9a3 Uploaded xuebing parents: diff changeset	455 For specifying the read group in samse or sampe, use the following::
9071e359b9a3 Uploaded xuebing parents: diff changeset	456
9071e359b9a3 Uploaded xuebing parents: diff changeset	457 @RG Read group. Unordered multiple @RG lines are allowed.
9071e359b9a3 Uploaded xuebing parents: diff changeset	458 ID Read group identiﬁer. Each @RG line must have a unique ID. The value of
9071e359b9a3 Uploaded xuebing parents: diff changeset	459 ID is used in the RG tags of alignment records. Must be unique among all
9071e359b9a3 Uploaded xuebing parents: diff changeset	460 read groups in header section. Read group IDs may be modiﬁed when
9071e359b9a3 Uploaded xuebing parents: diff changeset	461 merging SAM ﬁles in order to handle collisions.
9071e359b9a3 Uploaded xuebing parents: diff changeset	462 CN Name of sequencing center producing the read.
9071e359b9a3 Uploaded xuebing parents: diff changeset	463 DS Description.
9071e359b9a3 Uploaded xuebing parents: diff changeset	464 DT Date the run was produced (ISO8601 date or date/time).
9071e359b9a3 Uploaded xuebing parents: diff changeset	465 FO Flow order. The array of nucleotide bases that correspond to the
9071e359b9a3 Uploaded xuebing parents: diff changeset	466 nucleotides used for each flow of each read. Multi-base flows are encoded
9071e359b9a3 Uploaded xuebing parents: diff changeset	467 in IUPAC format, and non-nucleotide flows by various other characters.
9071e359b9a3 Uploaded xuebing parents: diff changeset	468 Format : /\*\|[ACMGRSVTWYHKDBN]+/
9071e359b9a3 Uploaded xuebing parents: diff changeset	469 KS The array of nucleotide bases that correspond to the key sequence of each read.
9071e359b9a3 Uploaded xuebing parents: diff changeset	470 LB Library.
9071e359b9a3 Uploaded xuebing parents: diff changeset	471 PG Programs used for processing the read group.
9071e359b9a3 Uploaded xuebing parents: diff changeset	472 PI Predicted median insert size.
9071e359b9a3 Uploaded xuebing parents: diff changeset	473 PL Platform/technology used to produce the reads. Valid values : CAPILLARY,
9071e359b9a3 Uploaded xuebing parents: diff changeset	474 LS454, ILLUMINA, SOLID, HELICOS, IONTORRENT and PACBIO.
9071e359b9a3 Uploaded xuebing parents: diff changeset	475 PU Platform unit (e.g. flowcell-barcode.lane for Illumina or slide for
9071e359b9a3 Uploaded xuebing parents: diff changeset	476 SOLiD). Unique identiﬁer.
9071e359b9a3 Uploaded xuebing parents: diff changeset	477 SM Sample. Use pool name where a pool is being sequenced.
9071e359b9a3 Uploaded xuebing parents: diff changeset	478
9071e359b9a3 Uploaded xuebing parents: diff changeset	479 </help>
9071e359b9a3 Uploaded xuebing parents: diff changeset	480 </tool>
9071e359b9a3 Uploaded xuebing parents: diff changeset	481
9071e359b9a3 Uploaded xuebing parents: diff changeset	482

Mercurial > repos > xuebing > sharplabtool

annotate tools/sr_mapping/bwa_color_wrapper.xml @ 1:cdcb0ce84a1b