comparison samtools_view.xml @ 1:6ade12d94f9a draft

planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/master/tool_collections/samtools/samtools_view commit c7bb87a32997810fd9f28572fc5af628c8c4ffa3
author iuc
date Tue, 25 Sep 2018 04:21:24 -0400
parents 00d1f08bdcdc
children 02afb5ea2fce
comparison
equal deleted inserted replaced
0:00d1f08bdcdc 1:6ade12d94f9a
81 81
82 ## additional reference data 82 ## additional reference data
83 #if $reffa!=None: 83 #if $reffa!=None:
84 -T '$reffa' 84 -T '$reffa'
85 -t '$reffai' 85 -t '$reffai'
86 #else: 86 #end if
87
88 ##currently reference based CRAM is disabled (see https://github.com/galaxyproject/tools-iuc/pull/1963)
89 #if $outtype=='cram':
87 --output-fmt-option no_ref 90 --output-fmt-option no_ref
88 #end if 91 #end if
89 92
90 infile 93 infile
91 94
187 </when> 190 </when>
188 <when value="no"/> 191 <when value="no"/>
189 </conditional> 192 </conditional>
190 <conditional name="addref_cond"> 193 <conditional name="addref_cond">
191 <param name="addref_select" type="select" label="Reference data"> 194 <param name="addref_select" type="select" label="Reference data">
192 <help>Reference data as fasta(.gz). Required for SAM input without @SQ headers and useful/required for writing CRAM output (see help).</help> 195 <help>Reference data as fasta(.gz). Required for SAM input without @SQ headers and useful/required for writing CRAM output (see help).</help>
193 <option value="no" selected="True">No, see help (-output-fmt-option no_ref)</option> 196 <option value="no" selected="True">No, see help (-output-fmt-option no_ref)</option>
194 <option value="history">Use a genome/index from the history (-t/-T)</option> 197 <option value="history">Use a genome/index from the history (-t/-T)</option>
195 <option value="cached">Use a built-in genome (-t/-T)</option> 198 <option value="cached">Use a built-in genome (-t/-T)</option>
196 </param> 199 </param>
197 <when value="no"/> 200 <when value="no"/>
198 <when value="history"> 201 <when value="history">
199 <param name="ref" format="fasta,fasta.gz" optional="true" type="data" label="Reference file"/> 202 <param name="ref" format="fasta,fasta.gz" optional="true" type="data" label="Reference file"/>
200 </when> 203 </when>
396 Inputs of type SAM, BAM, and CRAM are accepted and can be converted to each of these formats (alternatively alignment counts can be computed) by selecting the appropriate "Output type". 399 Inputs of type SAM, BAM, and CRAM are accepted and can be converted to each of these formats (alternatively alignment counts can be computed) by selecting the appropriate "Output type".
397 400
398 .. class:: infomark 401 .. class:: infomark
399 402
400 samtools view allows to specify a reference sequence. This is required for SAM input with missing @SQ headers (which include sequence names, length, md5, etc) and useful (and sometimes necessary) for CRAM input and output. In the following the use of reference sequence in the CRAM format is detailed. 403 samtools view allows to specify a reference sequence. This is required for SAM input with missing @SQ headers (which include sequence names, length, md5, etc) and useful (and sometimes necessary) for CRAM input and output. In the following the use of reference sequence in the CRAM format is detailed.
401 CRAM is (primarily) a reference-based compressed format, i.e. only differences between the stored sequences and the reference are stored. As a consequence the reference that was used to generate the alignemnts is always needed in order to interpret the alignments (a checksum stored in the CRAM file is used to verify that the only the correct sequence can be used), i.e. the CRAM file on its own is not useful per default. This allows for a more space efficient storage compared to BAM. But it is also possible to use CRAM without a reference with the disadvantage that the reference is stored explicitely (as in SAM and BAM). 404 CRAM is (intended as a primarily) a reference-based compressed format, i.e. only differences between the stored sequences and the reference are stored. As a consequence the reference that was used to generate the alignemnts is always needed in order to interpret the alignments (a checksum stored in the CRAM file is used to verify that the only the correct sequence can be used), i.e. the CRAM file on its own is not useful per default. This allows for a more space efficient storage compared to BAM.
402 405 But it is also possible to use CRAM without a reference with the disadvantage that the reference is stored explicitely (as in SAM and BAM).
403 The Galaxy tool allows both possibilities using the "reference data" option: 406
404 407 The Galaxy tool **currently generates only CRAM without reference sequence**.
405 - the default ("no reference") 408
406 - reference data can be chosen from history or built in genomes can be used 409 For reference based CRAM input the correct refernce sequence needs to be specified.
407
408 The reference data required for reading/writing reference based CRAM.
409 410
410 **Filtering alignments** 411 **Filtering alignments**
411 412
412 samtools view allows to filter alignements based on various criteria, i.e. the output will contain only alignemnts matching all criteria (an additional output containing the remaining alignments can be created additionally, see "Output alignments not passing the filter" in "output options"): e.g. by regions (see below), alignment quality (see below), and tags or flags set in the alignments. 413 samtools view allows to filter alignements based on various criteria, i.e. the output will contain only alignemnts matching all criteria (an additional output containing the remaining alignments can be created additionally, see "Output alignments not passing the filter" in "output options"): e.g. by regions (see below), alignment quality (see below), and tags or flags set in the alignments.
413 414