Mercurial > repos > devteam > data_manager_hisat_index_builder
comparison data_manager/hisat_index_builder.xml @ 0:ba11fef120cd draft default tip
planemo upload for repository https://github.com/galaxyproject/tools-devteam/tree/master/data_managers/data_manager_hisat_index_builder commit 5a7365750648c26206f05ac7956936c243c2b980
| author | devteam |
|---|---|
| date | Sat, 13 Jun 2015 08:27:38 -0400 |
| parents | |
| children |
comparison
equal
deleted
inserted
replaced
| -1:000000000000 | 0:ba11fef120cd |
|---|---|
| 1 <tool id="hisat_index_builder_data_manager" name="HISAT index" tool_type="manage_data" version="1.0.0"> | |
| 2 <description>builder</description> | |
| 3 <requirements> | |
| 4 <requirement type="package" version="0.1.6">hisat</requirement> | |
| 5 </requirements> | |
| 6 <stdio> | |
| 7 <exit_code range=":-1" /> | |
| 8 <exit_code range="1:" /> | |
| 9 </stdio> | |
| 10 <command interpreter="python">hisat_index_builder.py "${out_file}" --fasta_filename "${all_fasta_source.fields.path}" --fasta_dbkey "${all_fasta_source.fields.dbkey}" --fasta_description "${all_fasta_source.fields.name}" --data_table_name "hisat_indexes"</command> | |
| 11 <inputs> | |
| 12 <param label="Source FASTA Sequence" name="all_fasta_source" type="select"> | |
| 13 <options from_data_table="all_fasta" /> | |
| 14 </param> | |
| 15 <param label="Name of sequence" name="sequence_name" type="text" value="" /> | |
| 16 <param label="ID for sequence" name="sequence_id" type="text" value="" /> | |
| 17 </inputs> | |
| 18 <outputs> | |
| 19 <data format="data_manager_json" name="out_file" /> | |
| 20 </outputs> | |
| 21 <help> | |
| 22 <![CDATA[ | |
| 23 .. class:: infomark | |
| 24 | |
| 25 **Notice:** If you leave name, description, or id blank, it will be generated automatically. | |
| 26 | |
| 27 What is HISAT? | |
| 28 -------------- | |
| 29 | |
| 30 `HISAT <http://ccb.jhu.edu/software/hisat>`__ is a fast and sensitive | |
| 31 spliced alignment program. As part of HISAT, we have developed a new | |
| 32 indexing scheme based on the Burrows-Wheeler transform | |
| 33 (`BWT <http://en.wikipedia.org/wiki/Burrows-Wheeler_transform>`__) and | |
| 34 the `FM index <http://en.wikipedia.org/wiki/FM-index>`__, called | |
| 35 hierarchical indexing, that employs two types of indexes: (1) one global | |
| 36 FM index representing the whole genome, and (2) many separate local FM | |
| 37 indexes for small regions collectively covering the genome. Our | |
| 38 hierarchical index for the human genome (about 3 billion bp) includes | |
| 39 ~48,000 local FM indexes, each representing a genomic region of | |
| 40 ~64,000bp. As the basis for non-gapped alignment, the FM index is | |
| 41 extremely fast with a low memory footprint, as demonstrated by | |
| 42 `Bowtie <http://bowtie-bio.sf.net>`__. In addition, HISAT provides | |
| 43 several alignment strategies specifically designed for mapping different | |
| 44 types of RNA-seq reads. All these together, HISAT enables extremely fast | |
| 45 and sensitive alignment of reads, in particular those spanning two exons | |
| 46 or more. As a result, HISAT is much faster >50 times than | |
| 47 `TopHat2 <http://ccb.jhu.edu/software/tophat>`__ with better alignment | |
| 48 quality. Although it uses a large number of indexes, the memory | |
| 49 requirement of HISAT is still modest, approximately 4.3 GB for human. | |
| 50 HISAT uses the `Bowtie2 <http://bowtie-bio.sf.net/bowtie2>`__ | |
| 51 implementation to handle most of the operations on the FM index. In | |
| 52 addition to spliced alignment, HISAT handles reads involving indels and | |
| 53 supports a paired-end alignment mode. Multiple processors can be used | |
| 54 simultaneously to achieve greater alignment speed. HISAT outputs | |
| 55 alignments in `SAM <http://samtools.sourceforge.net/SAM1.pdf>`__ format, | |
| 56 enabling interoperation with a large number of other tools (e.g. | |
| 57 `SAMtools <http://samtools.sourceforge.net>`__, | |
| 58 `GATK <http://www.broadinstitute.org/gsa/wiki/index.php/The_Genome_Analysis_Toolkit>`__) | |
| 59 that use SAM. HISAT is distributed under the `GPLv3 | |
| 60 license <http://www.gnu.org/licenses/gpl-3.0.html>`__, and it runs on | |
| 61 the command line under Linux, Mac OS X and Windows. | |
| 62 ]]> | |
| 63 </help> | |
| 64 <citations> | |
| 65 <citation type="doi">10.1038/nmeth.3317</citation> | |
| 66 </citations> | |
| 67 </tool> |
