Mercurial > repos > rmarenco > multi_fasta_glimmer_hmm
comparison readme.md @ 0:0ddb5ee32ff6 draft default tip
planemo upload for repository https://github.com/remimarenco/multi_fasta_glimmerhmm.git commit 28bd73b26b50165eded1d9ba995979acdf005ad1-dirty
| author | rmarenco |
|---|---|
| date | Thu, 18 Aug 2016 18:50:00 -0400 |
| parents | |
| children |
comparison
equal
deleted
inserted
replaced
| -1:000000000000 | 0:0ddb5ee32ff6 |
|---|---|
| 1 Galaxy wrapper for GlimmerHMM | |
| 2 ===================================== | |
| 3 | |
| 4 This wrapper has been rewritten by Rémi Marenco in 2016 to fix issues, improve it and add the multi_fasta handling. It has been originally written by Björn Gruening. | |
| 5 | |
| 6 This is a wrapper for the command line tool of GlimmerHMM. | |
| 7 https://ccb.jhu.edu/software/glimmerhmm/ | |
| 8 | |
| 9 GlimmerHMM is a gene finder based on a Generalized Hidden Markov Model (GHMM). Although the gene finder conforms to the overall mathematical framework of a GHMM, | |
| 10 additionally it incorporates splice site models adapted from the GeneSplicer program and a decision tree adapted from GlimmerM. It also utilizes | |
| 11 Interpolated Markov Models for the coding and noncoding models. | |
| 12 Currently, GlimmerHMM's GHMM structure includes introns of each phase, intergenic regions, and four types of exons (initial, internal, final, and single). | |
| 13 | |
| 14 Majoros, W.H., Pertea, M., and Salzberg, S.L. TigrScan and GlimmerHMM: two open-source ab initio eukaryotic gene-finders Bioinformatics 20 2878-2879. | |
| 15 Pertea, M. and S. L. Salzberg (2002). "Computational gene finding in plants." Plant Molecular Biology 48(1-2): 39-48. | |
| 16 The Arabidopsis Genome Initiative, (2000) "Analysis of the genome sequence of the flowering plant Arabidopsis thaliana", Nature. Dec 14; 408(6814):796-815. | |
| 17 Pertea, M., S. L. Salzberg, et al. (2000). "Finding genes in Plasmodium falciparum." Nature 404(6773): 34; discussion 34-5. | |
| 18 Salzberg, S. L., M. Pertea, et al. (1999). "Interpolated Markov models for eukaryotic gene finding." Genomics 59(1): 24-31. | |
| 19 | |
| 20 | |
| 21 Installation | |
| 22 ============ | |
| 23 | |
| 24 To install Glimmer3, please download GlimmerHMM from | |
| 25 | |
| 26 ftp://ccb.jhu.edu/pub/software/glimmerhmm | |
| 27 | |
| 28 and follow the installation instructions. | |
| 29 To extract the glimmerHMM predicted genes, the GFF Parser from Brad Chapman (ttp://github.com/chapmanb/bcbb/tree/master/gff) was used and is included. | |
| 30 | |
| 31 To install the wrapper copy the glimmerHMM folder in the galaxy tools | |
| 32 folder and modify the $GALAXY_ROOT/config/tool_conf.xml file to make the tool available to Galaxy. | |
| 33 For example: | |
| 34 | |
| 35 ```xml | |
| 36 <tool file="gene_prediction/tools/glimmerHMM/glimmerhmm_predict.xml" /> | |
| 37 <tool file="gene_prediction/tools/glimmerHMM/glimmerhmm_to_sequence.xml" /> | |
| 38 ``` | |
| 39 | |
| 40 You also need to use a trained organism by adding them as reference data in Galaxy: | |
| 41 | |
| 42 1. Add the *glimmer_hmm_trained_dir* data table to `tool_data_table_conf.xml` in `$GALAXY_ROOT/config/`: | |
| 43 | |
| 44 ```xml | |
| 45 <!-- glimmer_hmm trained_dir --> | |
| 46 <table name="glimmer_hmm_trained_dir" comment_char="#"> | |
| 47 <columns>value, name, path</columns> | |
| 48 <file path="tool-data/glimmer_hmm.loc" /> | |
| 49 </table> | |
| 50 ``` | |
| 51 | |
| 52 2. Add the `glimmer_hmm.loc` file referencing your trained organism, in `tool-data`. | |
| 53 You have a sample [`glimmer_hmm.loc.sample`] available in the repository to help you configuring it properly | |
| 54 3. Add your data in the chosen folder at step 2. You can get them from the GlimmerHMM tar, `$GLIMMERHMM/trained_dir` | |
| 55 | |
| 56 History | |
| 57 ======= | |
| 58 | |
| 59 - v3.0 - Add the Multi Fasta support | |
| 60 - v2.0 - Update by Rémi Marenco to make it work without having to modify the wrapper + add ability to select the species | |
| 61 - v0.1 - Initial public release | |
| 62 | |
| 63 | |
| 64 Wrapper Licence (MIT/BSD style) | |
| 65 =============================== | |
| 66 | |
| 67 Permission to use, copy, modify, and distribute this software and its | |
| 68 documentation with or without modifications and for any purpose and | |
| 69 without fee is hereby granted, provided that any copyright notices | |
| 70 appear in all copies and that both those copyright notices and this | |
| 71 permission notice appear in supporting documentation, and that the | |
| 72 names of the contributors or copyright holders not be used in | |
| 73 advertising or publicity pertaining to distribution of the software | |
| 74 without specific prior permission. | |
| 75 | |
| 76 THE CONTRIBUTORS AND COPYRIGHT HOLDERS OF THIS SOFTWARE DISCLAIM ALL | |
| 77 WARRANTIES WITH REGARD TO THIS SOFTWARE, INCLUDING ALL IMPLIED | |
| 78 WARRANTIES OF MERCHANTABILITY AND FITNESS, IN NO EVENT SHALL THE | |
| 79 CONTRIBUTORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY SPECIAL, INDIRECT | |
| 80 OR CONSEQUENTIAL DAMAGES OR ANY DAMAGES WHATSOEVER RESULTING FROM LOSS | |
| 81 OF USE, DATA OR PROFITS, WHETHER IN AN ACTION OF CONTRACT, NEGLIGENCE | |
| 82 OR OTHER TORTIOUS ACTION, ARISING OUT OF OR IN CONNECTION WITH THE USE | |
| 83 OR PERFORMANCE OF THIS SOFTWARE. |
