diff tools/blast2go/README.rst @ 8:e23b621eb7bb draft

Uploaded v0.0.9, embed citation, updated README
author peterjc
date Thu, 26 Mar 2015 11:15:22 -0400
parents
children 887adf823bc0
line wrap: on
line diff
--- /dev/null	Thu Jan 01 00:00:00 1970 +0000
+++ b/tools/blast2go/README.rst	Thu Mar 26 11:15:22 2015 -0400
@@ -0,0 +1,220 @@
+Galaxy wrapper for Blast2GO for pipelines, b2g4pipe
+===================================================
+
+This wrapper is copyright 2011-2014 by Peter Cock, The James Hutton Institute
+(formerly SCRI, Scottish Crop Research Institute), UK. All rights reserved.
+See the licence text below (MIT licence).
+
+This is a wrapper for the command line Java tool b2g4pipe v2.5, Blast2GO for
+pipelines, currently a free to use download available at:
+http://www.blast2go.com/data/blast2go/b2g4pipe_v2.5.zip
+
+Note that this has been superceded by a non-free "Blast2GO Command Line (CLI)":
+http://www.blast2go.com/blast2gocli/
+
+This wrapper is freely available from the Galaxy Tool Shed at:
+http://toolshed.g2.bx.psu.edu/view/peterjc/blast2go
+
+
+References
+==========
+
+Peter Cock, Bjoern Gruening, Konrad Paszkiewicz and Leighton Pritchard (2013).
+Galaxy tools and workflows for sequence analysis with applications
+in molecular plant pathology. PeerJ 1:e167
+http://dx.doi.org/10.7717/peerj.167
+
+S. Geotz et al. (2008).
+High-throughput functional annotation and data mining with the Blast2GO suite.
+Nucleic Acids Res. 36(10):3420-3435.
+http://dx.doi.org/10.1093/nar/gkn176
+
+A. Conesa and S. Geotz (2008).
+Blast2GO: A Comprehensive Suite for Functional Analysis in Plant Genomics.
+International Journal of Plant Genomics. 619832.
+http://dx.doi.org/10.1155/2008/619832
+
+A. Conesa et al. (2005).
+Blast2GO: A universal tool for annotation, visualization and analysis in functional genomics research.
+Bioinformatics 21:3674-3676.
+http://dx.doi.org/10.1093/bioinformatics/bti610
+
+See also http://www.blast2go.com/
+
+
+Automated Installation
+======================
+
+Installation via the Galaxy Tool Shed should take care of the Galaxy side of
+things, including the dependency on 'blast_datatypes' which defines the
+'blastxml' file format. However, you will also probably need to configure
+the Blast2GO property file(s), for example if you have a local Blast2GO
+database (which we recommend for speed).
+
+
+Manual Installation
+===================
+
+The main dependency is b2g4pipe which must be installed manually. Also we
+strongly recommend installing a local Blast2GO database as well (see the
+intructions below about the ``blast2go.loc`` file). At the time of writing,
+the last free version is b2g4pipe v2.5 which is available here:
+
+* http://www.blast2go.com/data/blast2go/b2g4pipe_v2.5.zip
+
+You can change the path by setting the B2G4PIPE environement variable to
+the desired folder, but by default the script looks for the JAR file here::
+
+    /opt/b2g4pipe_v2.5/blast2go.jar
+
+To install the wrapper manually, first install 'blast_datatypes', then
+copy or move the following files under the Galaxy tools folder, e.g. in a
+tools/blast2go/ folder:
+
+* blast2go.xml (the Galaxy tool definition)
+* blast2go.py (the Python wrapper script)
+* massage_xml_for_blast2go.py (Python XML reformatting script)
+* README.rst (this file)
+
+For a manual installation of the wrapper you will also need to modify the
+tools_conf.xml file to tell Galaxy to offer the tool. We suggest putting
+it next to the NCBI BLAST+ wrappers. Just add the line::
+
+  <tool file="blast2go/blast2go.xml" />
+
+If you wish to run the unit tests, also add this to tools_conf.xml.sample
+and move/copy the test-data files under Galaxy's test-data folder. Then::
+
+    $ ./run_functional_tests.sh -id blast2go
+
+
+Configuration
+=============
+
+As part of setting up b2g4pipe you will need to setup one or more Blast2GO
+property files which tell the tool which database to use etc. The example
+``b2gPipe.properties`` provided with b2g4pipe is now out of date. The current
+server IP address and database name may given on the Blast2GO website, or
+can be found by running the latest GUI version via Java web-start, and
+looking under the tools/options menu. These property files can be anywhere
+accessable to the Galaxy Unix user, we put them with the JAR file etc.
+
+You must tell Galaxy about these Blast2GO property files so that they can
+be offered to the user. Copy file ``blast2go.loc.sample`` to
+``tool-data/blast2go.loc`` under the Galaxy folder and edit this to match
+your installation. This must be plain text, tab separated, with three columns:
+
+1. ID for the setup, e.g. ``Spain_2012_August``
+2. Description for the setup, e.g. ``Database in Spain (August 2012)``
+3. Properties filename for the setup, e.g. ``Spain_2012_August.properties``
+   relative to the main JAR file, or with a full path
+   e.g. ``/opt/b2g4pipe/Spain_2012_August.properties``
+
+Avoid including "Blast2GO" in the description (column 2) as this text will be
+included in the automatically assigned output dataset name. The ``blast2go.loc``
+file allows you to customise the database setup. If for example you have a local
+Blast2GO server running (which we recommend for speed), and you want this to be
+the default setting, include it as the first line in your ``blast2go.loc`` file.
+
+Consult the Blast2GO documentation for details about the property files and
+setting up a local MySQL Blast2GO database. e.g.
+https://www.blast2go.com/b2gsupport/resources/35-localb2gdb
+
+
+
+History
+=======
+
+======= ======================================================================
+Version Changes
+------- ----------------------------------------------------------------------
+v0.0.1  - Initial public release
+v0.0.2  - Documentation clarifications, e.g. concatenated BLAST XML is allowed.
+        - Fixed error handler in wrapper script (for when b2g4pipe fails).
+        - Reformats the XML to use old NCBI-style concatenated BLAST XML since
+          b2g4pipe crashes with heap space error on with large files using
+          current NCBI output.
+v0.0.3  - Include sample loc file, ``tool-data/blast2go.loc.sample``
+v0.0.4  - Include ``repository_dependencies.xml`` file for ``blastxml`` format
+          (previously included in the core Galaxy installation)
+v0.0.5  - Quote arguments in case of spaces in filenames (internal change)
+        - Last release supporting b2g4pipe v2.3.5
+v0.0.6  - Support for b2g4pipe v2.5 instead of v2.3.5
+
+          - Now invoked with a class path and es.blast2go.prog.B2GAnnotPipe
+            rather then simply calling the jar file
+          - Now uses the switch ``-annot`` instead of ``-a`` (this change
+            breaks support for b2g4pipe v2.3.5 unfortunately)
+
+        - Catch a few error messages and treat them explicitly as errors.
+v0.0.7  - Update output description in XML file (b2g4pipe v2.3.5 included
+          the sequence description, b2g4pipe v2.5 omits this).
+v0.0.8  - Automated installation via the Galaxy Tool Shed.
+        - Added unit test.
+        - Explain how to load the tabular file into the Blast2GO GUI.
+        - Link to Tool Shed added to help text and this documentation.
+        - Switch to standard MIT licence.
+        - Use reStructuredText for this README file.
+        - Updated citation information (Cock et al. 2013).
+        - Development moved to GitHub, https://github.com/peterjc/galaxy_blast
+        - Split out ``massage_xml_for_blast2go.py`` as a standalone file.
+v0.0.9  - Update README file now that BioBam are selling the latest version
+          of the Blast2GO command line tool. For now b2g4pipe v2.5 is still
+          available as a free download.
+        - Tool definition now embeds citation information.
+======= ======================================================================
+
+
+Developers
+==========
+
+This script and related tools were originally developed on the 'tools' branch
+of the following BitBucket Mercurial repository:
+https://bitbucket.org/peterjc/galaxy-central/
+
+As of September 2013, development is continuing on a dedicated GitHub repository:
+https://github.com/peterjc/galaxy_blast
+
+For making the "Galaxy Tool Shed" http://toolshed.g2.bx.psu.edu/ tarball I use
+the following command from the Galaxy root folder::
+
+    $ tar -czf blast2go.tar.gz tools/blast2go/README.rst tools/blast2go/blast2go.xml tools/blast2go/blast2go.py tools/blast2go/massage_xml_for_blast2go.py tools/blast2go/repository_dependencies.xml tools/blast2go/tool_dependencies.xml tool-data/blast2go.loc.sample test-data/blastp_sample.xml test-data/blastp_sample.blast2go.tabular
+
+Check this worked::
+
+    $ tar -tzf blast2go.tar.gz
+    tools/blast2go/README.rst
+    tools/blast2go/blast2go.xml
+    tools/blast2go/blast2go.py
+    tools/blast2go/massage_xml_for_blast2go.py
+    tools/blast2go/repository_dependencies.xml
+    tools/blast2go/tool_dependencies.xml
+    tool-data/blast2go.loc.sample
+    test-data/blastp_sample.xml
+    test-data/blastp_sample.blast2go.tabular
+
+
+Licence (MIT)
+=============
+
+Permission is hereby granted, free of charge, to any person obtaining a copy
+of this software and associated documentation files (the "Software"), to deal
+in the Software without restriction, including without limitation the rights
+to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
+copies of the Software, and to permit persons to whom the Software is
+furnished to do so, subject to the following conditions:
+
+The above copyright notice and this permission notice shall be included in
+all copies or substantial portions of the Software.
+
+THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
+IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
+FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
+AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
+LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
+OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN
+THE SOFTWARE.
+
+
+NOTE: This is the licence for the Galaxy Wrapper only. Blast2GO and
+associated data files are available and licenced separately.