Mercurial > repos > pjbriggs > rnachipintegrator
diff README.rst @ 0:d9c1f2133124 draft
Uploaded initial version 0.4.4.
author | pjbriggs |
---|---|
date | Tue, 30 Jun 2015 06:44:06 -0400 |
parents | |
children | 5f69a2c1b9c9 |
line wrap: on
line diff
--- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/README.rst Tue Jun 30 06:44:06 2015 -0400 @@ -0,0 +1,142 @@ +RnaChipIntegrator: integrated analysis of gene expression and ChIP data +======================================================================= + +Galaxy tool wrappers for running the RnaChipIntegrator program +(http://fls-bioinformatics-core.github.com/RnaChipIntegrator/) for integrated +analyses of gene expression and ChIP data. + +There are two tools available that are built around RnaChipIntegrator: + +- General RnaChipIntegrator tool that allows any peaks to be analysed against + any gene list +- A "cannonical gene" variant which allows ChIP peaks to be analysed against a + list of cannonical genes for different genomes + +Automated installation +====================== + +Installation via the Galaxy Tool Shed will take of installing the tool wrapper +and the RnaChipIntegrator programs, installing the .loc files, and setting the +appropriate environment variables. + +Manual Installation +=================== + +There are three files to install: + +- ``rnachipintegrator_wrapper.xml`` (the Galaxy tool definition for general usage) +- ``rnachipintegrator_canonical_genes.xml`` (tool definition for the "canonical + gene" variant) +- ``rnachipintegrator_wrapper.sh`` (the shell script wrapper) + +The suggested location is in a ``tools/rnachipintegrator/`` folder. You will then +need to modify the ``tools_conf.xml`` file to tell Galaxy to offer the tool +by adding the lines: + + <tool file="rnachipintegrator/rnachipintegrator_wrapper.xml" /> + <tool file="rnachipintegrator/rnachipintegrator_canonical_genes.xml" /> + +You will also need to install the RnaChipIntegrator program: + +- http://fls-bioinformatics-core.github.com/RnaChipIntegrator/ + +In addition for the cannonical gene and histone modification variants, it's +necessary to copy the .loc.sample files to .loc Galaxy's ``tool-data`` folder: + +- **Cannonical genes**: its necessary to manually acquire cannonical gene + list files from UCSC and then add appropriate references in the + ``rnachipintegrator_canonical_genes.loc`` file. + +If you want to run the functional tests, copy the sample test files under +``test-data`` to Galaxy's ``test-data/`` directory. Then: + + ./run_tests.sh -id fls_rnachipintegrator_wrapper + + +How to get canonical gene data +============================== + +Getting canonical gene info from UCSC is covered in the screencast at +http://blog.openhelix.eu/?p=6097 + +In summary: + +- **UCSC genome browser:** if multiple versions of each gene are visible (e.g. + if you view 'human Feb 2009 GRCh37/hg19' positions chr17:41,100,000-41,300,000) + then: + + - Scroll down to the Gene & Gene Prediction section + - Click on the UCSC genes link + - Uncheck the tickbox next to splice variants and resubmit to view without + duplicates. + +- **UCSC table browser:** to obtain the canonical set of genes from the table + browser: + + - Select knownCanonical from the table menu + - Select selected fields from primary & related tables from the output format menu + - Click on Get output + - Select the fields of interest (e.g. 'chrom', 'chromStart', 'chromEnd' from the + 'knownCanonical' table plus 'geneSymbol' from the kgXref table.) + - Click on Get output to get the data for download + +(Hint: to also get strand direction information i.e. +/-, also allow selection +from the refGene table, and select the strand field.) + + +Note on Excel output files and Galaxy +===================================== + +RnaChipIntegrator produces an Excel spreadsheet as one of its outputs, +however Galaxy is not currently set up by default to handle these. + +To enable Excel output file handling in Galaxy, edit the ``datatypes_conf.xml`` +file and add: + + <datatype extension="xls" type="galaxy.datatypes.data:Data" mimetype="application/vnd.ms-excel" /> + +You'll also need to remove the existing datatype with extension "xls". + +Restarting Galaxy should mean that the browser correctly handles Excel outputs +from RnaChipIntegrator. + +History +======= + +========== ====================================================================== +Version Changes +---------- ---------------------------------------------------------------------- +0.4.4-0 - Initial version pushed to toolshed +========== ====================================================================== + + +Developers +========== + +This tool is developed on the following GitHub repository: +https://github.com/fls-bioinformatics-core/galaxy-tools/tree/master/rnachipintegrator + +For making the "Galaxy Tool Shed" http://toolshed.g2.bx.psu.edu/ tarball I use +the ``package_rnachipintegrator_wrapper.sh`` script. + + +Licence (MIT) +============= + +Permission is hereby granted, free of charge, to any person obtaining a copy +of this software and associated documentation files (the "Software"), to deal +in the Software without restriction, including without limitation the rights +to use, copy, modify, merge, publish, distribute, sublicense, and/or sell +copies of the Software, and to permit persons to whom the Software is +furnished to do so, subject to the following conditions: + +The above copyright notice and this permission notice shall be included in +all copies or substantial portions of the Software. + +THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR +IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, +FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE +AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER +LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, +OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN +THE SOFTWARE.