0
|
1 Galaxy tool reporting sequence composition
|
|
2 ==========================================
|
|
3
|
2
|
4 This tool is copyright 2014-2017 by Peter Cock, The James Hutton Institute
|
0
|
5 (formerly SCRI, Scottish Crop Research Institute), UK. All rights reserved.
|
|
6 See the licence text below (MIT licence).
|
|
7
|
|
8 This tool is a short Python script (using Biopython library functions) to
|
|
9 loop over given sequence files (in a range of formats including FASTA, FASTQ,
|
|
10 and SFF), and report the count of each letter (i.e. amino acids or bases).
|
|
11
|
|
12 This can be useful for sanity checking assemblies (e.g. proportion of N
|
|
13 bases) or looking at differences in base composition.
|
|
14
|
|
15 This tool is available from the Galaxy Tool Shed at:
|
|
16
|
|
17 * http://toolshed.g2.bx.psu.edu/view/peterjc/seq_composition
|
|
18
|
|
19
|
|
20 Automated Installation
|
|
21 ======================
|
|
22
|
|
23 This should be straightforward using the Galaxy Tool Shed, which should be
|
|
24 able to automatically install the dependency on Biopython, and then install
|
|
25 this tool and run its unit tests.
|
|
26
|
|
27
|
|
28 Manual Installation
|
|
29 ===================
|
|
30
|
|
31 There are just two files to install to use this tool from within Galaxy:
|
|
32
|
|
33 * ``seq_composition.py`` (the Python script)
|
|
34 * ``seq_composition.xml`` (the Galaxy tool definition)
|
|
35
|
|
36 The suggested location is in a dedicated ``tools/seq_composition`` folder.
|
|
37
|
|
38 You will also need to modify the ``tools_conf.xml`` file to tell Galaxy to offer the
|
|
39 tool. One suggested location is in the filters section. Simply add the line::
|
|
40
|
|
41 <tool file="seq_composition/seq_composition.xml" />
|
|
42
|
|
43 You will also need to install Biopython 1.62 or later.
|
|
44
|
|
45 If you wish to run the unit tests, also move/copy the ``test-data/`` files
|
|
46 under Galaxy's ``test-data/`` folder. Then::
|
|
47
|
|
48 ./run_tests.sh -id seq_composition
|
|
49
|
|
50 That's it.
|
|
51
|
|
52
|
|
53 History
|
|
54 =======
|
|
55
|
|
56 ======= ======================================================================
|
|
57 Version Changes
|
|
58 ------- ----------------------------------------------------------------------
|
|
59 v0.0.1 - Initial version.
|
|
60 - Tool definition now embeds citation information.
|
1
|
61 v0.0.2 - Reorder XML elements (internal change only).
|
|
62 - Planemo for Tool Shed upload (``.shed.yml``, internal change only).
|
2
|
63 v0.0.3 - Python style updates (internal change only).
|
0
|
64 ======= ======================================================================
|
|
65
|
|
66
|
|
67 Developers
|
|
68 ==========
|
|
69
|
|
70 This script and related tools are being developed on this GitHub repository:
|
|
71 https://github.com/peterjc/pico_galaxy/tree/master/tools/seq_composition
|
|
72
|
1
|
73
|
|
74 For pushing a release to the test or main "Galaxy Tool Shed", use the following
|
|
75 Planemo commands (which requires you have set your Tool Shed access details in
|
|
76 ``~/.planemo.yml`` and that you have access rights on the Tool Shed)::
|
0
|
77
|
2
|
78 $ planemo shed_update -t testtoolshed --check_diff ~/repositories/pico_galaxy/tools/seq_composition/
|
1
|
79 ...
|
|
80
|
|
81 or::
|
0
|
82
|
2
|
83 $ planemo shed_update -t toolshed --check_diff ~/repositories/pico_galaxy/tools/seq_composition/
|
1
|
84 ...
|
0
|
85
|
1
|
86 To just build and check the tar ball, use::
|
0
|
87
|
1
|
88 $ planemo shed_upload --tar_only ~/repositories/pico_galaxy/tools/seq_composition/
|
|
89 ...
|
|
90 $ tar -tzf shed_upload.tar.gz
|
|
91 test-data/MID4_GLZRM4E04_rnd30_frclip.sff
|
|
92 test-data/MID4_GLZRM4E04_rnd30_frclip.seq_composition.tabular
|
|
93 test-data/ecoli.fastq
|
|
94 test-data/ecoli.seq_composition.tabular
|
|
95 test-data/four_human_proteins.fasta
|
|
96 test-data/four_human_proteins.seq_composition.tabular
|
0
|
97 tools/seq_composition/README.rst
|
|
98 tools/seq_composition/seq_composition.py
|
|
99 tools/seq_composition/seq_composition.xml
|
|
100 tools/seq_composition/tool_dependencies.xml
|
|
101
|
|
102
|
|
103 Licence (MIT)
|
|
104 =============
|
|
105
|
|
106 Permission is hereby granted, free of charge, to any person obtaining a copy
|
|
107 of this software and associated documentation files (the "Software"), to deal
|
|
108 in the Software without restriction, including without limitation the rights
|
|
109 to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
|
|
110 copies of the Software, and to permit persons to whom the Software is
|
|
111 furnished to do so, subject to the following conditions:
|
|
112
|
|
113 The above copyright notice and this permission notice shall be included in
|
|
114 all copies or substantial portions of the Software.
|
|
115
|
|
116 THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
|
|
117 IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
|
|
118 FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
|
|
119 AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
|
|
120 LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
|
|
121 OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN
|
|
122 THE SOFTWARE.
|