0
|
1 Galaxy tool to rename FASTA, QUAL, FASTQ or SFF sequences
|
|
2 =========================================================
|
|
3
|
|
4 This tool is copyright 2011-2017 by Peter Cock, The James Hutton Institute
|
|
5 (formerly SCRI, Scottish Crop Research Institute), UK. All rights reserved.
|
|
6 See the licence text below.
|
|
7
|
|
8 This tool is a short Python script (using Biopython library functions) to rename
|
|
9 sequences from a FASTA, QUAL, FASTQ, or SFF file based on an ID mapping gives as
|
|
10 two columns of a tabular file. The output order follows that of the sequence file,
|
|
11 and if there are duplicates in the input sequence file, there will be duplicates
|
|
12 in the output sequence file.
|
|
13
|
|
14 This tool is available from the Galaxy Tool Shed,
|
|
15
|
|
16 * http://toolshed.g2.bx.psu.edu/view/peterjc/seq_length
|
|
17
|
|
18 See also the sister tools to filter or select sequence files according to IDs
|
|
19 from column(s) of tabular file:
|
|
20
|
|
21 * http://toolshed.g2.bx.psu.edu/view/peterjc/seq_filter_by_id
|
|
22 * http://toolshed.g2.bx.psu.edu/view/peterjc/seq_select_by_id
|
|
23
|
|
24
|
|
25 Automated Installation
|
|
26 ======================
|
|
27
|
|
28 This should be straightforward using the Galaxy Tool Shed, which should be
|
|
29 able to automatically install the dependency on Biopython, and then install
|
|
30 this tool and run its unit tests.
|
|
31
|
|
32
|
|
33 Manual Installation
|
|
34 ===================
|
|
35
|
|
36 There are just two files to install to use this tool from within Galaxy:
|
|
37
|
|
38 * ``seq_length.py`` (the Python script)
|
|
39 * ``seq_length.xml`` (the Galaxy tool definition)
|
|
40
|
|
41 The suggested location is in a dedicated ``tools/seq_length`` folder.
|
|
42
|
|
43 You will also need to modify the ``tools_conf.xml`` file to tell Galaxy to offer the
|
|
44 tool. One suggested location is in the filters section. Simply add the line::
|
|
45
|
|
46 <tool file="seq_length/seq_length.xml" />
|
|
47
|
|
48 If you wish to run the unit tests, also move/copy the ``test-data/`` files
|
|
49 under Galaxy's ``test-data/`` folder. Then::
|
|
50
|
|
51 $ ./run_tests.sh -id seq_length
|
|
52
|
|
53 You will also need to install Biopython 1.54 or later. That's it.
|
|
54
|
|
55
|
|
56 History
|
|
57 =======
|
|
58
|
|
59 ======= ======================================================================
|
|
60 Version Changes
|
|
61 ------- ----------------------------------------------------------------------
|
|
62 v0.0.1 - Initial version.
|
1
|
63 v0.0.2 - Faster for FASTA and FASTQ.
|
|
64 - Fixed typo.
|
2
|
65 v0.0.3 - Improved command line usage (outside of Galaxy).
|
|
66 - More tests (now covers SFF as well).
|
|
67 - Fix requesting SFF format.
|
0
|
68 ======= ======================================================================
|
|
69
|
|
70
|
|
71 Developers
|
|
72 ==========
|
|
73
|
|
74 Development is here:
|
|
75
|
|
76 https://github.com/peterjc/pico_galaxy/tree/master/tools/seq_length
|
|
77
|
|
78 For pushing a release to the test or main "Galaxy Tool Shed", use the following
|
|
79 Planemo commands (which requires you have set your Tool Shed access details in
|
|
80 ``~/.planemo.yml`` and that you have access rights on the Tool Shed)::
|
|
81
|
|
82 $ planemo shed_update -t testtoolshed --check_diff tools/seq_length/
|
|
83 ...
|
|
84
|
|
85 or::
|
|
86
|
|
87 $ planemo shed_update -t toolshed --check_diff tools/seq_length/
|
|
88 ...
|
|
89
|
|
90 To just build and check the tar ball, use::
|
|
91
|
|
92 $ planemo shed_upload --tar_only tools/seq_length/
|
|
93 ...
|
|
94 $ tar -tzf shed_upload.tar.gz
|
|
95 test-data/SRR639755_sample_strict.fastq
|
|
96 test-data/SRR639755_sample_strict.length.tabular
|
|
97 test-data/four_human_proteins.fasta
|
|
98 test-data/four_human_proteins.length.tabular
|
|
99 tools/seq_length/README.rst
|
|
100 tools/seq_length/seq_length.py
|
|
101 tools/seq_length/seq_length.xml
|
|
102 tools/seq_length/tool_dependencies.xml
|
|
103
|
|
104
|
|
105 Licence (MIT)
|
|
106 =============
|
|
107
|
|
108 Permission is hereby granted, free of charge, to any person obtaining a copy
|
|
109 of this software and associated documentation files (the "Software"), to deal
|
|
110 in the Software without restriction, including without limitation the rights
|
|
111 to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
|
|
112 copies of the Software, and to permit persons to whom the Software is
|
|
113 furnished to do so, subject to the following conditions:
|
|
114
|
|
115 The above copyright notice and this permission notice shall be included in
|
|
116 all copies or substantial portions of the Software.
|
|
117
|
|
118 THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
|
|
119 IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
|
|
120 FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
|
|
121 AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
|
|
122 LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
|
|
123 OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN
|
|
124 THE SOFTWARE.
|