annotate tools/seq_length/README.rst @ 2:6f29bb9960ac draft

v0.0.3 - Fixed SFF; more tests
author peterjc
date Mon, 14 May 2018 12:09:50 -0400
parents 458f987918a6
children fcdf11fb34de
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
0
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
1 Galaxy tool to rename FASTA, QUAL, FASTQ or SFF sequences
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
2 =========================================================
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
3
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
4 This tool is copyright 2011-2017 by Peter Cock, The James Hutton Institute
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
5 (formerly SCRI, Scottish Crop Research Institute), UK. All rights reserved.
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
6 See the licence text below.
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
7
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
8 This tool is a short Python script (using Biopython library functions) to rename
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
9 sequences from a FASTA, QUAL, FASTQ, or SFF file based on an ID mapping gives as
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
10 two columns of a tabular file. The output order follows that of the sequence file,
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
11 and if there are duplicates in the input sequence file, there will be duplicates
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
12 in the output sequence file.
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
13
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
14 This tool is available from the Galaxy Tool Shed,
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
15
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
16 * http://toolshed.g2.bx.psu.edu/view/peterjc/seq_length
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
17
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
18 See also the sister tools to filter or select sequence files according to IDs
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
19 from column(s) of tabular file:
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
20
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
21 * http://toolshed.g2.bx.psu.edu/view/peterjc/seq_filter_by_id
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
22 * http://toolshed.g2.bx.psu.edu/view/peterjc/seq_select_by_id
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
23
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
24
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
25 Automated Installation
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
26 ======================
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
27
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
28 This should be straightforward using the Galaxy Tool Shed, which should be
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
29 able to automatically install the dependency on Biopython, and then install
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
30 this tool and run its unit tests.
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
31
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
32
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
33 Manual Installation
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
34 ===================
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
35
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
36 There are just two files to install to use this tool from within Galaxy:
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
37
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
38 * ``seq_length.py`` (the Python script)
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
39 * ``seq_length.xml`` (the Galaxy tool definition)
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
40
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
41 The suggested location is in a dedicated ``tools/seq_length`` folder.
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
42
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
43 You will also need to modify the ``tools_conf.xml`` file to tell Galaxy to offer the
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
44 tool. One suggested location is in the filters section. Simply add the line::
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
45
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
46 <tool file="seq_length/seq_length.xml" />
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
47
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
48 If you wish to run the unit tests, also move/copy the ``test-data/`` files
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
49 under Galaxy's ``test-data/`` folder. Then::
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
50
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
51 $ ./run_tests.sh -id seq_length
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
52
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
53 You will also need to install Biopython 1.54 or later. That's it.
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
54
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
55
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
56 History
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
57 =======
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
58
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
59 ======= ======================================================================
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
60 Version Changes
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
61 ------- ----------------------------------------------------------------------
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
62 v0.0.1 - Initial version.
1
458f987918a6 Faster FASTA and FASTQ, v0.0.2
peterjc
parents: 0
diff changeset
63 v0.0.2 - Faster for FASTA and FASTQ.
458f987918a6 Faster FASTA and FASTQ, v0.0.2
peterjc
parents: 0
diff changeset
64 - Fixed typo.
2
6f29bb9960ac v0.0.3 - Fixed SFF; more tests
peterjc
parents: 1
diff changeset
65 v0.0.3 - Improved command line usage (outside of Galaxy).
6f29bb9960ac v0.0.3 - Fixed SFF; more tests
peterjc
parents: 1
diff changeset
66 - More tests (now covers SFF as well).
6f29bb9960ac v0.0.3 - Fixed SFF; more tests
peterjc
parents: 1
diff changeset
67 - Fix requesting SFF format.
0
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
68 ======= ======================================================================
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
69
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
70
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
71 Developers
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
72 ==========
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
73
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
74 Development is here:
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
75
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
76 https://github.com/peterjc/pico_galaxy/tree/master/tools/seq_length
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
77
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
78 For pushing a release to the test or main "Galaxy Tool Shed", use the following
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
79 Planemo commands (which requires you have set your Tool Shed access details in
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
80 ``~/.planemo.yml`` and that you have access rights on the Tool Shed)::
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
81
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
82 $ planemo shed_update -t testtoolshed --check_diff tools/seq_length/
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
83 ...
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
84
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
85 or::
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
86
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
87 $ planemo shed_update -t toolshed --check_diff tools/seq_length/
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
88 ...
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
89
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
90 To just build and check the tar ball, use::
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
91
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
92 $ planemo shed_upload --tar_only tools/seq_length/
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
93 ...
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
94 $ tar -tzf shed_upload.tar.gz
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
95 test-data/SRR639755_sample_strict.fastq
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
96 test-data/SRR639755_sample_strict.length.tabular
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
97 test-data/four_human_proteins.fasta
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
98 test-data/four_human_proteins.length.tabular
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
99 tools/seq_length/README.rst
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
100 tools/seq_length/seq_length.py
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
101 tools/seq_length/seq_length.xml
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
102 tools/seq_length/tool_dependencies.xml
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
103
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
104
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
105 Licence (MIT)
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
106 =============
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
107
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
108 Permission is hereby granted, free of charge, to any person obtaining a copy
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
109 of this software and associated documentation files (the "Software"), to deal
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
110 in the Software without restriction, including without limitation the rights
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
111 to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
112 copies of the Software, and to permit persons to whom the Software is
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
113 furnished to do so, subject to the following conditions:
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
114
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
115 The above copyright notice and this permission notice shall be included in
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
116 all copies or substantial portions of the Software.
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
117
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
118 THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
119 IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
120 FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
121 AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
122 LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
123 OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN
c323e29a8248 Initial release v0.0.1
peterjc
parents:
diff changeset
124 THE SOFTWARE.