annotate tools/seq_select_by_id/README.rst @ 5:1a83f5ab9e95 draft

Uploaded v0.0.8, corrected automated dependency definition
author peterjc
date Thu, 21 Nov 2013 04:54:59 -0500
parents 6842c0c7bc70
children 91f55ee8fea5
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
4
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
1 Galaxy tool to select FASTA, QUAL, FASTQ or SFF sequences by ID
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
2 ===============================================================
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
3
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
4 This tool is copyright 2011-2013 by Peter Cock, The James Hutton Institute
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
5 (formerly SCRI, Scottish Crop Research Institute), UK. All rights reserved.
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
6 See the licence text below.
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
7
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
8 This tool is a short Python script (using Biopython library functions) to extract
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
9 sequences from a FASTA, QUAL, FASTQ, or SFF file based on the list of IDs given
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
10 by a column of a tabular file. The output order follows that of the tabular file,
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
11 and if there are duplicates in the tabular file, there will be duplicates in the
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
12 output sequence file.
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
13
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
14 This tool is available from the Galaxy Tool Shed at:
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
15
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
16 * http://toolshed.g2.bx.psu.edu/view/peterjc/seq_select_by_id
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
17
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
18 See also the sister tools to filter sequence files according to IDs from column(s)
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
19 of a tabular file (where the output order follows the sequence file, and any
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
20 duplicate IDs are ignored) and rename sequences:
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
21
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
22 * http://toolshed.g2.bx.psu.edu/view/peterjc/seq_filter_by_id
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
23 * http://toolshed.g2.bx.psu.edu/view/peterjc/seq_rename
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
24
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
25
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
26 Automated Installation
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
27 ======================
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
28
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
29 This should be straightforward using the Galaxy Tool Shed, which should be
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
30 able to automatically install the dependency on Biopython, and then install
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
31 this tool and run its unit tests.
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
32
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
33
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
34 Manual Installation
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
35 ===================
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
36
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
37 There are just two files to install to use this tool from within Galaxy:
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
38
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
39 * seq_select_by_id.py (the Python script)
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
40 * seq_select_by_id.xml (the Galaxy tool definition)
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
41
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
42 The suggested location is a dedicated tools/seq_select_by_id folder.
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
43
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
44 You will also need to modify the tools_conf.xml file to tell Galaxy to offer the
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
45 tool. One suggested location is in the filters section. Simply add the line::
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
46
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
47 <tool file="seq_select_by_id/seq_select_by_id.xml" />
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
48
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
49 If you wish to run the unit tests, also add this to tools_conf.xml.sample
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
50 and move/copy the test-data files under Galaxy's test-data folder. Then::
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
51
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
52 $ ./run_functional_tests.sh -id seq_select_by_id
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
53
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
54 You will also need to install Biopython 1.54 or later. That's it.
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
55
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
56
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
57 History
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
58 =======
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
59
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
60 ======= ======================================================================
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
61 Version Changes
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
62 ------- ----------------------------------------------------------------------
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
63 v0.0.1 - Initial version.
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
64 v0.0.3 - Ignore blank lines in input.
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
65 v0.0.4 - Record script version when run from Galaxy.
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
66 - Basic unit test included.
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
67 v0.0.5 - Check for errors using Python script's return code.
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
68 v0.0.6 - Link to Tool Shed added to help text and this documentation.
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
69 - Automatic installation of Biopython dependency.
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
70 - Use reStructuredText for this README file.
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
71 - Adopt standard MIT License.
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
72 v0.0.7 - Updated citation information (Cock et al. 2013).
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
73 - Fixed Biopython dependency setup.
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
74 - Development moved to GitHub, https://github.com/peterjc/pico_galaxy
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
75 - Renamed folder and adopted README.rst naming.
5
1a83f5ab9e95 Uploaded v0.0.8, corrected automated dependency definition
peterjc
parents: 4
diff changeset
76 v0.0.8 - Corrected automated dependency definition
4
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
77 ======= ======================================================================
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
78
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
79
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
80 Developers
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
81 ==========
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
82
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
83 This script and related tools were initially developed on the following hg branch:
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
84 http://bitbucket.org/peterjc/galaxy-central/src/tools
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
85
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
86 Development has now moved to a dedicated GitHub repository:
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
87 https://github.com/peterjc/pico_galaxy/tree/master/tools
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
88
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
89 For making the "Galaxy Tool Shed" http://toolshed.g2.bx.psu.edu/ tarball use
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
90 the following command from the Galaxy root folder::
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
91
5
1a83f5ab9e95 Uploaded v0.0.8, corrected automated dependency definition
peterjc
parents: 4
diff changeset
92 $ tar -czf seq_select_by_id.tar.gz tools/seq_select_by_id/README.rst tools/seq_select_by_id/seq_select_by_id.* tools/seq_select_by_id/tool_dependencies.xml test-data/k12_ten_proteins.fasta test-data/k12_hypothetical.fasta test-data/k12_hypothetical.tabular
4
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
93
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
94 Check this worked::
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
95
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
96 $ tar -tzf seq_select_by_id.tar.gz
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
97 tools/seq_select_by_id/README.rst
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
98 tools/seq_select_by_id/seq_select_by_id.py
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
99 tools/seq_select_by_id/seq_select_by_id.xml
5
1a83f5ab9e95 Uploaded v0.0.8, corrected automated dependency definition
peterjc
parents: 4
diff changeset
100 tools/seq_select_by_id/tool_dependencies.xml
4
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
101 test-data/k12_ten_proteins.fasta
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
102 test-data/k12_hypothetical.fasta
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
103 test-data/k12_hypothetical.tabular
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
104
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
105
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
106 Licence (MIT)
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
107 =============
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
108
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
109 Permission is hereby granted, free of charge, to any person obtaining a copy
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
110 of this software and associated documentation files (the "Software"), to deal
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
111 in the Software without restriction, including without limitation the rights
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
112 to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
113 copies of the Software, and to permit persons to whom the Software is
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
114 furnished to do so, subject to the following conditions:
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
115
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
116 The above copyright notice and this permission notice shall be included in
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
117 all copies or substantial portions of the Software.
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
118
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
119 THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
120 IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
121 FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
122 AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
123 LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
124 OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
125 THE SOFTWARE.