annotate tools/seq_select_by_id/README.rst @ 4:6842c0c7bc70 draft

Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
author peterjc
date Mon, 28 Oct 2013 05:21:45 -0400
parents
children 1a83f5ab9e95
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
4
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
1 Galaxy tool to select FASTA, QUAL, FASTQ or SFF sequences by ID
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
2 ===============================================================
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
3
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
4 This tool is copyright 2011-2013 by Peter Cock, The James Hutton Institute
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
5 (formerly SCRI, Scottish Crop Research Institute), UK. All rights reserved.
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
6 See the licence text below.
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
7
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
8 This tool is a short Python script (using Biopython library functions) to extract
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
9 sequences from a FASTA, QUAL, FASTQ, or SFF file based on the list of IDs given
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
10 by a column of a tabular file. The output order follows that of the tabular file,
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
11 and if there are duplicates in the tabular file, there will be duplicates in the
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
12 output sequence file.
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
13
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
14 This tool is available from the Galaxy Tool Shed at:
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
15
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
16 * http://toolshed.g2.bx.psu.edu/view/peterjc/seq_select_by_id
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
17
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
18 See also the sister tools to filter sequence files according to IDs from column(s)
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
19 of a tabular file (where the output order follows the sequence file, and any
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
20 duplicate IDs are ignored) and rename sequences:
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
21
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
22 * http://toolshed.g2.bx.psu.edu/view/peterjc/seq_filter_by_id
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
23 * http://toolshed.g2.bx.psu.edu/view/peterjc/seq_rename
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
24
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
25
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
26 Automated Installation
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
27 ======================
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
28
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
29 This should be straightforward using the Galaxy Tool Shed, which should be
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
30 able to automatically install the dependency on Biopython, and then install
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
31 this tool and run its unit tests.
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
32
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
33
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
34 Manual Installation
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
35 ===================
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
36
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
37 There are just two files to install to use this tool from within Galaxy:
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
38
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
39 * seq_select_by_id.py (the Python script)
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
40 * seq_select_by_id.xml (the Galaxy tool definition)
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
41
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
42 The suggested location is a dedicated tools/seq_select_by_id folder.
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
43
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
44 You will also need to modify the tools_conf.xml file to tell Galaxy to offer the
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
45 tool. One suggested location is in the filters section. Simply add the line::
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
46
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
47 <tool file="seq_select_by_id/seq_select_by_id.xml" />
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
48
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
49 If you wish to run the unit tests, also add this to tools_conf.xml.sample
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
50 and move/copy the test-data files under Galaxy's test-data folder. Then::
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
51
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
52 $ ./run_functional_tests.sh -id seq_select_by_id
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
53
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
54 You will also need to install Biopython 1.54 or later. That's it.
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
55
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
56
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
57 History
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
58 =======
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
59
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
60 ======= ======================================================================
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
61 Version Changes
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
62 ------- ----------------------------------------------------------------------
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
63 v0.0.1 - Initial version.
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
64 v0.0.3 - Ignore blank lines in input.
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
65 v0.0.4 - Record script version when run from Galaxy.
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
66 - Basic unit test included.
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
67 v0.0.5 - Check for errors using Python script's return code.
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
68 v0.0.6 - Link to Tool Shed added to help text and this documentation.
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
69 - Automatic installation of Biopython dependency.
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
70 - Use reStructuredText for this README file.
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
71 - Adopt standard MIT License.
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
72 v0.0.7 - Updated citation information (Cock et al. 2013).
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
73 - Fixed Biopython dependency setup.
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
74 - Development moved to GitHub, https://github.com/peterjc/pico_galaxy
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
75 - Renamed folder and adopted README.rst naming.
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
76 ======= ======================================================================
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
77
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
78
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
79 Developers
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
80 ==========
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
81
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
82 This script and related tools were initially developed on the following hg branch:
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
83 http://bitbucket.org/peterjc/galaxy-central/src/tools
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
84
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
85 Development has now moved to a dedicated GitHub repository:
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
86 https://github.com/peterjc/pico_galaxy/tree/master/tools
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
87
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
88 For making the "Galaxy Tool Shed" http://toolshed.g2.bx.psu.edu/ tarball use
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
89 the following command from the Galaxy root folder::
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
90
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
91 $ tar -czf seq_select_by_id.tar.gz tools/seq_select_by_id/README.rst tools/seq_select_by_id/seq_select_by_id.* tools/seq_select_by_id/repository_dependencies.xml test-data/k12_ten_proteins.fasta test-data/k12_hypothetical.fasta test-data/k12_hypothetical.tabular
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
92
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
93 Check this worked::
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
94
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
95 $ tar -tzf seq_select_by_id.tar.gz
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
96 tools/seq_select_by_id/README.rst
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
97 tools/seq_select_by_id/seq_select_by_id.py
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
98 tools/seq_select_by_id/seq_select_by_id.xml
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
99 tools/seq_select_by_id/repository_dependencies.xml
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
100 test-data/k12_ten_proteins.fasta
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
101 test-data/k12_hypothetical.fasta
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
102 test-data/k12_hypothetical.tabular
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
103
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
104
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
105 Licence (MIT)
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
106 =============
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
107
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
108 Permission is hereby granted, free of charge, to any person obtaining a copy
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
109 of this software and associated documentation files (the "Software"), to deal
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
110 in the Software without restriction, including without limitation the rights
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
111 to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
112 copies of the Software, and to permit persons to whom the Software is
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
113 furnished to do so, subject to the following conditions:
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
114
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
115 The above copyright notice and this permission notice shall be included in
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
116 all copies or substantial portions of the Software.
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
117
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
118 THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
119 IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
120 FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
121 AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
122 LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
123 OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN
6842c0c7bc70 Uploaded v0.0.7, depend on Biopython 1.62, tabs to spaces in XML
peterjc
parents:
diff changeset
124 THE SOFTWARE.