annotate tools/seq_filter_by_mapping/README.rst @ 5:1d6c149ca211 draft

Bump Biopython dependency
author peterjc
date Thu, 30 Nov 2023 09:50:23 +0000
parents f82868a026ea
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
0
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
1 Galaxy tool to filter FASTA, FASTQ or SFF sequences by SAM/BAM mapping
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
2 ======================================================================
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
3
5
1d6c149ca211 Bump Biopython dependency
peterjc
parents: 4
diff changeset
4 This tool is copyright 2014-2023 by Peter Cock, The James Hutton Institute
0
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
5 (formerly SCRI, Scottish Crop Research Institute), UK. All rights reserved.
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
6 See the licence text below.
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
7
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
8 This tool is a short Python script (using Biopython library functions) which
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
9 divides a FASTA, FASTQ, or SFF file in two, those sequences which do or do
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
10 not map according to given SAM/BAM file(s).
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
11
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
12 Example uses include mapping of FASTQ reads against a known contaminant
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
13 in order to remove reads prior to a de novo assembly.
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
14
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
15 This tool is available from the Galaxy Tool Shed at:
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
16
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
17 * http://toolshed.g2.bx.psu.edu/view/peterjc/seq_filter_by_mapping
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
18
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
19 See also related tools:
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
20
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
21 * http://toolshed.g2.bx.psu.edu/view/peterjc/seq_filter_by_id
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
22 * http://toolshed.g2.bx.psu.edu/view/peterjc/seq_select_by_id
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
23 * http://toolshed.g2.bx.psu.edu/view/peterjc/seq_rename
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
24
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
25
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
26 Automated Installation
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
27 ======================
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
28
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
29 This should be straightforward using the Galaxy Tool Shed, which should be
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
30 able to automatically install the dependency on Biopython and samtools
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
31 and then install this tool and run its unit tests.
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
32
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
33
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
34 Manual Installation
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
35 ===================
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
36
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
37 There are just two files to install to use this tool from within Galaxy:
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
38
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
39 * ``seq_filter_by_mapping.py`` (the Python script)
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
40 * ``seq_filter_by_mapping.xml`` (the Galaxy tool definition)
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
41
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
42 The suggested location is a dedicated ``tools/seq_filter_by_mapping/`` folder.
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
43
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
44 You will also need to modify the ``tools_conf.xml`` file to tell Galaxy to offer the
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
45 tool. One suggested location is in the filters section. Simply add the line::
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
46
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
47 <tool file="seq_filter_by_mapping/seq_filter_by_mapping.xml" />
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
48
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
49 If you wish to run the unit tests, also move/copy the ``test-data/`` files
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
50 under Galaxy's ``test-data/`` folder. Then::
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
51
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
52 $ ./run_tests.sh -id seq_filter_by_mapping
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
53
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
54 You will also need to install Biopython 1.54 or later. That's it.
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
55
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
56
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
57 History
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
58 =======
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
59
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
60 ======= ======================================================================
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
61 Version Changes
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
62 ------- ----------------------------------------------------------------------
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
63 v0.0.1 - Initial version.
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
64 v0.0.2 - Fixed some error messages.
1
8ff0ac66f1a3 v0.0.4; Report FASTQ counts; misc internal changes
peterjc
parents: 0
diff changeset
65 v0.0.3 - Report counts for FASTQ as done for FASTA and SFF files.
8ff0ac66f1a3 v0.0.4; Report FASTQ counts; misc internal changes
peterjc
parents: 0
diff changeset
66 v0.0.4 - Use the ``format_source=...`` tag.
8ff0ac66f1a3 v0.0.4; Report FASTQ counts; misc internal changes
peterjc
parents: 0
diff changeset
67 - Reorder XML elements (internal change only).
8ff0ac66f1a3 v0.0.4; Report FASTQ counts; misc internal changes
peterjc
parents: 0
diff changeset
68 - Planemo for Tool Shed upload (``.shed.yml``, internal change only).
2
48e71dfd51b3 v0.0.5 Depend on Biopython 1.67 from Tool Shed or (Bio)conda
peterjc
parents: 1
diff changeset
69 v0.0.5 - Python script cleanups (internal change only).
48e71dfd51b3 v0.0.5 Depend on Biopython 1.67 from Tool Shed or (Bio)conda
peterjc
parents: 1
diff changeset
70 - Depends on Biopython 1.67 via legacy Tool Shed package or bioconda.
48e71dfd51b3 v0.0.5 Depend on Biopython 1.67 from Tool Shed or (Bio)conda
peterjc
parents: 1
diff changeset
71 - Use ``<command detect_errors="aggressive">`` (internal change only).
48e71dfd51b3 v0.0.5 Depend on Biopython 1.67 from Tool Shed or (Bio)conda
peterjc
parents: 1
diff changeset
72 - Single quote command line arguments (internal change only).
3
481b0a925e66 v0.0.6 Python 3 compatible print function
peterjc
parents: 2
diff changeset
73 v0.0.6 - Python 3 compatible print function.
4
f82868a026ea "v0.0.7 - long overdue upload to main Tool Shed"
peterjc
parents: 3
diff changeset
74 v0.0.7 - Script works on Python 2 and 3 (fixed input file mode)
5
1d6c149ca211 Bump Biopython dependency
peterjc
parents: 4
diff changeset
75 v0.0.8 - Bumped Biopython dependency version for Python 3 fixes.
0
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
76 ======= ======================================================================
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
77
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
78
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
79 Developers
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
80 ==========
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
81
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
82 Development is on this GitHub repository:
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
83 https://github.com/peterjc/pico_galaxy/tree/master/tools/seq_filter_by_mapping
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
84
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
85 Much of the code was copied from my older tool:
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
86 https://github.com/peterjc/pico_galaxy/tree/master/tools/seq_filter_by_id
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
87
1
8ff0ac66f1a3 v0.0.4; Report FASTQ counts; misc internal changes
peterjc
parents: 0
diff changeset
88 For pushing a release to the test or main "Galaxy Tool Shed", use the following
8ff0ac66f1a3 v0.0.4; Report FASTQ counts; misc internal changes
peterjc
parents: 0
diff changeset
89 Planemo commands (which requires you have set your Tool Shed access details in
8ff0ac66f1a3 v0.0.4; Report FASTQ counts; misc internal changes
peterjc
parents: 0
diff changeset
90 ``~/.planemo.yml`` and that you have access rights on the Tool Shed)::
8ff0ac66f1a3 v0.0.4; Report FASTQ counts; misc internal changes
peterjc
parents: 0
diff changeset
91
2
48e71dfd51b3 v0.0.5 Depend on Biopython 1.67 from Tool Shed or (Bio)conda
peterjc
parents: 1
diff changeset
92 $ planemo shed_update -t testtoolshed --check_diff tools/seq_filter_by_mapping/
1
8ff0ac66f1a3 v0.0.4; Report FASTQ counts; misc internal changes
peterjc
parents: 0
diff changeset
93 ...
8ff0ac66f1a3 v0.0.4; Report FASTQ counts; misc internal changes
peterjc
parents: 0
diff changeset
94
8ff0ac66f1a3 v0.0.4; Report FASTQ counts; misc internal changes
peterjc
parents: 0
diff changeset
95 or::
0
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
96
2
48e71dfd51b3 v0.0.5 Depend on Biopython 1.67 from Tool Shed or (Bio)conda
peterjc
parents: 1
diff changeset
97 $ planemo shed_update -t toolshed --check_diff tools/seq_filter_by_mapping/
1
8ff0ac66f1a3 v0.0.4; Report FASTQ counts; misc internal changes
peterjc
parents: 0
diff changeset
98 ...
8ff0ac66f1a3 v0.0.4; Report FASTQ counts; misc internal changes
peterjc
parents: 0
diff changeset
99
8ff0ac66f1a3 v0.0.4; Report FASTQ counts; misc internal changes
peterjc
parents: 0
diff changeset
100 To just build and check the tar ball, use::
0
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
101
2
48e71dfd51b3 v0.0.5 Depend on Biopython 1.67 from Tool Shed or (Bio)conda
peterjc
parents: 1
diff changeset
102 $ planemo shed_upload --tar_only tools/seq_filter_by_mapping/
1
8ff0ac66f1a3 v0.0.4; Report FASTQ counts; misc internal changes
peterjc
parents: 0
diff changeset
103 ...
8ff0ac66f1a3 v0.0.4; Report FASTQ counts; misc internal changes
peterjc
parents: 0
diff changeset
104 $ tar -tzf shed_upload.tar.gz
8ff0ac66f1a3 v0.0.4; Report FASTQ counts; misc internal changes
peterjc
parents: 0
diff changeset
105 test-data/SRR639755_mito_pairs.fastq.gz
8ff0ac66f1a3 v0.0.4; Report FASTQ counts; misc internal changes
peterjc
parents: 0
diff changeset
106 test-data/SRR639755_sample_by_coord.sam
8ff0ac66f1a3 v0.0.4; Report FASTQ counts; misc internal changes
peterjc
parents: 0
diff changeset
107 test-data/SRR639755_sample_lax.fastq
8ff0ac66f1a3 v0.0.4; Report FASTQ counts; misc internal changes
peterjc
parents: 0
diff changeset
108 test-data/SRR639755_sample_strict.fastq
0
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
109 tools/seq_filter_by_mapping/README.rst
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
110 tools/seq_filter_by_mapping/seq_filter_by_mapping.py
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
111 tools/seq_filter_by_mapping/seq_filter_by_mapping.xml
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
112 tools/seq_filter_by_mapping/tool_dependencies.xml
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
113
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
114
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
115 Licence (MIT)
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
116 =============
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
117
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
118 Permission is hereby granted, free of charge, to any person obtaining a copy
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
119 of this software and associated documentation files (the "Software"), to deal
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
120 in the Software without restriction, including without limitation the rights
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
121 to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
122 copies of the Software, and to permit persons to whom the Software is
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
123 furnished to do so, subject to the following conditions:
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
124
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
125 The above copyright notice and this permission notice shall be included in
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
126 all copies or substantial portions of the Software.
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
127
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
128 THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
129 IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
130 FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
131 AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
132 LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
133 OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN
1d773da0ccf0 Uploaded v0.0.2, fixed some error messages
peterjc
parents:
diff changeset
134 THE SOFTWARE.