annotate tools/fastq_paired_unpaired/README.rst @ 10:4851620acfda draft

Bump Biopython dependency
author peterjc
date Thu, 30 Nov 2023 09:50:55 +0000
parents 8cbc866b72ce
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
4
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
1 Galaxy tool to divide FASTQ files into paired and unpaired reads
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
2 ================================================================
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
3
10
4851620acfda Bump Biopython dependency
peterjc
parents: 8
diff changeset
4 This tool is copyright 2010-2023 by Peter Cock, The James Hutton Institute
4
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
5 (formerly SCRI, Scottish Crop Research Institute), UK. All rights reserved.
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
6 See the licence text below (MIT licence).
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
7
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
8 This tool is a short Python script which divides a FASTQ file into paired
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
9 reads, and single or orphan reads. You can have separate files for the
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
10 forward/reverse reads, or have them interleaved in a single file.
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
11
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
12 Note that the FASTQ variant is unimportant (Sanger, Solexa, Illumina, or even
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
13 Color Space should all work equally well).
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
14
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
15 This tool is available from the Galaxy Tool Shed at:
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
16 http://toolshed.g2.bx.psu.edu/view/peterjc/fastq_paired_unpaired
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
17
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
18
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
19 Automated Installation
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
20 ======================
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
21
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
22 This should be straightforward, Galaxy should automatically download and install
5
b38bbcbd458d v0.1.2 belatedly declare Biopython dependency
peterjc
parents: 4
diff changeset
23 the tool from the Galaxy Tool Shed, and Biopython, and allow you to run the unit
b38bbcbd458d v0.1.2 belatedly declare Biopython dependency
peterjc
parents: 4
diff changeset
24 tests.
4
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
25
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
26
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
27 Manual Installation
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
28 ===================
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
29
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
30 There are just two files to install:
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
31
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
32 * ``fastq_paired_unpaired.py`` (the Python script)
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
33 * ``fastq_paired_unpaired.xml`` (the Galaxy tool definition)
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
34
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
35 The suggested location is in the Galaxy folder tools/fastq next to other FASTQ
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
36 tools provided with Galaxy.
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
37
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
38 You will also need to modify the ``tools_conf.xml`` file to tell Galaxy to offer
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
39 the tool. One suggested location is next to the ``fastq_filter.xml`` entry, or use
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
40 a dedicated folder like ``tools/fastq_paired_unpaired``. Then simply add the line::
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
41
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
42 <tool file="fastq_paired_unpaired/fastq_paired_unpaired.xml" />
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
43
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
44 That's it.
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
45
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
46
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
47 History
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
48 =======
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
49
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
50 ======= ======================================================================
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
51 Version Changes
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
52 ------- ----------------------------------------------------------------------
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
53 v0.0.1 - Initial version, using Biopython
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
54 v0.0.2 - Help text; cope with multiple pairs per template
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
55 v0.0.3 - Galaxy XML wrappers added
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
56 v0.0.4 - Use Galaxy library to handle FASTQ files (avoid Biopython dependency)
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
57 v0.0.5 - Handle Illumina 1.8 style pair names
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
58 v0.0.6 - Record script version when run from Galaxy
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
59 - Added unit test (FASTQ file using Sanger naming)
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
60 v0.0.7 - Link to Tool Shed added to help text and this documentation.
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
61 v0.0.8 - Use reStructuredText for this README file.
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
62 - Adopt standard MIT License.
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
63 - Updated citation information (Cock et al. 2013).
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
64 - Development moved to GitHub.
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
65 v0.0.9 - Renamed folder and adopted README.rst naming.
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
66 - Removed some unused code in the Python script.
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
67 v0.1.0 - Switch to using Biopython (easier to use script outside of Galaxy).
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
68 - Leaves FASTQ plus lines blank (smaller output files).
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
69 - Tool definition now embeds citation information.
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
70 v0.1.1 - Reorder XML elements (internal change only).
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
71 - Use ``format_source=...`` tag.
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
72 - Planemo for Tool Shed upload (``.shed.yml``, internal change only).
5
b38bbcbd458d v0.1.2 belatedly declare Biopython dependency
peterjc
parents: 4
diff changeset
73 v0.1.2 - Belatedly declare Biopython dependency via Tool Shed.
6
f396701fbf32 v0.1.3 Depends on Biopython 1.67 via Tool Shed package or bioconda.
peterjc
parents: 5
diff changeset
74 v0.1.3 - Minor internal changes to Python script for error reporting & style.
f396701fbf32 v0.1.3 Depends on Biopython 1.67 via Tool Shed package or bioconda.
peterjc
parents: 5
diff changeset
75 - Updated to point at Biopython 1.67 (latest version in Tool Shed).
f396701fbf32 v0.1.3 Depends on Biopython 1.67 via Tool Shed package or bioconda.
peterjc
parents: 5
diff changeset
76 - Explicit dependency on ``galaxy_sequence_utils``.
7
2709a0f065c9 v0.1.4 Internal changes to command line handling
peterjc
parents: 6
diff changeset
77 v0.1.4 - Use ``<command detect_errors="aggressive">`` (internal change only).
2709a0f065c9 v0.1.4 Internal changes to command line handling
peterjc
parents: 6
diff changeset
78 - Single quote command line arguments (internal change only).
10
4851620acfda Bump Biopython dependency
peterjc
parents: 8
diff changeset
79 v0.1.5 - Bump Biopython dependency version for Python 3 fixes.
4
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
80 ======= ======================================================================
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
81
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
82
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
83 Developers
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
84 ==========
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
85
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
86 This script and other tools for filtering FASTA, FASTQ and SFF files were
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
87 initially developed on the following hg branch:
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
88 http://bitbucket.org/peterjc/galaxy-central/src/fasta_filter
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
89
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
90 Development has now moved to a dedicated GitHub repository:
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
91 https://github.com/peterjc/pico_galaxy/tree/master/tools/fastq_paired_unpaired
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
92
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
93 For pushing a release to the test or main "Galaxy Tool Shed", use the following
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
94 Planemo commands (which requires you have set your Tool Shed access details in
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
95 ``~/.planemo.yml`` and that you have access rights on the Tool Shed)::
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
96
6
f396701fbf32 v0.1.3 Depends on Biopython 1.67 via Tool Shed package or bioconda.
peterjc
parents: 5
diff changeset
97 $ planemo shed_update -t testtoolshed --check_diff tools/fastq_paired_unpaired/
4
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
98 ...
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
99
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
100 or::
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
101
6
f396701fbf32 v0.1.3 Depends on Biopython 1.67 via Tool Shed package or bioconda.
peterjc
parents: 5
diff changeset
102 $ planemo shed_update -t toolshed --check_diff tools/fastq_paired_unpaired/
4
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
103 ...
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
104
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
105 To just build and check the tar ball, use::
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
106
6
f396701fbf32 v0.1.3 Depends on Biopython 1.67 via Tool Shed package or bioconda.
peterjc
parents: 5
diff changeset
107 $ planemo shed_upload --tar_only tools/fastq_paired_unpaired/
4
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
108 ...
8
8cbc866b72ce "Update all the pico_galaxy tools on main Tool Shed"
peterjc
parents: 7
diff changeset
109 $ tar -tzf shed_upload.tar.gz
4
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
110 test-data/sanger-pairs-forward.fastq
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
111 test-data/sanger-pairs-interleaved.fastq
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
112 test-data/sanger-pairs-mixed.fastq
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
113 test-data/sanger-pairs-reverse.fastq
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
114 test-data/sanger-pairs-singles.fastq
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
115 tools/fastq_paired_unpaired/README.rst
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
116 tools/fastq_paired_unpaired/fastq_paired_unpaired.py
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
117 tools/fastq_paired_unpaired/fastq_paired_unpaired.xml
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
118
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
119
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
120 Licence (MIT)
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
121 =============
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
122
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
123 Permission is hereby granted, free of charge, to any person obtaining a copy
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
124 of this software and associated documentation files (the "Software"), to deal
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
125 in the Software without restriction, including without limitation the rights
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
126 to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
127 copies of the Software, and to permit persons to whom the Software is
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
128 furnished to do so, subject to the following conditions:
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
129
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
130 The above copyright notice and this permission notice shall be included in
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
131 all copies or substantial portions of the Software.
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
132
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
133 THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
134 IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
135 FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
136 AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
137 LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
138 OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
139 THE SOFTWARE.