annotate tools/fastq_paired_unpaired/README.rst @ 5:b38bbcbd458d draft

v0.1.2 belatedly declare Biopython dependency
author peterjc
date Wed, 05 Aug 2015 11:17:49 -0400
parents 09f9f0e29e47
children f396701fbf32
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
4
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
1 Galaxy tool to divide FASTQ files into paired and unpaired reads
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
2 ================================================================
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
3
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
4 This tool is copyright 2010-2015 by Peter Cock, The James Hutton Institute
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
5 (formerly SCRI, Scottish Crop Research Institute), UK. All rights reserved.
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
6 See the licence text below (MIT licence).
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
7
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
8 This tool is a short Python script which divides a FASTQ file into paired
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
9 reads, and single or orphan reads. You can have separate files for the
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
10 forward/reverse reads, or have them interleaved in a single file.
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
11
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
12 Note that the FASTQ variant is unimportant (Sanger, Solexa, Illumina, or even
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
13 Color Space should all work equally well).
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
14
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
15 This tool is available from the Galaxy Tool Shed at:
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
16 http://toolshed.g2.bx.psu.edu/view/peterjc/fastq_paired_unpaired
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
17
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
18
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
19 Automated Installation
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
20 ======================
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
21
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
22 This should be straightforward, Galaxy should automatically download and install
5
b38bbcbd458d v0.1.2 belatedly declare Biopython dependency
peterjc
parents: 4
diff changeset
23 the tool from the Galaxy Tool Shed, and Biopython, and allow you to run the unit
b38bbcbd458d v0.1.2 belatedly declare Biopython dependency
peterjc
parents: 4
diff changeset
24 tests.
4
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
25
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
26
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
27 Manual Installation
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
28 ===================
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
29
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
30 There are just two files to install:
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
31
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
32 * ``fastq_paired_unpaired.py`` (the Python script)
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
33 * ``fastq_paired_unpaired.xml`` (the Galaxy tool definition)
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
34
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
35 The suggested location is in the Galaxy folder tools/fastq next to other FASTQ
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
36 tools provided with Galaxy.
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
37
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
38 You will also need to modify the ``tools_conf.xml`` file to tell Galaxy to offer
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
39 the tool. One suggested location is next to the ``fastq_filter.xml`` entry, or use
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
40 a dedicated folder like ``tools/fastq_paired_unpaired``. Then simply add the line::
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
41
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
42 <tool file="fastq_paired_unpaired/fastq_paired_unpaired.xml" />
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
43
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
44 That's it.
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
45
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
46
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
47 History
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
48 =======
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
49
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
50 ======= ======================================================================
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
51 Version Changes
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
52 ------- ----------------------------------------------------------------------
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
53 v0.0.1 - Initial version, using Biopython
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
54 v0.0.2 - Help text; cope with multiple pairs per template
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
55 v0.0.3 - Galaxy XML wrappers added
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
56 v0.0.4 - Use Galaxy library to handle FASTQ files (avoid Biopython dependency)
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
57 v0.0.5 - Handle Illumina 1.8 style pair names
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
58 v0.0.6 - Record script version when run from Galaxy
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
59 - Added unit test (FASTQ file using Sanger naming)
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
60 v0.0.7 - Link to Tool Shed added to help text and this documentation.
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
61 v0.0.8 - Use reStructuredText for this README file.
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
62 - Adopt standard MIT License.
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
63 - Updated citation information (Cock et al. 2013).
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
64 - Development moved to GitHub.
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
65 v0.0.9 - Renamed folder and adopted README.rst naming.
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
66 - Removed some unused code in the Python script.
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
67 v0.1.0 - Switch to using Biopython (easier to use script outside of Galaxy).
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
68 - Leaves FASTQ plus lines blank (smaller output files).
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
69 - Tool definition now embeds citation information.
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
70 v0.1.1 - Reorder XML elements (internal change only).
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
71 - Use ``format_source=...`` tag.
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
72 - Planemo for Tool Shed upload (``.shed.yml``, internal change only).
5
b38bbcbd458d v0.1.2 belatedly declare Biopython dependency
peterjc
parents: 4
diff changeset
73 v0.1.2 - Belatedly declare Biopython dependency via Tool Shed.
4
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
74 ======= ======================================================================
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
75
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
76
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
77 Developers
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
78 ==========
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
79
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
80 This script and other tools for filtering FASTA, FASTQ and SFF files were
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
81 initially developed on the following hg branch:
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
82 http://bitbucket.org/peterjc/galaxy-central/src/fasta_filter
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
83
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
84 Development has now moved to a dedicated GitHub repository:
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
85 https://github.com/peterjc/pico_galaxy/tree/master/tools/fastq_paired_unpaired
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
86
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
87 For pushing a release to the test or main "Galaxy Tool Shed", use the following
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
88 Planemo commands (which requires you have set your Tool Shed access details in
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
89 ``~/.planemo.yml`` and that you have access rights on the Tool Shed)::
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
90
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
91 $ planemo shed_update --shed_target testtoolshed --check_diff ~/repositories/pico_galaxy/tools/fastq_paired_unpaired/
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
92 ...
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
93
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
94 or::
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
95
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
96 $ planemo shed_update --shed_target toolshed --check_diff ~/repositories/pico_galaxy/tools/fastq_paired_unpaired/
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
97 ...
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
98
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
99 To just build and check the tar ball, use::
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
100
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
101 $ planemo shed_upload --tar_only ~/repositories/pico_galaxy/tools/fastq_paired_unpaired/
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
102 ...
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
103 $ tar -tzf shed_upload.tar.gz
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
104 test-data/sanger-pairs-forward.fastq
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
105 test-data/sanger-pairs-interleaved.fastq
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
106 test-data/sanger-pairs-mixed.fastq
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
107 test-data/sanger-pairs-reverse.fastq
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
108 test-data/sanger-pairs-singles.fastq
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
109 tools/fastq_paired_unpaired/README.rst
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
110 tools/fastq_paired_unpaired/fastq_paired_unpaired.py
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
111 tools/fastq_paired_unpaired/fastq_paired_unpaired.xml
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
112
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
113
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
114 Licence (MIT)
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
115 =============
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
116
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
117 Permission is hereby granted, free of charge, to any person obtaining a copy
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
118 of this software and associated documentation files (the "Software"), to deal
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
119 in the Software without restriction, including without limitation the rights
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
120 to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
121 copies of the Software, and to permit persons to whom the Software is
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
122 furnished to do so, subject to the following conditions:
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
123
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
124 The above copyright notice and this permission notice shall be included in
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
125 all copies or substantial portions of the Software.
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
126
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
127 THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
128 IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
129 FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
130 AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
131 LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
132 OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
133 THE SOFTWARE.