annotate tools/fastq_paired_unpaired/README.rst @ 4:09f9f0e29e47 draft

v0.0.6 use format_source; v0.0.5 error handling & citation
author peterjc
date Wed, 05 Aug 2015 11:06:38 -0400
parents
children b38bbcbd458d
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
4
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
1 Galaxy tool to divide FASTQ files into paired and unpaired reads
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
2 ================================================================
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
3
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
4 This tool is copyright 2010-2015 by Peter Cock, The James Hutton Institute
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
5 (formerly SCRI, Scottish Crop Research Institute), UK. All rights reserved.
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
6 See the licence text below (MIT licence).
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
7
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
8 This tool is a short Python script which divides a FASTQ file into paired
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
9 reads, and single or orphan reads. You can have separate files for the
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
10 forward/reverse reads, or have them interleaved in a single file.
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
11
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
12 Note that the FASTQ variant is unimportant (Sanger, Solexa, Illumina, or even
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
13 Color Space should all work equally well).
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
14
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
15 This tool is available from the Galaxy Tool Shed at:
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
16 http://toolshed.g2.bx.psu.edu/view/peterjc/fastq_paired_unpaired
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
17
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
18
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
19 Automated Installation
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
20 ======================
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
21
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
22 This should be straightforward, Galaxy should automatically download and install
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
23 the tool from the Galaxy Tool Shed, and run the unit tests
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
24
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
25
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
26 Manual Installation
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
27 ===================
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
28
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
29 There are just two files to install:
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
30
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
31 * ``fastq_paired_unpaired.py`` (the Python script)
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
32 * ``fastq_paired_unpaired.xml`` (the Galaxy tool definition)
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
33
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
34 The suggested location is in the Galaxy folder tools/fastq next to other FASTQ
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
35 tools provided with Galaxy.
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
36
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
37 You will also need to modify the ``tools_conf.xml`` file to tell Galaxy to offer
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
38 the tool. One suggested location is next to the ``fastq_filter.xml`` entry, or use
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
39 a dedicated folder like ``tools/fastq_paired_unpaired``. Then simply add the line::
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
40
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
41 <tool file="fastq_paired_unpaired/fastq_paired_unpaired.xml" />
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
42
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
43 That's it.
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
44
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
45
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
46 History
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
47 =======
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
48
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
49 ======= ======================================================================
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
50 Version Changes
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
51 ------- ----------------------------------------------------------------------
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
52 v0.0.1 - Initial version, using Biopython
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
53 v0.0.2 - Help text; cope with multiple pairs per template
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
54 v0.0.3 - Galaxy XML wrappers added
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
55 v0.0.4 - Use Galaxy library to handle FASTQ files (avoid Biopython dependency)
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
56 v0.0.5 - Handle Illumina 1.8 style pair names
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
57 v0.0.6 - Record script version when run from Galaxy
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
58 - Added unit test (FASTQ file using Sanger naming)
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
59 v0.0.7 - Link to Tool Shed added to help text and this documentation.
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
60 v0.0.8 - Use reStructuredText for this README file.
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
61 - Adopt standard MIT License.
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
62 - Updated citation information (Cock et al. 2013).
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
63 - Development moved to GitHub.
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
64 v0.0.9 - Renamed folder and adopted README.rst naming.
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
65 - Removed some unused code in the Python script.
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
66 v0.1.0 - Switch to using Biopython (easier to use script outside of Galaxy).
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
67 - Leaves FASTQ plus lines blank (smaller output files).
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
68 - Tool definition now embeds citation information.
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
69 v0.1.1 - Reorder XML elements (internal change only).
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
70 - Use ``format_source=...`` tag.
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
71 - Planemo for Tool Shed upload (``.shed.yml``, internal change only).
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
72 ======= ======================================================================
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
73
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
74
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
75 Developers
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
76 ==========
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
77
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
78 This script and other tools for filtering FASTA, FASTQ and SFF files were
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
79 initially developed on the following hg branch:
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
80 http://bitbucket.org/peterjc/galaxy-central/src/fasta_filter
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
81
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
82 Development has now moved to a dedicated GitHub repository:
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
83 https://github.com/peterjc/pico_galaxy/tree/master/tools/fastq_paired_unpaired
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
84
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
85 For pushing a release to the test or main "Galaxy Tool Shed", use the following
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
86 Planemo commands (which requires you have set your Tool Shed access details in
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
87 ``~/.planemo.yml`` and that you have access rights on the Tool Shed)::
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
88
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
89 $ planemo shed_update --shed_target testtoolshed --check_diff ~/repositories/pico_galaxy/tools/fastq_paired_unpaired/
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
90 ...
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
91
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
92 or::
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
93
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
94 $ planemo shed_update --shed_target toolshed --check_diff ~/repositories/pico_galaxy/tools/fastq_paired_unpaired/
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
95 ...
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
96
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
97 To just build and check the tar ball, use::
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
98
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
99 $ planemo shed_upload --tar_only ~/repositories/pico_galaxy/tools/fastq_paired_unpaired/
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
100 ...
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
101 $ tar -tzf shed_upload.tar.gz
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
102 test-data/sanger-pairs-forward.fastq
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
103 test-data/sanger-pairs-interleaved.fastq
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
104 test-data/sanger-pairs-mixed.fastq
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
105 test-data/sanger-pairs-reverse.fastq
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
106 test-data/sanger-pairs-singles.fastq
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
107 tools/fastq_paired_unpaired/README.rst
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
108 tools/fastq_paired_unpaired/fastq_paired_unpaired.py
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
109 tools/fastq_paired_unpaired/fastq_paired_unpaired.xml
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
110
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
111
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
112 Licence (MIT)
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
113 =============
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
114
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
115 Permission is hereby granted, free of charge, to any person obtaining a copy
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
116 of this software and associated documentation files (the "Software"), to deal
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
117 in the Software without restriction, including without limitation the rights
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
118 to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
119 copies of the Software, and to permit persons to whom the Software is
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
120 furnished to do so, subject to the following conditions:
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
121
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
122 The above copyright notice and this permission notice shall be included in
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
123 all copies or substantial portions of the Software.
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
124
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
125 THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
126 IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
127 FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
128 AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
129 LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
130 OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN
09f9f0e29e47 v0.0.6 use format_source; v0.0.5 error handling & citation
peterjc
parents:
diff changeset
131 THE SOFTWARE.