Mercurial > repos > peterjc > seq_filter_by_id
annotate tools/filters/seq_filter_by_id.txt @ 2:abdd608c869b draft
Uploaded v0.0.5, checked script return code for errors X copes with a FASTA entry missing an ID.
author | peterjc |
---|---|
date | Wed, 24 Apr 2013 11:34:12 -0400 |
parents | 262f08104540 |
children |
rev | line source |
---|---|
0
5844f6a450ed
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
1 Galaxy tool to filter FASTA, FASTQ or SFF sequences by ID |
5844f6a450ed
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
2 ========================================================= |
5844f6a450ed
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
3 |
1
262f08104540
Uploaded v0.0.4 which includes a unit test and is faster at filtering FASTA files with large records (e.g. whole chromosomes)
peterjc
parents:
0
diff
changeset
|
4 This tool is copyright 2010-2013 by Peter Cock, The James Hutton Institute |
262f08104540
Uploaded v0.0.4 which includes a unit test and is faster at filtering FASTA files with large records (e.g. whole chromosomes)
peterjc
parents:
0
diff
changeset
|
5 (formerly SCRI, Scottish Crop Research Institute), UK. All rights reserved. |
0
5844f6a450ed
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
6 See the licence text below. |
5844f6a450ed
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
7 |
5844f6a450ed
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
8 This tool is a short Python script (using both the Galaxy and Biopython library |
5844f6a450ed
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
9 functions) which divides a FASTA, FASTQ, or SFF file in two, those sequences with |
5844f6a450ed
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
10 or without an ID present in the specified column(s) of a tabular file. Example uses |
5844f6a450ed
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
11 include filtering based on search results from a tool like NCBI BLAST before |
5844f6a450ed
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
12 assembly. |
5844f6a450ed
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
13 |
1
262f08104540
Uploaded v0.0.4 which includes a unit test and is faster at filtering FASTA files with large records (e.g. whole chromosomes)
peterjc
parents:
0
diff
changeset
|
14 |
262f08104540
Uploaded v0.0.4 which includes a unit test and is faster at filtering FASTA files with large records (e.g. whole chromosomes)
peterjc
parents:
0
diff
changeset
|
15 Installation |
262f08104540
Uploaded v0.0.4 which includes a unit test and is faster at filtering FASTA files with large records (e.g. whole chromosomes)
peterjc
parents:
0
diff
changeset
|
16 ============ |
262f08104540
Uploaded v0.0.4 which includes a unit test and is faster at filtering FASTA files with large records (e.g. whole chromosomes)
peterjc
parents:
0
diff
changeset
|
17 |
0
5844f6a450ed
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
18 There are just two files to install: |
5844f6a450ed
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
19 |
5844f6a450ed
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
20 * seq_filter_by_id.py (the Python script) |
5844f6a450ed
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
21 * seq_filter_by_id.xml (the Galaxy tool definition) |
5844f6a450ed
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
22 |
5844f6a450ed
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
23 The suggested location is in the Galaxy folder tools/filters next to the tool |
5844f6a450ed
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
24 for calling sff_extract.py for converting SFF to FASTQ or FASTA + QUAL. |
5844f6a450ed
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
25 |
5844f6a450ed
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
26 You will also need to modify the tools_conf.xml file to tell Galaxy to offer the |
5844f6a450ed
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
27 tool. One suggested location is in the filters section. Simply add the line: |
5844f6a450ed
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
28 |
5844f6a450ed
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
29 <tool file="filters/sff_filter_by_id.xml" /> |
5844f6a450ed
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
30 |
5844f6a450ed
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
31 You will also need to install Biopython 1.54 or later. That's it. |
5844f6a450ed
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
32 |
5844f6a450ed
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
33 |
5844f6a450ed
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
34 History |
5844f6a450ed
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
35 ======= |
5844f6a450ed
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
36 |
5844f6a450ed
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
37 v0.0.1 - Initial version, combining three separate scripts for each file format. |
1
262f08104540
Uploaded v0.0.4 which includes a unit test and is faster at filtering FASTA files with large records (e.g. whole chromosomes)
peterjc
parents:
0
diff
changeset
|
38 v0.0.4 - Record script version when run from Galaxy. |
262f08104540
Uploaded v0.0.4 which includes a unit test and is faster at filtering FASTA files with large records (e.g. whole chromosomes)
peterjc
parents:
0
diff
changeset
|
39 - Faster FASTA code which preserves the original line wrapping. |
262f08104540
Uploaded v0.0.4 which includes a unit test and is faster at filtering FASTA files with large records (e.g. whole chromosomes)
peterjc
parents:
0
diff
changeset
|
40 - Basic unit test included. |
2
abdd608c869b
Uploaded v0.0.5, checked script return code for errors X copes with a FASTA entry missing an ID.
peterjc
parents:
1
diff
changeset
|
41 v0.0.5 - Check for errors using Python script's return code. |
abdd608c869b
Uploaded v0.0.5, checked script return code for errors X copes with a FASTA entry missing an ID.
peterjc
parents:
1
diff
changeset
|
42 - Cope with malformed FASTA entries without an identifier. |
0
5844f6a450ed
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
43 |
5844f6a450ed
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
44 |
5844f6a450ed
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
45 Developers |
5844f6a450ed
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
46 ========== |
5844f6a450ed
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
47 |
5844f6a450ed
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
48 This script and related tools are being developed on the following hg branch: |
5844f6a450ed
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
49 http://bitbucket.org/peterjc/galaxy-central/src/tools |
5844f6a450ed
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
50 |
5844f6a450ed
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
51 This incorporates the previously used hg branch: |
5844f6a450ed
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
52 http://bitbucket.org/peterjc/galaxy-central/src/fasta_filter |
5844f6a450ed
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
53 |
1
262f08104540
Uploaded v0.0.4 which includes a unit test and is faster at filtering FASTA files with large records (e.g. whole chromosomes)
peterjc
parents:
0
diff
changeset
|
54 For making the "Galaxy Tool Shed" http://toolshed.g2.bx.psu.edu/ tarball use |
0
5844f6a450ed
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
55 the following command from the Galaxy root folder: |
5844f6a450ed
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
56 |
1
262f08104540
Uploaded v0.0.4 which includes a unit test and is faster at filtering FASTA files with large records (e.g. whole chromosomes)
peterjc
parents:
0
diff
changeset
|
57 $ tar -czf seq_filter_by_id.tar.gz tools/filters/seq_filter_by_id.* test-data/k12_ten_proteins.fasta test-data/k12_hypothetical.fasta test-data/k12_hypothetical.tabular |
0
5844f6a450ed
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
58 |
5844f6a450ed
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
59 Check this worked: |
5844f6a450ed
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
60 |
5844f6a450ed
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
61 $ tar -tzf seq_filter_by_id.tar.gz |
5844f6a450ed
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
62 filter/seq_filter_by_id.py |
5844f6a450ed
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
63 filter/seq_filter_by_id.txt |
5844f6a450ed
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
64 filter/seq_filter_by_id.xml |
1
262f08104540
Uploaded v0.0.4 which includes a unit test and is faster at filtering FASTA files with large records (e.g. whole chromosomes)
peterjc
parents:
0
diff
changeset
|
65 test-data/k12_ten_proteins.fasta |
262f08104540
Uploaded v0.0.4 which includes a unit test and is faster at filtering FASTA files with large records (e.g. whole chromosomes)
peterjc
parents:
0
diff
changeset
|
66 test-data/k12_hypothetical.fasta |
262f08104540
Uploaded v0.0.4 which includes a unit test and is faster at filtering FASTA files with large records (e.g. whole chromosomes)
peterjc
parents:
0
diff
changeset
|
67 test-data/k12_hypothetical.tabular |
0
5844f6a450ed
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
68 |
5844f6a450ed
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
69 |
5844f6a450ed
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
70 Licence (MIT/BSD style) |
5844f6a450ed
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
71 ======================= |
5844f6a450ed
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
72 |
5844f6a450ed
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
73 Permission to use, copy, modify, and distribute this software and its |
5844f6a450ed
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
74 documentation with or without modifications and for any purpose and |
5844f6a450ed
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
75 without fee is hereby granted, provided that any copyright notices |
5844f6a450ed
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
76 appear in all copies and that both those copyright notices and this |
5844f6a450ed
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
77 permission notice appear in supporting documentation, and that the |
5844f6a450ed
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
78 names of the contributors or copyright holders not be used in |
5844f6a450ed
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
79 advertising or publicity pertaining to distribution of the software |
5844f6a450ed
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
80 without specific prior permission. |
5844f6a450ed
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
81 |
5844f6a450ed
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
82 THE CONTRIBUTORS AND COPYRIGHT HOLDERS OF THIS SOFTWARE DISCLAIM ALL |
5844f6a450ed
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
83 WARRANTIES WITH REGARD TO THIS SOFTWARE, INCLUDING ALL IMPLIED |
5844f6a450ed
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
84 WARRANTIES OF MERCHANTABILITY AND FITNESS, IN NO EVENT SHALL THE |
5844f6a450ed
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
85 CONTRIBUTORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY SPECIAL, INDIRECT |
5844f6a450ed
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
86 OR CONSEQUENTIAL DAMAGES OR ANY DAMAGES WHATSOEVER RESULTING FROM LOSS |
5844f6a450ed
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
87 OF USE, DATA OR PROFITS, WHETHER IN AN ACTION OF CONTRACT, NEGLIGENCE |
5844f6a450ed
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
88 OR OTHER TORTIOUS ACTION, ARISING OUT OF OR IN CONNECTION WITH THE USE |
5844f6a450ed
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
89 OR PERFORMANCE OF THIS SOFTWARE. |