annotate README.rst @ 4:5e66e9fa2d3f draft default tip

Uploaded revision to update the citation
author peterjc
date Fri, 25 Oct 2013 10:24:35 -0400
parents 72f03c2102ee
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
1
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
1 This is package is a Galaxy workflow for the identification of candidate
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
2 secreted proteins from a given protein FASTA file.
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
3
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
4 It runs SignalP v3.0 (Bendtsen et al. 2004) and selects only proteins with a
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
5 strong predicted signal peptide, and then runs TMHMM v2.0 (Krogh et al. 2001)
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
6 on those, and selects only proteins without a predicted trans-membrane helix.
2
3a0c0d1c388f Uploaded v0.0.2, updated tool versions to solve warning from core Galaxy filter tool being updated. No functional changes.
peterjc
parents: 1
diff changeset
7 This workflow was used in Kikuchi et al. (2011), and is a simplification of
3a0c0d1c388f Uploaded v0.0.2, updated tool versions to solve warning from core Galaxy filter tool being updated. No functional changes.
peterjc
parents: 1
diff changeset
8 the candidate effector protocol described in Jones et al. (2009).
1
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
9
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
10 See http://www.galaxyproject.org for information about the Galaxy Project.
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
11
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
12
4
5e66e9fa2d3f Uploaded revision to update the citation
peterjc
parents: 3
diff changeset
13 Availability
5e66e9fa2d3f Uploaded revision to update the citation
peterjc
parents: 3
diff changeset
14 ============
5e66e9fa2d3f Uploaded revision to update the citation
peterjc
parents: 3
diff changeset
15
5e66e9fa2d3f Uploaded revision to update the citation
peterjc
parents: 3
diff changeset
16 This workflow is available to download and/or install from the main
5e66e9fa2d3f Uploaded revision to update the citation
peterjc
parents: 3
diff changeset
17 Galaxy Tool Shed:
5e66e9fa2d3f Uploaded revision to update the citation
peterjc
parents: 3
diff changeset
18
5e66e9fa2d3f Uploaded revision to update the citation
peterjc
parents: 3
diff changeset
19 http://toolshed.g2.bx.psu.edu/view/peterjc/secreted_protein_workflow
5e66e9fa2d3f Uploaded revision to update the citation
peterjc
parents: 3
diff changeset
20
5e66e9fa2d3f Uploaded revision to update the citation
peterjc
parents: 3
diff changeset
21 Test releases (which should not normally be used) are on the Test Tool Shed:
5e66e9fa2d3f Uploaded revision to update the citation
peterjc
parents: 3
diff changeset
22
5e66e9fa2d3f Uploaded revision to update the citation
peterjc
parents: 3
diff changeset
23 http://testtoolshed.g2.bx.psu.edu/view/peterjc/secreted_protein_workflow
5e66e9fa2d3f Uploaded revision to update the citation
peterjc
parents: 3
diff changeset
24
5e66e9fa2d3f Uploaded revision to update the citation
peterjc
parents: 3
diff changeset
25 Development is being done on github here:
5e66e9fa2d3f Uploaded revision to update the citation
peterjc
parents: 3
diff changeset
26
5e66e9fa2d3f Uploaded revision to update the citation
peterjc
parents: 3
diff changeset
27 https://github.com/peterjc/pico_galaxy/tree/master/workflows/secreted_protein_workflow
5e66e9fa2d3f Uploaded revision to update the citation
peterjc
parents: 3
diff changeset
28
5e66e9fa2d3f Uploaded revision to update the citation
peterjc
parents: 3
diff changeset
29
2
3a0c0d1c388f Uploaded v0.0.2, updated tool versions to solve warning from core Galaxy filter tool being updated. No functional changes.
peterjc
parents: 1
diff changeset
30 Sample Data
3a0c0d1c388f Uploaded v0.0.2, updated tool versions to solve warning from core Galaxy filter tool being updated. No functional changes.
peterjc
parents: 1
diff changeset
31 ===========
3a0c0d1c388f Uploaded v0.0.2, updated tool versions to solve warning from core Galaxy filter tool being updated. No functional changes.
peterjc
parents: 1
diff changeset
32
3a0c0d1c388f Uploaded v0.0.2, updated tool versions to solve warning from core Galaxy filter tool being updated. No functional changes.
peterjc
parents: 1
diff changeset
33 This workflow was developed and run on several nematode species. For example,
4
5e66e9fa2d3f Uploaded revision to update the citation
peterjc
parents: 3
diff changeset
34 try the protein set for *Bursaphelenchus xylophilus* (Kikuchi et al. 2011):
2
3a0c0d1c388f Uploaded v0.0.2, updated tool versions to solve warning from core Galaxy filter tool being updated. No functional changes.
peterjc
parents: 1
diff changeset
35
3a0c0d1c388f Uploaded v0.0.2, updated tool versions to solve warning from core Galaxy filter tool being updated. No functional changes.
peterjc
parents: 1
diff changeset
36 ftp://ftp.sanger.ac.uk/pub/pathogens/Bursaphelenchus/xylophilus/Assembly-v1.2/BUX.v1.2.genedb.protein.fa.gz
3a0c0d1c388f Uploaded v0.0.2, updated tool versions to solve warning from core Galaxy filter tool being updated. No functional changes.
peterjc
parents: 1
diff changeset
37
3a0c0d1c388f Uploaded v0.0.2, updated tool versions to solve warning from core Galaxy filter tool being updated. No functional changes.
peterjc
parents: 1
diff changeset
38 You can upload this directly into Galaxy via this URL. Galaxy will handle
3a0c0d1c388f Uploaded v0.0.2, updated tool versions to solve warning from core Galaxy filter tool being updated. No functional changes.
peterjc
parents: 1
diff changeset
39 removing the gzip compression to give you the FASTA protein file which has
3a0c0d1c388f Uploaded v0.0.2, updated tool versions to solve warning from core Galaxy filter tool being updated. No functional changes.
peterjc
parents: 1
diff changeset
40 18,074 sequences. The expected result (selecting organism type Eukaryote)
3a0c0d1c388f Uploaded v0.0.2, updated tool versions to solve warning from core Galaxy filter tool being updated. No functional changes.
peterjc
parents: 1
diff changeset
41 is a FASTA protein file of 2,297 predicted secreted protein sequences.
3a0c0d1c388f Uploaded v0.0.2, updated tool versions to solve warning from core Galaxy filter tool being updated. No functional changes.
peterjc
parents: 1
diff changeset
42
3a0c0d1c388f Uploaded v0.0.2, updated tool versions to solve warning from core Galaxy filter tool being updated. No functional changes.
peterjc
parents: 1
diff changeset
43
1
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
44 Citation
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
45 ========
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
46
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
47 If you use this workflow directly, or a derivative of it, in work leading
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
48 to a scientific publication, please cite:
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
49
4
5e66e9fa2d3f Uploaded revision to update the citation
peterjc
parents: 3
diff changeset
50 Cock, P.J.A. and Pritchard, L. (2014). Galaxy as a platform for identifying
1
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
51 candidate pathogen effectors. Chapter 1 in "Plant-Pathogen Interactions:
4
5e66e9fa2d3f Uploaded revision to update the citation
peterjc
parents: 3
diff changeset
52 Methods and Protocols (Second Edition)"; P. Birch, J. Jones, and J.I. Bos, eds.
5e66e9fa2d3f Uploaded revision to update the citation
peterjc
parents: 3
diff changeset
53 Methods in Molecular Biology. Humana Press, Springer. ISBN 978-1-62703-985-7.
5e66e9fa2d3f Uploaded revision to update the citation
peterjc
parents: 3
diff changeset
54 http://www.springer.com/life+sciences/plant+sciences/book/978-1-62703-985-7
1
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
55
4
5e66e9fa2d3f Uploaded revision to update the citation
peterjc
parents: 3
diff changeset
56 Peter J.A. Cock, Björn A. Grüning, Konrad Paszkiewicz and Leighton Pritchard (2013).
5e66e9fa2d3f Uploaded revision to update the citation
peterjc
parents: 3
diff changeset
57 Galaxy tools and workflows for sequence analysis with applications
5e66e9fa2d3f Uploaded revision to update the citation
peterjc
parents: 3
diff changeset
58 in molecular plant pathology. PeerJ 1:e167
5e66e9fa2d3f Uploaded revision to update the citation
peterjc
parents: 3
diff changeset
59 http://dx.doi.org/10.7717/peerj.167
1
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
60
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
61 Bendtsen, J.D., Nielsen, H., von Heijne, G., Brunak, S. (2004)
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
62 Improved prediction of signal peptides: SignalP 3.0. J Mol Biol 340: 783–95.
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
63 http://dx.doi.org/10.1016/j.jmb.2004.05.028
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
64
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
65 Krogh, A., Larsson, B., von Heijne, G., Sonnhammer, E. (2001)
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
66 Predicting transmembrane protein topology with a hidden Markov model:
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
67 application to complete genomes. J Mol Biol 305: 567- 580.
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
68 http://dx.doi.org/10.1006/jmbi.2000.4315
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
69
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
70
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
71 Additional References
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
72 =====================
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
73
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
74 Kikuchi, T., Cotton, J.A., Dalzell, J.J., Hasegawa. K., et al. (2011)
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
75 Genomic insights into the origin of parasitism in the emerging plant
4
5e66e9fa2d3f Uploaded revision to update the citation
peterjc
parents: 3
diff changeset
76 pathogen *Bursaphelenchus xylophilus*. PLoS Pathog 7: e1002219.
1
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
77 http://dx.doi.org/10.1371/journal.ppat.1002219
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
78
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
79 Jones, J.T., Kumar, A., Pylypenko, L.A., Thirugnanasambandam, A., et al. (2009)
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
80 Identification and functional characterization of effectors in expressed
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
81 sequence tags from various life cycle stages of the potato cyst nematode
4
5e66e9fa2d3f Uploaded revision to update the citation
peterjc
parents: 3
diff changeset
82 *Globodera pallida*. Mol Plant Pathol 10: 815–28.
1
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
83 http://dx.doi.org/10.1111/j.1364-3703.2009.00585.x
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
84
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
85
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
86 Dependencies
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
87 ============
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
88
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
89 These dependencies should be resolved automatically via the Galaxy Tool Shed:
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
90
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
91 * http://toolshed.g2.bx.psu.edu/view/peterjc/tmhmm_and_signalp
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
92 * http://toolshed.g2.bx.psu.edu/view/peterjc/seq_filter_by_id
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
93
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
94 However, at the time of writing those Galaxy tools have their own
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
95 dependencies required for this workflow which require manual
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
96 installation (SignalP v3.0 and TMHMM v2.0).
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
97
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
98
2
3a0c0d1c388f Uploaded v0.0.2, updated tool versions to solve warning from core Galaxy filter tool being updated. No functional changes.
peterjc
parents: 1
diff changeset
99 History
3a0c0d1c388f Uploaded v0.0.2, updated tool versions to solve warning from core Galaxy filter tool being updated. No functional changes.
peterjc
parents: 1
diff changeset
100 =======
3a0c0d1c388f Uploaded v0.0.2, updated tool versions to solve warning from core Galaxy filter tool being updated. No functional changes.
peterjc
parents: 1
diff changeset
101
3a0c0d1c388f Uploaded v0.0.2, updated tool versions to solve warning from core Galaxy filter tool being updated. No functional changes.
peterjc
parents: 1
diff changeset
102 ======= ======================================================================
3a0c0d1c388f Uploaded v0.0.2, updated tool versions to solve warning from core Galaxy filter tool being updated. No functional changes.
peterjc
parents: 1
diff changeset
103 Version Changes
3a0c0d1c388f Uploaded v0.0.2, updated tool versions to solve warning from core Galaxy filter tool being updated. No functional changes.
peterjc
parents: 1
diff changeset
104 ------- ----------------------------------------------------------------------
3a0c0d1c388f Uploaded v0.0.2, updated tool versions to solve warning from core Galaxy filter tool being updated. No functional changes.
peterjc
parents: 1
diff changeset
105 v0.0.1 - Initial release to Tool Shed (May, 2013)
3a0c0d1c388f Uploaded v0.0.2, updated tool versions to solve warning from core Galaxy filter tool being updated. No functional changes.
peterjc
parents: 1
diff changeset
106 - Expanded README file to include example data
3a0c0d1c388f Uploaded v0.0.2, updated tool versions to solve warning from core Galaxy filter tool being updated. No functional changes.
peterjc
parents: 1
diff changeset
107 v0.0.2 - Updated versions of the tools used, inclulding core Galaxy Filter
3a0c0d1c388f Uploaded v0.0.2, updated tool versions to solve warning from core Galaxy filter tool being updated. No functional changes.
peterjc
parents: 1
diff changeset
108 tool to avoid warning about new ``header_lines`` parameter.
3
72f03c2102ee Uploaded v0.0.2b, adding link to Tool Shed in the workflow annotation (so that end users can find the README file).
peterjc
parents: 2
diff changeset
109 - Added link to Tool Shed in the workflow annotation explaining there
72f03c2102ee Uploaded v0.0.2b, adding link to Tool Shed in the workflow annotation (so that end users can find the README file).
peterjc
parents: 2
diff changeset
110 is a README file with sample data, and a requested citation.
2
3a0c0d1c388f Uploaded v0.0.2, updated tool versions to solve warning from core Galaxy filter tool being updated. No functional changes.
peterjc
parents: 1
diff changeset
111 ======= ======================================================================
3a0c0d1c388f Uploaded v0.0.2, updated tool versions to solve warning from core Galaxy filter tool being updated. No functional changes.
peterjc
parents: 1
diff changeset
112
3a0c0d1c388f Uploaded v0.0.2, updated tool versions to solve warning from core Galaxy filter tool being updated. No functional changes.
peterjc
parents: 1
diff changeset
113
1
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
114 Developers
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
115 ==========
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
116
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
117 This workflow is under source code control here:
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
118
4
5e66e9fa2d3f Uploaded revision to update the citation
peterjc
parents: 3
diff changeset
119 https://github.com/peterjc/pico_galaxy/tree/master/workflows/secreted_protein_workflow
1
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
120
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
121 To prepare the tar-ball for uploading to the Tool Shed, I use this:
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
122
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
123 $ tar -cf secreted_protein_workflow.tar.gz README.rst repository_dependencies.xml secreted_protein_workflow.ga
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
124
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
125 Check this,
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
126
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
127 $ tar -tzf secreted_protein_workflow.tar.gz
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
128 README.rst
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
129 repository_dependencies.xml
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
130 secreted_protein_workflow.ga