1
|
1 This is package is a Galaxy workflow for the identification of candidate
|
|
2 secreted proteins from a given protein FASTA file.
|
|
3
|
|
4 It runs SignalP v3.0 (Bendtsen et al. 2004) and selects only proteins with a
|
|
5 strong predicted signal peptide, and then runs TMHMM v2.0 (Krogh et al. 2001)
|
|
6 on those, and selects only proteins without a predicted trans-membrane helix.
|
|
7 This workflow was used in Kikuchi et al (2001), and is a simplification of
|
|
8 the candidate effector protocol described in Jones et al (2009).
|
|
9
|
|
10 See http://www.galaxyproject.org for information about the Galaxy Project.
|
|
11
|
|
12
|
|
13 Citation
|
|
14 ========
|
|
15
|
|
16 If you use this workflow directly, or a derivative of it, in work leading
|
|
17 to a scientific publication, please cite:
|
|
18
|
|
19 Cock, P.J.A. and Pritchard, L. 2013. Galaxy as a platform for identifying
|
|
20 candidate pathogen effectors. Chapter 1 in "Plant-Pathogen Interactions:
|
|
21 Methods and Protocols (Second Edition)"; Methods in Molecular Biology.
|
|
22 Humana Press, Springer. In press.
|
|
23
|
|
24 Also consider citing:
|
|
25
|
|
26 Bendtsen, J.D., Nielsen, H., von Heijne, G., Brunak, S. (2004)
|
|
27 Improved prediction of signal peptides: SignalP 3.0. J Mol Biol 340: 783–95.
|
|
28 http://dx.doi.org/10.1016/j.jmb.2004.05.028
|
|
29
|
|
30 Krogh, A., Larsson, B., von Heijne, G., Sonnhammer, E. (2001)
|
|
31 Predicting transmembrane protein topology with a hidden Markov model:
|
|
32 application to complete genomes. J Mol Biol 305: 567- 580.
|
|
33 http://dx.doi.org/10.1006/jmbi.2000.4315
|
|
34
|
|
35
|
|
36 Additional References
|
|
37 =====================
|
|
38
|
|
39 Kikuchi, T., Cotton, J.A., Dalzell, J.J., Hasegawa. K., et al. (2011)
|
|
40 Genomic insights into the origin of parasitism in the emerging plant
|
|
41 pathogen Bursaphelenchus xylophilus. PLoS Pathog 7: e1002219.
|
|
42 http://dx.doi.org/10.1371/journal.ppat.1002219
|
|
43
|
|
44 Jones, J.T., Kumar, A., Pylypenko, L.A., Thirugnanasambandam, A., et al. (2009)
|
|
45 Identification and functional characterization of effectors in expressed
|
|
46 sequence tags from various life cycle stages of the potato cyst nematode
|
|
47 Globodera pallida. Mol Plant Pathol 10: 815–28.
|
|
48 http://dx.doi.org/10.1111/j.1364-3703.2009.00585.x
|
|
49
|
|
50
|
|
51 Availability
|
|
52 ============
|
|
53
|
|
54 This workflow is available to download and/or install from the main
|
|
55 Galaxy Tool Shed:
|
|
56
|
|
57 http://toolshed.g2.bx.psu.edu/view/peterjc/secreted_protein_workflow
|
|
58
|
|
59 Test releases (which should not normally be used) are on the Test Tool Shed:
|
|
60
|
|
61 http://testtoolshed.g2.bx.psu.edu/view/peterjc/secreted_protein_workflow
|
|
62
|
|
63 Development is being done on github here:
|
|
64
|
|
65 https://github.com/peterjc/picobio/tree/master/galaxy_workflows/secreted_protein_workflow
|
|
66
|
|
67
|
|
68 Dependencies
|
|
69 ============
|
|
70
|
|
71 These dependencies should be resolved automatically via the Galaxy Tool Shed:
|
|
72
|
|
73 * http://toolshed.g2.bx.psu.edu/view/peterjc/tmhmm_and_signalp
|
|
74 * http://toolshed.g2.bx.psu.edu/view/peterjc/seq_filter_by_id
|
|
75
|
|
76 However, at the time of writing those Galaxy tools have their own
|
|
77 dependencies required for this workflow which require manual
|
|
78 installation (SignalP v3.0 and TMHMM v2.0).
|
|
79
|
|
80
|
|
81 Developers
|
|
82 ==========
|
|
83
|
|
84 This workflow is under source code control here:
|
|
85
|
|
86 https://github.com/peterjc/picobio/tree/master/galaxy_workflows/secreted_protein_workflow
|
|
87
|
|
88 To prepare the tar-ball for uploading to the Tool Shed, I use this:
|
|
89
|
|
90 $ tar -cf secreted_protein_workflow.tar.gz README.rst repository_dependencies.xml secreted_protein_workflow.ga
|
|
91
|
|
92 Check this,
|
|
93
|
|
94 $ tar -tzf secreted_protein_workflow.tar.gz
|
|
95 README.rst
|
|
96 repository_dependencies.xml
|
|
97 secreted_protein_workflow.ga
|