annotate README.rst @ 3:72f03c2102ee draft

Uploaded v0.0.2b, adding link to Tool Shed in the workflow annotation (so that end users can find the README file).
author peterjc
date Wed, 21 Aug 2013 12:31:19 -0400
parents 3a0c0d1c388f
children 5e66e9fa2d3f
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
1
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
1 This is package is a Galaxy workflow for the identification of candidate
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
2 secreted proteins from a given protein FASTA file.
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
3
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
4 It runs SignalP v3.0 (Bendtsen et al. 2004) and selects only proteins with a
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
5 strong predicted signal peptide, and then runs TMHMM v2.0 (Krogh et al. 2001)
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
6 on those, and selects only proteins without a predicted trans-membrane helix.
2
3a0c0d1c388f Uploaded v0.0.2, updated tool versions to solve warning from core Galaxy filter tool being updated. No functional changes.
peterjc
parents: 1
diff changeset
7 This workflow was used in Kikuchi et al. (2011), and is a simplification of
3a0c0d1c388f Uploaded v0.0.2, updated tool versions to solve warning from core Galaxy filter tool being updated. No functional changes.
peterjc
parents: 1
diff changeset
8 the candidate effector protocol described in Jones et al. (2009).
1
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
9
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
10 See http://www.galaxyproject.org for information about the Galaxy Project.
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
11
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
12
2
3a0c0d1c388f Uploaded v0.0.2, updated tool versions to solve warning from core Galaxy filter tool being updated. No functional changes.
peterjc
parents: 1
diff changeset
13 Sample Data
3a0c0d1c388f Uploaded v0.0.2, updated tool versions to solve warning from core Galaxy filter tool being updated. No functional changes.
peterjc
parents: 1
diff changeset
14 ===========
3a0c0d1c388f Uploaded v0.0.2, updated tool versions to solve warning from core Galaxy filter tool being updated. No functional changes.
peterjc
parents: 1
diff changeset
15
3a0c0d1c388f Uploaded v0.0.2, updated tool versions to solve warning from core Galaxy filter tool being updated. No functional changes.
peterjc
parents: 1
diff changeset
16 This workflow was developed and run on several nematode species. For example,
3a0c0d1c388f Uploaded v0.0.2, updated tool versions to solve warning from core Galaxy filter tool being updated. No functional changes.
peterjc
parents: 1
diff changeset
17 try the protein set for Bursaphelenchus xylophilus (Kikuchi et al. 2011):
3a0c0d1c388f Uploaded v0.0.2, updated tool versions to solve warning from core Galaxy filter tool being updated. No functional changes.
peterjc
parents: 1
diff changeset
18
3a0c0d1c388f Uploaded v0.0.2, updated tool versions to solve warning from core Galaxy filter tool being updated. No functional changes.
peterjc
parents: 1
diff changeset
19 ftp://ftp.sanger.ac.uk/pub/pathogens/Bursaphelenchus/xylophilus/Assembly-v1.2/BUX.v1.2.genedb.protein.fa.gz
3a0c0d1c388f Uploaded v0.0.2, updated tool versions to solve warning from core Galaxy filter tool being updated. No functional changes.
peterjc
parents: 1
diff changeset
20
3a0c0d1c388f Uploaded v0.0.2, updated tool versions to solve warning from core Galaxy filter tool being updated. No functional changes.
peterjc
parents: 1
diff changeset
21 You can upload this directly into Galaxy via this URL. Galaxy will handle
3a0c0d1c388f Uploaded v0.0.2, updated tool versions to solve warning from core Galaxy filter tool being updated. No functional changes.
peterjc
parents: 1
diff changeset
22 removing the gzip compression to give you the FASTA protein file which has
3a0c0d1c388f Uploaded v0.0.2, updated tool versions to solve warning from core Galaxy filter tool being updated. No functional changes.
peterjc
parents: 1
diff changeset
23 18,074 sequences. The expected result (selecting organism type Eukaryote)
3a0c0d1c388f Uploaded v0.0.2, updated tool versions to solve warning from core Galaxy filter tool being updated. No functional changes.
peterjc
parents: 1
diff changeset
24 is a FASTA protein file of 2,297 predicted secreted protein sequences.
3a0c0d1c388f Uploaded v0.0.2, updated tool versions to solve warning from core Galaxy filter tool being updated. No functional changes.
peterjc
parents: 1
diff changeset
25
3a0c0d1c388f Uploaded v0.0.2, updated tool versions to solve warning from core Galaxy filter tool being updated. No functional changes.
peterjc
parents: 1
diff changeset
26
1
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
27 Citation
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
28 ========
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
29
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
30 If you use this workflow directly, or a derivative of it, in work leading
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
31 to a scientific publication, please cite:
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
32
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
33 Cock, P.J.A. and Pritchard, L. 2013. Galaxy as a platform for identifying
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
34 candidate pathogen effectors. Chapter 1 in "Plant-Pathogen Interactions:
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
35 Methods and Protocols (Second Edition)"; Methods in Molecular Biology.
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
36 Humana Press, Springer. In press.
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
37
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
38 Also consider citing:
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
39
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
40 Bendtsen, J.D., Nielsen, H., von Heijne, G., Brunak, S. (2004)
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
41 Improved prediction of signal peptides: SignalP 3.0. J Mol Biol 340: 783–95.
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
42 http://dx.doi.org/10.1016/j.jmb.2004.05.028
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
43
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
44 Krogh, A., Larsson, B., von Heijne, G., Sonnhammer, E. (2001)
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
45 Predicting transmembrane protein topology with a hidden Markov model:
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
46 application to complete genomes. J Mol Biol 305: 567- 580.
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
47 http://dx.doi.org/10.1006/jmbi.2000.4315
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
48
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
49
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
50 Additional References
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
51 =====================
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
52
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
53 Kikuchi, T., Cotton, J.A., Dalzell, J.J., Hasegawa. K., et al. (2011)
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
54 Genomic insights into the origin of parasitism in the emerging plant
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
55 pathogen Bursaphelenchus xylophilus. PLoS Pathog 7: e1002219.
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
56 http://dx.doi.org/10.1371/journal.ppat.1002219
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
57
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
58 Jones, J.T., Kumar, A., Pylypenko, L.A., Thirugnanasambandam, A., et al. (2009)
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
59 Identification and functional characterization of effectors in expressed
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
60 sequence tags from various life cycle stages of the potato cyst nematode
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
61 Globodera pallida. Mol Plant Pathol 10: 815–28.
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
62 http://dx.doi.org/10.1111/j.1364-3703.2009.00585.x
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
63
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
64
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
65 Availability
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
66 ============
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
67
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
68 This workflow is available to download and/or install from the main
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
69 Galaxy Tool Shed:
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
70
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
71 http://toolshed.g2.bx.psu.edu/view/peterjc/secreted_protein_workflow
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
72
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
73 Test releases (which should not normally be used) are on the Test Tool Shed:
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
74
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
75 http://testtoolshed.g2.bx.psu.edu/view/peterjc/secreted_protein_workflow
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
76
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
77 Development is being done on github here:
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
78
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
79 https://github.com/peterjc/picobio/tree/master/galaxy_workflows/secreted_protein_workflow
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
80
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
81
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
82 Dependencies
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
83 ============
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
84
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
85 These dependencies should be resolved automatically via the Galaxy Tool Shed:
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
86
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
87 * http://toolshed.g2.bx.psu.edu/view/peterjc/tmhmm_and_signalp
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
88 * http://toolshed.g2.bx.psu.edu/view/peterjc/seq_filter_by_id
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
89
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
90 However, at the time of writing those Galaxy tools have their own
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
91 dependencies required for this workflow which require manual
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
92 installation (SignalP v3.0 and TMHMM v2.0).
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
93
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
94
2
3a0c0d1c388f Uploaded v0.0.2, updated tool versions to solve warning from core Galaxy filter tool being updated. No functional changes.
peterjc
parents: 1
diff changeset
95 History
3a0c0d1c388f Uploaded v0.0.2, updated tool versions to solve warning from core Galaxy filter tool being updated. No functional changes.
peterjc
parents: 1
diff changeset
96 =======
3a0c0d1c388f Uploaded v0.0.2, updated tool versions to solve warning from core Galaxy filter tool being updated. No functional changes.
peterjc
parents: 1
diff changeset
97
3a0c0d1c388f Uploaded v0.0.2, updated tool versions to solve warning from core Galaxy filter tool being updated. No functional changes.
peterjc
parents: 1
diff changeset
98 ======= ======================================================================
3a0c0d1c388f Uploaded v0.0.2, updated tool versions to solve warning from core Galaxy filter tool being updated. No functional changes.
peterjc
parents: 1
diff changeset
99 Version Changes
3a0c0d1c388f Uploaded v0.0.2, updated tool versions to solve warning from core Galaxy filter tool being updated. No functional changes.
peterjc
parents: 1
diff changeset
100 ------- ----------------------------------------------------------------------
3a0c0d1c388f Uploaded v0.0.2, updated tool versions to solve warning from core Galaxy filter tool being updated. No functional changes.
peterjc
parents: 1
diff changeset
101 v0.0.1 - Initial release to Tool Shed (May, 2013)
3a0c0d1c388f Uploaded v0.0.2, updated tool versions to solve warning from core Galaxy filter tool being updated. No functional changes.
peterjc
parents: 1
diff changeset
102 - Expanded README file to include example data
3a0c0d1c388f Uploaded v0.0.2, updated tool versions to solve warning from core Galaxy filter tool being updated. No functional changes.
peterjc
parents: 1
diff changeset
103 v0.0.2 - Updated versions of the tools used, inclulding core Galaxy Filter
3a0c0d1c388f Uploaded v0.0.2, updated tool versions to solve warning from core Galaxy filter tool being updated. No functional changes.
peterjc
parents: 1
diff changeset
104 tool to avoid warning about new ``header_lines`` parameter.
3
72f03c2102ee Uploaded v0.0.2b, adding link to Tool Shed in the workflow annotation (so that end users can find the README file).
peterjc
parents: 2
diff changeset
105 - Added link to Tool Shed in the workflow annotation explaining there
72f03c2102ee Uploaded v0.0.2b, adding link to Tool Shed in the workflow annotation (so that end users can find the README file).
peterjc
parents: 2
diff changeset
106 is a README file with sample data, and a requested citation.
2
3a0c0d1c388f Uploaded v0.0.2, updated tool versions to solve warning from core Galaxy filter tool being updated. No functional changes.
peterjc
parents: 1
diff changeset
107 ======= ======================================================================
3a0c0d1c388f Uploaded v0.0.2, updated tool versions to solve warning from core Galaxy filter tool being updated. No functional changes.
peterjc
parents: 1
diff changeset
108
3a0c0d1c388f Uploaded v0.0.2, updated tool versions to solve warning from core Galaxy filter tool being updated. No functional changes.
peterjc
parents: 1
diff changeset
109
1
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
110 Developers
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
111 ==========
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
112
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
113 This workflow is under source code control here:
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
114
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
115 https://github.com/peterjc/picobio/tree/master/galaxy_workflows/secreted_protein_workflow
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
116
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
117 To prepare the tar-ball for uploading to the Tool Shed, I use this:
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
118
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
119 $ tar -cf secreted_protein_workflow.tar.gz README.rst repository_dependencies.xml secreted_protein_workflow.ga
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
120
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
121 Check this,
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
122
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
123 $ tar -tzf secreted_protein_workflow.tar.gz
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
124 README.rst
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
125 repository_dependencies.xml
606da4e1d925 README file with clearer citation instructions.
peterjc
parents:
diff changeset
126 secreted_protein_workflow.ga