Mercurial > repos > ktnyt > gembassy
comparison GEMBASSY-1.0.3/doc/html/ggcsi.html @ 2:8947fca5f715 draft default tip
Uploaded
author | ktnyt |
---|---|
date | Fri, 26 Jun 2015 05:21:44 -0400 |
parents | 84a17b3fad1f |
children |
comparison
equal
deleted
inserted
replaced
1:84a17b3fad1f | 2:8947fca5f715 |
---|---|
1 <!--START OF HEADER - DON'T ALTER --> | |
2 | |
3 <HTML> | |
4 <HEAD> | |
5 <TITLE> EMBOSS: ggcsi </TITLE> | |
6 </HEAD> | |
7 <BODY BGCOLOR="#FFFFFF" text="#000000"> | |
8 | |
9 | |
10 | |
11 <table align=center border=0 cellspacing=0 cellpadding=0> | |
12 <tr><td valign=top> | |
13 <A HREF="/" ONMOUSEOVER="self.status='Go to the EMBOSS home page';return true"><img border=0 src="http://soap.g-language.org/gembassy/emboss_explorer/manual/emboss_icon.jpg" alt="" width=150 height=48></a> | |
14 </td> | |
15 <td align=left valign=middle> | |
16 <b><font size="+6"> | |
17 ggcsi | |
18 </font></b> | |
19 </td></tr> | |
20 </table> | |
21 <br> | |
22 <p> | |
23 | |
24 | |
25 <!--END OF HEADER--> | |
26 | |
27 | |
28 | |
29 | |
30 | |
31 | |
32 <H2> Function </H2> | |
33 GC Skew Index: an index for strand-specific mutational bias | |
34 <!-- | |
35 DON'T WRITE ANYTHING HERE. | |
36 IT IS DONE FOR YOU. | |
37 --> | |
38 | |
39 | |
40 | |
41 | |
42 <H2>Description</H2> | |
43 <p> | |
44 ggcsi calculates the GC Skew Index (GCSI) of the given circular bacterial<br /> | |
45 genome. GCSI quantifies the degree of GC Skew. In other words, this index<br /> | |
46 represents the degree of strand-specific mutational bias in bacterial<br /> | |
47 genomes, caused by replicational selection. <br /> | |
48 GCSI is calculated by the following formula:<br /> | |
49 <br /> | |
50 GCSI = sqrt((SA/6000) * (dist/600))<br /> | |
51 <br /> | |
52 where SA is the spectral amplitude of Fourier power spectrum at 1Hz,<br /> | |
53 and dist is the normalized Euclidean distance between the vertices of <br /> | |
54 cumulative GC skew.<br /> | |
55 <br /> | |
56 GCSI ranges from 0 (no observable skew) to 1 (strong skew), and Archaeal<br /> | |
57 genomes that have multiple replication origins and therefore have no<br /> | |
58 observable skew mostly have GCSI below 0.05. Escherichia coli genome has<br /> | |
59 values around 0.10.<br /> | |
60 <br /> | |
61 Version 1 of GCSI required fixed number of windows (4096), but the new GCSI<br /> | |
62 version 2 (also known as generalized GCSI: gGCSI) is invariant of the number<br /> | |
63 of windows. GCSI version 1 is calculated as an arithmetic mean (as opposed<br /> | |
64 to the geometric mean of gGCSI) of SR (spectral ratio, the signal-to-noise<br /> | |
65 ratio of 1Hz power spectrum) and dist.<br /> | |
66 <br /> | |
67 G-language SOAP service is provided by the<br /> | |
68 Institute for Advanced Biosciences, Keio University.<br /> | |
69 The original web service is located at the following URL:<br /> | |
70 <br /> | |
71 http://www.g-language.org/wiki/soap<br /> | |
72 <br /> | |
73 WSDL(RPC/Encoded) file is located at:<br /> | |
74 <br /> | |
75 http://soap.g-language.org/g-language.wsdl<br /> | |
76 <br /> | |
77 Documentation on G-language Genome Analysis Environment methods are<br /> | |
78 provided at the Document Center<br /> | |
79 <br /> | |
80 http://ws.g-language.org/gdoc/<br /> | |
81 <br /> | |
82 | |
83 </p> | |
84 | |
85 <H2>Usage</H2> | |
86 | |
87 Here is a sample session with ggcsi | |
88 | |
89 <table width="90%"><tr><td bgcolor="#CCFFFF"><pre> | |
90 | |
91 % ggcsi refseqn:NC_000913 | |
92 GC Skew Index: an index for strand-specific mutational bias | |
93 Program compseq output file [nc_000913.ggcsi]: | |
94 | |
95 </pre></td></tr></table> | |
96 | |
97 Go to the <a href="#input">input files</a> for this example<br> | |
98 Go to the <a href="#output">output files</a> for this example<br><br> | |
99 | |
100 <h2>Command line arguments</h2> | |
101 | |
102 <table border cellspacing=0 cellpadding=3 bgcolor="#ccccff"> | |
103 <tr bgcolor="#FFFFCC"> | |
104 <th align="left">Qualifier</th> | |
105 <th align="left">Type</th> | |
106 <th align="left">Description</th> | |
107 <th align="left">Allowed values</th> | |
108 <th align="left">Default</th> | |
109 </tr> | |
110 | |
111 <tr bgcolor="#FFFFCC"> | |
112 <th align="left" colspan=5>Standard (Mandatory) qualifiers</th> | |
113 </tr> | |
114 | |
115 <tr bgcolor="#FFFFCC"> | |
116 <td>[-sequence]<br>(Parameter 1)</td> | |
117 <td>seqall</td> | |
118 <td>Nucleotide sequence(s) filename and optional format, or reference (input USA)</td> | |
119 <td>Readable sequence(s)</td> | |
120 <td><b>Required</b></td> | |
121 </tr> | |
122 | |
123 <tr bgcolor="#FFFFCC"> | |
124 <td>[-outfile]<br>(Parameter 2)</td> | |
125 <td>outfile</td> | |
126 <td>Program compseq output file</td> | |
127 <td>Output file</td> | |
128 <td><i><*></i>.ggcsi</td> | |
129 </tr> | |
130 | |
131 <tr bgcolor="#FFFFCC"> | |
132 <th align="left" colspan=5>Additional (Optional) qualifiers</th> | |
133 </tr> | |
134 | |
135 <tr> | |
136 <td colspan=5>(none)</td> | |
137 </tr> | |
138 | |
139 <tr bgcolor="#FFFFCC"> | |
140 <th align="left" colspan=5>Advanced (Unprompted) qualifiers</th> | |
141 </tr> | |
142 | |
143 <tr bgcolor="#FFFFCC"> | |
144 <td>-gcsi</td> | |
145 <td>selection</td> | |
146 <td>GCSI version to use</td> | |
147 <td>Choose from selection list of values</td> | |
148 <td>2</td> | |
149 </tr> | |
150 | |
151 <tr bgcolor="#FFFFCC"> | |
152 <td>-window</td> | |
153 <td>integer</td> | |
154 <td>Number of windows. Must be a power of 2</td> | |
155 <td>Any integer value</td> | |
156 <td>4096</td> | |
157 </tr> | |
158 | |
159 <tr bgcolor="#FFFFCC"> | |
160 <td>-purine</td> | |
161 <td>boolean</td> | |
162 <td>Use purine skew for calculation</td> | |
163 <td>Boolean value Yes/No</td> | |
164 <td>No</td> | |
165 </tr> | |
166 | |
167 <tr bgcolor="#FFFFCC"> | |
168 <td>-keto</td> | |
169 <td>boolean</td> | |
170 <td>Use keto skew for calculation</td> | |
171 <td>Boolean value Yes/No</td> | |
172 <td>No</td> | |
173 </tr> | |
174 | |
175 <tr bgcolor="#FFFFCC"> | |
176 <td>-at</td> | |
177 <td>boolean</td> | |
178 <td>Use AT skew for calculation</td> | |
179 <td>Boolean value Yes/No</td> | |
180 <td>No</td> | |
181 </tr> | |
182 | |
183 <tr bgcolor="#FFFFCC"> | |
184 <td>-pval</td> | |
185 <td>boolean</td> | |
186 <td>Calculate p-value when GCSI version 2 is selected</td> | |
187 <td>Boolean value Yes/No</td> | |
188 <td>No</td> | |
189 </tr> | |
190 | |
191 <tr bgcolor="#FFFFCC"> | |
192 <td>-[no]accid</td> | |
193 <td>boolean</td> | |
194 <td>Include to use sequence accession ID as query</td> | |
195 <td>Boolean value Yes/No</td> | |
196 <td>Yes</td> | |
197 </tr> | |
198 | |
199 </table> | |
200 | |
201 | |
202 <h2>Input file format</h2> | |
203 | |
204 <p id="input"> | |
205 The database definitions for following commands are available at<br /> | |
206 http://soap.g-language.org/kbws/embossrc<br /> | |
207 <br /> | |
208 ggcsi reads one or more nucleotide sequences.<br /> | |
209 <br /> | |
210 | |
211 </p> | |
212 | |
213 <h2 id="output">Output file format</h2> | |
214 | |
215 <p> | |
216 The output from ggcsi is to a plain text file.<br /> | |
217 <br /> | |
218 File: nc_000913.ggcsi<br /> | |
219 <br /> | |
220 <table width="90%"><tr><td bgcolor="#CCFFCC"> | |
221 Sequence: NC_000913 GCSI: 0.0966615833014818 SA: 487.218569030757 DIST: 69.037726<br /> | |
222 </td></tr></table> | |
223 | |
224 </p> | |
225 | |
226 <h2>Data files</h2> | |
227 | |
228 <p> | |
229 None. | |
230 </p> | |
231 | |
232 <h2>Notes</h2> | |
233 | |
234 <p> | |
235 None. | |
236 </p> | |
237 | |
238 <h2>References</h2> | |
239 | |
240 <pre> | |
241 Arakawa, K., Mori, K., Ikeda, K., Matsuzaki, T., Konayashi, Y., and | |
242 Tomita, M. (2003) G-language Genome Analysis Environment: A Workbench | |
243 for Nucleotide Sequence Data Mining, Bioinformatics, 19, 305-306. | |
244 | |
245 Arakawa, K. and Tomita, M. (2006) G-language System as a Platform for | |
246 large-scale analysis of high-throughput omics data, J. Pest Sci., | |
247 31, 7. | |
248 | |
249 Arakawa, K., Kido, N., Oshita, K., Tomita, M. (2010) G-language Genome | |
250 Analysis Environment with REST and SOAP Web Service Interfaces, | |
251 Nucleic Acids Res., 38, W700-W705. | |
252 | |
253 </pre> | |
254 | |
255 <h2>Warnings</h2> | |
256 | |
257 <p> | |
258 None. | |
259 </p> | |
260 | |
261 <h2>Diagnostic Error Messages</h2> | |
262 | |
263 <p> | |
264 None. | |
265 </p> | |
266 | |
267 <h2>Exit status</h2> | |
268 | |
269 <p> | |
270 It always exits with a status of 0. | |
271 </p> | |
272 | |
273 <h2>Known bugs</h2> | |
274 | |
275 <p> | |
276 None. | |
277 </p> | |
278 | |
279 <h2>See also</h2> | |
280 | |
281 <table border cellpadding=4 bgcolor="#FFFFF0"><tr><th>Program name</th> | |
282 <th>Description</th></tr> | |
283 | |
284 <tr> | |
285 <td><a href="gb1.html">gb1</a></td> | |
286 <td>Calculate strand bias of bacterial genome using B1 index</td> | |
287 </tr><tr> | |
288 <td><a href="gb2.html">gb2</a></td> | |
289 <td>Calculate strand bias of bacterial genome using B2 index</td> | |
290 </tr><tr> | |
291 <td><a href="gdeltagcskew.html">gdeltagcskew</a></td> | |
292 <td>Calculate strand bias of bacterial genome using delta GC skew</td> | |
293 </tr><tr> | |
294 <td><a href="gldabias.html">gldabias</a></td> | |
295 <td>Calculate strand bias of bacterial genome using linear discriminant analysis (LDA)</td> | |
296 </tr> | |
297 | |
298 </table> | |
299 | |
300 <h2>Author(s)</h2> | |
301 | |
302 <pre> | |
303 Hidetoshi Itaya (celery@g-language.org) | |
304 Institute for Advanced Biosciences, Keio University | |
305 252-0882 Japan | |
306 | |
307 Kazuharu Arakawa (gaou@sfc.keio.ac.jp) | |
308 Institute for Advanced Biosciences, Keio University | |
309 252-0882 Japan</pre> | |
310 | |
311 <h2>History</h2> | |
312 | |
313 2012 - Written by Hidetoshi Itaya | |
314 | |
315 <h2>Target users</h2> | |
316 | |
317 This program is intended to be used by everyone and everything, from | |
318 naive users to embedded scrips. | |
319 | |
320 <h2>Comments</h2> | |
321 | |
322 None. | |
323 | |
324 </BODY> | |
325 </HTML> |