comparison GEMBASSY-1.0.3/doc/text/gbui.txt @ 0:8300eb051bea draft

Initial upload
author ktnyt
date Fri, 26 Jun 2015 05:19:29 -0400
parents
children
comparison
equal deleted inserted replaced
-1:000000000000 0:8300eb051bea
1 gbui
2 Function
3
4 Calculates base usage indices for protein-coding sequences
5
6 Description
7
8 gbui calculates base usage indices of protein-coding sequences (excluding
9 start and stop codons) for each gene. Indices calculated are as follows,
10 acgt: Total bumber of bases (A+T+G+C)
11 ryr: Purine/Pyrimidine ratio (A+G)/(T+C)
12 gcc: G+C content (G+C)/(A+T+G+C)
13 Hgc: entropy of G+C content (G+C)/(A+T+G+C)
14 gcs: GC skew (C-G)/(C+G)
15 ats: AT skew (A-T)/(A+T)
16
17 G-language SOAP service is provided by the
18 Institute for Advanced Biosciences, Keio University.
19 The original web service is located at the following URL:
20
21 http://www.g-language.org/wiki/soap
22
23 WSDL(RPC/Encoded) file is located at:
24
25 http://soap.g-language.org/g-language.wsdl
26
27 Documentation on G-language Genome Analysis Environment methods are
28 provided at the Document Center
29
30 http://ws.g-language.org/gdoc/
31
32 Usage
33
34 Here is a sample session with gbui
35
36 % gbui refseqn:NC_000913
37 Calculates base usage indices for protein-coding sequences
38 Program compseq output file [nc_000913.gbui]:
39
40 Go to the input files for this example
41 Go to the output files for this example
42
43 Command line arguments
44
45 Standard (Mandatory) qualifiers:
46 [-sequence] seqall Nucleotide sequence(s) filename and optional
47 format, or reference (input USA)
48 [-outfile] outfile [*.gbui] Program compseq output file
49
50 Additional (Optional) qualifiers: (none)
51 Advanced (Unprompted) qualifiers:
52 -translate boolean [N] Include when translating using standard
53 codon table
54 -position menu [all] Codon position (Values: all (Assess
55 overall base usage of the gene); 1 (Assess
56 base usage at 1st position of codons); 2
57 (Assess base usage at 2nd position of
58 codons); 3 (Assess base usage at 3rd
59 position of codons))
60 -delkey string [[^ACDEFGHIKLMNPQRSTVWYacgtU]] Regular
61 expression to delete key (Any string)
62 -[no]accid boolean [Y] Include to use sequence accession ID as
63 query
64
65 Associated qualifiers:
66
67 "-sequence" associated qualifiers
68 -sbegin1 integer Start of each sequence to be used
69 -send1 integer End of each sequence to be used
70 -sreverse1 boolean Reverse (if DNA)
71 -sask1 boolean Ask for begin/end/reverse
72 -snucleotide1 boolean Sequence is nucleotide
73 -sprotein1 boolean Sequence is protein
74 -slower1 boolean Make lower case
75 -supper1 boolean Make upper case
76 -scircular1 boolean Sequence is circular
77 -sformat1 string Input sequence format
78 -iquery1 string Input query fields or ID list
79 -ioffset1 integer Input start position offset
80 -sdbname1 string Database name
81 -sid1 string Entryname
82 -ufo1 string UFO features
83 -fformat1 string Features format
84 -fopenfile1 string Features file name
85
86 "-outfile" associated qualifiers
87 -odirectory2 string Output directory
88
89 General qualifiers:
90 -auto boolean Turn off prompts
91 -stdout boolean Write first file to standard output
92 -filter boolean Read first file from standard input, write
93 first file to standard output
94 -options boolean Prompt for standard and additional values
95 -debug boolean Write debug output to program.dbg
96 -verbose boolean Report some/full command line options
97 -help boolean Report command line options and exit. More
98 information on associated and general
99 qualifiers can be found with -help -verbose
100 -warning boolean Report warnings
101 -error boolean Report errors
102 -fatal boolean Report fatal errors
103 -die boolean Report dying program messages
104 -version boolean Report version number and exit
105
106 Input file format
107
108 The database definitions for following commands are available at
109 http://soap.g-language.org/kbws/embossrc
110
111 gbui reads one or more nucleotide sequences.
112
113 Output file format
114
115 The output from gbui is to a plain text file.
116
117 File: nc_000913.gbui
118
119 Sequence: NC_000913
120 acgt,ryr,gcc,Hgc,gcs,ats,gene
121 60,0.9355,0.5333,0.9968,+0.3750,+0.3571,thrL
122 2457,1.0206,0.5311,0.9972,-0.0575,-0.0434,thrA
123 927,1.0973,0.5653,0.9877,-0.1183,-0.0471,thrB
124 1281,1.0795,0.5301,0.9974,-0.0692,+0.0033,thrC
125 291,1.0638,0.5430,0.9947,+0.0506,+0.1278,yaaX
126 771,1.0615,0.4994,1.0000,-0.0182,+0.0415,yaaA
127 1425,0.8363,0.5347,0.9965,-0.0315,-0.2278,yaaJ
128 948,1.1303,0.5222,0.9986,-0.0263,+0.0993,talB
129
130 [Part of this file has been deleted for brevity]
131
132 507,1.0444,0.5661,0.9874,-0.1080,-0.0909,yjjX
133 642,1.1472,0.5654,0.9876,-0.0909,+0.0394,ytjC
134 864,0.9636,0.5347,0.9965,+0.0087,-0.0299,rob
135 468,1.1273,0.4936,0.9999,-0.1169,+0.0042,creA
136 684,1.0118,0.5556,0.9911,-0.0579,-0.0592,creB
137 1419,1.0655,0.5398,0.9954,-0.1018,-0.0505,creC
138 1347,1.0660,0.4974,1.0000,-0.1433,-0.0783,creD
139 711,1.0850,0.5134,0.9995,-0.0082,+0.0751,arcA
140 135,0.8493,0.4370,0.9885,+0.2203,+0.0263,yjjY
141 681,1.1415,0.5007,1.0000,-0.0792,+0.0529,yjtD
142
143
144 Data files
145
146 None.
147
148 Notes
149
150 None.
151
152 References
153
154 Arakawa, K., Mori, K., Ikeda, K., Matsuzaki, T., Konayashi, Y., and
155 Tomita, M. (2003) G-language Genome Analysis Environment: A Workbench
156 for Nucleotide Sequence Data Mining, Bioinformatics, 19, 305-306.
157
158 Arakawa, K. and Tomita, M. (2006) G-language System as a Platform for
159 large-scale analysis of high-throughput omics data, J. Pest Sci.,
160 31, 7.
161
162 Arakawa, K., Kido, N., Oshita, K., Tomita, M. (2010) G-language Genome
163 Analysis Environment with REST and SOAP Web Service Interfaces,
164 Nucleic Acids Res., 38, W700-W705.
165
166 Warnings
167
168 None.
169
170 Diagnostic Error Messages
171
172 None.
173
174 Exit status
175
176 It always exits with a status of 0.
177
178 Known bugs
179
180 None.
181
182 See also
183
184 gdinuc Calculates dinucleotide usage
185
186 Author(s)
187
188 Hidetoshi Itaya (celery@g-language.org)
189 Institute for Advanced Biosciences, Keio University
190 252-0882 Japan
191
192 Kazuharu Arakawa (gaou@sfc.keio.ac.jp)
193 Institute for Advanced Biosciences, Keio University
194 252-0882 Japan
195
196 History
197
198 2012 - Written by Hidetoshi Itaya
199 2013 - Fixed by Hidetoshi Itaya
200
201 Target users
202
203 This program is intended to be used by everyone and everything, from
204 naive users to embedded scripts.
205
206 Comments
207
208 None.
209