Mercurial > repos > ktnyt > gembassy
comparison GEMBASSY-1.0.3/doc/text/gsvalue.txt @ 0:8300eb051bea draft
Initial upload
author | ktnyt |
---|---|
date | Fri, 26 Jun 2015 05:19:29 -0400 |
parents | |
children |
comparison
equal
deleted
inserted
replaced
-1:000000000000 | 0:8300eb051bea |
---|---|
1 gsvalue | |
2 Function | |
3 | |
4 Calculate the strength of selected codon usage bias (S) | |
5 | |
6 Description | |
7 | |
8 gsvalue calculates the strength of selected codon usage bias (S), also | |
9 known as Sharp's S index. Using four codon pairs that are recognized by the | |
10 same tRNA anticodon, namely, Phe(UUC and UUU), Ile(AUC and AUU), Tyr(UAC and | |
11 UAU), and Asn(AAC and AAU), since the former in each of the pairs has | |
12 stronger Watson-Crick pairing, selection towards the former codon can be | |
13 observed for highly expressed genes. S index is therefore the weighted | |
14 average of such bias, giving an over-all value for a genome, indicating its | |
15 strength of selected codon usage bias. See Sharp et al. (2005) for details. | |
16 Sharp originally defined 40 genes as the highly expressed gene group, with | |
17 tufA, tsf, fusA, rplA-rplF, rplI-rplT, rpsB-rpsT. Since the identificaiton | |
18 of these genes is not convenient for computational automation, by default, | |
19 this method uses ribosomal proteins as the highly expressed gene group, | |
20 as used by Viera-silva and Rocha (2010). | |
21 However, Sharp's gene group can be optionally used with -sharp option. | |
22 With this option, all of the 40 genes must be named accordingly in the given | |
23 genome file. | |
24 | |
25 G-language SOAP service is provided by the | |
26 Institute for Advanced Biosciences, Keio University. | |
27 The original web service is located at the following URL: | |
28 | |
29 http://www.g-language.org/wiki/soap | |
30 | |
31 WSDL(RPC/Encoded) file is located at: | |
32 | |
33 http://soap.g-language.org/g-language.wsdl | |
34 | |
35 Documentation on G-language Genome Analysis Environment methods are | |
36 provided at the Document Center | |
37 | |
38 http://ws.g-language.org/gdoc/ | |
39 | |
40 Usage | |
41 | |
42 Here is a sample session with gsvalue | |
43 | |
44 % gsvalue refseqn:NC_000913 | |
45 Calculate the strength of selected codon usage bias (S) | |
46 Codon usage output file [nc_000913.gsvalue]: | |
47 | |
48 Go to the input files for this example | |
49 Go to the output files for this example | |
50 | |
51 Command line arguments | |
52 | |
53 Standard (Mandatory) qualifiers: | |
54 [-sequence] seqall Nucleotide sequence(s) filename and optional | |
55 format, or reference (input USA) | |
56 [-outfile] outfile [*.gsvalue] Codon usage output file | |
57 | |
58 Additional (Optional) qualifiers: (none) | |
59 Advanced (Unprompted) qualifiers: | |
60 -sharp boolean [N] Include to use the 40 genes used by | |
61 Sharp instead of ribosomal proteins | |
62 -[no]accid boolean [Y] Include to use sequence accession ID as | |
63 query | |
64 | |
65 Associated qualifiers: | |
66 | |
67 "-sequence" associated qualifiers | |
68 -sbegin1 integer Start of each sequence to be used | |
69 -send1 integer End of each sequence to be used | |
70 -sreverse1 boolean Reverse (if DNA) | |
71 -sask1 boolean Ask for begin/end/reverse | |
72 -snucleotide1 boolean Sequence is nucleotide | |
73 -sprotein1 boolean Sequence is protein | |
74 -slower1 boolean Make lower case | |
75 -supper1 boolean Make upper case | |
76 -scircular1 boolean Sequence is circular | |
77 -sformat1 string Input sequence format | |
78 -iquery1 string Input query fields or ID list | |
79 -ioffset1 integer Input start position offset | |
80 -sdbname1 string Database name | |
81 -sid1 string Entryname | |
82 -ufo1 string UFO features | |
83 -fformat1 string Features format | |
84 -fopenfile1 string Features file name | |
85 | |
86 "-outfile" associated qualifiers | |
87 -odirectory2 string Output directory | |
88 | |
89 General qualifiers: | |
90 -auto boolean Turn off prompts | |
91 -stdout boolean Write first file to standard output | |
92 -filter boolean Read first file from standard input, write | |
93 first file to standard output | |
94 -options boolean Prompt for standard and additional values | |
95 -debug boolean Write debug output to program.dbg | |
96 -verbose boolean Report some/full command line options | |
97 -help boolean Report command line options and exit. More | |
98 information on associated and general | |
99 qualifiers can be found with -help -verbose | |
100 -warning boolean Report warnings | |
101 -error boolean Report errors | |
102 -fatal boolean Report fatal errors | |
103 -die boolean Report dying program messages | |
104 -version boolean Report version number and exit | |
105 | |
106 Input file format | |
107 | |
108 The database definitions for following commands are available at | |
109 http://soap.g-language.org/kbws/embossrc | |
110 | |
111 gsvalue reads one or more nucleotide sequences. | |
112 | |
113 Output file format | |
114 | |
115 The output from gsvalue is to a plain text file. | |
116 | |
117 File: nc_000913.gsvalue | |
118 | |
119 Sequence: NC_000913 S-value: 1.23467100598485 | |
120 | |
121 | |
122 Data files | |
123 | |
124 None. | |
125 | |
126 Notes | |
127 | |
128 None. | |
129 | |
130 References | |
131 | |
132 Sharp PM et al. (2005) "Variation in the strength of selected codon usage | |
133 bias among bacteria", Nucleic Acids Research, 33(4):1141-1153 | |
134 | |
135 Vieira-Silva S and Rocha EPC (2010) "The systemic imprint of growth and its | |
136 uses in ecological (meta)genomics", PLoS Genetics, 6(1):e1000808 | |
137 | |
138 Arakawa, K., Mori, K., Ikeda, K., Matsuzaki, T., Konayashi, Y., and | |
139 Tomita, M. (2003) G-language Genome Analysis Environment: A Workbench | |
140 for Nucleotide Sequence Data Mining, Bioinformatics, 19, 305-306. | |
141 | |
142 Arakawa, K. and Tomita, M. (2006) G-language System as a Platform for | |
143 large-scale analysis of high-throughput omics data, J. Pest Sci., | |
144 31, 7. | |
145 | |
146 Arakawa, K., Kido, N., Oshita, K., Tomita, M. (2010) G-language Genome | |
147 Analysis Environment with REST and SOAP Web Service Interfaces, | |
148 Nucleic Acids Res., 38, W700-W705. | |
149 | |
150 Warnings | |
151 | |
152 None. | |
153 | |
154 Diagnostic Error Messages | |
155 | |
156 None. | |
157 | |
158 Exit status | |
159 | |
160 It always exits with a status of 0. | |
161 | |
162 Known bugs | |
163 | |
164 None. | |
165 | |
166 See also | |
167 | |
168 gcbi Calculates the codon bias index (CBI) | |
169 gdelta_enc Calculate the codon usage bias related to translation optimization (delta ENC) | |
170 gicdi Calculates the intrinsic codon deviation index (ICDI) | |
171 | |
172 Author(s) | |
173 | |
174 Hidetoshi Itaya (celery@g-language.org) | |
175 Institute for Advanced Biosciences, Keio University | |
176 252-0882 Japan | |
177 | |
178 Kazuharu Arakawa (gaou@sfc.keio.ac.jp) | |
179 Institute for Advanced Biosciences, Keio University | |
180 252-0882 Japan | |
181 | |
182 History | |
183 | |
184 2012 - Written by Hidetoshi Itaya | |
185 2013 - Fixed by Hidetoshi Itaya | |
186 | |
187 Target users | |
188 | |
189 This program is intended to be used by everyone and everything, from | |
190 naive users to embedded scripts. | |
191 | |
192 Comments | |
193 | |
194 None. | |
195 |