DNA Research Advance Access originally published online on December 13, 2006
DNA Research 2006 13(6):245-254; doi:10.1093/dnares/dsl014
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Exploration and Grading of Possible Genes from 183 Bacterial Strains by a Common Protocol to Identification of New Genes: Gene Trek in Prokaryote Space (GTPS)
1 Center for Information Biology, DNA Data Bank of Japan, National Institute of Genetics, The Graduate University for Advanced Studies (SOKENDAI) Mishima, Shizuoka 411-8540, Japan
2 Institute for Bioinformatics Research and Development (BIRD), Japan Science and Technology Corporation (JST) Chiyoda-ku, Tokyo 102-8666, Japan
3 Faculty of Pharmaceutical Science, Tokyo University of Science, Noda Chiba 278-8510, Japan
4 Advanced Center for Computing and Communication, RIKEN, Wako Saitama 351-0198, Japan
A large number of complete microorganism genomes has been sequenced and submitted to the public database and then incorporated into our complete genome database, Genome Information Broker (GIB, http://gib.genes.nig.ac.jp/). However, when comparative genomics is carried out, researchers must be aware that there are protein-coding genes not confirmed by homology or motif search and that reliable protein-coding genes are missing. Therefore, we developed a protocol (Gene Trek in Prokaryote Space, GTPS) for finding possible protein-coding genes in bacterial genomes. GTPS assigns a degree of reliability to predicted protein-coding genes. We first systematically applied the protocol to the complete genomes of all 123 bacterial species and strains that were publicly available as of July 2003, and then to those of 183 species and strains available as of September 2004. We found a number of incorrect genes and several new ones in the genome data in question. We also found a way to estimate the total number of orthologous genes in the bacterial world.
Key words: comparative genome; gene prediction; annotation; gene grading; bacterial genome
*To whom correspondence should be addressed. Tel. +81-55-981-6895. Fax +81-55-981-6896, E-mail: hsugawar{at}genes.nig.ac.jp
Communicated by Dr Katsumi Isono
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
E. Kaminuma, J. Mashima, Y. Kodama, T. Gojobori, O. Ogasawara, K. Okubo, T. Takagi, and Y. Nakamura DDBJ launches a new archive database with analytical tools for next-generation sequence data Nucleic Acids Res., January 1, 2010; 38(suppl_1): D33 - D38. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Abe, S. Kanaya, H. Uehara, and T. Ikemura A Novel Bioinformatics Strategy for Function Prediction of Poorly-Characterized Protein Genes Obtained from Metagenome Analyses DNA Res, October 3, 2009; (2009) dsp018v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Sugawara, O. Ogasawara, K. Okubo, T. Gojobori, and Y. Tateno DDBJ with new system and face Nucleic Acids Res., January 11, 2008; 36(suppl_1): D22 - D24. [Abstract] [Full Text] [PDF] |
||||

