DNA Research Advance Access originally published online on January 10, 2006
DNA Research 2005 12(5):281-290; doi:10.1093/dnares/dsi015
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Novel Phylogenetic Studies of Genomic Sequence Fragments Derived from Uncultured Microbe Mixtures in Environmental and Clinical Samples
1Center for Information Biology, National Institute of Genetics, The Graduate University for Advanced Studies (Sokendai) Mishima, Shizuoka 411-8540, Japan
2Department of Bio-System Engineering, Faculty of Engineering, Yamagata University Yonezawa, Yamagata 992-8510, Japan
3Department of Bioinformatics and Genomes, Graduate School of Information Science, Nara Institute of Science and Technology Takayama, Ikoma, Nara 630-0101, Japan
4The Graduate University for Advanced Studies (Sokendai), Hayama Center for Advanced Research Hayama-cho, Kanagawa 240-0193, Japan
A self-organizing map (SOM) was developed as a novel bioinformatics strategy for phylogenetic classification of sequence fragments obtained from pooled genome samples of uncultured microbes in environmental and clinical samples. This phylogenetic classification was possible without either orthologous sequence sets or sequence alignments. We first constructed SOMs for tetranucleotide frequencies in 210 000 5 kb sequence fragments obtained from 1502 prokaryotes for which at least 10 kb of genomic sequence has been deposited in public DNA databases. The sequences could be classified primarily according to phylogenetic groups without information regarding the species. We used the SOM method to classify sequence fragments derived from environmental samples of the Sargasso Sea and of an acidophilic biofilm growing in acid mine drainage. Phylogenetic diversity of the environmental sequences was effectively visualized on a single map. Sequences that were derived from a single genome but cloned independently could be reassociated in silico. G + C% has been used for a long period as a fundamental parameter for phylogenetic classification of microbes, but the G + C% is apparently too simple a parameter to differentiate a wide variety of known species. Oligonucleotide frequency can be used to distinguish the species because oligonucleotide frequencies vary significantly among their genomes.
Key words: self-organizing map; environmental samples; metagenome; phylogenetic classification; SOM
*To whom correspondence should be addressed. Tel. +81-55-981-6835, Fax. +81-55-981-6896, Email: takaabe{at}genes.nig.ac.jp
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
C. Martin, N. N. Diaz, J. Ontrup, and T. W. Nattkemper Hyperbolic SOM-based clustering of DNA fragment features for taxonomic visualization and classification Bioinformatics, July 15, 2008; 24(14): 1568 - 1574. [Abstract] [PDF] |
||||
![]() |
B. E. Dutilh, Y. He, M. L. Hekkelman, and M. A. Huynen Signature, a web server for taxonomic characterization of sequence samples using signature genes Nucleic Acids Res., July 1, 2008; 36(suppl_2): W470 - W474. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Kosaka, S. Kato, T. Shimoyama, S. Ishii, T. Abe, and K. Watanabe The genome of Pelotomaculum thermopropionicum reveals niche-associated evolution in anaerobic microbiota Genome Res., March 1, 2008; 18(3): 442 - 448. [Abstract] [Full Text] [PDF] |
||||


