Skip Navigation


DNA Research Advance Access originally published online on February 22, 2006
DNA Research 2006 13(1):3-14; doi:10.1093/dnares/dsi026
This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow Supplementary data
Right arrowOA All Versions of this Article:
13/1/3    most recent
dsi026v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (11)
Right arrow Request Permissions
Google Scholar
Right arrow Articles by Ogura, Y.
Right arrow Articles by Hayashi, T.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Ogura, Y.
Right arrow Articles by Hayashi, T.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author 2006. Kazusa DNA Research Institute
The online version of this article has been published under an open access model. Users are entitled to use, reproduce, disseminate, or display the open access version of this article for non-commercial purposes provided that: the original authorship is properly and fully attributed; the Journal and Oxford University Press are attributed as the original place of publication with the correct citation details given; if an article is subsequently reproduced or disseminated not in its entirety but only in part or as a derivative work this must be clearly indicated. For commercial re-use, please contact journals.permissions@oxfordjournals.org

Complexity of the Genomic Diversity in Enterohemorrhagic Escherichia coli O157 Revealed by the Combinational Use of the O157 Sakai OligoDNA Microarray and the Whole Genome PCR scanning

Yoshitoshi Ogura1,2, Ken Kurokawa3, Tadasuke Ooka2, Kousuke Tashiro4, Toru Tobe5, Makoto Ohnishi6, Keisuke Nakayama2, Takuya Morimoto1, Jun Terajima6, Haruo Watanabe6, Satoru Kuhara4 and Tetsuya Hayashi1,2,*

1 Division of Bioenvironmental Science, Frontier Science Research Center, University of Miyazaki Miyazaki, Japan
2 Division of Microbiology, Department of Infectious Diseases, Faculty of Medicine, University of Miyazaki Miyazaki, Japan
3 Laboratory of Comparative Genomics, Graduate School of Information Science, Nara Institute of Science and Technology Nara, Japan
4 Laboratory of Molecular Gene Technics, Department of Genetic Resources Technology, Faculty of Agriculture, Kyushu University Fukuoka, Japan
5 Division of Applied Bacteriology, Graduate School of Medicine, Osaka University Osaka, Japan
6 Department of Bacteriology, National Institute for Infectious Diseases Tokyo, Japan

Received 29 November 2005; revised 21 December 2005


    Abstract
 Top
 Abstract
 1. Introduction
 2. Materials and Methods
 3. Results
 4. Discussion
 Supplementary Data
 Acknowledgements
 References
 
Escherichia coli O157, an etiological agent of hemorrhagic colitis and hemolytic uremic syndrome, is one of the leading worldwide public health threats. Genome sequencing of two O157 strains have revealed that the chromosome is comprised of a 4.1 Mb backbone shared by K-12 and a total of 1.4 Mb O157-specific sequences. Most of the large O157-specific sequences are prophages and prophage-like elements, which have carried many virulence genes into the O157 genome. This suggests that bacteriophages have played the key roles in the emergence of O157. The Whole Genome PCR Scanning (WGPScanning) analysis of O157 strains, on the other hand, revealed a high level of genomic diversity in O157. Variation of prophages has also been suggested as a major factor generating such diversity. In this study, we analyzed the gene content of O157 strains, by an oligoDNA microarray, using the same set of strains as examined by the WGPScanning method. Although most of the strains were typical O157 : H7, they differed remarkably in gene composition, particularly in those on prophages, and we identified more than 400 ‘variably absent or present’ genes which included virulence-related genes. This confirms the role of prophages in generating the genomic diversity, and raises a possibility that some level of variation in potential virulence is present among O157 strains. Fine comparison of the two datasets obtained by microarray and WGPScanning provided much further details on the O157 genome diversity than illustrated by each method alone, indicating the usefulness of this combinational approach in the genomic comparison of closely related strains.

Key words: E. coli O157; genomic diversity; microarray; whole genome PCR scanning


    1. Introduction
 Top
 Abstract
 1. Introduction
 2. Materials and Methods
 3. Results
 4. Discussion
 Supplementary Data
 Acknowledgements
 References
 
Escherichia coli is a constituent of normal microflora in intestinal tracts, but certain types of E. coli strains are associated with diseases in human and animals.1Go,2Go Among these pathogenic E. coli strains, enterohemorrhagic E. coli O157 not only causes large outbreaks of hemorrhagic colitis but also numerous small outbreaks and sporadic cases, and is regarded as one of the major worldwide public health concerns.3Go The genome sequence determination of an O157 strain, RIMD 0509952 (referred to as O157 Sakai) and the genomic comparison with a nonpathogenic strain, K-12 MG1655,4Go,5Go have revealed that the O157 chromosome is comprised of a 4.1 Mb backbone common with K-12 and a total of 1.4 Mb strain-specific sequences. O157 Sakai-specific sequences (referred to as S-loops) are inserted at various sites on the chromosome backbone, and encode more than 1600 of O157 Sakai-specific genes. Of importance is that most of the large S-loops are prophages and prophage-like elements. We have identified 18 prophages (Sp1-18) and 6 prophage-like elements (SpLE1-6). They comprise about two thirds of the O157 Sakai-specific sequences and have carried many virulence-related genes including Shiga toxin (Stx) genes (stx1 and stx2) into O157. This suggested that acquisition of these bacteriophages have played the key roles in the emergence of O157.5Go,6Go Similar findings were also obtained by the genome sequencing analysis of another O157 strain, EDL933.7Go

More recently, we analyzed, based on the genome sequence of O157 Sakai, the whole genome structures of eight O157 strains that displayed diverse XbaI-digestion patterns by a systematic PCR analysis called Whole Genome PCR Scanning (WGPScanning).8Go In this analysis, we amplified the whole genome of each strain using 560 pairs of PCR primers, and searched for genomic segments with any structural differences by comparing each PCR product with that from O157 Sakai. This analysis provided the first genome-wide view on the genomic diversity of O157, and revealed that there is an unexpectedly high degree of diversity among O157 strains. In particular, the variation in prophages and prophage-related elements were remarkable, implying that their variability is the major factor generating such structural diversity. However, additional information on the gene level is required.

In this study, we constructed an oligoDNA microarray according to the genome sequence of O157 Sakai. We first validated the accuracy of the comparative genomic hybridization (CGH) analysis using this oligoarray by a test experiment with K-12 and O157 Sakai, and then performed a gene composition analysis of O157 using the same set of strains examined by the WGPScanning. We further made a fine comparison of the microarray data with WGPScanning data to understand the genomic diversity of O157 more in detail.


    2. Materials and Methods
 Top
 Abstract
 1. Introduction
 2. Materials and Methods
 3. Results
 4. Discussion
 Supplementary Data
 Acknowledgements
 References
 
2.1. Bacterial strains, growth condition and DNA extraction
Eight O157 strains examined in this study were 980938 (referred to as #2), 980706 (#3), 990281 (#4), 980551 (#5), 990570 (#6), 981456 (#7), 982243 (#8) and 981795 (#9). They were human isolates obtained in Japan in 1998 as described previously.8Go All were negative for Sorbitol fermentation (SOR) and ß-glucuronidase activity (GUD) whose activities were examined after 24 h incubation at 37°C on SIB II agar plates (Kyokuto Pharmaceutical Industrial, Tokyo, Japan) and on ES Colimark agar plate (Eiken Kizai, Tokyo, Japan), respectively. As for strain #2, however, a weak positive reaction for ß-glucuronidase was detected after 40 h incubation. O157 Sakai (RIMD 0509952) was used as the reference in all hybridization experiments. This strain was isolated in a large outbreak in Sakai city, Japan, which occurred in 1996, and has been sequenced by our group.5Go The strain is available at the American Type Culture Collection (ATCC BAA-460). Escherichia coli K-12 MG1655 was kindly provided by Dr H. Mori (Nara Institute of Science and Technology). Cells were grown to the stationary phase at 37°C in Luria–Bertani medium. Genomic DNA was purified using the Genomic-tip 100/G and Genomic DNA buffer set (Qiagen) according to the manufacturer's instruction.

2.2. Design of the O157 Sakai microarray
Oligonucleotide probes were prepared for all the protein-coding genes on the O157 Sakai genome (5447 genes in total). Principles for the probe design were 60 mer in length and two probes for each gene. However, O157 Sakai contains 542 repeated genes, most of which are derived from IS elements or 13 lambda-like prophages sharing various length of DNA segments with almost identical sequences. As for these repeated genes, we prepared a single probe for each repeated gene family. Such probes totaled to 151 probes, each having two or more targets of ≥90% sequence identity on the O157 Sakai genome (referred to as ‘multiple hit probes’). As for the singleton genes (4905 genes), we were able to design two different 60 mer probes for 4173 genes, but only one 60 mer probe for 725 genes and one 44 or 45 mer probes for 7 genes. In total, 9620 probes were synthesized (Sigma-Aldrich Japan, Tokyo, Japan), and each probe was spotted onto poly-L-Lysine-coated glass slides (SD10011; Matsunami Glass Ind., Osaka, Japan) using SPBIO (Hitachi Software Engineering, Tokyo, Japan). As the negative controls, we included 20 oligonucleotides prepared from 10 yeast genes. Probe sequences are shown in Supplementary Table 1 available at www.dnaresearch.oxfordjournals.org.

2.3. DNA labeling and hybridization
Genomic DNAs from test strains and the reference (O157 Sakai) were fluorescently labeled with Cy3 and Cy5, respectively, according to the following protocol. First, 3 µg of genomic DNA was labeled with aminoallyl-modified dUTP (Sigma) using the Bioprime DNA Labeling System (Invitrogen). DNAs were not sheared or digested by restriction enzymes prior to the labeling. The aminoallyl-labeled DNA was purified by Microcon YM-30 (Millipore), dried in a speed-vac and resuspended in 10 µl of 50 mM NaHCO3. After adding 10 µl of dimethyl sulfoxide solution containing Cy3 or Cy5 monofunctional reactive dye (Amersham), the sample was incubated at room temperature in the dark for 1 h to allow the dye to couple with DNA. The fluorescently labeled DNA was finally purified by the Qiaquick PCR purification kit (Qiagen) into 30 µl elution buffer provided by the manufacturer.

The Cy5-labeled and the Cy3-labeled DNA preparations (15 µl for each) were mixed with a 90 µl hybridization solution containing 6.4x SSC, 0.64% SDS and 1.3 mg/ml yeast tRNA. After incubation at 96°C for 2 min, the denatured sample was applied to a microarray slide and incubated on an ArrayBooster hybridization apparatus (Advalytix AG, Brunnthal, Germany) at 50°C for 16 h. The slide was then washed twice in 2x SSC/0.2% SDS at room temperature for 10 min, twice in 0.2x SSC/0.2% SDS at 50°C for 5 min and twice in 0.2x SSC at room temperature for 10 min. Finally, the slide was briefly rinsed with ethanol, dried by centrifugation and scanned with a FLA-8000 scanner (Fuji Photofilm, Tokyo, Japan). The obtained data were analyzed by the ArrayVision 8.0 software (Imaging Research). Each strain was examined twice with the labeled DNAs prepared independently for each hybridyzation.

2.4. Data analysis
In our experimental condition, signal intensities from the negative controls were almost the same as that of the local background (LBG) in both cannels. Thus, spots with reference signal intensities lower than the LBG plus 5 SD or with some spotting abnormalities were categorized as ‘low quality’ (LQ). Signal intensities of other spots were corrected by subtracting the LBG, and were subjected to the ‘presence (P) or absence (A)’ determination using the array-based genotyping software GACK capable of dynamically generating cutoffs for present/absent (conserved/divergent) gene analysis in each array hybridization based on the signal-ratio distribution of un-normalized data sets.9Go Since we performed two independent experiments for each strain, some probes yielded inconsistent results; P/A, P/LQ or A/LQ. In these cases, the probe was categorized as ‘uncertain’ (U), P or A, respectively.

Finally, the presence or absence of each gene was determined according to the data from each spot. Because most genes were represented by two different probes on our microarray, inconsistent results could also be obtained from the two probes. In such cases, the gene was categorized as P when the judgments of two probes were P/U or P/LQ, as A when they were A/U or A/LQ, and as U when they were P/A or U/LQ. Processed data sets were displayed in the genomic order using the TREEVIEW program.10Go Raw signal intensities from each spot and processed data are presented in Supplementary Tables 1 and 2 available at www.dnares.oxfordjournals.org, respectively.

The presence or absence of 37 genes, which were always categorized into LQ on the current version of O157 Sakai microarray, were determined by PCR. The PCR amplification was performed using the Ex Taq PCR kit (TAKARA Bio, Japan) and 10 ng of template DNA with 30 cycles of 20 s at 98°C/30 s at 60°C/30 (or 120) s at 72°C. The list of genes analyzed by PCR and the primer sequences are shown in Supplementary Table 3 available at www.dnares.oxfordjournals.org.


    3. Results
 Top
 Abstract
 1. Introduction
 2. Materials and Methods
 3. Results
 4. Discussion
 Supplementary Data
 Acknowledgements
 References
 
3.1. Evaluation of the sensitivity and specificity of CGH analysis using the O157 Sakai oligoDNA microarray
The array we constructed covered all of the 5447 protein-coding genes on the O157 Sakai genome; 3729 on the chromosome backbone, 1632 on S-loops, and 86 on pO157 and pOSAK1 plasmids.11Go Before applying the microarray to the CGH analysis of O157 strains, we evaluated its sensitivity and specificity using K-12 MG1655 as a test strain.

We first examined the sequence identities of each probe to the K-12 genome sequence by the FASTA program, and classified the probes into two groups; those with ≥90% identity into ‘Conserved in K-12’ (CK) probes and others into ‘Specific to Sakai’ (SS) probes. Each gene was then classified into CK, ‘Partially Conserved in K-12’ (PCK; genes represented by a CK probe and an SS probe), or SS genes according to the probe category (Supplementary Table 1). Next, the data of the microarray analysis were compared with those from the in silico analysis. In this comparison, repeated gene families, represented by ‘multiple hit probes’ having two or more targets of ≥90% sequence identity on the O157 Sakai genome, were analyzed separately from the singleton genes.

As for the singleton genes, 96.9% of CK genes were correctly judged as ‘present’, and 97.8% of SS genes as ‘absent’ (Table 1). Most of the K-12 genes that gave incorrect results (false negative, false positive and uncertain) contained slightly divergent target sequences with 3–6 base mismatches with CK probes or weak homologies to the SS probes (70–90% identity). On the other hand, the presence/absence determination of the repeated gene families conserved in K-12 was somewhat problematic. While all families with the copy number ratio of >0.5 were judged as ‘present’, many families with the copy number ratios of ≤0.5 were judged as ‘absent’ or ‘uncertain’. This indicates that the repeated genes judged as ‘absent’ by the microarray analysis include those actually absent and those with reduced copy numbers. Thus, in the following CGH analyses of O157 strains, we decided to categorize such repeated genes into ‘uncertain due to the multiple target sequences in O157 Sakai’ [U(M)] and to consider only the repeated genes judged as ‘present’ as having the same or similar (or higher) copy numbers in the test strain.


View this table:
[in this window]
[in a new window]
 
Table 1. Summary of the CGH analysis of K-12 MG1655 using the O157 Sakai oligoDNA microarray

 
3.2. Overview of the gene content analysis of O157 strains
Using the O157 Sakai microarray, we analyzed the gene content of eight O157 strains that were previously analyzed by the WGPScanning method.8Go The data are summarized in Fig. 1 and Tables 2 and 3. As shown in Table 2, all the singleton genes identified in O157 Sakai were shared by at least one strain, but as many as 431 genes displayed variable distributions (or highly divergent sequences). Most of these ‘variably absent or present genes’ (referred to as VAP genes) belonged to the SS genes; 389 of the 1153 SS singleton genes were variably present. As shown in Fig. 1, these genes appeared very frequently on prophages and prophage-like elements; among the 640 singleton genes on these genetic elements, 350 (54.7%) displayed variable distributions. In sharp contrast, CK singleton genes exhibited a high level of conservation; only 1% (37 genes) exhibited variable distributions (Table 2). Among these, 20 genes were again located on prophage regions shared by O157 and K-12. In particular, a region of Sp10 (corresponding to a part of the Rac prophage in K-12) contained 11 VAP genes. These included recE and recT genes (ECs1933 and ECs1934) involved in the RecA-independent recombination and double-strand break repair.12Go


Figure 1
Figure 1
View larger version (85K):
[in this window]
[in a new window]
 
Figure 1. The summary and data comparison of the microarray analysis and the WGPScanning analysis of the same set of O157 strains. Results from the gene composition analysis of eight O157 strains (strains #2–#9) using the O157 Sakai microarray are shown in the upper parts, and those from the genome structure analysis using the WGPScanning method in the lower parts. In the three rows above the microarray data, genes conserved and partially conserved in K-12 are indicated in green and pink, respectively (the first row), genes on prophages, prophage-like elements and plasmids in red (the second row), and repeated genes represented by multiple hit probes in black (the third row). Genes judged as present and absent in the microarray analysis are indicated in blue and yellow, respectively. Singleton and repeated genes that were classified as ‘uncertain’ are indicated in pink and gray, respectively. Results from the WGPScaning analysis are presented as follows. The segments of the same sizes as those from O157 Sakai are indicated in gray, and those with larger (≥2 kb) and smaller (<2 kb) size reductions in yellow and light yellow, respectively. The segments with larger (≥2 kb) and smaller (<2 kb) size increments are indicated in blue and light blue, respectively, and those that were not amplified in red. Prophages or prophage-like elements of O157 Sakai that were deleted (or translocated or not integrated) in some test strains are left blank, and their alternative integration sites so far identified are also indicated. In this figure, each segment is not drawn to scale but to the gene position in the data presentation of the microarray analysis. The data presented here are the most update version, thus several modifications have been made from the original version published previously.8Go

 

View this table:
[in this window]
[in a new window]
 
Table 2. Conservation of O157 Sakai genes in the eight O157 strains

 

View this table:
[in this window]
[in a new window]
 
Table 3. Summary of the CGH analysis of each O157 strain

 
Among the eight strains, strain #2 lacked the largest number of genes (at least 307 genes), and the number of missing singleton genes reduced in the following order; #7, #6, #4, #8, #3, #9 and #5 (Table 3). This pattern well correlated with the level of structural diversification observed in the WGPScanning analysis (Fig. 1, and also refer to Ref. 23Go). Most of the repeated gene families identified in O157 Sakai were also conserved (present in the same or similar copy number) in strains #3, #5, #8 and #9. These data suggest that the four strains are more closely related to O157 Sakai in terms of gene content and genome structure.

3.3. O157 Sakai-specific genes with variable distributions in O157
Classification of the SS genes according to their functions revealed that genes belonging to specific categories contained more VAP genes compared to other categories (Table 4). The abundance of VAP genes in ‘transcription’, ‘replication, recombination, and repair’, ‘genes with unknown or uncharacterized functions’ and ‘unclassified genes’ was remarkable. This appears to be a reflection of the fact that most VAP genes are on prophages or prophage-like elements, which contain many genes for transcriptional regulation, replication, repair and phage-specific functions as well as a number of uncharacterized genes.


View this table:
[in this window]
[in a new window]
 
Table 4. Categorization and conservation of SS genes in the eight O157 strains

 
A notable finding was that 16 virulence-related genes exhibited variable distributions (Table 5). They included genes for Stx1, an HmwA-like protein, a TraT-homologue, a HecB-like protein. HmwA is known to be involved in adhesion in Haemophilus influenzae,13Go TraT in serum resistance in enteric bacteria14Go and HecB in hemolysin activation in Neisseria meningitides.15Go One strain lacked the efa-1 gene which is involved in the adherence of non-O157 EHEC to cultured epithelial cells16Go and in the inhibition of lymphocyte proliferation and proinflammatory cytokine synthesis in EPEC (called lifA in EPEC).17Go In O157, the efa-1 gene is split into two genes (ECs3860 and 3861), but it has been reported that disruption of ECs3860 (efa-1') results in reduced expression and secretion of LEE (the locus of enterocyte effacement)-encoded proteins and in reduced adherence to cultured cells.18Go The pchD and pchE genes encoding PerC-like regulators were also missing in four strains and one strain, respectively. Since the other pch genes (pchA, pchB and pchC) have been shown to regulate the LEE gene expression,19Go pchD and pchE may be involved in the regulation of virulence-related genes as well.


View this table:
[in this window]
[in a new window]
 
Table 5. Virulence-related genes that were variably present among the eight O157 strains

 
More importantly, several genes encoding type III secretion system (TTSS) effectors exhibited variable distributions. Recently, several proteins encoded on the non-LEE loci have been identified as effectors secreted by the LEE-encoded TTSS (non-LEE encoded effectors) in EPEC, EHEC and Citrobactor rodentium.20Go–25Go O157 Sakai contains at least 19 non-LEE encoded effectors or their homologues. Among these, nleA (also named espI, ECs1812), an nleF-homologue (ECs1815), an nleG-homologue (ECs1828) and an nleH-homologue (ECs1814) were variably present among the eight O157 strains. Functions of these effectors have not been well elucidated, but it has been shown that NleA/EspI injected into the host cell localizes to the Golgi apparatus.24Go

Many virulence-related genes were also encoded on the pO157 plasmid, but all were conserved except for katP (pO157_76). Of interest was that 18 out of the 83 genes on pO157 were missing in Strain #2 (Fig. 1). Since the pO157 plasmid of #2 is almost the same in size as those from other strains (data not shown), several parts of the plasmid have probably been replaced by different or highly divergent sequences. pOSAK1, a small cryptic plasmid, was present only in two strains.

3.4. Comparison of two datasets obtained by microarray and WGPScanning analyses
The CGH analysis using microarray provides the information on the gene composition but not their genomic positions, thus translocation events specific to the test strain cannot be detected. It also provides no information on strain-specific insertions. Conversely, WGPScanning detects possible loci where such genetic events have taken place while the presence or absence of each gene cannot be determined. Therefore, in order to know more details on the genomic diversity of O157, we compared the two datasets, one from the present microarray analysis and the other from the previous WGPScanning8Go (Fig. 1).

In the segments that exhibited no size variation in the WGPScanning analysis (indicated in gray in Fig. 1), no deletion of genes was detected by microarray except for only five segments. This confirmed that these regions possess the same genomic structures as those in Sakai. Among the five segments where deletions of a gene(s) were observed, four were derived from lambda-like prophage regions, implying that some parts of these prophages have been replaced by DNA fragments with same sizes but with different or highly divergent sequences.

By the WGPScanning analysis, a total of 35 segments with larger size reduction (≥2 kb smaller than that of O157 Sakai) were identified from the 8 O157 strains (indicated in yellow in Fig. 1). In most cases (29 segments), deleted genes were identified. The regions with gene deletion, however, were often associated with repeated genes derived from IS elements or lambda-like prophages, and thus precise boundaries of the deletion events were not defined in many cases. In six segments, no apparent gene deletion was detected, indicating that several genes on these segments have probably translocated to other genomic loci.

Among the 60 segments with smaller size reduction (<2 kb), which were indicated in light yellow in Fig. 1, we detected deletion of a gene(s) only in seven segments by the microarray analysis. Most of the segments with undetectable deletions (50 out of 53) contained IS elements and/or parts of lambda-like prophages, indicating that small deletions involving these elements frequently occur on O157 genomes. In the remaining three segments, small deletions in the regions that were not represented by probes or some translocations have probably taken place.

Besides the above-mentioned genomic segments that apparently contain deletions, 20 loci have been identified by the WGPScanning analysis, where prophages or prophage-like elements are integrated in Sakai but deleted (or not integrated) in some other strains (indicated by blank in Fig. 1). These loci were classified into three groups according to the patterns of gene conservation observed in the microarray analysis. The first group included the Sp7 region of strain #2, Sp13 (P2-like phage) of strains #3 and #9, and Sp18 (Mu-like phage) of strains #2, #3, #5, #6, #7, #8 and #9. In these regions, all or nearly all genes were absent. We concluded that these genetic elements are completely missing in these strains. The second group included the Sp18 region of strain #4 and SpLE1 of strain #6. In both cases, most genes on the elements were conserved, indicating that a prophage or prophage-like element almost identical to Sp18 or SpLE1 is present in other locus. We have already identified alternative integration sites for each element (Fig. 1). The third group included the Sp5 region of strains #2, #3, #4, #6, #7 and #8, Sp1/Sp2 of strain #8 and Sp7 of strain #3. In these regions, genes were partially conserved, exhibiting mosaic conservation patterns. This indicates that these strains contain, in some other loci, prophages which are significantly diverged but with some structural similarity to Sp5, Sp1/Sp2 or Sp7. Of particular interest was Sp5, the Stx2 phage of O157 Sakai. The Sp5 region was one of the regions with the highest level of variation, and we have already identified the alternative integration sites for the Stx2 phages in these six strains (Fig. 1). These Stx2 phages are thus different from Sp5 not only in the integration site but also in the genomic structure.

A total of 83 segments yielded PCR products larger than those of Sakai in the WGPScanning analysis as shown in Fig. 1, where the segments with larger increments (≥2 kb) are indicated in blue, and those with smaller increments (<2 kb) in light blue. These segments must contain some insertions. In addition, as many as 120 segments were not amplified by PCR (indicated in red in Fig. 1), in which large insertions or some other types of large genomic rearrangements likely have taken place. In these cases, if all genes on a segment were conserved, the segment most likely contains a simple insertion. By comparing the two datasets, 66 such segments were identified. Among these, we have identified seven new phage integration sites including those for Stx1 and Stx2 phages (Fig. 1). In other cases, more complicated genetic events must have occurred.

3.5. Unusual signal reductions around the IS insertion sites
During the data comparison of microarray and WGPScanning analysis, we noticed that somewhat unusual signal reductions occurred in the ETT2 region. In strains #2, #4 and #7, the region yielded larger PCR products, but a block of genes were judged as absent in the microarray analysis. Sequencing analyses of these regions, however, indicated that all genes were completely conserved but one copy of ISEc8 was newly inserted in each strain. A closer inspection of the microarray data revealed that, although the ISEc8 insertion sites were different between the strains, the genes judged as absent were all located in the 2–4 kb regions surrounding each ISEc8, and that their signal ratios were just slightly lower than the cutoff value (Fig. 2). This phenomenon was specific to the ETT2 region, and observed even after the labeled DNA was fragmented by sonication. Although we do not know the mechanism underlying this phenomenon, some caution is required to interpret the microarray data when a block of genes exhibit a moderate level of signal reduction.


Figure 2
View larger version (15K):
[in this window]
[in a new window]
 
Figure 2. Unusual signal reduction associated with insertions of ISEc8. Distribution of signal ratios (relative to O157 Sakai) from the probes representing the ETT2 regions of strains #2, #4, #7 and #8 is shown. Genes in the ETT2 region (ECs3703–3737) are depicted by black arrows, and genes on ISEc8 newly inserted in strains #2, #4 and #7 by gray arrows. Positions of each gene and probe are drawn to the O157 Sakai genome sequence. Horizontal broken lines indicate cutoff lines for the presence or absence determination in each strain. We have confirmed that all the probe sequences exhibiting unusual signal reduction are identical to those of Sakai. As seen in strain #8, no signal reduction was observed in other strains without ISEc8-insertion.

 

    4. Discussion
 Top
 Abstract
 1. Introduction
 2. Materials and Methods
 3. Results
 4. Discussion
 Supplementary Data
 Acknowledgements
 References
 
Presence of an unexpectedly high degree of structural diversity in O157 genomes has been revealed by our previous analysis using the WGPScanning method.8Go The results also suggested that the major factor generating such diversity is the variation of prophages and prophage-like elements. However, it remained to be elucidated how the genomic diversity detected by WGPScanning affects the gene content of each strain. In order to know the variation of their gene composition, we constructed an oligoDNA microarray according to the genome sequence of O157 Sakai,10Go and performed the CGH analysis of the same set of O157 strains as examined by the WGPScanning.

By this analysis, we identified 431 VAP genes that were variably present in the 8 O157 strains, and found that most of them were on S-loops, particularly on the prophage regions. This indicates that a remarkable level of variation in the gene repertoire exist among the O157 strains, and confirms the role of bacteriophages as the key players in the genome diversification of O157. Very recently, Wick et al.26Go have reported the results of CGH analyses of several types of O157 strains and its close relatives, including typical O157:H7 strains (GUD, SOR), atypical O157:H7 strains (GUD+, SOR), O157:H German clones (GUD+, SOR+) and O55:H7 strains (the closest relatives of O157:H727Go). They used a multigenome array containing 50 mer oligonucleotide probes that target 6176 ORFs on the chromosomes of K-12, O157 EDL933 and O157 Sakai. By this analysis, they identified the Sakai and K-12 genes that were gained or lost during the emergence of O157:H7 from its O55:H7-like ancestor. Many of these genes were parts of prophages and prophage-like elements, suggesting complex histories of these elements during the evolution of O15726 [GenBank] Go. What we should emphasize here is, however, that all O157 strains but strain #2 examined in our present study belonged to the typical O157 group. Even if the 89 genes that were absent only in strain #2 were excluded, 350 genes still exhibited variable distribution. This indicates that there is a remarkable variation in gene content even among typical O157 strains.

Identification of these VAP genes is of medical importance in that they could be used as valuable genetic markers to discriminate O157 strains. For example, a mini-DNA chip featuring these genes would become a useful molecular tool for epidemiological studies of O157. Of more importance may be that the VAP genes include a considerable number of virulence-related genes. Although the variation of stx1 and stx2 genes among O157 strains is known, additional 14 potentially virulence-related genes were found to display variable distributions. They include four non-LEE TTSS effectors, indicating that, although the LEE locus encoding the TTS machinery is highly conserved, the repertoire of its effectors differs significantly among O157 strains. Some metabolic genes, such as those for urease production (ECs1321–1326) and iron acquisition (ECs1693–1699), which may be indirectly involved in the pathogenesis of O157, were also found to be variably present. These data raise a possibility that some variation in the potential virulence and infectivity exists among O157 strains, and we need to analyze more O157 strains from various sources, including environmental isolates.

The phylogenetic position of strain #2 in the O157 lineage is somewhat obscure. But, since weak ß-glucuronidase activity was detected after long cultivation, the strain may belong to the GUD+ O157:H7 group. The largest number of VAP genes was detected in this strain, and it shared many, if not all, features that Wick et al.26Go described as being specific to GUD+ strains. For example, several gene blocks, such as ECs4134–4139, ECs1611–1619 and ECs3860–3861, were deleted (or highly divergent) only in this strain. The gene content on the pO157 plasmid of strain #2 also differed significantly from other strains. Unfortunately Wick et al.26Go did not examine the pO157 genes, but the difference in pO157 may be another genomic feature that discriminates the GUD+ group from typical O157 : H7 strains.

In the present study, we examined the same set of O157 strains as that were analyzed previously by the WGPScanning method. This provided a good opportunity to compare the data obtained by the two methods (Fig. 1). Although the microarray is now widely used as a major tool in comparative genomics, a fine comparison of the two datasets rather highlighted the weakness of microarray and emphasized the usefulness of a combinational approach with WGPScanning. Microarray actually provided valuable genome-wide information on the genome composition in each test strain, but almost no information on strain-specific insertions and translocations. In the case of O157, particular cautions are required in interpreting the microarray data because of the presence of same or very similar prophages as well as many IS elements. Even with such difficulties, all simple insertions as well as many translocations and replacements have been identified by comparing the two datasets. This approach also identified many small deletions that could not be detected by microarray alone. IS elements were frequently associated with such small deletions, suggesting that IS elements may be another key player to diversify O157 genomes. The combinational use of microarray and WGPScanning overcame the weakness of each method alone, and provided us more details and thus a much complex view on the genomic diversity of O157 strains.

The variation of prophages among O157 strains was not fully illustrated even with this combinational approach. But, as most strikingly shown for the Stx2-transducing phages, the data clearly indicated that prophages of O157 exhibit an extremely high level of strain-to-strain variations. These variations appeared to have been generated not only by simple insertion, deletion or translocation of prophages but also by more complex genetic events involving local recombination and deletion of prophage segments. One such example has been documented by Shaikh and Tarr28Go in the prophages integrated into the yehV locus (corresponding to Sp15, the Stx1-transducing phage of O157 Sakai). Of importance again is that most of the strains examined here were typical O157 : H7 strains. This indicates that such dynamic turnover of prophages is still actively taking place, and as we have proposed previously, O157 strains are functioning as a kind of ‘phage factory’, releasing a variety of new recombinant bacteriophages into the environment.6Go This process should be significantly involved in the generation of variable gene contents of O157 strains detected by the CGH analysis.


    Supplementary Data
 Top
 Abstract
 1. Introduction
 2. Materials and Methods
 3. Results
 4. Discussion
 Supplementary Data
 Acknowledgements
 References
 
Supplementary Data can be found online at http://dnaresearch.oxfordjournals.org.


    Acknowledgements
 Top
 Abstract
 1. Introduction
 2. Materials and Methods
 3. Results
 4. Discussion
 Supplementary Data
 Acknowledgements
 References
 
This work was supported by Research for the Future Program funded by the Japan Society for the Promotion of Science (JSPS-RFTF00L01411), Grant-in-Aids for Scientific Research on Priority Areas and the 21st Century COE (Centers of Excellence) Program (Life Science) from the Ministry of Education, Culture, Sports, Science and Technology, Japan and a grant from the Yakult Foundation. We thank Dr Taku Ohshima for his valuable advice, Yoshiyuki Maki for his assistance in primer design, Akemi Yoshida and Yumiko Takeshita for their technical assistances, and Yumiko Hayashi for her language assistance.


    Footnotes
 
*To whom correspondence should be addressed. Tel. +81-985-85-0871, Fax. +81-985-85-6475, E-mail: thayash{at}med.miyazaki-u.ac.jp

Communicated by Satoshi Tabata


    References
 Top
 Abstract
 1. Introduction
 2. Materials and Methods
 3. Results
 4. Discussion
 Supplementary Data
 Acknowledgements
 References
 

  1. Johnson, J. R. 2002, Evolution of pathogenic Escherichia coli, San Diego, Calif Academic Press55–77 Escherichia coli: Virulence Mechanisms of a Versatile Pathogen.
  2. Kaper, J. B., Nataro, J. P., Mobley, H. L. 2004, Pathogenic Escherichia coli, Nat. Rev. Microbiol., 2, 123–140.[CrossRef][ISI][Medline]
  3. Mead, P. S. and Griffin, P. M. 1998, Escherichia coli O157:H7, Lancet, 352, 1207–1212.[CrossRef][ISI][Medline]
  4. Blattner, F. R., Plunkett, G. 3rd, Bloch, C. A., et al. 1997, The complete genome sequence of Escherichia coli K-12, Science, 277, 1453–1474.[Abstract/Free Full Text]
  5. Hayashi, T., Makino, K., Ohnishi, M., et al. 2001, Complete genome sequence of enterohemorrhagic Escherichia coli O157:H7 and genomic comparison with a laboratory strain K-12, DNA Res., 8, 11–22.[Abstract]
  6. Ohnishi, M., Kurokawa, K., Hayashi, T. 2001, Diversification of Escherichia coli genomes: are bacteriophages the major contributors?, Trends Microbiol., 9, 481–485.[CrossRef][ISI][Medline]
  7. Perna, N.T., Plunkett, G. 3rd, Burland, V., et al. 2001, Genome sequence of enterohaemorrhagic Escherichia coli O157:H7, Nature, 409, 529–533.[CrossRef][Medline]
  8. Ohnishi, M., Terajima, J., Kurokawa, K., et al. 2002, Genomic diversity of enterohemorrhagic Escherichia coli O157 revealed by whole genome PCR scanning, Proc. Natl Acad. Sci. USA, 99, 17043–17048.[Abstract/Free Full Text]
  9. Kim, C. C., Joyce, E. A., Chan, K., Falkow, S. 2002, Improved analytical methods for microarray-based genome-composition analysis, Genome Biol., 3, RESEARCH0065.
  10. Eisen, M. B., Spellman, P. T., Brown, P. O., Botstein, D. 1998, Cluster analysis and display of genome-wide expression patterns, Proc. Natl Acad. Sci. USA, 95, 14863–14868.[Abstract/Free Full Text]
  11. Makino, K., Ishii, K., Yasunaga, T., et al. 1998, Complete nucleotide sequences of 93-kb and 3.3-kb plasmids of an enterohemorrhagic Escherichia coli O157:H7 derived from Sakai outbreak, DNA Res., 5, 1–9.[Abstract]
  12. Kolodner, R., Hall, S. D., Luisi-DeLuca, C. 1994, Homologous pairing proteins encoded by the Escherichia coli recE and recT genes, Mol. Microbiol., 11, 23–30.[ISI][Medline]
  13. Barenkamp, S. J. and Leininger, E. 1992, Cloning, expression, and DNA sequence analysis of genes encoding nontypeable Haemophilus influenzae high-molecular-weight surface-exposed proteins related to filamentous hemagglutinin of Bordetella pertussis, Infect. Immun., 60, 1302–1313.[Abstract/Free Full Text]
  14. Pramoonjago, P., Kaneko, M., Kinoshita, T., et al. 1992, Role of TraT protein, an anticomplementary protein produced in Escherichia coli by R100 factor, in serum resistance, J. Immunol., 148, 827–836.[Abstract]
  15. Tettelin, H., Saunders, N. J., Heidelberg, J., et al. 2000, Complete genome sequence of Neisseria meningitidis serogroup B strain MC58, Science, 287, 1809–1815.[Abstract/Free Full Text]
  16. Nicholls, L., Grant, T. H., Robins-Browne, R. M. 2000, Identification of a novel genetic locus that is required for in vitro adhesion of a clinical isolate of enterohaemorrhagic Escherichia coli to epithelial cells, Mol. Microbiol., 35, 275–288.[CrossRef][ISI][Medline]
  17. Klapproth, J. M., Scaletsky, I. C., McNamara, B. P., et al. 2000, A large toxin from pathogenic Escherichia coli strains that inhibits lymphocyte activation, Infect Immun., 68, 2148–2155.[Abstract/Free Full Text]
  18. Stevens, M. P., Roe, A. J., Vlisidou, I., et al. 2004, Mutation of toxB and a truncated version of the efa-1 gene in Escherichia coli O157:H7 influences the expression and secretion of locus of enterocyte effacement-encoded proteins but not intestinal colonization in calves or sheep, Infect. Immun., 72, 5402–5411.[Abstract/Free Full Text]
  19. Iyoda, S. and Watanabe, H. 2004, Positive effects of multiple pch genes on expression of the locus of enterocyte effacement genes and adherence of enterohaemorrhagic Escherichia coli O157: H7 to HEp-2 cells, Microbiology, 150, 2357–2571.[Abstract/Free Full Text]
  20. Campellone, K. G., Robbins, D., Leong, J. M. 2004, EspFU is a translocated EHEC effector that interacts with Tir and N-WASP and promotes Nck-independent actin assembly, Dev. Cell, 7, 217–228.[CrossRef][ISI][Medline]
  21. Dahan, S., Wiles, S., La Ragione, R. M., et al. 2005, EspJ is a prophage-carried type III effector protein of attaching and effacing pathogens that modulates infection dynamics, Infect. Immun., 73, 679–686.[Abstract/Free Full Text]
  22. Deng, W., Puente, J. L., Gruenheid, S., Li, Y., et al. 2004, Dissecting virulence: systematic and functional analyses of a pathogenicity island, Proc. Natl Acad. Sci. USA, 101, 3597–3602.[Abstract/Free Full Text]
  23. Garmendia, J., Phillips, A. D., Carlier, M. F., et al. 2004, TccP is an enterohaemorrhagic Escherichia coli O157:H7 type III effector protein that couples Tir to the actin-cytoskeleton, Cell Microbiol., 6, 1167–1183.[CrossRef][ISI][Medline]
  24. Gruenheid, S., Sekirov, I., Thomas, N. A., et al. 2004, Identification and characterization of NleA, a non-LEE-encoded type III translocated virulence factor of enterohaemorrhagic Escherichia coli O157:H7, Mol. Microbiol., 51, 1233–1249.[CrossRef][ISI][Medline]
  25. Mundy, R., Petrovska, L., Smollett, K., et al. 2004, Identification of a novel Citrobacter rodentium type III secreted protein, EspI, and roles of this and other secreted proteins in infection, Infect. Immun., 72, 2288–2302.[Abstract/Free Full Text]
  26. Wick, L. M., Qi, W., Lacher, D. W., Whittam, T. S. 2005, Evolution of genomic content in the stepwise emergence of Escherichia coli O157:H7, J. Bacteriol., 187, 1783–1791.[Abstract/Free Full Text]
  27. Feng, P., Lampel, K. A., Karch, H., Whittam, T. S. 1998, Genotypic and phenotypic changes in the emergence of Escherichia coli O157:H7, J. Infect. Dis., 177, 1750–1753.[CrossRef][ISI][Medline]
  28. Shaikh, N. and Tarr, P. I. 2003, Escherichia coli O157:H7 Shiga toxin-encoding bacteriophages: integrations, excisions, truncations, and evolutionary implications, J. Bacteriol., 185, 3596–3605.[Abstract/Free Full Text]

Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Infect. Immun.Home page
K. A. Eaton, D. I. Friedman, G. J. Francis, J. S. Tyler, V. B. Young, J. Haeger, G. Abu-Ali, and T. S. Whittam
Pathogenesis of Renal Disease Due to Enterohemorrhagic Escherichia coli in Germ-Free Mice
Infect. Immun., July 1, 2008; 76(7): 3054 - 3063.
[Abstract] [Full Text] [PDF]


Home page
Infect. Immun.Home page
K. Izutsu, K. Kurokawa, K. Tashiro, S. Kuhara, T. Hayashi, T. Honda, and T. Iida
Comparative Genomic Analysis Using Microarray Demonstrates a Strong Correlation between the Presence of the 80-Kilobase Pathogenicity Island and Pathogenicity in Kanagawa Phenomenon-Positive Vibrio parahaemolyticus Strains
Infect. Immun., March 1, 2008; 76(3): 1016 - 1023.
[Abstract] [Full Text] [PDF]


Home page
MicrobiologyHome page
A. Iguchi, T. Ooka, Y. Ogura, Asadulghani, K. Nakayama, G. Frankel, and T. Hayashi
Genomic comparison of the O-antigen biosynthesis gene clusters of Escherichia coli O55 strains belonging to three distinct lineages
Microbiology, February 1, 2008; 154(2): 559 - 570.
[Abstract] [Full Text] [PDF]


Home page
Infect. Immun.Home page
G. Wu, B. Carter, M. Mafura, E. Liebana, M. J. Woodward, and M. F. Anjum
Genetic Diversity among Escherichia coli O157:H7 Isolates and Identification of Genes Linked to Human Infections
Infect. Immun., February 1, 2008; 76(2): 845 - 856.
[Abstract] [Full Text] [PDF]


Home page
DNA ResHome page
H. Abe, A. Miyahara, T. Oshima, K. Tashiro, Y. Ogura, S. Kuhara, N. Ogasawara, T. Hayashi, and T. Tobe
Global Regulation by Horizontally Transferred Regulators Establishes the Pathogenicity of Escherichia coli
DNA Res, February 1, 2008; 15(1): 25 - 38.
[Abstract] [Full Text] [PDF]


Home page
MicrobiologyHome page
M. L. Kotewicz, S. A. Jackson, J. E. LeClerc, and T. A. Cebula
Optical maps distinguish individual strains of Escherichia coli O157 : H7
Microbiology, June 1, 2007; 153(6): 1720 - 1733.
[Abstract] [Full Text] [PDF]


Home page
MicrobiologyHome page
H. Ando, H. Abe, N. Sugimoto, and T. Tobe
Maturation of functional type III secretion machinery by activation of anaerobic respiration in enterohaemorrhagic Escherichia coli
Microbiology, February 1, 2007; 153(2): 464 - 473.
[Abstract] [Full Text] [PDF]


Home page
Appl. Environ. Microbiol.Home page
M. Steele, K. Ziebell, Y. Zhang, A. Benson, P. Konczy, R. Johnson, and V. Gannon
Identification of Escherichia coli O157:H7 Genomic Regions Conserved in Strains with a Genotype Associated with Human Infection
Appl. Envir. Microbiol., January 1, 2007; 73(1): 22 - 31.
[Abstract] [Full Text] [PDF]


Home page
J. Bacteriol.Home page
H. Willenbrock, A. Petersen, C. Sekse, K. Kiil, Y. Wasteson, and D. W. Ussery
Design of a Seven-Genome Escherichia coli Microarray for Comparative Genomic Profiling
J. Bacteriol., November 15, 2006; 188(22): 7713 - 7721.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow Supplementary data
Right arrowOA All Versions of this Article:
13/1/3    most recent
dsi026v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (11)