© The Author 2005. Kazusa DNA Research Institute
Short Communications |
Complete Nucleotide Sequence of the Chloroplast Genome from the Tasmanian Blue Gum, Eucalyptus globulus (Myrtaceae)
Cooperative Research Centre for Sustainable Production Forestry, School of Plant Science, University of Tasmania Private Bag 55, Hobart, Tasmania 7001, Australia
Received 22 November 2004; revised 4 April 2005
| Abstract |
|---|
|
|
|---|
The complete nucleotide sequence of the chloroplast genome of the hardwood species Eucalyptus globulus is presented and compared with chloroplast genomes of tree and non-tree angiosperms and two softwood tree species. The 160 286 bp genome is similar in gene order to that of Nicotiana, with an inverted repeat (IR) (26 393 bp) separated by a large single copy (LSC) region of 89 012 bp and a small single copy region of 18 488 bp. There are 128 genes (112 individual gene species and 16 genes duplicated in the inverted repeat) coding for 30 transfer RNAs, 4 ribosomal RNAs and 78 proteins. One pseudogene (
-infA) and one pseudo-ycf (
-ycf15) were identified. The chloroplast genome of E. globulus is essentially co-linear with that of another hardwood tree species, Populus trichocarpa, except that the latter lacks rps16 and rpl32, and the IR has expanded in Populus to include rps19 (part of the LSC in E. globulus). Since the chloroplast genome of E. globulus is not significantly different from other tree and non-tree angiosperm taxa, a comparison of hardwood and softwood chloroplasts becomes, in essence, a comparison of angiosperm and gymnosperm chloroplasts. When compared with E. globulus, Pinus chloroplasts have a very small IR, two extra tRNAs and four additional photosynthetic genes, lack any functional ndh genes and have a significantly different genome arrangement. There does not appear to be any correlation between plant habit and chloroplast genome composition and arrangement.
Key words: eucalypt; Myrtaceae; chloroplast DNA; pseudogene; gymnosperm
Eucalyptus globulus is one of the most economically important species for hardwood forestry plantations in temperate regions of the world.1
The complete chloroplast genome of E. globulus (GenBank accession no. AY780259) may be represented as a circular chromosome (Fig. 1), although this is likely to be a rare form of the molecule, as most chloroplast DNA is, in fact, linear.3
,4
Comprising 160 286 bp, it ranks among the larger land plant chloroplast genomes. Most land plant plastids sequenced to date have genomes of 116163 kb, and the longest belongs to Oenothera elata (163 935 bp5
). The structure of the E. globulus chloroplast genome is typical of most plastids: a large single copy (LSC) region (89 012 bp) and a small single copy (SSC) region (18 488 bp) are separated by an inverted repeat (IR) (26 393 bp). The relative sizes of the LSC, SSC and IR regions remain reasonably constant across genomes of angiosperms (approximately 55, 12 and 16.5% of the total genome size, respectively), regardless of the overall size of the genome. The relative size of the IR in gymnosperms varies much more. For example, in Ginkgo biloba the IR is 17 kb, but in P. thunbergii it is just 495 bp2
containing trnI-CAU and 83 bp from the 3' end of psbA, but lacking the ribosomal RNA genes that characterize other land plant IRs.
|
The Eucalyptus chloroplast genome has a GC-content of 36.9%, which is comparable with that of other vascular plant plastids (e.g. 36.7% in Populus, 37.8% in Nicotiana, 38.4% in Zea, 39.2% in Oenothera and 38.5% in P. thunbergii). The genome is AT-rich in both the non-coding intergenic regions (67% AT) and the coding regions (62% AT), where there is an AT bias (73% ± 4.5%) in the third base positions of all amino acid codons. This phenomenon is also observed in other plastid genomes. In contrast, the tRNA genes show less of an AT bias (58%), and the rRNA genes have a slight GC bias (55%). The latter is characteristic of rRNA genes in other plants.6
Table 1 lists all genes detected in the chloroplast genome of E. globulus. The start codons for the protein-coding genes were based on alignments with plastomes of other taxa. The start positions are, therefore, hypothetical and can be confirmed only through analysis of gene transcripts. The genome is essentially co-linear with that of the annual angiosperm Nicotiana tabacum (Fig. 2a), with all the same genes except sprA, which is absent from E. globulus. The chloroplast genome of E. globulus is also virtually co-linear with that from another hardwood tree species, P. trichocarpa (Fig. 2b), except for three notable differences: (i) rps16 and flanking intergenic sequences (
1800 bp of LSC) are missing from Populus; (ii) the gene rpl32 and flanking sequences (
1100 bp of SSC) are absent from Populus; and (iii) the IR in Populus has expanded to include rps19, with the extra copy of this gene located close to JLA. As in other angiosperms, the E. globulus plastome has four ribosomal RNA (rRNA) genes and 30 transfer RNA (tRNA) genes (of which seven are located in the IRs) that provide tRNAs for all 20 amino acids (Table 1). There are 78 protein-coding genes, including four conserved open reading frames (ORFs) (ycfs). Approximately 74 protein-coding genes are common to most angiosperm chloroplast genomes, and an additional 5 are present in only some species.7
Of these five, four (accD, ycf1, ycf2 and rpl23) appear to be functional in the plastome of E. globulus, but the fifth, infA, is a pseudogene (
), as in Populus, Nicotiana, Arabidopsis and Oenothera.7
One other pseudogene was detected, that of a hypothetical chloroplast protein,
ycf15. One open reading frame, ORF113, has high homology to regions of ycf68 in rice, maize and Pinus, as well as to hypothetical proteins ORF119 and ORF58 in the trnI intron of Oenothera. A second open reading frame, ORF366, is found in IRB at the junction with the SSC. It is a truncated inverted repeat of ycf1 and is probably non-functional.
|
|
There are three classes of ORFs in plastid DNA: (i) genes of known function; (ii) hypothetical chloroplast reading frames (ycfs) that are highly conserved between species; and (iii) species-specific or rapidly diverging ORFs. Four major ycfs have been partially characterized, but their precise functions are not yet understood. Two highly conserved ycfs, ycf1 and ycf2, have been demonstrated to be essential to cellular function in dicots;8
250 bp (295 bp in E. globulus) introduces premature stop codons. Schmitz-Linneweber et al.11
The psbL gene that codes for a 38 amino acid peptide of photosystem II is highly conserved among many higher plants. This gene is unusual because in Eucalyptus, as well as in some other taxa (e.g. Nicotiana and Spinacia, but not Populus), transcription of the gene does not require any of the standard chloroplast initiation codons [i.e. leucine (TTG, CTG), isoleucine (ATT, ATC, ATA), valine (GTG) or, the most common, methionine (ATG)]. Instead, ACG appears at the beginning of the gene. It has been shown in Nicotiana that a translatable psbL mRNA containing an AUG initiator codon is formed by C to U editing of the ACG codon,14
and it is possible that a similar mechanism exists in Eucalyptus.
In general, the chloroplast genome of E. globulus is not significantly different from most other angiosperms, so a comparison of hardwood and softwood chloroplasts becomes, in essence, a comparison of angiosperm and gymnosperm chloroplasts. Chloroplast DNA sequences are available for two gymnosperms, P. thunbergii (119 707 bp) and P. koraiensis (116 866 bp). Both genomes are significantly smaller than those of most angiosperms sequenced so far. Pairwise comparisons using Harr plots (Fig. 2c) and DOGMA software15
(data not shown) show that the chloroplast DNA sequences of the two pine species are very similar. In contrast, those same analytical techniques indicate that the chloroplast genomes of P. thunbergii and E. globulus are arranged very differently (Fig. 2d). Relative to Eucalyptus, rbcL and its neighboring regions in the LSC region are inverted in the pines, and a large region from the LSC, including psaA and psaB, occurs in the SSC.2
The rRNA genes from rrn16 to trnR-AGC that are in the inverted repeat in angiosperms form a cluster in the middle of the SSC in P. thunbergii.2
In addition to the 30 tRNA genes found in angiosperms, the two pine species have two unusual tRNAs, trnP-GGG and trnR-CCG. The first of these is also found in hornworts16
and ferns17
, and trnR-CCG has been found in moss, although it is not essential for plastid function in moss and may not be a functional gene.18
Angiosperms and pines have the same suite of ribosomal protein genes, except that the pines lack rps16. Pines have an intact infA gene, in contrast to the pseudogene found in Eucalyptus and many other angiosperms (see above). In addition to the 29 genes encoding components of the photosynthetic apparatus in angiosperms, pines have 4 more genes that exist in some lower plants: psaM, chlB, chlL and chlN. The psaM gene (which is duplicated in the LSC of P. thunbergii,2
but not in P. koraiensis) has been found in non-vascular plants, but is absent from ferns and angiosperms, suggesting parallel losses in the latter two groups during their evolution.17
The genes chlB, chlL and chlN may be associated with the ability of pines to synthesize chlorophyll in the dark (as in Chlamydomonas19
). A major difference in the gene content between pines and angiosperms is the complete absence of functional ndh genes from pine chloroplasts.2
It is unclear whether chloroplast ndh genes have been transferred to the nuclear genome of pines, or whether pine chloroplasts lack an NADH dehydrogenase altogether. Eucalyptus and Nicotiana have 21 introns, 5 more than P. thunbergii and P. koraiensis. Of these five, three occur in genes that are absent from pines (rps16, ndhA and ndhB), and two occur in clpP that, in pines, has no introns. The 16 remaining split genes are conserved between pines and angiosperms.2
In conclusion, there does not appear to be any correlation between plant habit and plastome composition and arrangement. Differences between chloroplast genomes of tree and non-tree angiosperm species are slight. In contrast, although angiosperm and gymnosperm chloroplasts share many genes, there are significant differences in genome size, arrangement and gene content.
| Acknowledgements |
|---|
|
|
|---|
The author thanks Peter Wilson and other staff at the Australian Genome Research Facility (AGRF); Natalie Papworth and Alan McFadden (Royal Tasmanian Botanical Garden); Peter Boyer (SouthWind Writing and Publishing Services, Tasmania); Bob Elliott, Adam Smolenski, Natalie Conod, Rebecca Jones, Catherine Phillips, Briony Patterson, Gay McKinnon, Brad Potts and René Vaillancourt (University of Tasmania). This research was funded by the Cooperative Research Centre for Sustainable Production Forestry (CRC-SPF).
| Footnotes |
|---|
*Tel. +61-3-62261828, Fax. +61-3-62262698, E-mail: dorothy.steane{at}utas.edu.au
| References |
|---|
|
|
|---|
- Eldridge, K. G., Davidson, J., Harwood, C., van Wyk, G. 1993, Eucalypt Domestication and Breeding, Oxford Clarendon Press.
- Wakasugi, T., Tsudzuki, J., Ito, S., Nakashima, K., Tsudzuki, T., Sugiura, M. 1994, Loss of all ndh genes as determined by sequencing the entire chloroplast genome of the black pine Pinus thunbergii, Proc. Natl Acad. Sci. USA, 91, 97949798.
[Abstract/Free Full Text] - Oldenburg, D. J. and Bendich, A. J. 2004, Most chloroplast DNA of maize seedlings in linear molecules with defined ends and branched forms, J. Mol. Biol., 335, 953970.[CrossRef][Medline]
- Bendich, A. J. 2004, Circular chloroplast chromosomes: the grand illusion, Plant Cell, 16, 16611666.
[Free Full Text] - Hupfer, H., Swiatek, M., Hornung, S., et al. 2000, Complete nucleotide sequence of the Oenothera elata plastid chromosome, representing plastome I of the five distinguishable Euoenothera plastomes, Mol. Gen. Genet., 263, 581585.[Web of Science][Medline]
- Goremykin, V. V., Hirsch-Ernst, K. I., Wolfl, S., Hellwig, F. H. 2003, Analysis of the Amborella trichopoda chloroplast genome sequence suggests that Amborella is not a basal angiosperm, Mol. Biol. Evol., 20, 14991505.
[Abstract/Free Full Text] - Millen, R. S., Olmstead, R. G., Adams, K. L., et al. 2001, Many parallel losses of infA from chloroplast DNA during angiosperm evolution with multiple independent transfers to the nucleus, Plant Cell, 13, 645658.
[Abstract/Free Full Text] - Drescher, A., Ruf, S., Calsa, T., Carrer, H., Bock, R. 2000, The two largest chloroplast genome-encoded open reading frames of higher plants are essential genes, Plant J., 22, 97104.[CrossRef][Web of Science][Medline]
- Boudreau, E., Takahashi, Y., Lemieux, C., Turmel, M., Rochaix, J. D. 1997, The chloroplast ycf3 and ycf4 open reading frames of Chlamydomonas reinhardtii are required for the accumulation of the photosystem I complex, EMBO J., 16, 60956104.[CrossRef][Web of Science][Medline]
- Ruf, S., Kossel, H., Bock, R. 1997, Targeted inactivation of a tobacco intron-containing open reading frame reveals a novel chloroplast-encoded photosystem I-related gene, J. Cell Biol., 139, 95102.
[Abstract/Free Full Text] - Schmitz-Linneweber, C., Maier, R. M., Alcaraz, J. P., Cottet, A., Herrmann, R. G., Mache, R. 2001, The plastid chromosome of spinach (Spinacia oleracea): complete nucleotide sequence and gene organization, Plant Mol. Biol., 45, 307315.[CrossRef][Web of Science][Medline]
- Vera, A. and Sugiura, M. 1994, A novel RNA gene in the tobacco plastid genome: its possible role in the maturation of 16S ribosomal RNA, EMBO J, 13, 22112217.[Web of Science][Medline]
- Sugita, M., Svab, Z., Maliga, P., Sugiura, M. 1997, Targeted deletion of sprA from the tobacco plastid genome indicates that the encoded small RNA is not essential for pre-16S rRNA maturation in plastids, Mol. Gen. Genet., 257, 2327.[CrossRef][Web of Science][Medline]
- Kudla, J., Igloi, G., Metzlaff, M., Hagemann, R., Kossel, H. 1992, RNA editing in tobacco chloroplasts leads to the formation of a translatable psbL mRNA by a C to U substitution within the initiation codon, EMBO J., 11, 10991103.[Web of Science][Medline]
- Wyman, S. K., Jansen, R. K., Boore, J. L. 2004, Automatic annotation of organellar genomes with DOGMA, Bioinformatics, 20, 32523255.
[Abstract/Free Full Text] - Kugita, M., Kaneko, A., Yamamoto, Y., Takeya, Y., Matsumoto, T., Yoshinaga, K. 2003, The complete nucleotide sequence of the hornwort (Anthoceros formosae) chloroplast genome: insight into the earliest land plants, Nucleic Acids Res., 31, 716721.
[Abstract/Free Full Text] - Wolf, P. G., Rowe, C. A., Sinclair, R. B., Hasebe, M. 2003, Complete nucleotide sequence of the chloroplast genome from a leptosporangiate fern, Adiantum capillus-veneris L, DNA Res., 10, 5965.[Abstract]
- Sugiura, C. and Sugita, M. 2004, Plastid transformation reveals that moss tRNA(Arg)-CCG is not essential for plastid function, Plant J., 40, 314321.[CrossRef][Medline]
- Liu, X. Q., Xu, H., Huang, C. Z. 1993, Chloroplast chlB gene is required for light-independent chlorophyll accumulation in Chlamydomonas reinhardtii, Plant Mol. Biol., 23, 297308.[CrossRef][Web of Science][Medline]
- Palmer, J. D. 1986, Methods in Enzymology, New York Academic Press167186.
- Steane, D. A., West, A. K., Potts, B. M., Ovenden, J. R., Reid, J. B. 1991, Restriction fragment length polymorphisms in chloroplast DNA from six species of Eucalyptus, Aust. J. Bot., 39, 399414.
- Doyle, J. J. and Doyle, J. L. 1990, Isolation of plant DNA from fresh tissue, Focus, 12, 1315.
- Ewing, B. and Green, P. 1998, Base-calling of automated sequencer traces using Phred. II. Error probabilities, Genome Res., 8, 186194.
[Abstract/Free Full Text] - Wakasugi, T., Sugita, M., Tsudzuki, T., Sugiura, M. 1998, Updated gene map of tobacco chloroplast DNA, Plant Mol. Biol. Rep., 16, 231241.[CrossRef][Web of Science]
This article has been cited by other articles:
![]() |
S. Tangphatsornruang, D. Sangsrakru, J. Chanprasert, P. Uthaipaisanwong, T. Yoocha, N. Jomchai, and S. Tragoonrung The Chloroplast Genome Sequence of Mungbean (Vigna radiata) Determined by High-throughput Pyrosequencing: Structural Organization and Phylogenetic Relationships DNA Res, February 1, 2010; 17(1): 11 - 22. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. A. Cullis, B. J. Vorster, C. Van Der Vyver, and K. J. Kunert Transfer of genetic material between the chloroplast and nucleus: how is it related to stress in plants? Ann. Bot., February 1, 2009; 103(4): 625 - 633. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Ueda, T. Nishikawa, M. Fujimoto, H. Takanashi, S.-i. Arimura, N. Tsutsumi, and K.-i. Kadowaki Substitution of the Gene for Chloroplast RPS16 Was Assisted by Generation of a Dual Targeting Signal Mol. Biol. Evol., August 1, 2008; 25(8): 1566 - 1575. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Greiner, X. Wang, U. Rauwolf, M. V. Silber, K. Mayer, J. Meurer, G. Haberer, and R. G. Herrmann The complete nucleotide sequences of the five genetically distinct plastid genomes of Oenothera, subsection Oenothera: I. Sequence evaluation and plastome evolution Nucleic Acids Res., April 1, 2008; 36(7): 2366 - 2378. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. M. Barthet and K. W. Hilu Expression of matK: functional and evolutionary implications Am. J. Botany, August 1, 2007; 94(8): 1402 - 1412. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. E. Timme, J. V. Kuehl, J. L. Boore, and R. K. Jansen A comparative analysis of the Lactuca and Helianthus (Asteraceae) plastid genomes: identification of divergent regions and categorization of shared repeats Am. J. Botany, March 1, 2007; 94(3): 302 - 312. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. W. Chumley, J. D. Palmer, J. P. Mower, H. M. Fourcade, P. J. Calie, J. L. Boore, and R. K. Jansen The Complete Chloroplast Genome Sequence of Pelargonium x hortorum: Organization and Evolution of the Largest and Most Highly Rearranged Chloroplast Genome of Land Plants Mol. Biol. Evol., November 1, 2006; 23(11): 2175 - 2190. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||






