DNA Research Advance Access originally published online on February 23, 2006
DNA Research 2005 12(6):403-416; doi:10.1093/dnares/dsi023
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Characterization of a Whole Set of tRNA Molecules in an Aerobic Hyper-thermophilic Crenarchaeon, Aeropyrum pernix K1
1National Institute of Technology and Evaluation 2-49-10 Nishihara, Shibuya, Tokyo 151-0066, Japan
2National Institute of Advanced Industrial Science and Technology (AIST) Higashi 1-1, Tsukuba, Ibaraki 305-8566, Japan
Received 13 August 2004; revised 29 November 2005
| Abstract |
|---|
|
|
|---|
The tRNA molecule has an important role in translation, the function of which is to carry amino acids to the ribosomes. It is known that tRNA is transcribed from tRNA genes, some of which, in Eukarya and Archaea, contain introns. A computational analysis of the complete genome of Aeropyrum pernix K1 predicted the presence of 14 intron-containing tRNA genes. To elucidate whether these introns are actually processed in living cells and what mechanism detects the intron regions, cDNAs for premature and mature forms of the tRNA molecules transcribed from the intron-containing tRNA genes in the model aerobic acidothermophilic crenarchaeon, A. pernix K1 were identified and analyzed. A comparison between the nucleotide sequences of these two types of cDNAs indicated that the intron regions of the tRNA molecules were indeed processed in A. pernix K1 living cells. Some cDNA clones showed that the actual splicing positions were different from those predicted by computational analysis. However, the bulgehelixbulge structure, which has been previously identified in exonintron boundaries of archaeal tRNA genes, was evident in all boundary regions confirmed in this work. These results indicate that the generally described mechanism for tRNA processing in Archaea is utilized for processing the intron region of the tRNA molecules in A. pernix K1.
Key words: tRNA; intron; splicing; bulgehelixbulge structure; genome; Crenarchaeon; Aeropyrum pernix K1
| 1. Introduction |
|---|
|
|
|---|
The intron region, which is processed immediately after transcription, has been identified in protein-coding and RNA-coding genes in many organisms. As shown in Fig. 1, the mechanisms for splicing of the introns already characterized were classified in the group I, II, III, nuclear mRNA and tRNA introns.
|
It is known that introns in tRNA genes, which encode RNA molecules necessary to transfer the amino acids to ribosome, are identified in many organisms, Eukarya, Bacteria and Archaea. The tRNA introns in Bacteria can be classified into group I or II intron, that are processed with GTP or by intron itself without other factor, respectively (Fig. 1A and B). Conversely, the tRNA introns in Eukarya or Archaea lack any identifiable sequence or structure to group I, II, III or mRNA introns as indicated in Fig. 1C.1
Previous analysis indicated that tRNA introns in Eukarya were recognized their mature structure by tRNA endonuclease during the cleavage of the introns.2
In this structure, the tRNA portion of the precursor maintains an L-shaped conformation, stabilized by the interaction between the D and T
C loops. The 3' splice site junction is always single-stranded.3
In vitro and in vivo analyses have shown that the cleavage sites of intron in eukaryotic tRNA molecules are identified by a measurement mechanism that senses the distance from the center of the molecule, possibly the top of the anticodon stem, to the 5' and 3' cleavage sites, as shown in Fig. 2A.4
,5
Because of this mechanism for the processing of intron portion in Eukarya, the intron in eukaryotic tRNA genes is located only at one base 3' from anticodon region.
|
In contrast to eukaryotic mechanism, it is found that the recognition and cleavage of the intron region by the archaeal intron endonuclease does not require the complete mature tRNA structure, but requires a defined structural element at the exonintron boundaries, referred to as the bulgehelixbulge motif shown in Fig. 2B.6
To date, the entire nucleotide sequences for 16 archaeal genomes have been determined. From this genomic data, the tRNA genes, that were interrupted by an intron, were identified in both the Euryarchaeota and the Crenarchaeota. Though the genomes of the Euryarchaeota contain a limited number of interrupted tRNA genes, computational analysis predicted 14 and 24 interrupted tRNA genes in two Crenarchaea, Aeropyrum pernix K1 and Sulfolobus tokodaii strain7, respectively.14
,15
Also noteworthy was the observation that tRNA genes containing an intron longer than 40 nt were predicted in both genomes and that the tRNA genes interrupted by the intron at the other position than the general one, one base 3' from anticodon region, were also predicted in both genomes. This data led to two questions: (i) Are these introns isolated in archaeal tRNA genes actually cleaved out during tRNA processing in living cells? (ii) Are the all intron regions in tRNA genes, including those located at other position than one base 3' from anticodon region, correctly recognized by the tRNAintron splicing machinery in archaeal living cells?
In this study, to determine whether all tRNA molecules predicted were expressed, whether the intron regions predicted were actually processed in archaeal living cells and the mechanism for tRNAintron splicing in Archaea, A. pernix K1 was selected as a target. The reasons why this archaeon was selected are shown in the following: (i) the entire genomic sequence was already determined14
; (ii) 14 tRNA genes were identified as interrupted tRNA genes in this genome; (iii) tRNA genes that contain the long intron or the introns within anticodon loop and anticodon stem were predicted and (iv) this microorganism can be easily cultured in aerobic conditions. The cDNA molecules were synthesized from tRNA molecules of this Crenarchaeon by RTPCR using the specific primers designed from the genomic data.
The plasmid clones containing the cDNA fragments were used for determination of the nucleotide sequences of the mature and premature types of the cDNA molecules. A comparison between these two types of sequences indicated that the introns predicted in tRNA genes of A. pernix K1 were actually processed in living cells and that the bulgehelixbulge motif was required for recognition of exonintron boundaries and cleavage of the intron region in A. pernix K1 living cells.
| 2. Materials and Methods |
|---|
|
|
|---|
2.1. Bacterial strains and vector
The A. pernix strain K1, deposited in the Japan Collection of Microorganisms (JCM number 9820), was cultured using JXT medium.16
2.2. Preparation of total RNA and small-sized RNA
From A. pernix K1 cells, which were grown to late log phase in 40 ml of JXT medium,16
a crude total RNA sample was prepared with Trizol Reagent (Invitrogen, Carlsbad, CA, USA) according to the manufacturer's instruction. The prepared RNA was dissolved in 10 µl of RNase-free water. To remove the contaminated DNA, the total RNA was incubated with 10 U of the RNase-free DNase I (Worthington biochemical, Lakewood, NJ, USA) in 50 µl of the reaction buffer [50 mM TrisHCl (pH 7.8), 1 mM MgCl2 and 1 mM CaCl2] at 37°C for 30 min. After incubation, purified RNA was treated by an equal volume of phenol chloroform solution and precipitated with ethanol. The recovered RNA molecules were dissolved in 10 µl of RNase-free water and stored at 80°C.
A mixture of small-sized RNA, including the crude tRNA and 5S RNA, was prepared from 40 µg of total RNA by fractionation with the QIAGEN RNA/DNA Kit (QIAGEN, Hilden, Germany) according to the manufacturer's instruction.
2.3. Preparation of the beads covalently binding the sequence-specific DNA molecules
To enrich the less abundant tRNA molecules, the specific tRNA molecules were purified by affinity chromatography. Magnetic beads covalently binding the poly(dT) oligo nucleotide were used as the starting materials. Since the nucleotide sequence of the 3' half of tRNASer(CGA) and tRNASer(UGA) were initially same, two 40mer oligo nucleotides, which contained the 3' half of tRNATrp(CCA) and tRNASer(CGA) or tRNASer(UGA), with a 10 nt long poly(dA) sequences at the 3' end of this oligo DNA were synthesized. The sequences of the synthesized DNA are GACCCGTAGGTCGGGGGTTCAAATCCCCCCGGGCCCACCAAAAAAAAAAA for tRNATrp(CCA), and CGCGAGGGGCGCGCGGGTTCGAATCCCGCCCCCGGCGCCAAAAAAAAAAA for tRNASer(CGA) and tRNASer(UGA).
For construction of the matrix for affinity chromatography, 10 µg of synthesized oligo nucleotides were heated at 94°C for 3 min with 15 µl of the OligotexTM-MAG (Takara, Kyoto, Japan) in 50 µl of the kit's 2x binding buffer. After treatment, the solution was slowly cooled to room temperature and the OligotexTM-MAG beads were washed with 350 µl of the kit's wash solution to remove any non-hybridized oligo DNA. For synthesis of the DNA strands complimentary to the oligo DNA on the beads, the OligotexTM-MAG beads with hybridized oligo DNA were incubated in 80 µl of klenow DNA polymerase buffer [10 mM TrisHCl (pH 7.5), 7 mM MgCl2, 0.1 mM DTT and 0.125 mM each of dATP, dGTP, dCTP and dTTP] at 37°C for 30 min with 12 U of klenow DNA polymerase. After incubation, the OligotexTM-MAG beads in the solution were heated at 94°C for 2 min and quickly chilled. The resulting magnetic beads covalently bind the tRNA-specific DNA strand, and hence were used for affinity concentration of the less-abundant tRNA molecules from small-sized tRNA.
2.4. Purification of the species-specific tRNA molecules
Small-sized RNA (0.5 µg) was incubated at 94°C for 3 min with the OligotexTM-MAG beads prepared as above in 50 µl of 2x binding buffer used in the OligotexTM-MAG kit. After heating, the solution was incubated at 25°C for 10 min. The non-hybridized tRNA molecules were removed by washing twice with 350 µl of the kit's wash buffer. The species-specific tRNA molecules were recovered in 20 µl of RNase-free pure water by heating at 94°C for 2 min and quickly chilled in ice water. Recovered species-specific concentrated tRNA molecules were stored at 80°C and used for the synthesis of species-specific cDNA.
2.5. RTPCR
Synthesis and amplification of cDNA for specific tRNA molecules were carried out by the ThermoscriptTM RTPCR System (Invitrogen, Carlsbad, CA, USA) according to the supplier's instruction with some modification. One microgram of total RNA, 1 µg of small-sized RNA or 0.05 µg of species-specific concentrated RNA and 10 pmol of primer were incubated at 95°C for 5 min in 19 µl of ThermoscriptTM RTPCR buffer [50 mM Trisacetate (pH 8.4), 75 mM potassium acetate, 8 mM magnesium acetate, stabilizer, 5 mM DTT, 40 U RNaseOUT (Invitrogen, Carlsbad, CA, USA) and 1 mM each of dATP, dGTP, dCTP and dTTP] and cooled to the Tm of each primer set used. After cooling to this temperature, 15 U ThermoscriptRT enzyme was added and the reaction mixture was kept at this temperature for 30 min to allow synthesis of the first strand of cDNA.
The subsequent PCR amplification was performed in 50 µl of reaction mixture containing 2 µl of solution from the first reaction, 20 mM TrisHCl (pH 8.4), 50 mM KCl, 1.5 mM MgCl2, 0.2 mM each of dATP, dGTP, dCTP and dTTP, 10 pmol of the appropriate primer set (as detailed in Tables 1 and 2) and 2 U of Platinum Taq DNA polymerase (Invitrogen, Carlsbad, CA, USA). The PCR was performed under standard conditions as specified in the manual accompanying the ThermoscriptTM RTPCR kit. After purification of the amplified fragments by 3% Nusive GTG agarose (FMC, Rockland, MD, USA) gel electrophoresis, the fragments were dissolved in the 10 µl of RNase-free water and stored in 20°C.
|
|
2.6. Cloning of cDNA
For cloning the cDNA fragments amplified, 2 µl of cDNA molecules purified were incubated in 10 µl of reaction solution at 14°C by Original TA Cloning kit (Invitrogen, Carlsbad, CA, USA) with 0.05 µg of pCR2.1 Vector (Invitrogen, Carlsbad, CA, USA). For transformation of the ligated DNA, the DNA was introduced into the E. coli TOP10F' competent cells (Invitrogen, Carlsbad, CA, USA).
2.7. Sequencing
The plasmid DNA was prepared from E. coli cells, which were cultured in 1.2 ml of liquid LuriaBertani broth at 37°C for 18 h, by QIAprep turbo Miniprep KIT (QIAGEN, Hilden, Germany). The sequencing reaction was performed with the BigDye Terminator Cycle Sequencing Kit (ABI, Foster city, CA, USA), according to the supplier manual and the signal was detected using BASE station (MJ research, Waltham, MA, USA).
| 3. Results |
|---|
|
|
|---|
3.1. RNA molecules prepared
As shown in Materials and Methods section, three different qualities of RNA molecules, total RNA, small RNA and species-specific concentrated tRNA molecules, were prepared from freshly cultured cells of A. pernix K1.
Approximately 120 µg of the total RNA molecule was recovered from A. pernix K1 cells cultured in 40 ml culture broth. Approximately 1.8 µg of small RNA was recovered using the QIAGEN RNA/DNA Kit from 40 µg of DNase-treated total RNA. From 0.5 µg of small RNA, all species-specific concentrated tRNA molecules for tRNATrp(CCA), tRNASer(CGA) and tRNASer(UGA) were used for the cDNA synthesis.
3.2. Design of primers
The experiment in this report was designed to utilize the specific primers, each of which possessed the sequence identical to corresponding tRNA molecule. The specific primers were capable of annealing only to the specific sequence of the tRNA molecules. For amplification and cloning of the specific cDNA from tRNA molecules, selection of length and sequence of the primer was thought to be important.
For the design of the primers utilized for the synthesis of the first strand of the cDNA from the 47 species of tRNA molecules assigned from the genomic sequence,14
the 3' end sequences of the tRNA genes were compared with each other. As some identical 3' end sequences were recognized, 21 independent 18mer oligonucleotides were designed and synthesized. As indicated in Table 1 (column A), these synthesized oligo DNAs were referred as cP plus a two-digit number.
To amplify the cDNA molecules synthesized as the first strand of cDNA from the tRNA molecules, the primers, named aP plus a two-digit number, were designed and synthesized according to the 5' end sequence of tRNA genes predicted. As listed in Table 1 (column B), 21 independent 18mer oligonucleotides were synthesized, because of the presence of the identical 5' sequences. PCR with 27 different combinations of cP and aP primers, as listed in Table 2, were planned for amplification of all 47 tRNA molecules present in A. pernix.
3.3. Detection of amplified fragments
During synthesis of the first strand of cDNA, the RNA template and cP-type primer was not chilled below the Tm of the primer used, to minimize the non-specific annealing of primers to the unrelated region or unrelated RNA molecules. After heat treatment, at 95°C for 5 min, of a reaction solution containing both the RNA and cP-type primer, the solution was cooled only to the annealing temperature and this temperature was maintained until addition of reverse transcriptase.
Among 27 RTPCRs with the total RNA, all the amplified cDNA products were detected except for reaction by primer set 8. The cDNA fragments synthesized from the total RNA by 26 primer sets were used for construction of independent libraries by cloning into pCR2.1 vector. Each library constructed was designated as the APcDNAT plus number of primer set.
Since some cDNA fragments which were expected to be involved in the libraries APcDNAT08, 15, 20, 22 and 27 were not recovered, RTPCRs and library construction were performed with the small RNA to obtain the cDNAs not identified from the total RNA. By this reaction, the four libraries, referred as the APcDNAS plus number of primer set, were constructed with products amplified by the primer set 15, 20, 22 and 27, but a fragment was not amplified by the primer set 8. Although the plasmid clones made from mature form of tRNATyr(GUA), tRNASer(GGA) and tRNAPhe(GAA) were identified from the APcDNAS libraries, cDNA fragments for mature forms of tRNATrp(CCA) and tRNASer(CGA) and tRNASer(UGA) were still not obtained from this type of libraries.
To obtain the cDNA fragments synthesized from the mature forms of tRNATrp(CCA) and tRNASer(CGA) and tRNASer(UGA), affinity chromatography was performed to separate and concentrate these three specific tRNA molecules. The cDNA clone for the mature form of tRNATrp(CCA) was identified in the library constructed from species-specific concentrated tRNA molecules. However, cDNA fragments for mature form of tRNASer(CGA) and tRNASer(UGA) were not detected in the libraries constructed from this concentrated RNA.
3.4. Identification of tRNA molecules without introns by sequence determination
The cDNA libraries APcDNAT02, 04, 05, 06, 07, 09, 10 and 12 contained only one uninterrupted cDNA species as shown in Table 2. To confirm these tRNA molecules, five plasmid clones, each of one library, were randomly selected and used for determination of the nucleotide sequence of the insert portion. Sequencing results indicated that these eight tRNA genes actually produced the tRNA molecules identical to the prediction.
Since the cDNA libraries APcDNAT01, 03, 11, 13 and 14 contained the multiple tRNA molecules transcribed from the uninterrupted tRNA genes, 30, 40, 20, 20 and 20 plasmid DNAs used for determination of sequence were prepared from the cDNA libraries APcDNAT01, 03, 11, 13 and 14, respectively. Sequencing results indicated that all 13 uninterrupted tRNA molecules recognized by these five primer sets were actually present in the living A. pernix cells, and that the sequence of the tRNA molecules actually present in the living A. pernix cells were identical to previous prediction of these tRNA genes.
The sequencing results for clones from APcDNAT01 to 07 and from APcDNA09 to 14 indicated that the nucleotide sequences of the 21 tRNA molecules actually produced from the uninterrupted tRNA genes in A. pernix were identical to those predicted in the previous genomic paper.14
Each library APcDNAT16, 20, 21, 23, 24 and 27 contained two or more cDNA clones corresponding to the premature and mature forms of tRNA molecules, because these primer sets were specific to both the intron-containing and uncontaining tRNA genes. Thus, 80, 60, 40, 60, 60 and 40 clones randomly selected were used for determination of the sequence of insert portion of clones in libraries APcDNAT16, 20, 21, 23, 24 and 27, respectively. Sequencing results indicated that among 11 tRNA molecules expected to be included in these libraries, 8 cDNA clones for uninterrupted molecules for tRNAVal(CAC), tRNAVal(UAC), tRNAVal(GAC), tRNASer(GCU), tRNAArg(GCG), tRNAArg(CCU), tRNAGly(CCC) and tRNAGly(UCC), were identified in these APcDNAT type of libraries. The APcDNAS libraries were constructed with cDNA synthesized from the small RNA with primer set 20 and 27, to obtain the remaining cDNAs for three uninterrupted tRNA molecules. Sequencing results showed that the cDNA clones for tRNASer(GGA) and tRNAPhe(GAA) were identified from the APcDNAS libraries.
In the libraries constructed from the total or small RNA, totally 31 cDNA clones corresponding to the tRNA molecules produced from the uninterrupted tRNA genes were identified. Although cDNA clones for tRNASer(UGA) and tRNAThr(UGU-2) were not detected in any libraries, the over all results revealed that previous prediction of the uninterrupted tRNA genes was accurate.
3.5. Confirmation of actual intron position
To determine the actual splicing patterns of tRNA molecules transcribed from each interrupted tRNA gene, sequence comparison of the premature form with mature form of the tRNA molecules transcribed from same interrupted tRNA gene was performed.
As the length of the cDNA synthesized from premature form of tRNA molecules was longer than those from mature form of tRNA molecules, five longer clones, each of one library, APcDNAT15 to 27, were used for the determination of their sequences for the premature form. The cloning and sequencing results indicated that all premature forms of tRNA molecules transcribed from the interrupted tRNA genes were identified from the APcDNAT library except for tRNAThr(CGU), which was still not identified from APcDNAS library. These results are summarized in Table 3.
|
Since the primer sets 15, 17, 18, 19, 25 and 26 were able to recognize only one interrupted tRNA gene, it was thought that each library, APcDNAT15, 17, 18, 19, 25 and 26, contained only two different-sized cDNA clones made from the premature and mature forms of tRNA molecules. Each of the five plasmids with the short and long insert were selected from each library and used for sequencing analyses. All cDNA clones corresponding to the mature form of these interrupted tRNA molecules were successfully identified from these six libraries. The cDNA clone for the mature form of the tRNAThr(CGU) molecule was identified from the APcDNAT library, but that for the premature form of tRNAThr(CGU) was not identified from the APcDNAT or APcDNAS libraries.
As the APcDNAT22 library contained cDNAs corresponding to the two interrupted tRNA genes, tRNATrp(CCA) and tRNAMet(CAU-2), each of the 30 clones for the shorter and longer clones from the library were used for the determination of the sequence. The sequencing results indicated that both tRNAMet(CAU-2) molecules were present in A. pernix, but the evidence for the maturation of tRNATrp(CCA) was not obtained from APcDNAT22 library. Then, the library APcDNAS22 was constructed from the small RNA, but the cDNA clone corresponding to the mature form of tRNATrp(CCA) was not identified in this library.
The libraries APcDNAT16, 20, 21, 23, 24 and 27 contained the multiple cDNAs synthesized with primer sets specific to both the interrupted and uninterrupted tRNA genes. The length of the cDNA fragment was confirmed by PCR analyses of insert portion, and 60100 independent plasmid clones with short insert were used for the determination of sequence. All shorter cDNA clones were detected in these sequenced clones in APcDNAT libraries except for the APcDNAT20 and 27 libraries. Two cDNA libraries, APcDNAS20 and 27, were constructed to obtain these missing cDNA clones in the libraries APcDNAT20 and APcDNAT27. The cDNA clones corresponding to the uninterrupted tRNA molecules, tRNASer(GGA) and tRNAPhe(GAA), and mature form of tRNATyr(GUA) were identified from the APcDNAS libraries. However, two cDNA clones corresponding to the mature form of tRNASer(CGA) and the tRNASer(UGA) molecule were still not identified from the APcDNAS library.
As the cDNAs for two mature forms of the tRNA molecules, tRNATrp(CCA) and tRNASer(CGA), were not identified from the APcDNAS library, the concentration of species-specific tRNA molecules for tRNATrp(CCA) and tRNASer(CGA) were performed as described in Materials and Methods section. In the libraries constructed from the species-specific concentrated tRNA molecules, the mature form of tRNATrp(CCA) was identified, but cDNA for tRNASer(CGA) was not obtained.
Among 14 interrupted tRNA genes, 12 sets of both the premature and mature forms, 1 mature form and 1 premature form of cDNA, were identified as indicated in Table 3. These results indicated that the tRNA molecules, that were actually transcribed from the interrupted tRNA genes predicted, were actually processed in A. pernix K1 living cells. Since the mature form of tRNAThr(CGU) molecule was cloned and sequenced, sequence comparison data of this clone and 12 sets of both forms of tRNA molecules provided the information for actual splicing position.
In the previous genomic paper of A. pernix K1,14
the intron region of tRNATrp(CCA), tRNAThr(UGU-1) and tRNAAsp(GUC) were manually predicted to locate at three bases 5' from the anticodon, within the D-loop and at one base 3' from anticodon with the 121 bp long intron, which was 1.5 times longer than the tRNA molecule itself, respectively. The comparison of cDNA sequences corresponding to the mature and premature forms of these tRNA molecules indicated that the intron region predicted in these tRNA genes were actually processed in A. pernix K1 cells.
However, 36 nt long introns in the tRNALys(CUU) and the tRNALys(UUU) were identified at 8 bases 3' from the anticodon region as shown in bold in Table 3. Also, the 44 nt long intron in the tRNAPro(CGG) was identified at one base 5' from the anticodon region. Totally, five interrupted tRNA genes in A. pernix contained the intron portion at other position than one base 3' from anticodon and other eight tRNA introns, including the 121 bp long intron in tRNAAsp(GUC), were inserted into the position at one base 3' from anticodon, which is common to the eukaryotic tRNA intron.
| 4. Discussion |
|---|
|
|
|---|
Our work was able to show both the actual transcription of the tRNA molecules and the actual processing of tRNAintron portions by cloning a whole set of cDNA for tRNA molecules in A. pernix living cells. This is one of the first cases of identification of a whole set of tRNA molecules present in one organism as cDNA, including the premature and mature forms of tRNA molecules.
Our success in cloning cDNA clones for most of tRNA molecules was mainly due to the improvement of the methods used for synthesis of the first strand of cDNA from RNA molecules. In our work, the solution containing both template RNA and primer was not chilled below the Tm of the primers, which should allow primers to anneal to specific sites of tRNA molecules only. It is noteworthy that the annealing temperature of primers and RNA template is important for synthesis and amplification of specific cDNA from similar molecules such as tRNA.
In this work, four tRNA molecules, tRNAThr(UGU-2), tRNASer(UGA), mature form of tRNASer(CGA) and premature form of tRNAThr(CGU), were not identified as cDNA molecules from any qualities of RNA. The mature form of tRNATyr(GUA), tRNASer(GGA) and tRNAPhe(GAA) were isolated from small RNA molecules. The cDNA for the mature form of tRNATrp(CCA) was successfully cloned from the species-specific tRNA molecules concentrated by the affinity chromatography technique, indicating that affinity chromatography was effective for concentration of the species-specific tRNA molecules. These results revealed that the concentration of one kind of tRNA molecule and usage of small RNA were effective for the cloning of relatively rare tRNA molecules.
The insertion position and length of the introns identified in the tRNALys(CUU) and tRNALys(UUU) genes were identical. And it was shown that these two tRNA genes shared the same sequence except for 2 nt, even though the different length and sequences of the intron portions were confirmed in other tRNA genes. In the genomic data of the two methanogen, Methanocaldococcus jannaschii17
and Methanothermobacter thermautotrophicus,18
the tRNALys(CUU) gene was not present. These facts proposed one hypothesis that the two introns identified in the tRNALys(CUU) and tRNALys(UUU) genes were derived from the same origin. It can be thought that the intron was introduced into the tRNALys(UUU) gene before duplication or separation of the tRNALys(CUU) gene, then the tRNALys(CUU) was duplicated or separated from tRNALys(UUU) recently in A. pernix, accompanied with the transfer of the same intron to the tRNALys (CUU).
According to the tRNAThr(UGU-2), it was thought that this tRNA gene was overestimated by tRNA detection software. The reason for this conclusion is due to the facts that cDNA for tRNAThr(UGU-2) was not amplified even when specific primer set 8 was used, the promoter motif sequence was not identified at upstream of the tRNAThr(UGU-2) gene (shown below), both the premature and mature products from the tRNA gene for threonine with same anticodon, tRNAThr(UGU-1), were identified and the score for this tRNA gene indicated by tRNA detection software was lower than those for other tRNA genes.
The previous analyses proposed two types of mechanisms for the splicing of the intron region in tRNA molecule. One is the eukaryotic type, in which the intron region in the tRNA molecule was recognized by the distance from the top of the anticodon stem to the 5' and 3' cleavage sites as shown in Fig. 2A4
,5
and cleavage of the eukaryotic intron requires the formation of a 3 nt bulge loop at the 3' intronexon boundary site.19
This mechanism is well suited for the fact that all eukaryal tRNA introns are located in one base 3' from anticodon.
The other is the archaeal type of splicing mechanism of tRNA intron. In contrast with the eukaryotic type, studies for archaeal tRNA intron showed that complete mature tRNA structure was not required for digestion of intron by intron endonuclease.6
,7
Cleavage of the archaeal tRNA introns requires a structural motif element at the exonintron boundaries, the bulgehelixbulge motif structure as shown in Fig. 2B.7
Since the tRNA introns in A. pernix were confirmed at six different positions, the distance between the top of the anticodon stem and cleavage sites was widely variable. Hence, we searched the bulgehelixbulge motif structures at the exonintron boundaries confirmed in this work.
For the eight interrupted tRNA genes, of which introns were confirmed to locate at one base 3' from anticodon, the bulgehelixbulge motif structures were detected at the exonintron boundaries as shown in Fig. 3.
|
The same bulgehelixbulge motif structures were searched on the two genes for tRNAThr(UGU-1) and tRNATrp(CCA), their actual processing occurred at the D stem and the anticodon stem which were at the same position as predicted, respectively. The bulgehelixbulge motif structures were clearly recognized at the confirmed exonintron boundaries of these tRNA genes as shown in Fig. 4, even though those introns were identified at the D stem and the anticodon stem.
|
For the three tRNA genes encoding tRNALys(CUU), tRNALys(UUU) and tRNAPro(CGG), of which the actual cleavage site were differed from that predicted by software, we compared the structures at the exonintron boundaries confirmed in this work with those found surround the exonintron boundaries predicted. Figure 5AC indicates the structures of premature forms of tRNA molecules including introns at the position predicted by software and Fig. 5DF indicates those including introns at the confirmed insertion position. Figure 5AC showed that the bulgehelixbulge motif structure was not recognized at the exonintron boundary. However, the bulgehelixbulge motif structure was clearly identified at the exonintron boundary confirmed by this work, as shown in Fig. 5DF.
|
These results indicate that the bulgehelixbulge motif structure plays a most important role for recognition and cleavage of intron region in the tRNA genes in A. pernix. This fact indicates that the intron in the tRNA molecules is cleaved by the mechanism generally used in Archaea. Marck and Grosjean20
In both the rRNA genes of A. pernix K1, the intron sequences were previously identified.21
The structure at the exonintron boundary in these rRNA genes was predicted as the bulgehelixbulge motif structure similar to those recognized at the exonintron boundary in the tRNA molecules. Watanabe et al.22
indicated the evidence that an intron-containing protein-coding gene, APE0989 and APE0990, was present in A. pernix K1. The exonintron boundary sequence of this protein-coding gene, which encodes a subunit of a small nucleolar ribonucleoprotein involved in eukaryotic rRNA, indicated the typical bulgehelixbulge motif structure. The structural similarity of intron boundary sequences in archaeal genes encoding tRNA, rRNA and protein suggests the hypothesis that the introns of archaeal tRNA gene transfered to rRNA genes and protein-coding genes. If so, it can be thought that the intron region in archaeal tRNA gene is the ancestral intron in the protein-coding genes in Eukaryotes.
The tRNAintron endonucleases identified from archaeal genomic data are classified into two types, the one is over 300 amino acid residues and the other is shorter than 190 residues. This difference in size suggests that these two types of endonucleases will exhibit different functions or mechanisms of cleavage of introns in tRNA molecules. From the genome data of A. pernix K1, the 187 residues long APE1646 protein was identified as a protein similar to the tRNAintron endonuclease of Haloferax volcanii reported by Kleman-Leyer et al.23
The tRNA ligase capable of joining exon ends was identified from yeast,24
but not detected in halophiles.7
We searched for this ligase in the genomic information of A. pernix K1. However, an ORF encoding the related enzyme was not detected as similar to the other Archaea. This ligase would be necessary for the joining of two exons cleaved by the tRNA endonuclease. Hence, identification and characterization of this ligase is the next important step for understanding the mechanism of tRNA splicing in Archaea.
The first prediction of tRNA genes in A. pernix K1 was performed by tRNAscan, which was developed by Lowe and Eddy,25
as described in the previous genomic paper. Forty-five tRNA genes were predicted by this software, and two genes, tRNAAsp(GUC) and tRNATrp(CCA), were predicted manually. The intron in tRNAThr(UGU-1) and tRNATrp(CCA) genes were manually predicted within the D stem, regions which were not detected by tRNAscan as intron, and at three bases 5' from anticodon, respectively. The intron location of the other 12 interrupted tRNA genes was predicted at one base 3' from anticodon in the tRNA gene.
According to the 33 uninterrupted tRNA genes, 31 tRNA molecules transcribed were confirmed in this work as same molecules as prediction. This result indicated the reliability of the assignment of the uninterrupted tRNA genes by tRNAscan.
In contrast, the result for confirmation of transcript from the interrupted tRNA genes predicted by tRNAscan was not in the same situation. Among 14 interrupted tRNA genes, the correct position of the intron portion in tRNASer(CGA) was not confirmed by this experiment, because of the failure of obtaining the mature type of its cDNA. The actual location of the intron for the remaining 13 interrupted tRNA genes were confirmed in this work, as both the premature and mature forms of cDNA molecules for 12 interrupted tRNA genes and the mature form of cDNA molecule for the tRNAThr(CGU) were identified. A comparison of the mature sequence with the premature sequence of cDNA molecules indicated that transcripts from seven tRNA genes, tRNAArg(UCU), tRNACys(GCA), tRNAMet(CAU-1), tRNAMet(CAU-2), tRNAPro(GGG), tRNATyr(GUA) and tRNAThr(CGU), were spliced at the exonintron boundary predicted by software. However, the introns in six tRNA genes, shown in bold and underlined in Table 3, were processed at the other positions than those predicted by software. The ratio of correct assignment of the intron position is only 54%. This result shows that tRNAscan is reliable for assignment of the uninterrupted tRNA genes, but not reliable for assignment of the interrupted tRNA genes in A. pernix K1.
The tRNA genes are transcribed by RNA polymerase III, which recognizes the promoter sequence located in the coding part of the tRNA genes in Eukaryotes and that located upstream from the transcription start point in prokaryotic tRNA genes. The two distinct recognition boxes, 11 bp long A box: TRRYNNARYGG and B box: GGTTCGANTCC, were identified in the coding part of the Eukaryotic tRNA genes,26
and similar sequences were also detected upstream of prokaryotic tRNA genes.27
These boxed sequences were searched in the coding and upstream region of the tRNA genes of A. pernix K1, but no identical or related boxed sequence was identified. Instead of the absence of the known boxed sequences, the novel consensus motif sequences were identified in the upstream region of the tRNA genes. The consensus motif sequences identified and the original sequences of the upstream region of tRNA genes are summarized in Table 4.
|
The consensus motif sequence identified consisted of three independent consensus elements, motif A is poly-purines, motif B is RRTA and motif C is SRGG, and each consensus motif was separated by the spacer sequences with different length. The consensus motif sequences were not identified at the upstream region of the tRNAThr(UGU-2) gene, which supported the suggestion that this region was not worked as tRNA gene.
Three tRNA gene clusters, tRNAMet(CAU-1)-tRNAThr(UGU-1), tRNASer(CGA)-tRNAPro(UGG) and tRNACys(GCA)-tRNAGlu(UUC), were predicted to be transcribed as a single transcript. The promoter motif sequences were identified at the upstream region of the gene clusters, but conservation of the promoter motif sequences was not clearly identified at the upstream region of the downstream genes. As the tRNAPro(UGG) gene was located downstream of the tRNASer(CGA) gene with a distance of just 7 nt, the cDNA for tRNAPro(UGG) and premature form of tRNASer(CGA) were obtained from the total RNA. However, the cDNA for the mature form of tRNASer(CGA) was not obtained, suggesting that splicing of the premature form of the tRNASer(CGA) molecule should not have efficiently occurred.
Our study emphasizes that genomic information is a powerful source from which is investigated the analyses of a family of molecules or reactions in one organism. Our results provide the opportunity for future research on the more detailed understanding of a variety of biological events in Archaea. Since the tRNA ligase was not identified from Archaea, it is necessary to identify the tRNA ligase from Archaea, which also provides the information for recognition mechanism of the exon portion and construction mechanism of the mature tRNA molecules. As three tRNA gene clusters were identified in A. pernix, it is also targeted to understand the actual cleavage mechanism of the transcript from the cluster tRNA genes into each independent tRNA molecule in Archaea cell. Since the similar intron portion was identified in the tRNA, rRNA and protein-coding genes in A. pernix, this microorganism is thought to be one of the most convenient materials for understanding the mechanism for evolution of intron region and the machinery utilized in A. pernix for processing the independent but similar intron portion possessed in the different types of genes.
| Acknowledgements |
|---|
|
|
|---|
We thank Y. Hino for technical support of the nucleotide sequencing and T. Tanaka for assistance with the computational analysis of the genomic data of A. pernix K1. This work was supported by the Ministry of Economy, Trade and Industry.
| Footnotes |
|---|
*To whom correspondence should be addressed. Tel: +81-29-861-6040, Fax: +81-29-861-6423, Email: kawarabayasi.yutaka{at}aist.go.jp
| References |
|---|
|
|
|---|
- Westaway, S. K. and Abelson, J. 1995, Splicing of tRNA Precursors, In Söll, D. and RajBhandary, T. (Eds.). tRNA: Structure, Biosynthesis, and Function Washington, DC American Society for Microbiology pp. 7992.
- Szekely, E., Belford, H. G., Greer, C. L. 1988, Intron sequence and structure requirements for tRNA splicing in Saccharomyces cerevisiae, J. Biol. Chem., 263, 1383913847.
[Abstract/Free Full Text] - Swerdlow, H. and Guthrie, C. 1984, Structure of intron-containing tRNA precursors. Analysis of solution conformation using chemical and enzymatic probes, J. Biol. Chem., 259, 51975207.
[Abstract/Free Full Text] - Reyes, V. M. and Abelson, J. 1988, Substrate recognition and splice site determination in yeast tRNA splicing, Cell, 55, 719730.[CrossRef][Web of Science][Medline]
- Mattoccia, E., Baldi, I. M., Gandini-Attardi, D., Ciafre, S., Tocchini-valentini, G. P. 1988, Site selection by the splicing endonuclease of Xenopus Laevis, Cell, 55, 731738.[CrossRef][Web of Science][Medline]
- Thompson, L. D. and Daniels, C. J. 1988, A tRNA(Trp) intron endonuclease from Halobacterium volcanii. Unique substrate recognition properties, J. Biol. Chem., 263, 1795117959.
[Abstract/Free Full Text] - Thompson, L. D. and Daniels, C. J. 1990, Recognition of exonintron boundaries by the Halobacterium volcanii tRNA intron endonuclease, J. Biol. Chem., 265, 1810418111.
[Abstract/Free Full Text] - Daniels, C. J., Gupta, R., Doolittle, W. F. 1985, Transcription and excision of a large intron in the tRNATrp gene of an archaebacterium, Halobacterium volcanii, J. Biol. Chem., 260, 31323134.
[Abstract/Free Full Text] - Datta, P. K., Hawkins, L. K., Gupta, R. 1989, Presence of an intron in elongator methionine-tRNA of Halobacterium volcanii, Can. J. Microbiol., 35, 189194.[Web of Science][Medline]
- Kaine, B. P., Gupta, R., Woese, C. R. 1983, Putative introns in tRNA genes of prokaryotes, Proc. Natl Acad. Sci. USA, 80, 33093312.
[Abstract/Free Full Text] - Kjems, J., Leffers, H., Olesen, T., Garrett, R. A. 1989, A unique tRNA intron in the variable loop of the extreme thermophile Thermofilm pendens and its possible evolutionary implications, J. Biol. Chem., 264, 1783417837.
[Abstract/Free Full Text] - Kjems, J., Jensen, J., Olesen, T., Garrett, R. A. 1989, Comparison of transfer RNA and ribosomal RNA intron splicing in the extreme thermophile and archaebacterium Desulfurococcus mobilis, Can. J. Microbiol., 35, 210214.[Web of Science][Medline]
- Wich, G., Leinfelder, W., Bock, A. 1987, Genes for stable RNA in the extreme thermophile Thermoproteus tenax: introns and transcription signals, EMBO J., 6, 523528.[Web of Science][Medline]
- Kawarabayasi, Y., Hino, Y., Horikawa, H., et al. 1999, Complete genome sequence of an aerobic hyper-thermophilic crenarchaeota, Aeropyrum pernix K1, DNA Res., 6, 8476 147155.
- Kawarabayasi, Y., Hino, Y., Horikawa, H., et al. 2001, Complete genome sequence of an aerobic thermoacidophilic crenarchaeon, Sulfolobus tokodaii strain7, DNA Res., 8, 123140.[Abstract]
- Sako, Y., Nomura, N., Uchida, A., et al. 1996, Aeropurum pernix gen. nov, sp. nov., a novel aerobic hyper thermophilic archaeon growing at temperatures up to 100°C, Int. J. Syst. Bacteriol., 46, 10701077.
[Abstract/Free Full Text] - Bult, C. J., White, O., Olsen, G. J., et al. 1996, Complete genome sequence of the methanogenic archaeon, Methanococcus jannaschii, Science, 273, 10581073.[Abstract]
- Smith, D. R., Doucette-Stamm, L. A., Deloughery, C., et al. 1997, Complete genome sequence of Methanobacterium thermoautotrophicum deltaH: functional analysis and comparative genomics, J. Bacteriol., 179, 71357155.
[Abstract/Free Full Text] - Baldi, M. I., Mattoccia, E., Bufardeci, E., Fabbri, S., Tocchini-Valentini, G. P. 1992, Participation of the intron in the reaction catalyzed by the Xenopus tRNA splicing endonuclease, Science, 255, 14041408.
[Abstract/Free Full Text] - Marck, C. and Grosjean, H. 2003, Identification of BHB splicing motifs in intron-containing tRNAs from 18 archaea: evolutionary implications, RNA, 9, 15161531.
[Abstract/Free Full Text] - Nomura, N., Sako, Y., Uchida, A. 1998, Molecular characterization and postsplicing fate of three introns within the single rRNA operon of the hyperthermophilic archaeon Aeropyrum pernix K1, J. Bacteriol., 180, 36353643.
[Abstract/Free Full Text] - Watanabe, Y., Yokobori, S., Inaba, T., et al. 2002, Introns in protein-coding genes in Archaea, FEBS Lett., 510, 2730.[CrossRef][Web of Science][Medline]
- Kleman-Leyer, K., Armbruster, D. W., Daniels, C. J. 1997, Properties of H. volcanii tRNA intron endonuclease reveal a relationship between the archaeal and eucaryal tRNA intron processing systems, Cell, 89, 839848.[CrossRef][Web of Science][Medline]
- Phizicky, E. M., Schwartz, R. C., Abelson, J. 1986, Saccharomyces cerevisiae tRNA ligase. Purification of the protein and isolation of the structural gene, J. Biol. Chem., 261, 29782986.
[Abstract/Free Full Text] - Lowe, T. M. and Eddy, S. R. 1997, tRNAscan-SE: A program for improved detection of transfer RNA genes in genomic sequence, Nucleic Acids Res., 25, 955964.
[Abstract/Free Full Text] - Sharp, S. J., Schaak, J., Cooley, L., Burke, D. J., Söll, D. 1985, Structure and transcription of eukaryotic tRNA genes, CRC Crit. Biochem., 19, 107144.
- Nayokova, T. M., Kondrakhin, Y. V., Rogozin, I. B., Voevoda, M. I., Yudin, N. S., Romaschenko, A. G. 2003, Concerted changes in the nucleotide sequences of the intragenic promoter regions of eukaryotic genes for tRNAs of all specificities, J. Mol. Evol., 57, 520532.[CrossRef][Web of Science][Medline]
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||




