DNA Research Advance Access first published online on August 6, 2008
This version published online on September 9, 2008
DNA Research, doi:10.1093/dnares/dsn018
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Collection and Comparative Analysis of 1888 Full-length cDNAs from Wild Rice Oryza rufipogon Griff. W1943
1 College of Life Science and Biotechnology, Shanghai Jiaotong University, Shanghai, PR China
2 National Center for Gene Research and Shanghai Institute of Plant Physiology and Ecology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, 500 Caobao Road, Shanghai 200233, PR China
3 School of Life Sciences, Fudan University, Shanghai, PR China
4 Plant Genome Center, 1-25-2 Kan-nondai, Tsukuba, Ibaraki 305-0856, Japan
Received 29 April 2008; accepted 9 July 2008.
| Abstract |
|---|
|
|
|---|
A huge amount of cDNA and EST resources have been developed for cultivated rice species Oryza sativa; however, only few cDNA resources are available for wild rice species. In this study, we isolated and completely sequenced 1888 putative full-length cDNA (FLcDNA) clones from wild rice Oryza rufipogon Griff. W1943 for comparative analysis between wild and cultivated rice species. Two cDNA libraries were constructed from 3-week-old leaf samples under either normal or cold-treated conditions. Homology searching of these cDNA sequences revealed that >96.8% of the wild rice cDNAs were matched to the cultivated rice O. sativa ssp. japonica cv. Nipponbare genome sequence. However, <22% of them were fully matched to the cv. Nipponbare genome sequence. The comparative analysis showed that O. rufipogon W1943 had greater similarity to O. sativa ssp. japonica than to ssp. indica cultivars. In addition, 17 novel rice cDNAs were identified, and 41 putative tissue-specific expression genes were defined through searching the rice massively parallel signature-sequencing database. In conclusion, these FLcDNA clones are a resource for further function verification and could be broadly utilized in rice biological studies.
Key words: wild rice; Oryza rufipogon; full-length cDNA; transcriptome comparison; tissue-specific expression
| 1. Introduction |
|---|
|
|
|---|
The wild rice species Oryza rufipogon Griff. (AA genome) is the most closely related ancestral species to Asian cultivated rice (O. sativa L.).1
Oryza rufipogon has been classified into perennial and annual ecotypes.12
W1943 is a perennial O. rufipogon. For the first time, a total of 1888 FLcDNAs of O. rufipogon W1943 were generated in the present study; most (>96.8%) were highly homologous with cultivated rice genome sequences. Furthermore, W1943 had greater similarity to ssp. japonica than to ssp. indica. Additionally, 1% of W1943 FLcDNAs was verified as novel rice genes not previously reported. We also discovered 41 putative tissue-specific expressed genes by applying the rice massively parallel signature-sequencing (MPSS) database.13
| 2. Materials and Methods |
|---|
|
|
|---|
2.1 Plant materials and cDNA library construction
Two enriched FLcDNA libraries were constructed from wild rice O. rufipogon Griff. W1943. Seeds were germinated and seedlings were grown in a greenhouse with day/night of 13/11 h and 25/30°C. Three weeks after germination, some seedlings were exposed to 5°C and leaves were separately harvested after 0, 1, 12, 24, 48, 72 and 120 h of cold treatment. We constructed two cDNA libraries from 3-week-old rice leaves grown under normal and cold conditions, respectively. All samples were immediately frozen in liquid nitrogen and stored at –80°C.
We constructed two FLcDNA libraries according to the Cap-Tagging8
and Cap-trapper methods.14
The 5' cap-tagging method utilizes the 5' cap-capture technique through the combined treatments of calf intestinal phosphatase (CIP) and tobacco acid pyrophosphatase (TAP) so that only the FLcDNA was targeted for library construction. The cap-trapper method is based on chemical introduction of a biotin group into the diol residue of the cap structure of mRNA, which is followed by RNase I treatment to select FLcDNA. Total RNA was isolated using the TRIZOL reagents, and mRNAs were purified with the Oligotex mRNA kit (Qiagen). Double-stranded cDNA was digested with EcoRI (1 U) and XhoI (10 U) for 1 h at 37°C, and cDNA fraction of 0.6–2 kb was collected and pooled, with which ligated to the sites of EcoRI and XhoI of vector pBluescript SK+ (Strategene) at 16°C overnight. Then, cDNA was transformed into competent E. coli DH10B cells (Invitrogen) by electroporation. We assessed the library quality by assaying ligations and carrying out 5'-end sequencing; the former procedure determined library titer, and the latter used to evaluate cDNA full-length percentage as well as the proportion of empty vectors.
2.2 DNA sequencing and assembling
DNA sequencing was carried out on ABI3730 sequencers. The clones were sequenced from both ends by the dideoxy chain termination method using BigDye Terminator Cycle sequencing V2.0 Ready Reaction (Applied Biosystems). The Phred base-calling software was used to analyze sequence trace files and generate raw sequences.15
Peaks with Phred quality values of <20 were taken as ambiguous sequences and were presented by a universal placeholder "N". Vector sequences were filtered automatically. Then, all 5'-tagged sequences were selected by a Perl script for clustering, which used the TGICL program.16
These singletons and every representative clone from each contig were selected to be completely sequenced by bidirectional sequencing strategy. All processed sequences were assembled by Phrap software.
Accession numbers for submitted data in the EMBL database CT841557 [GenBank] –CT841684 [GenBank] ; CT841686 [GenBank] –CT841707 [GenBank] ; CT841710 [GenBank] –CT841954 [GenBank] ; CT841956 [GenBank] –CT842008 [GenBank] ; CU405560 [GenBank] –CU405627 [GenBank] ; CU405629 [GenBank] –CU405654 [GenBank] ; CU405656 [GenBank] –CU405706 [GenBank] ; CU405708 [GenBank] –CU405710 [GenBank] ; CU405712 [GenBank] –CU405714 [GenBank] ; CU405716 [GenBank] –CU405717 [GenBank] ; CU405719 [GenBank] –CU405720 [GenBank] ; CU405722 [GenBank] –CU405729 [GenBank] ; CU405731 [GenBank] –CU405880 [GenBank] ; CU405882 [GenBank] –CU405928 [GenBank] ; CU405930 [GenBank] –CU406064 [GenBank] ; CU406066 [GenBank] –CU406249 [GenBank] ; CU406251 [GenBank] –CU406335 [GenBank] ; CU406337 [GenBank] –CU406954 [GenBank] and CU861673 [GenBank] –CU861883.
These W1943 sequences are available from our website (http://202.127.18.228/ricd/dym/ftp.php).
2.3 Comparative analysis of FLcDNA sequences
Similarity searches were performed with BLAST (version 2.2.14) program17
against sequence data as follows: NCBI GenBank nt DB (2007-12), nr DB (2007-12), est-other DB (2007-07), rice japonica genomic sequence (http://rgp.dna.affrc.go.jp/IRGSP/), the Institute for Genomic Research (TIGR) rice cDNA data (release 4.0), TIGR_Oryza_Repeats_v3.1, Knowledge-based Oryza Molecular Biological Encyclopedia japonica cDNA collection (http://cdna01.dna.affrc.go.jp/cDNA, 2006-10-11) and National Center for Gene Research (NCGR, http://www.ncgr.ac.cn/ricd) Rice Indica cDNA Database (RICD). We downloaded all above sequence data and used our 1888 clones as query sequences. The similarity threshold of E-value was lower than 1E–10. We searched InterPro database18
to compare the profiles of proteins encoded in W1943 FLcDNAs. Functional classification of cDNAs was referred to PFAM profiles.19
A similarity-based tool sim420
was used to align W1943 FLcDNA sequence with rice genomic sequence. It was also used to identify and discard redundant gene sequences. Open reading frames (ORFs) of cDNA sequences were determined by using getorf program of EMBOSS package.21
The rice MPSS database13
was used for quantitative expression analysis of these W1943 cDNAs in rice. The expression levels were calculated for rice different tissues or same tissues at different developmental stages by summing all expressed tags in the sense strand. To calculate synonymous divergence (Ks), program ClustalX 1.822
and PAL2NAL (version: V11)23
were applied.
Rfam database24
(http://www.sanger.ac.uk/Software/Rfam/) and miRBase25
(http://microrna.sanger.ac.uk/) data were downloaded for non-protein-coding transcripts analysis. Software mFOLD was applied to predict pre-miRNAs' secondary structure (http://mfold.bioinfo.rpi.e.du/).26
| 3. Results and Discussion |
|---|
|
|
|---|
3.1 Overall description of W1943 FLcDNA sequences
Two full-length enriched cDNA libraries of O. rufipogon W1943 were constructed following the cap-tagging method.8
Up to now, we have successfully obtained 1888 non-redundant W1943 cDNA sequences. Of 1888 cDNA sequences, 1360 sequences matched to NCBI GenBank non-redundant database of proteins (nrDB) (E < 1e–10; >70% identity). Of 1360 sequences, 997 cDNAs could fully cover the protein N-terminal first amino acid sequence. Therefore, we estimated that >70% of the 1832 cDNA sequences were FLcDNAs. It should be pointed out that the efficiency of CIP and TAP treatments played a key role in constructing the FLcDNA library. On the other hand, it was also possible that some of the remaining 30% putative truncated cDNA sequences might be genuine FLcDNAs transcribed from alternative start sites. There are lots of alternative transcription start sites known in mammals.27
,28
3.2 Mapping of the 1888 W1943 FLcDNAs onto cultivated rice O. sativa genomic sequences
The 1888 FLcDNAs from O. rufipogon W1943 were mapped to O. sativa ssp. japonica cv. Nipponbare genomic sequence pseudomolecules (version 4.0) and compared with GenBank nrDB based on BLASTn (E < 1e–10) and BLASTx (E < 1e–10), respectively.5
Of the 1888 FLcDNA sequences, 1831 (97.0%) could be aligned to the japonica genomic sequences at >80% sequence identity over the entire length (Fig. 1). The remaining 57 cDNAs that did not match the ssp. japonica genomic sequences are discussed in the following analysis. Among 1831 W1943 cDNAs, 395 (21.6%) fully matched the ssp. japonica cv. Nipponbare genomic sequences with 100% identity at nucleotide level. However, among 1831 cDNAs, 487 fully matched to corresponding proteins in nrDB with 100% identity. Therefore, 35.8% of W1943 cDNAs had full identity to proteins from nrDB at amino acid level. In spite of relatively low full identity at nucleotide acid level (only 21.6%), it was more conservative at amino acid level (>35.8%) between wild and cultivated rice. It was propitious to protect some key proteins from losing their conserved and vital functions.
|
We also mapped the 1888 W1943 FLcDNAs to the O. sativa ssp. indica cv. 93-11 whole-genome shotgun sequences using BLASTn (E < 1e–10). A total of 1837 (97.2%) W1943 cDNAs could be aligned to the cv. 93-11 genome sequences at >80% sequence identity over the entire length (Fig. 1). Of these, 126 (6.9%) identically matched the cv. 93-11 genome sequences. These results indicated that the sequence of wild rice W1943 had a very high similarity with those of cultivated ssp. japonica (97.0%) and ssp. indica (97.2%) rice; and W1943 had greater similarity to japonica than to indica at nucleotide acid level. Monna et al.29
In the case of 395 W1943 FLcDNAs that were 100% matched to the genomic sequences, we checked the splicing patterns by comparing with all rice ESTs or mRNAs in public databases. The results revealed that 15 W1943 cDNAs had alternative splicing patterns when compared with cultivated rice ESTs or mRNAs (Table 1). These alternative splicing patterns might be specific for W1943. Furthermore, the first introns of two genes (CT841942 [GenBank] and CU406810 [GenBank] ) had a distinct splice site with GC-AG and GT-TG. We concluded that cultivated rice had experienced some mutations including the intron region, and thus some genes were lost over the long evolutionary period. There were four typical alternative splicing patterns of these sequences (Fig. 2).
|
|
It should be pointed out that 10 of 1831 W1943 cDNAs had no hits to previously reported rice ESTs or mRNAs in GenBank database (Table 2). Another seven cDNAs had hits to rice ESTs or mRNAs at the sense–antisense pattern (Table 3). So these cDNA sequences offered novel rice transcripts to public database. As for the 17 W1943 cDNA sequences, they were either wild-rice-specific genes or cultivated rice co-owner genes. If the latter was the case, it may indicate that these genes are expressed at much lower levels in cultivated than in wild rice. Hence, it would be difficult to clone these cDNAs from cultivated rice in spite of a total of
47 000 ssp. japonica and ssp. indica cDNAs available in the current public database (ftp://ftp.ncbi.nih.gov/). We used the rice MPSS database (http://mpss.udel.edu/rice/) to detect the expression level of these 17 putative novel W1943 cDNAs under different conditions.13
|
|
In addition, 57 W1943 cDNAs that could not be aligned to the ssp. japonica cv. Nipponbare genomic sequence were further analyzed. After comparing with other public databases, 14 of them matched the ssp. indica cv. 93-11 genomic sequences, 6 matched to rice ESTs in NCBI est-other database, 4 had similarity to Sorghum bicolor, Triticum aestivum, Manihot esculenta and Spartina alterniflora ESTs, 15 were homologs to Gibberella moniliformis, Gibberella zeae and Magnaporthe grisea, and the remaining 18 had no hits. Table 4 listed 24 W1943 cDNAs' information after excluding 15 possible contamination clones and 18 no any hits clones. Several W1943 cDNAs that did not match to the cv. Nipponbare genomic sequence might be located in the gap of genomic sequence or might be related to wild rice W1943-specific genes.
|
3.3 Comparative analysis with cultivated rice cDNA sequences in public databases
The 1888 W1943 cDNAs were compared with cultivated rice cDNA sequences. The large-scale rice ssp. japonica cv. Nipponbare cDNA sequences have been released to public databases.7
Initially, we identified chromosomal distributions of the three different rice cDNAs along the cv. Nipponbare chromosomal pseudomolecules (Fig. 3). Though there were relatively small quantities of W1943 cDNAs, there were similar trace trends and no visible large bias comparing KOME and NCGR cDNAs. So the 1888 W1943 cDNAs can give clues to the entire W1943 genome.
|
A Perl script known as MISA (http://pgrc.ipk-gatersleben.de/misa/) was used to identify simple sequence repeats (SSRs) in these cDNA sequences. We described all SSR motifs of 1–6 nucleotides in size. The minimum repeat unit was prescribed as follows: 10 repeats for mononucleotides, 6 for di-nucleotides and 5 for all the other motifs such as tri-, tetra-, penta- and hexa-nucleotides. We detected the five highest frequencies of SSR motifs of the overall cDNA sequences, 5'-UTR sequences, ORF sequences and 3'-UTR sequences, respectively (Fig. 4). The highest frequencies of the SSR motifs in the three different rice cDNAs were identical in 5'-UTR, ORF or 3'-UTR regions. First, the motif CCG/CGG has the highest frequencies in 5'-UTR and ORF regions, but the SSR motif A/T has the highest frequency in 3'-UTR region. Second, all kinds of motif types were unevenly distributed in the FLcDNA sequences. The motifs CCG/CGG and A/T were more frequent in the ORF and 3'-UTR regions, respectively, with frequencies >50%. However, in 5'-UTR regions, the most frequent SSR motifs were
28%. In addition, scanning showed that the three most frequent SSR motif-types in ORF regions were all triplets that differed from those in UTR regions. This difference was very important for coding sequence because tri-nucleotide SSR motif-types could effectively prevent amino acid from frame shifting. Furthermore, the five most frequent SSR motifs were all triplets; the only exception was the fourth most frequent SSR type of NCGR, which was A/T (7.19%). In the process of evolution, relative higher frequency of mononucleotide SSR motifs of NCGR ORF was likely to be one key factor that led to divergence of ssp. indica and ssp. japonica. This could partly explain why W1943 was closer to japonica than to indica.
|
We carried out transcripts comparisons between W1943 and the other two cultivated rice subspecies (Fig. 5). A total of 823 W1943 cDNAs were detected according to their homology with both KOME and NCGR (
95% identity and non-redundant hit to KOME and NCGR). We extracted the ORF of each cDNA sequence using the getorf program.21
|
The nucleotides of 194 identical ORF groups were extracted for further calculation of synonymous substitution rates. The results showed that 106 of 194 (54.6%) groups were also completely identical at nucleotide level. So the remaining 88 groups were used to calculate synonymous divergence (Ks) (Fig. 5B). Of 88 groups, 42 groups had no synonymous substitution between W1943 and KOME; 9 groups had no synonymous substitution between W1943 and NCGR; 15 groups had no synonymous substitution between KOME and NCGR and another 22 groups had synonymous substitutions among the three species and subspecies. That is, at nucleotide level, 76.2% of 194 identical ORF groups had no changes in W1943 and cv. Nipponbare, and 59.2% for W1943 and cv. Guangluai 4.
It was reported29
that the rates of polymorphisms in predicted intergenic regions of rice were 0.302 (W1943/Nipponbare), 0.653 (W1943/Guangluai 4) and 0.630 (Nipponbare/Guangluai 4), respectively. These were similar to results in coding sequence regions in the present study. Thus, the hypothesis that O. rufipogon W1943 was closer to ssp. japonica than to ssp. indica was further validated.
3.4 miRNAs identification
After searching against NCBI nrDB using BLASTx, 432 sequences of 1888 W1943 cDNAs found no hits in the database. Of 432 sequences, 71 were predicted as ORFs > 100 amino acid in length, so the remaining 361 were assumed to be putative non-protein-coding transcripts. Searching against Rfam database and miRBase, four cDNAs matched to four miRNA families; the osa-MIR159a, osa-MIR156j, osa-MIR818e and osa-miR446 families, respectively (Table 5). Using the mFOLD program, all four sequences could be predicted to pre-miRNA secondary structure and identified as miRNAs according to folding results.
|
3.5 Expression analysis by searching against the rice MPSS database
We used the rice MPSS database (http://mpss.udel.edu/rice/) to detect the expression level of W1943 cDNAs under different conditions.13
|
In the similar restricted conditions as above, there were seven W1943 cDNAs with distinct expression level in leaves exposed to cold, drought or salinity stresses (Table 7). Of the seven cDNAs, four genes were up-regulated by cold stress, two genes were up-regulated by drought and one gene was up-regulated by salinity. It should be pointed out that gene "CU405946" matched to PFAM protein annotated as "Dehydrin". This protein is produced by plants that experience water-stress.33
|
3.6 Conclusions
In this research, we collected and completely sequenced 1888 putative FLcDNAs of wild rice O. rufipogon Griff. W1943. A total of 17 novel rice cDNAs and 41 putative tissue-specific expression genes were identified. The comparative analysis between wild rice and two cultivated rice subspecies indicated that O. rufipogon W1943 had greater similarity to O. sativa ssp. japonica than to ssp. indica cultivars. It is reported that W1943 is primarily distributed in Dongxiang (26°14'N, 116°36'E) of Jiangxi Province in China.34
| Funding |
|---|
|
|
|---|
This research was supported by the grants from the Ministry of Science and Technology of China (the China Rice Functional Genomics Programs, 2005CB120805 and 2006AA10A102), the Chinese Academy of Sciences (038019315 and KSCX2-YW-N-024) and the Shanghai Municipal Commission of Science and Technology.
| Acknowledgements |
|---|
|
|
|---|
We thank the Plant Genome Center (Tsukuba, Japan) for kindly providing seeds of W1943.
| Footnotes |
|---|
* To whom correspondence should be addressed. Tel. +86 21-64845260. Fax. +86 21-64825775. E-mail: bhan{at}ncgr.ac.cn
The authors wish to apologise for the following errors: {list errors below}.
1. Page 2, The last sixth line (Line 219): "our 1832 clones..." should be corrected as "our 1888 clones...".
2. Page 3, The first seventh line (Line 231): "MPSS database18" should be cited as "MPSS database13".
3. Page 5, The last tenth line (Line 551): "national center for gene research" should be written as "National Center for Gene Research, CAS".
4. Page 4, Table 1. The first column and the fourth clonename "CT841893" should be replaced of "CT841874".
5. page 6, Table 4. The last third line, the second column. The clonename "CT841912" should be replaced of "CU406778". The last second line, the second column. The clonename "CT841912" should be replaced of "CU861677".
| References |
|---|
|
|
|---|
- Wang Z. Y., Second G., Tanksley S. D. Polymorphism and phylogenetic relationships among species in the genus Oryza as determined by analysis of nuclear RFLPs. Theor. Appl. Genet. (1992) 83:565–581.[Web of Science]
- Londo J. P., Chiang Y. C., Hung K. H., Chiang T. Y., Schaal B. A. Phylogeography of Asian wild rice, Oryza rufipogon, reveals multiple independent domestications of cultivated rice, Oryza sativa. Proc. Natl. Acad. Sci. USA (2006) 103:9578–9583.
[Abstract/Free Full Text] - Zhang X., Zhou S., Fu Y., Su Z., Wang X., Sun C. Identification of a drought tolerant introgression line derived from Dongxiang common wild rice (O. rufipogon Griff.). Plant Mol. Biol. (2006) 62:247–259.[CrossRef][Web of Science][Medline]
- Tian F., Zhu Z., Zhang B., et al. Fine mapping of a quantitative trait locus for grain number per panicle from wild rice (Oryza rufipogon Griff.). Theor. Appl. Genet. (2006) 113:619–629.[CrossRef][Web of Science][Medline]
- International Rice Genome Sequencing Project, The map-based sequence of the rice genome. Nature (2005) 436:793–800.[CrossRef][Web of Science][Medline]
- Yu J., Hu S., Wang J., et al. A draft sequence of the rice genome Oryza sativa L. ssp. indica. Science (2002) 296:92–100.
[Abstract/Free Full Text] - The Rice Full-Length cDNA Consortium, Collection, mapping, and annotation of over 28,000 cDNA clones from japonica rice. Science (2003) 301:376–379.
[Abstract/Free Full Text] - Liu X., Lu T., Yu S., et al. A collection of 10,096 indica rice full-length cDNAs reveals highly expressed sequence divergence between Oryza sativa indica and japonica subspecies. Plant Mol. Biol. (2007) 65:403–415.[CrossRef][Web of Science][Medline]
- McNally K. L., Bruskiewich R., Mackill D., Buell C. R., Leach J. E., Leung H. Sequencing multiple and diverse rice varieties. Connecting whole-genome variation with phenotypes. Plant Physiol. (2006) 141:26–31.
[Free Full Text] - Satoh K., Doi K., Nagata T., et al. Gene organization in rice revealed by full-length cDNA mapping and gene expression analysis through microarray. PLoS ONE (2007) 2:e1235.[CrossRef]
- Cho S. K., Ok S. H., Jeung J. U., et al. Comparative analysis of 5,211 leaf ESTs of wild rice (Oryza minuta). Plant Cell Rep. (2004) 22:839–847.[CrossRef][Web of Science][Medline]
- Morishima H., Sano Y., Oka H. I. Evolutionary studies in cultivated rice and its wild relatives. Oxford Surv. Evol. Biol. (1992) 8:135–184.
- Nakano M., Nobuta K., Vemaraju K., Tej S. S., Skogen J. W., Meyers B. C. Plant MPSS databases: signature-based transcriptional resources for analyses of mRNA and small RNA. Nucleic Acids Res. (2006) 34:731–735.[CrossRef]
- Carninci P., Kvam C., Kitamura A., et al. High-efficiency full-length cDNA cloning by biotinylated CAP trapper. Genomics (1996) 37:327–336.[CrossRef][Web of Science][Medline]
- Ewing B., Green P. Base-calling of automated sequencer traces using phred. II. Error probabilities. Genome Res. (1998) 8:186–194.
[Abstract/Free Full Text] - Pertea G., Huang X., Liang F., et al. TIGR gene Indices clustering tools (TGICL): a software system for fast clustering of large EST datasets. Bioinformatics (2003) 19:651–652.
[Abstract/Free Full Text] - Altschul S. F., Madden T. L., Schaffer A. A., et al. Gapped BLAST and PSI-BLAST: a new generation of protein database search Programs. Nucleic Acids Res. (1997) 25:3389–3402.
[Abstract/Free Full Text] - Apweiler R., Attwood T. K., Bairoch A., et al. The InterPro database, an integrated documentation resource for protein families, domains and functional sites. Nucleic Acids Res. (2001) 29:37–41.
[Abstract/Free Full Text] - Bateman A., Coin L., Durbin R., et al. The Pfam protein families database. Nucleic Acids Res. (2004) 32:138–141.
- Florea L., Hartzell G., Zhang Z., Rubin G. M., Webb M. A computer program for aligning a cDNA sequence with a genomic DNA sequence. Genome Res. (1998) 8:967–974.
[Abstract/Free Full Text] - Rice P., Longden I., Bleasby A. EMBOSS: the European molecular biology open software suite. Trends Genet. (2000) 16:276–277.[CrossRef][Web of Science][Medline]
- Thompson J. D., Gibson T. J., Plewniak F., Jeanmougin F., Higgins D. G. The ClustalX windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools. Nucleic Acids Res. (1997) 24:4876–4882.
- Suyama M., Torrents D., Bork P. PAL2NAL: robust conversion of protein sequence alignments into the corresponding codon alignments. Nucleic Acids Res. (2006) 34:W609–W612.
[Abstract/Free Full Text] - Griffiths-Jones S., Moxon S., Marshall M., Khanna A., Eddy S. R., Bateman A. Rfam: annotating non-coding RNAs in complete genomes. Nucleic Acids Res. (2005) 33:D121–D124.
[Abstract/Free Full Text] - Griffiths-Jones S., Saini H. K., van Dongen S., Enright A. J. miRBase: tools for microRNA genomics. Nucleic Acids Res. (2008) 36:D154–D158.
[Abstract/Free Full Text] - Zuker M. Mfold web server for nucleic acid folding and hybridization prediction. Nucleic Acids Res. (2003) 31:3406–3415.
[Abstract/Free Full Text] - Tsuritani K., Irie T., Yamashita R., et al. Distinct class of putative "non-conserved" promoters in humans: comparative studies of alternative promoters of human and mouse genes. Genome Res. (2005) 17:1005–1014.[CrossRef]
- Carninci P., Sandelin A., Lenhard B., et al. Genome-wide analysis of mammalian promoter architecture and evolution. Nat Genet. (2006) 38:626–635.[CrossRef][Web of Science][Medline]
- Monna L., Ohta R., Masuda H., Koike A., Minobe Y. Genome-wide searching of single-nucleotide polymorphisms among eight distantly and closely related rice cultivars (Oryza sativa L.) and a wild accession (Oryza rufipogon Griff.). DNA Res. (2006) 13:43–51.
[Abstract/Free Full Text] - Cheng C., Motohashi R., Tsuchimoto S., Fukuta Y., Ohtsubo H., Ohtsubo E. Polyphyletic origin of cultivated rice: based on the interspersion pattern of SINEs. Mol. Biol. Evol. (2003) 20:67–75.
[Abstract/Free Full Text] - Reimmann C., Dudler R. Circadian rhythmicity in the expression of a novel light-regulated rice gene. Plant Mol. Biol. (1993) 22:165–170.[CrossRef][Web of Science][Medline]
- Tumer N. E., Clark W. G., Tabor G. J., Hironaka C. M., Fraley R. T., Shah D. M. The genes encoding the small subunit of ribulose-1,5-bisphosphate carboxylase are expressed differentially in petunia leaves. Nucleic Acids Res. (1986) 14:3325–3342.
[Abstract/Free Full Text] - Close T. J., Kortt A. A., Chandler P. M. A cDNA-based comparison of dehydration-induced proteins (dehydrins) in barley and corn. Plant Mol. Biol. (1989) 13:95–108.[CrossRef][Web of Science][Medline]
- Gao L. Z., Hong D. Y., Ge S. Allozyme variation and population genetic structure of common wild rice Oryza rufipogon Griff. in China. Theor. Appl. Genet. (2000) 101:494–502.[CrossRef][Web of Science]
- Wang Z. S., Zhu L. H., Liu Z. Y., Wang X. K. Genetic diversity of natural wild rice populations detected by RFLP markers (in Chinese). Agric Biotechnol. (1996) 4:111–117.
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||



