Plant Cell Hybrigenics The Protein Interactions Experts
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


This Article
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Related articles in Plant Cell
Right arrow Similar articles in this journal
Right arrow Similar articles in Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via CrossRef
Right arrow Citing Articles via Web of Science (3)
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Burr, B.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Burr, B.
Agricola
Right arrow Articles by Burr, B.
The Plant Cell, Vol. 14, 521-523, March 2002, Copyright © 2002,
American Society of Plant Biologists


IN THIS ISSUE

Mapping and Sequencing the Rice Genome

Benjamin Burr

Department of Biology Brookhaven National Laboratory Upton, NY 11973

burr{at}bnl.gov

In 1997, a group of scientists met in Singapore and agreed to collaborate on sequencing the rice genome. It was agreed at the outset to work on a single cultivar, to share materials, to use a clone-by-clone approach, and to accept the policy of immediate sequence release (Sasaki and Burr, 2000Go). Although the ultimate goal of the International Rice Genome Sequencing Project (IRGSP) is to obtain a finished-quality sequence for the complete genome, the group has adopted an interim milestone of obtaining phase 2 quality for the complete genome by the end of 2002 (http://rgp.dna.affrc.go.jp/rgp/press_releas20011225.htm). Phase 2 quality is defined as sequenced bacterial artificial chromosomes (BACs) or P1 artificial chromosomes (PACs), with few sequencing gaps, whose pieces are ordered and oriented directionally. Currently, an estimated 206 Mb of nonoverlapping assembled sequence of phase 3 (finished-quality) or phase 2 rice genomic sequence is available in public databases, and three chromosomes are nearing completion (http://rgp.dna.affrc.go.jp/cgi-bin/statusdb/seqcollab.pl). Further details of the genome-sequencing methods used by the IRGSP were described by Eckardt (2000)Go.

Having a robust physical map is critical to the success of the project. From the outset, this project had the advantage of having an extremely well-mapped plant genome. About 1450 genetically mapped expressed sequence tags (ESTs) were used previously to anchor a yeast artificial chromosome (YAC)–based physical map that covered 63% of the genome (Saji et al., 2001Go; http://rgp.dna.affrc.go.jp/Publicdata.html). YACs, BACs, and PACs are large clones generally containing inserts of >100 kb, and up to 1.5 Mb in the case of some YACs. The ESTs also have been instrumental in anchoring the BAC- and PAC-based maps.

In this issue of The Plant Cell, Wu et al. (pages 525–535) and Chen et al. (pages 537–545) present two complementary studies that greatly increase the number of mapped ESTs and refine the physical maps to provide 90% coverage of the rice genome. In addition, they provide new estimates of the size of the rice genome and indicate the distribution of genes on the rice chromosomes.


    A COMPREHENSIVE RICE TRANSCRIPT MAP
 TOP
 A COMPREHENSIVE RICE TRANSCRIPT...
 AN INTEGRATED PHYSICAL AND...
 RICE GENOME SIZE
 References
 
In the first article, Wu and colleagues obtained 3' end sequences from >20,000 clones of their rice cDNA libraries. 3' end sequences tend to be the least redundant and thus the most likely to give gene-specific markers. From these, they selected 8440 unique sequences as templates for polymerase chain reaction primers. After screening, they retained 6713 sequences that amplified a single band of the predicted size from both rice genomic DNA and the pooled YAC library. Subsequently, they screened pools from the YAC library to identify the YAC clones containing the EST markers. Most of these markers identified YACs that were part of the physical map, permitting immediate mapping of these markers. Approximately 1500 ESTs identified YACs not placed on the physical map previously. A subset of 431 ESTs were mapped genetically, which allowed the placement of more clones on the YAC-based physical map, increasing coverage from 63 to 80%. Finally, a centromere-specific primer was used to identify YACs covering 11 of the 12 rice centromeres. In the end, an additional 6591 EST markers were placed on the physical map. This high marker density will be important in identifying BACs that fill the remaining gaps in the tiling path and in anchoring unplaced BAC contigs (a contig is a contiguous set of overlapping clones or sequences).

Previous work ties the physical map to the genetic map so that the two can be aligned. Because the genetic distance is not uniform with respect to the physical distance, these two maps show different spacing between the markers. Typically, recombination is reduced around centromeres so that genetic distances tend to be condensed, whereas they are more spread out on the chromosome arms. The work of Wu et al. (2002)Go presents a detailed view of the arrangement of transcribed genes along the length of the physical map. The primary lesson is that gene density varies from chromosome to chromosome and within chromosomes. The three largest chromosomes constitute 31% of the physical map but contain 41% of the EST sites. Within chromosomes, fewer ESTs are found in the vicinity of centromeres, and gene densities generally are highest at the distal ends of the chromosome arms. Similarly skewed gene distributions have been inferred in maize and wheat. Fifty-nine gene-rich regions were identified on the chromosome arms, and the authors estimate that 21% of the rice genome could contain 40% of the genes.


    AN INTEGRATED PHYSICAL AND GENETIC MAP
 TOP
 A COMPREHENSIVE RICE TRANSCRIPT...
 AN INTEGRATED PHYSICAL AND...
 RICE GENOME SIZE
 References
 
BACs and PACs are the primary templates for the clone-by-clone sequencing approach. The article by Chen et al. (2002)Go represents the culmination of several years of work by Rod Wing and his colleagues at the Clemson University Genetics Institute (CUGI) to create a BAC-based physical map of rice that covers 100% of the euchromatin and >90% of the genome. Two BAC libraries were created from HindIII and BamHI restriction enzyme partial digests that together represent 25-fold coverage of the genome. The ends of each clone were sequenced, and ~110,000 end sequences called sequence tag connectors (STCs) were obtained. STCs are used to pick clones flanking sequenced BACs with minimum overlap. The standard approach in building a physical map is to use DNA fingerprinting. In this method, individual BACs are digested to completion and displayed by high-resolution gel electrophoresis with molecular markers so that the fragments can be sized accurately. A collection of 25 or more sized fragments becomes the fingerprint for each BAC clone. FPC (FingerPrinted Contigs) software (Soderlund et al., 2000Go) is used to assemble the fingerprints to find overlapping BACs. These assemblies then must be examined manually to edit the contigs. Additionally, primer probes were developed from end sequences of terminal clones in many of the assemblies to identify overlapping clones or contigs. The contigs were placed on the genetic map by probing with mapped markers from the IRGSP. Carol Soderlund, the author of FPC, has written software that generates DNA fingerprints from sequenced BACs as they appear in GenBank and brings them into the assemblies, further anchoring the assemblies to the physical map.

The Monsanto Company conducted an independent genome sequence project for rice (Barry, 2001Go) in which >3000 BACs were sequenced to a fivefold level of redundancy. Monsanto generously made these sequences available to the IRGSP and to public researchers. As the IRGSP brings the sequence quality of Monsanto BACs to phase 2, the sequences are released to public databases. Brad Barbazuk of Monsanto was able to relate independently fingerprinted and assembled Monsanto BAC contigs to the CUGI assemblies by finding high-quality matches between the CUGI STCs and sequences in the assembled Monsanto BACs. This permitted the integration of the Monsanto BACs into the CUGI physical map.


    RICE GENOME SIZE
 TOP
 A COMPREHENSIVE RICE TRANSCRIPT...
 AN INTEGRATED PHYSICAL AND...
 RICE GENOME SIZE
 References
 
Estimates of the genome size of rice and the physical length of rice chromosomes are important issues for rice genome sequencers, who need to know how much must be sequenced. Arumuganathan and Earle (1991)Go reported that the 2C (twice the gametic) value for Oryza sativa ssp. japonica ranged from 0.86 to 0.91 pg and that the haploid genome size was 430 Mb. Arumuganathan (personal communication) later measured O. japonica cv Nipponbare and found a 2C value of 0.90 ± 0.02 pg. Assuming a mass of 650 D per base pair, this value places the haploid genome of cv Nipponbare at 417 Mb.

Chen et al. (2002)Go estimate that they have covered nearly all of the euchromatic portions of the genome. The BAC contigs are anchored to the genetic map by mapped markers common to the physical and genetic maps, and a genetic distance for each gap between contigs can be measured. Using a local ratio of physical distance to genetic distance, the sizes of the gaps (in base pairs) can be estimated. When the BAC contigs and gaps are totaled, the estimate for the genome size comes to 403 Mb. Chen et al. (2002)Go allow that their physical map does not cover the nucleolar organizer region at the end of chromosome 9. Also, the libraries apparently do not include telomeres. Furthermore, the estimates for gaps in centromeres must be approximate because there is virtually no recombination in these regions.

The current estimate of the length of each chromosome was calculated assuming a genome size of 430 Mb and dividing this figure by each chromosome's fraction of the total genetic distance measured in the Nipponbare x Kasalanth mapping population. Now that sequencing for chromosomes 1, 4, and 10 is nearly complete, we can see that, except for chromosome 1, these are reasonable estimates of the sizes of most of the chromosomes. The size of chromosome 1 is not known accurately because there are still a few gaps, including in the centromere region. The sequencing groups working on chromosomes 1, 4, and 10 estimate sizes of 47, 36, and 24.5 Mb, respectively. The sizes estimated from the integrated physical map are 44.3, 36.6, and 25.7 Mb.

The work of Wu et al. (2002)Go and Chen et al. (2002)Go represents a major contribution to the sequencing of the rice genome in that they provide tools for obtaining a minimal tiling path of minimally overlapping clones to be used for sequencing templates. They further provide a detailed transcript map for rice and confirm the size of the genome to be slightly >400 Mb. Beyond sequencing, the comprehensive EST transcript map and integrated physical and genetic maps provide valuable tools for map-based cloning and gene identification in rice and related monocot species.


    References
 TOP
 A COMPREHENSIVE RICE TRANSCRIPT...
 AN INTEGRATED PHYSICAL AND...
 RICE GENOME SIZE
 References
 
Arumuganathan, K., and Earle, E.D. (1991). Nuclear DNA content of some important plant species. Plant Mol. Biol. Rep. 3, 208–218.

Barry, G. (2001). The use of the Monsanto draft rice genome sequence in research. Plant Physiol. 125, 1164–1165.[Free Full Text]

Chen, M., et al. (2002). An integrated physical and genetic map of the rice genome. Plant Cell 14, 537–545.[Abstract/Free Full Text]

Eckardt, N.A. (2000). Sequencing the rice genome. Plant Cell 12, 2011–2017.[Free Full Text]

Saji, S., Umehara, Y., Antonio, B.A., Yamane, H., Tanoue, H., Baba, T., Aoki, H., Ishige, N., Wu, J., Koike, K., Matsumoto, T., and Sasaki, T. (2001). A physical map with yeast artifical chromosome (YAC) clones covering 63% of the 12 rice chromosomes. Genome 44, 32–37.[Medline]

Sasaki, T., and Burr, B. (2000). International Rice Genome Sequencing Project: The effort to completely sequence the rice genome. Curr. Opin. Plant Biol. 3, 138–141.[CrossRef][Web of Science][Medline]

Soderlund, C., Humphray, S., Dunham, A., and French, L. (2000). Contigs built with fingerprints, markers and FPC V4.7. Genome Res. 10, 1772–1787.[Abstract/Free Full Text]

Wu, J., et al. (2002). A comprehensive rice transcript map containing 6591 ex-pressed sequence tag sites. Plant Cell 14, 525–535.[Abstract/Free Full Text]


Related articles in Plant Cell:

A Comprehensive Rice Transcript Map Containing 6591 Expressed Sequence Tag Sites
Jianzhong Wu, Tomoko Maehara, Takanori Shimokawa, Shinichi Yamamoto, Chizuko Harada, Yuka Takazaki, Nozomi Ono, Yoshiyuki Mukai, Kazuhiro Koike, Jyunshi Yazaki, Fumiko Fujii, Ayahiko Shomura, Tsuyu Ando, Izumi Kono, Kazunori Waki, Kimiko Yamamoto, Masahiro Yano, Takashi Matsumoto, and Takuji Sasaki
Plant Cell 2002 14: 525-535. [Abstract] [Full Text]  

An Integrated Physical and Genetic Map of the Rice Genome
Mingsheng Chen, Gernot Presting, W. Brad Barbazuk, Jose Luis Goicoechea, Barbara Blackmon, Guangchen Fang, Hyeran Kim, David Frisch, Yeisoo Yu, Shouhong Sun, Stephanie Higingbottom, John Phimphilai, Dao Phimphilai, Scheen Thurmond, Brian Gaudette, Ping Li, Jingdong Liu, Jamie Hatfield, Dorrie Main, Kasey Farrar, Caroline Henderson, Laura Barnett, Ravi Costa, Brian Williams, Suzanne Walser, Michael Atkins, Caroline Hall, Muhammad A. Budiman, Jeffery P. Tomkins, Meizhong Luo, Ian Bancroft, Jerome Salse, Farid Regad, Trilochan Mohapatra, Nagendra K. Singh, Akhilesh K. Tyagi, Carol Soderlund, Ralph A. Dean, and Rod A. Wing
Plant Cell 2002 14: 537-545. [Abstract] [Full Text]  



This article has been cited by other articles:


Home page
Proc. Natl. Acad. Sci. USAHome page
A. Mukhopadhyay, S. Vij, and A. K. Tyagi
Overexpression of a zinc-finger protein gene from rice confers tolerance to cold, dehydration, and salt stress in transgenic tobacco
PNAS, April 20, 2004; 101(16): 6309 - 6314.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
M. E. Sorrells, M. La Rota, C. E. Bermudez-Kandianis, R. A. Greene, R. Kantety, J. D. Munkvold, Miftahudin, A. Mahmoud, X. Ma, P. J. Gustafson, et al.
Comparative DNA Sequence Analysis of Wheat and Rice Genomes
Genome Res., August 1, 2003; 13(8): 1818 - 1827.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Related articles in Plant Cell
Right arrow Similar articles in this journal
Right arrow Similar articles in Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via CrossRef
Right arrow Citing Articles via Web of Science (3)
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Burr, B.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Burr, B.
Agricola
Right arrow Articles by Burr, B.


HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
ASPB Publications THE PLANT CELL PLANT PHYSIOLOGY
Copyright © 2002 by the American Society of Plant Biologists