|
|
||||||||
|
First published online April 14, 2006; 10.1105/tpc.106.041111 The Plant Cell 18:1152-1165 (2006) © 2006 American Society of Plant Biologists Independent Ancient Polyploidy Events in the Sister Families Brassicaceae and Cleomaceae[W]a Department of Genetics and Evolution, Max Planck Institute for Chemical Ecology, D-07745 Jena, Germany 2 To whom correspondence should be addressed. E-mail eric.schranz{at}duke.edu; fax 919-613-8177.
Recent studies have elucidated the ancient polyploid history of the Arabidopsis thaliana (Brassicaceae) genome. The studies concur that there was at least one polyploidy event occurring some 14.5 to 86 million years ago (Mya), possibly near the divergence of the Brassicaceae from its sister family, Cleomaceae. Using a comparative genomics approach, we asked whether this polyploidy event was unique to members of the Brassicaceae, shared with the Cleomaceae, or an independent polyploidy event in each lineage. We isolated and sequenced three genomic regions from diploid Cleome spinosa (Cleomaceae) that are each homoeologous to a duplicated region shared between At3 and At5, centered on the paralogs of SEPALLATA (SEP) and CONSTANS (CO). Phylogenetic reconstructions and analysis of synonymous substitution rates support the hypothesis that a genomic triplication in Cleome occurred independently of and more recently than the duplication event in the Brassicaceae. There is a strong correlation in the copy number (single versus duplicate) of individual genes, suggesting functionally consistent influences operating on gene copy number in these two independently evolving lineages. However, the amount of gene loss in Cleome is greater than in Arabidopsis. The genome of C. spinosa is only 1.9 times the size of A. thaliana, enabling comparative genome analysis of separate but related polyploidy events.
Polyploidy, or whole-genome duplication, has played a major role in the evolution of many branches of the tree of life by providing exponential and saltational increases in genome size and gene copy number. Polyploidy provides the opportunity for selection to sculpt a variety of new gene functions, traits, and lineages. There are several examples of polyploid animals (Mable, 2004
Considering the prevalence of recent polyploids among flowering plants, it is not surprising that a plethora of ancient polyploid events have also been discovered. Evidence for ancient polyploidy has come mostly from comparative genetic mapping, analysis of specific gene families, or by the identification of duplicated genes in EST collections, for example, Brassica species (Lagercrantz, 1998
Studies of duplicate genes and colinearity in the A. thaliana genome differ in their assessment of the total number of polyploidy events but generally agree on the occurrence of at least one complete ancient genome duplication (Arabidopsis Genome Initiative, 2000
Recent phylogenetic analyses have established that the Brassicaceae is sister to the Cleomaceae, with these two families being sister to the Capparaceae (Hall et al., 2002
There are three hypotheses to describe the -polyploidy event in Arabidopsis relative to the divergence of the sister families Brassicaceae and Cleomaceae (illustrated in Figure 2
). First, the polyploid event could be unique to the Brassicaceae. Second, it could predate the divergence of these families and therefore be shared by the Brassicaceae and Cleomaceae. Finally, there could have been independent ancient polyploidy events in both lineages. For each of these hypotheses, there are specific expectations for genomic organization and relationships among homologous genes. For the Brassicaceae-only hypothesis, there should be a 2:1 ratio of genomic regions and genes in the Brassicaceae compared with the Cleomaceae. For the shared and independent hypotheses, there should be multiple homologous genomic regions and sets of genes in both lineages. However, these two hypotheses can be distinguished by the relationship and evolution among the duplicated copies of each gene (Cannon and Young, 2003
Comparative Cleome BAC Sequencing We focused our comparative genomics analysis on regions in Cleome that were homoeologous to a duplicated region in Arabidopsis shared between At3 and At5 (Figure 3A ). Our target region was contained within one of the largest recently duplicated blocks identified in Arabidopsis; it was the fourth largest block (block 0305000103160) identified by Blanc et al. (2003) 128 kb on At3 and 178 kb on At5 (for a total of 306 kb). There are 36 annotated genes on At3 in this interval and 51 genes on At5. A total of 20 duplicated (paralogous) gene pairs exist between these two regions (Figure 3A).
The screening of our Cleome BAC library with the Cleome SEP and CO probes identified a number of putative positive clones. Screening the clones by restriction digestion and PCR analysis identified three independent homoeologous Cleome genomic regions. BACs from each of the three regions that had nearly complete target regions (identified by PCR) were chosen for shotgun sequencing (Cleome BACs Cs_1, Cs_2, and Cs_3). Sequencing confirmed that Cs_1 and Cs_2 BACs contained homologs to At3g02230/At5g15650 and At3g02560/At5g16130. Cs_3 contained a homolog to At3g02230/At5g15650 but not At3g02560/At5g16130 (the BAC ended at the homolog of At5g16120). However, using PCR, we were able to extend and clone the length of the Cs_3 region to include a homolog to At3g02560/At5g16130. The length of sequence between the flanking genes that define the target region in each BAC was 124, 102, and 143 kb, respectively (for a combined sequence length of 369 kb). TwinScan identified a number of open reading frames (ORFs) on each BAC. BLASTN and BLASTX analyses found that each of the BACs contained a unique subset of genes with homology to those from At3 and At5 or that were duplicated (Figure 3; see Supplemental Table 2 online). For example, the target region on BAC Cs_1 contained 2 At3-specific genes, 6 At5-specific genes, 13 duplicated genes, and 16 BAC-specific ORFs (most of which are clustered in the center of the BAC and contain a number of transposons) (Figure 3B). Overall, the gene density within the target regions was very similar between Arabidopsis (with an average gene density of one gene per 3.4 kb) and Cleome (with an average gene density of one gene per 3.2 kb). The five genomic regions (two in Arabidopsis and three in Cleome) were remarkably colinear across the target region and could easily be aligned (Figure 3C). There were three loci for which there were five homologs, one on each genomic region, including the loci defining the ends of the target region (Figure 3C). The complete list of genes found on these five genomic regions and the homology relationships among them allowed for the reconstruction of the putative ancestral gene order (see Supplemental Table 2 online). In total, the three Cleome BACs contained homologs of 50 of the 66 (76%) unique annotated genes found in Arabidopsis (paralogous genes were counted only once and excluded tRNA, transposons, and tandem duplicates found in Arabidopsis). Of the 17 genes found in Arabidopsis for which no Cleome homologs were identified, many were members of multigene families. Hence, these loci were impractical for additional BAC library screening. Only one of the missing loci was both single copy and highly conserved among many organisms (At5g15920, the structural maintenance of chromosomes family protein [MSS2]). The Cleome homolog of the At5g15920 gene was PCR amplified, cloned, sequenced, and used to reprobe the Cleome BAC library. The putative positive clones were screened and found to correspond to only a single genomic region. Limited shotgun sequencing was done from a pool of four of these BACs. The similarity of these sequences was to the homolog of At5g15920 and then to another part of the A. thaliana genome, suggesting that this locus has undergone a rearrangement in Cleome relative to Arabidopsis.
There were a total of 36 ORFs predicted in the three Cleome BACs that did not have homologs within the target regions of Arabidopsis. Many of these ORFs are short and ultimately may not represent true genes. However, several of these are likely, and potentially novel, genes. For example, Cs3_ORF44 is predicted to code for a 392amino acid peptide whose only similarity is to an unknown peptide in rice (GenBank accession number BAD81800). There are other loci that have homology to Arabidopsis genes but not to genes found in the target interval. Often these are members of large gene families. For example, Cs3_ORF7 has high similarity to members of phosphatidylinositol 3- and 4-kinase family proteins. Cs1_ORF27 is a 639amino acid protein with high similarity to pentatricopeptide family proteins (and the gene At1g31430 in particular). Cs1_ORF25 has similarity to Cyp89-like cytochrome P450 proteins, and Cs1_ORF31 is only 78 amino acids long and has similarity to S-adenosylmethionine decarboxylases. This last ORF may represent a regulatory upstream ORF in the 5' region of the upstream region of the adenosylmethionine decarboxylase Cs1_At5g15950 gene, as has been described in other systems (Hanfrey et al., 2003
In addition, six ORFs on Cs1 had similarity to transposon-like genes (Cs1_ORFs 20 to 24, 26, and 41). When we did a complete BLASTX of our BAC sequences to the conserved-motif amino acid database of the Brassicaceae compiled by Zhang and Wessler (2004) For each of the sequenced Cleome BACs, we obtained some sequence beyond the target interval. Most additional sequence was colinear with the homologous At3 and/or At5 regions, except for the BAC Cs_2, which showed evidence of a rearrangement (with the homolog of At5g15650 next to the homolog of At5g18950; data not shown). This represents the only structural rearrangement detected in our data.
Phylogenetic Reconstructions with Homologous Gene Sets
One of the homologous gene sets examined from our target region was the SEP1 and SEP2 family (Figure 4). Phylogenetic relationships of the SEP genes have recently been established (Zahn et al., 2005 -clone library; Windsor et al., 2006
We recovered the same tree topology for the SEP1 and SEP2 regions as Zahn et al. (2005)
In order to have an independent assessment of gene duplication, we present a representative of a duplicated gene from outside our target region used for phylogenetic reconstruction, the Arg decarboxylase (ADC) family (Figure 5). The ADC gene was duplicated between At2 and At4 (ADC1, At2g16500; ADC2, At4g34710). Sequences of ADC genes were used previously for phylogenetic studies within the Brassicaceae where the gene was found to be duplicated across the entire Brassicaceae, except in the genus Aethionema (Galloway et al., 1998
Finally, we combined the coding sequence data for all duplicated genes in the target regions for Arabidopsis and Cleome for phylogenetic analysis (Figure 6). The At3 and At5 sequences contained data for all 14 duplicated genes. However, each of the three regions in Cleome contained some, but not all, of the duplicates. Hence, the missing genes were scored as missing data. Recent studies have concluded that phylogenetic studies containing even large amounts of missing data can be more robust than many smaller gene and taxa studies (Driskell et al., 2004
Calculation and Analysis of Synonymous Substitution Rates
We also performed an analysis of variance on the Ks values from the comparisons of Arabidopsis-Arabidopsis and Cleome-Cleome and looked for locus-specific variation in synonymous substitution rates (Table 2 ). Levels of synonymous differentiation differed significantly between Arabidopsis-Arabidopsis and Cleome-Cleome (P = 0.0007), but we found no evidence of heterogeneous rates among loci (P = 0.228). Hence, the age of the duplication events in the two lineages, as measured by Ks values, is different.
Patterns of Duplicate Gene Retention and Loss The sequencing of the three replicated genomic regions from C. spinosa allowed us to examine the overall patterns of duplicate gene retention and loss compared with that observed within A. thaliana (Table 3 , Figure 7 ). A 2 test of independence between species and copy number (single versus duplicate) found a highly significant result (P = 0.0079). Retention of duplicate copies in the two species was nonrandom. The copy number of particular loci is correlated in these two independently evolving lineages (Table 3).
We compiled the gene ontology (GO) annotations of the genes that were either duplicated in both lineages or single copy in both lineages. Although genes can have multiple GO annotations and therefore may be represented multiple times, we report the number of gene annotations in each molecular function GO category for single and duplicated copy genes (Figure 7). The most obvious overrepresented category for single-copy genes was hydrolase activity, and the most overrepresented category for duplicated genes was protein binding (Figure 7). Assuming that there were 50 ancestral genes in the target region at the split between Brassicaceae and Cleomaceae (this is a conservative number based on the number of genes shared now between Cleome and Arabidopsis), a genome duplication in the Arabidopsis lineage would have resulted in 100 loci and a genome triplication in Cleome with 150 loci. For these genes in the two Arabidopsis regions, there are currently 30 genes in single copy and 20 in duplicate (20 + 20 = 40) for a total of 70 loci. The overall degree of gene retention is thus 70/100 = 70%. In Cleome, there are 24 single-copy genes, 23 in duplicate (23 + 23 = 46), and 3 in triplicate (3 + 3 + 3 = 9) for a total of 79 loci. Thus, the overall degree of gene retention is only 79/150 = 53%.
Since the ancient polyploid history of A. thaliana was illuminated by genome analyses, there has been much speculation about the timing and phylogenetic position of the genome doubling (Arabidopsis Genome Initiative, 2000 The occurrence of independent ancient polyploidy events in these closely related lineages allows analysis of the relative timing of polyploidization, patterns of gene retention and loss, and genome evolution. Our results highlight the continuous cycles of polyploidy and diploidization that have occurred during angiosperm evolution.
Timing of Ancient Polyploidy Events
Following the suggestion of Maere et al. (2005)
One of the most important conclusions to be drawn from our analysis is that the divergence of the Cleome and Arabidopsis lineages is older than the Arabidopsis lineage polyploidy event. A major goal now will be to determine when the polyploid event occurred during the evolution of the Brassicaceae, particularly in relationship to the divergence of the genus Aethionema from the rest of the family. A previous study (Galloway et al., 1998
Another unresolved question is when the likely whole-genome polyploid event occurred during the evolution of the Cleomaceae. There is a well-supported New World temperate clade containing the genera Cleomella, Oxystylis, Wislizenia, and Isomeris that is sister to the remainder of the family (Hall et al., 2002
Patterns of Gene Retention and Loss
When we examined the GO annotations of the genes found in duplicate in both lineages or single copy in both lineages, we found several biased categories despite our rather small sample of the genomes (only
A comparative analysis of genes within our target region has the potential to elucidate gene evolution and function. The maintenance of duplicate copies of genes in both Arabidopsis and Cleome may indicate that it is beneficial or necessary to do so. For example, we found SEP1 and SEP2 genes to be duplicated in both lineages. This agrees with published reports that MADS box genes, and SEP genes in particular, often have been maintained in duplicate throughout angiosperm evolution (Irish and Litt, 2005
Other genes are duplicated in one lineage and single copy in the other. For example, CO is replicated in Arabidopsis but exists in single copy in Cleome. In Arabidopsis, there is a tandem duplication on At5, giving rise to CO and CONSTANS-LIKE1 (COL1), and their paralog on At3, COL2 (Ledger et al., 2001
Polyploidization in Cleome
In addition to the triplicated genomic region that we sequenced in Cleome, we also amplified replicated copies for four additional loci. These additional loci are found in duplicate in Arabidopsis and are scattered about the genome. The finding of additional Cleome replicated genes of the same approximate age as our triplicated region suggests that there was a whole-genome triplication. Additionally, phylogenetic reconstructions with these loci show similar patterns suggestive of independent polyploidy events in the Arabidopsis and Cleome lineages (e.g., Figure 5). It is possible that replicated regions in Cleome were derived by aneuploidy or segmental duplication. Examples of large-scale segmental duplications have been recently detected in the rice genome (Yu et al., 2005
The likely whole-genome triplication observed in Cleome suggests the ancestral Cleome polyploid was a hexaploid (6x), which subsequently underwent diploidization. For example, it has been hypothesized that diploid Brassica species are derived from a hexaploid ancestor, and now much of the genome exists in triplicate compared with Arabidopsis (Lagercrantz, 1998
Genome Evolution
The patterns of gene loss are important for understanding both the evolution of the genes themselves and the genome in general. For genes present in both genomes, 60% of Arabidopsis genes are now present in single copy compared with 48% in Cleome. Although more genes are found in single copy in A. thaliana, the degree of gene retention is higher (70% versus 53%). The lower degree of gene retention in Cleome is surprising considering it is a much younger polyploidy event. The difference of gene loss between the two lineages is likely attributable to the initial degree of replication. For a gene to become single copy in Cleome, two paralogous sequences must be lost. Most replicated genes in Cleome are duplicated (23 pairs) rather than triplicated (only three loci), meaning one gene copy has been lost. A similar degree of high gene loss in ancient triplicated genomes has been found by analysis of the diploid Brassica genomes (O'Neill and Bancroft, 2000 Of the genes found in the two Arabidopsis regions, we were unable to find homologs to 24% of them in the Cleome regions. Although most of these were members of gene families in Arabidopsis, we did find evidence for the transduction of at least one locus. Similarly, we predicted 36 unique ORFs in the three Cleome regions. Many of these were short and may not represent true genes. However, many of these Cleome ORFs were similar to members of gene families in Arabidopsis but did not have homologs within the target interval. Hence, in both Arabidopsis and Cleome, there appears to be a migration of members of gene families in and out of various genomic locations that affects the microsynteny of the genomes.
In addition to gene loss and movement, chromosomal rearrangements play an important role in the evolution of genomes after polyploidization. The three regions in Cleome and the two regions in Arabidopsis showed a remarkable colinearity across our target region despite the degree of gene loss and movement that has occurred in each lineage subsequent to polyploidization. We detected a single inversion or rearrangement in one of the sequenced BACs, but this lay outside of our target interval. The degree of large-scale rearrangements can be ascertained by comparative mapping between Brassicaceae species (e.g., Koch et al., 2001
The processes of gene loss and chromosomal rearrangements will obviously contribute to the evolution of genome size. Within our target interval, the three Cleome regions are only 1.2 times the size of the two Arabidopsis regions, and the complete genome of Cleome is The use of both comparative genomic and phylogenetic approaches was critical for our elucidation of the independent polyploidy events. A phylogenetic approach alone, for example, using EST sequences, would likely have detected independent polyploid events but probably would have concluded that Cleome was duplicated rather than triplicated. A comparative mapping approach may have been able to detect ancient triploidy, but without the phylogenetic analysis it would have been impossible to say that they were independent polyploidy events. Our genomic approach also allowed for an assessment of the types of genes present in duplicate versus single copy. Finally, the occurrence of independent ancient polyploidy events in closely related lineages provides an interesting opportunity to examine duplicate gene evolution and its contribution to phenotypic diversity.
Plant Materials and DNA Isolation Plants were grown from seeds of Cleome spinosa (ES1046; Spinnenpflanze) purchased from Kiepenkerl. DNA isolations were done using Qiagen genomic tips and by a modified CTAB procedure as previously described (Schranz et al., 2005
Amplification, Cloning, and Sequencing of PCR Products PCR products were obtained with 35 cycles as follows: 94°C, 30 s/60°C, 30 s/72°C, 2 min. Two independent PCRs were performed for each sample, and products were cloned using the TOPO TA cloning kit (Invitrogen Life Technologies). At a minimum, four clones per cloning reaction were sequenced as above on both strands with an ABI3700 capillary sequencer (Applied Biosystems).
BAC Library
Clone Identification and Sequencing The homologous C. spinosa BAC clones were identified by filter hybridization of gridded nylon membranes containing the clones of the BAC library. Gene products for the Cleome homologs of CO/COL2 and SEP1 and SEP2 were amplified, and PCR products were gel-purified using the QIAquick gel extraction kit (Qiagen). One hundred nanograms of each purified PCR product was labeled and hybridized using the ECL Direct nucleic acid labeling and detection system (Amersham Biosciences) according to the manufacturer's protocol. Several positive clones were isolated from each hybridization and amplified in small overnight culture, and DNA minipreparations for each BAC were done using standard alkaline lysis procedures. The minipreparations of BAC DNA were fingerprinted by restriction digest and screened by PCR with primers to many of the putative genes located within the genomic target region. Three distinct BAC regions, each covering the approximate interval from At5g15650 to At5g16130, were grown in 250 mL TB liquid cultures with chloramphenicol selection. BAC DNA isolation was done using the NucleoBond BAC100 kit (Macherey-Nagel) according to the manufacturer's instructions. The BAC DNA was then cleaned of contaminating sequences using plasmid-safe ATP-dependent DNase (Epicentre Biotechnologies). The selected BAC clones were shotgun sequenced by subjecting each to separate partial digestions with Tsp509I and Sau3AI enzymes, size fractionating into 12- to 18-kb fragments, cloning into a pUC19 vector, and end-sequencing the clones with an ABI3700 capillary sequencer. Sequences were assembled and edited with Seqman 5.0 (DNASTAR).
Sequence Annotation
Sequence Alignment and Phylogenetic Analysis
Alignments of homologous sets of loci were done on translated protein sequences using ClustalW (Thompson et al., 1994
Phylogenetic reconstruction was performed by analyzing DNA sequences with maximum-likelihood (ML) methods in the PHYML program (Guindon et al., 2005
In addition, the aligned duplicated gene coding sequences from At3, At5, and the three Cleome BAC regions were each concatenated into one large sequence. Missing genes were treated as gaps. The total aligned sequence was 12,934 bp long. In order to examine the overall relationships of the regions, these five concatenated sequences were used for phylogenetic analysis in the same manner as described above for individual genes. The phylogenies were then examined for branching patterns of Cleome copies relative to Arabidopsis copies for evidence of orthologous versus paralogous relationships (Bowers et al., 2003
Evolutionary Analysis
We built a 2 x 2 contingency table (Table 3) to compare the observed and expected numbers of duplicate versus single-copy genes under the hypothesis of independence between the two lineages and performed a standard GO annotations for A. thaliana genes in our target genomic regions were downloaded from TAIR (www.arabidopsis.org). The GO annotations of genes found in duplicate in both Arabidopsis and Cleome were contrasted to those found in single copy in both (Figure 7). The amount of gene loss in the target region was calculated for both Arabidopsis and Cleome, assuming that there were 50 ancestral genes (the number of loci shared between the two genomes in the target region). Genome duplication in the Arabidopsis lineage would result in 100 loci, and a genome triplication in Cleome with 150 loci. The number of loci for these shared genes now present in the two genomes was used to calculate the degree of gene loss.
Accession Numbers
Supplemental Data
We thank Jocelyn Hall and the three reviewers for helpful comments on the manuscript. We also thank Heiko Vogel, Karl Schmid, Domenica Schnabelrauch, Xiaoyu Zhang, and Aaron Windsor for technical advice and assistance. Finally, we thank Maria Clauss, Chris Pires, Martin Lysak, Amy Bouck, Marcus Koch, David Baum, George Coupland, Laurent Corbesier, Jeff Doyle, and Ihsan Al-Shehbaz for illuminating discussions. The Max Planck Gesellschaft and Duke University provided support for this research.
1 Current address: Department of Biology, PO Box 91000, Duke University, Durham, NC 27708. The authors responsible for distribution of materials integral to the findings presented in this article in accordance with the policy described in the Instructions for Authors (www.plantcell.org) are: M. Eric Schranz (eric.schranz{at}duke.edu) and Thomas Mitchell-Olds (tmo1{at}duke.edu).
[W] Online version contains Web-only data. Article, publication date, and citation information can be found at www.plantcell.org/cgi/doi/10.1105/tpc.106.041111. Received January 13, 2006; Revision received February 27, 2006. accepted March 21, 2006.
Altschul, S.F., Gish, W., Miller, W., Myers, E.W., and Lipman, D.J. (1990). Basic Local Alignment Search Tool. J. Mol. Biol. 215, 403410.[CrossRef][Web of Science][Medline] APG II (2003). An update of the Angiosperm Phylogeny Group classification for the orders and families of flowering plants: APG II. Bot. J. Linn. Soc. 141, 399436.[CrossRef] Arabidopsis Genome Initiative (2000). Analysis of the genome sequence of the flowering plant Arabidopsis thaliana. Nature 408, 796815.[CrossRef][Medline] Blanc, G., Hokamp, K., and Wolfe, K.H. (2003). A recent polyploidy superimposed on older large-scale duplications in the Arabidopsis genome. Genome Res. 13, 137144. Blanc, G., and Wolfe, K.H. (2004a). Widespread paleopolyploidy in model plant species inferred from age distributions of duplicate genes. Plant Cell 16, 16671678. Blanc, G., and Wolfe, K.H. (2004b). Functional divergence of duplicated genes formed by polyploidy during Arabidopsis evolution. Plant Cell 16, 16791691. Bowers, J.E., Chapman, B.A., Rong, J.K., and Paterson, A.H. (2003). Unravelling angiosperm genome evolution by phylogenetic analysis of chromosomal duplication events. Nature 422, 433438.[CrossRef][Medline] Brown, N.J., Parsley, K., and Hibberd, J.M. (2005). The futured C-4 research Maize, Flaveria or Cleome? Trends Plant Sci. 10, 215221.[CrossRef][Web of Science][Medline] Cannon, S.B., and Young, N.D. (2003). OrthoParaMap: Distinguishing orthologs from paralogs by integrating comparative genome data and gene phylogenies. BMC Bioinformatics 4, 35.[CrossRef][Medline] Chapman, B.A., Bowers, J.E., Feltus, F.A., and Paterson, A.H. (2006). Buffering of crucial functions by paleologous duplicated genes may contribute cyclicality to angiosperm genome duplication. Proc. Natl. Acad. Sci. USA 103, 27302735. Chapman, B.A., Bowers, J.E., Schulze, S.R., and Paterson, A.H. (2004). A comparative phylogenetic approach for dating whole genome duplication events. Bioinformatics 20, 180185. Corbesier, L., and Coupland, G. (2005). Photoperiodic flowering of Arabidopsis: Integrating genetic and physiological approaches to characterization of the floral stimulus. Plant Cell Environ. 28, 5466.[CrossRef] De Bodt, S., Maere, S., and Van de Peer, Y. (2005). Genome duplication and the origin of angiosperms. Trends Ecol. Evol. 20, 591597.[CrossRef][Medline] Devos, K.M. (2005). Updating the crop circle. Curr. Opin. Plant Biol. 8, 155162.[CrossRef][Web of Science][Medline] Driskell, A.C., Ane, C., Burleigh, J.G., McMahon, M.M., O'Meara, B.C., and Sanderson, M.J. (2004). Prospects for building the tree of life from large sequence databases. Science 306, 11721174. Duarte, J.M., Cui, L.Y., Wall, P.K., Zhang, Q., Zhang, X.H., Leebens-Mack, J., Ma, H., Altman, N., and dePamphilis, C.W. (2006). Expression pattern shifts following duplication indicative of subfunctionalization and neofunctionalization in regulatory genes of Arabidopsis. Mol. Biol. Evol. 23, 469478. Ermolaeva, M.D., Wu, M., Eisen, J.A., and Salzberg, S.L. (2003). The age of the Arabidopsis thaliana genome duplication. Plant Mol. Biol. 51, 859866.[CrossRef][Web of Science][Medline] Felsenstein, J. (1985). Confidence limits on phylogenies: An approach using the bootstrap. Evolution Int. J. Org. Evolution 39, 783791.[CrossRef][Web of Science] Feuillet, C., and Keller, B. (2002). Comparative genomics in the grass family: Molecular characterization of grass genome structure and evolution. Ann. Bot. (Lond.) 89, 310. Force, A., Lynch, M., Pickett, F.B., Amores, A., Yan, Y.L., and Postlethwait, J. (1999). Preservation of duplicate genes by complementary, degenerative mutations. Genetics 151, 15311545. Gallardo, M.H., Bickham, J.W., Honeycutt, R.L., Ojeda, R.A., and Kohler, N. (1999). Discovery of tetraploidy in a mammal. Nature 401, 341.[CrossRef][Medline] Galloway, G.L., Malmberg, R.L., and Price, R.A. (1998). Phylogenetic utility of the nuclear gene arginine decarboxylase: An example from Brassicaceae. Mol. Biol. Evol. 15, 13121320.[Abstract] Gascuel, O. (1997). BIONJ: An improved version of the NJ algorithm based on a simple model of sequence data. Mol. Biol. Evol. 14, 685695.[Abstract] Gaut, B.S., and Doebley, J.F. (1997). DNA sequence evidence for the segmental allotetraploid origin of maize. Proc. Natl. Acad. Sci. USA 94, 68096814. Guindon, S., Lethiec, F., Duroux, P., and Gascuel, O. (2005). PHYML Online A web server for fast maximum likelihood-based phylogenetic inference. Nucleic Acids Res. 33, W557W559. Hall, J.C., Iltis, H.H., and Sytsma, K.J. (2004). Molecular phylogenetics of core brassicales, placement of orphan genera Emblingia, Forchhammeria, Tirania, and character evolution. Syst. Bot. 29, 654669.[CrossRef] Hall, J.C., Sytsma, K.J., and Iltis, H.H. (2002). Phylogeny of Capparaceae and Brassicaceae based on chloroplast sequence data. Am. J. Bot. 89, 18261842. Hanfrey, C., Franceschetti, M., Mayer, M.J., Illingworth, C., Elliott, K., Collier, M., Thompson, B., Perry, B., and Michael, A.J. (2003). Translational regulation of the plant S-adenosylmethionine decarboxylase. Biochem. Soc. Trans. 31, 424427.[CrossRef][Web of Science][Medline] Irish, V.F., and Litt, A. (2005). Flower development and evolution: Gene duplication, diversification and redeployment. Curr. Opin. Genet. Dev. 15, 454460.[CrossRef][Web of Science][Medline] Jaillon, O., et al. (2004). Genome duplication in the teleost fish Tetraodon nigroviridis reveals the early vertebrate proto-karyotype. Nature 431, 946957.[CrossRef][Medline] Johnston, J.S., Pepper, A.E., Hall, A.E., Chen, Z.J., Hodnett, G., Drabek, J., Lopez, R., and Price, H.J. (2005). Evolution of genome size in Brassicaceae. Ann. Bot. (Lond.) 95, 229235. Kashkush, K., Feldman, M., and Levy, A.A. (2002). Gene loss, silencing and activation in a newly synthesized wheat allotetraploid. Genetics 160, 16511659. Kellis, M., Birren, B.W., and Lander, E.S. (2004). Proof and evolutionary analysis of ancient genome duplication in the yeast Saccharomyces cerevisiae. Nature 428, 617624.[CrossRef][Medline] Koch, M., Haubold, B., and Mitchell-Olds, T. (2001). Molecular systematics of the Brassicaceae: Evidence from coding plastidic matK and nuclear Chs sequences. Am. J. Bot. 88, 534544. Korf, I., Flicek, P., Duan, D., and Brent, M.R. (2001). Integrating genomic homology into gene structure prediction. Bioinformatics 17, S140S148.[Abstract] Lagercrantz, U. (1998). Comparative mapping between Arabidopsis thaliana and Brassica nigra indicates that Brassica genomes have evolved through extensive genome replication accompanied by chromosome fusions and frequent rearrangements. Genetics 150, 12171228. Le Comber, S.C., and Smith, C. (2004). Polyploidy in fishes: Patterns and processes. Biol. J. Linn. Soc. Lond. 82, 431442.[CrossRef] Ledger, S., Strayer, C., Ashton, F., Kay, S.A., and Putterill, J. (2001). Analysis of the function of two circadian-regulated CONSTANS-LIKE genes. Plant J. 26, 1422. Leitch, I.J., and Bennett, M.D. (2004). Genome downsizing in polyploid plants. Biol. J. Linn. Soc. Lond. 82, 651663.[CrossRef] Lukens, L.N., Quijada, P.A., Udall, J., Pires, J.C., Schranz, M.E., and Osborn, T.C. (2004). Genome redundancy and plasticity within ancient and recent Brassica crop species. Biol. J. Linn. Soc. Lond. 82, 665674.[CrossRef] Lynch, M., and Conery, J.S. (2000). The evolutionary fate and consequences of duplicate genes. Science 290, 11511155. Lysak, M.A., Koch, M.A., Pecinka, A., and Schubert, I. (2005). Chromosome triplication found across the tribe Brassiceae. Genome Res. 15, 516525. Ma, X.F., and Gustafson, J.P. (2005). Genome evolution of allopolyploids: A process of cytological and genetic diploidization. Cytogenet. Genome Res. 109, 236249.[CrossRef][Web of Science][Medline] Mable, B.K. (2004). Why polyploidy is rarer in animals than in plants': myths and mechanisms. Biol. J. Linn. Soc. Lond. 82, 453466. Maere, S., De Bodt, S., Raes, J., Casneuf, T., Van Montagu, M., Kuiper, M., and Van de Peer, Y. (2005). Modeling gene and genome duplications in eukaryotes. Proc. Natl. Acad. Sci. USA 102, 54545459. Malcomber, S.T., and Kellogg, E.A. (2005). SEPALLATA gene diversification: Brave new whorls. Trends Plant Sci. 10, 427435.[CrossRef][Web of Science][Medline] O'Neill, C.M., and Bancroft, I. (2000). Comparative physical mapping of segments of the genome of Brassica oleracea var. alboglabra that are homoeologous to sequenced regions of chromosomes 4 and 5 of Arabidopsis thaliana. Plant J. 23, 233243.[CrossRef][Web of Science][Medline] Osborn, T.C., Pires, J.C., Birchler, J.A., Auger, D.L., Chen, Z.J., Lee, H.S., Comai, L., Madlung, A., Doerge, R.W., Colot, V., and Martienssen, R.A. (2003). Understanding mechanisms of novel gene expression in polyploids. Trends Genet. 19, 141147.[CrossRef][Web of Science][Medline] Panopoulou, G., and Poustka, A.J. (2005). Timing and mechanism of ancient vertebrate genome duplications The adventure of a hypothesis. Trends Genet. 21, 559567.[CrossRef][Web of Science][Medline] Parkin, I.A.P., Gulden, S.M., Sharpe, A.G., Lukens, L., Trick, M., Osborn, T.C., and Lydiate, D.J. (2005). Segmental structure of the Brassica napus genome based on comparative analysis with Arabidopsis thaliana. Genetics 171, 765781. Paterson, A.H., Bowers, J.E., and Chapman, B.A. (2004). Ancient polyploidization predating divergence of the cereals, and its consequences for comparative genomics. Proc. Natl. Acad. Sci. USA 101, 99039908. Pfeil, B.E., Schlueter, J.A., Shoemaker, R.C., and Doyle, J.J. (2005). Placing paleopolyploidy in relation to taxon divergence: A phylogenetic analysis in legumes using 39 gene families. Syst. Biol. 54, 441454. Posada, D., and Crandall, K.A. (1998). MODELTEST: Testing the model of DNA substitution. Bioinformatics 14, 817818. Prince, V.E., and Pickett, F.B. (2002). Splitting pairs: The diverging fates of duplicated genes. Nat. Rev. Genet. 3, 827837.[CrossRef][Web of Science][Medline] Ptacek, M.B., Gerhardt, H.C., and Sage, R.D. (1994). Speciation by polyploidy in treefrogs Multiple origins of the tetraploid, Hyla versicolor. Evolution Int. J. Org. Evolution 48, 898908. Quiros, C.F., Grellet, F., Sadowski, J., Suzuki, T., Li, G., and Wroblewski, T. (2001). Arabidopsis and Brassica comparative genomics: Sequence, structure and gene content in the ABI1-Rps2-Ck1 chromosomal segment and related regions. Genetics 157, 13211330. Rana, D., Boogaart, T., O'Neill, C.M., Hynes, L., Bent, E., Macpherson, L., Park, J.Y., Lim, Y.P., and Bancroft, I. (2004). Conservation of the microstructure of genome segments in Brassica napus and its diploid relatives. Plant J. 40, 725733.[CrossRef][Web of Science][Medline] Rozas, J., Sanchez-DelBarrio, J.C., Messeguer, X., and Rozas, R. (2003). DnaSP, DNA polymorphism analyses by the coalescent and other methods. Bioinformatics 19, 24962497. Sanchez-Acebo, L. (2005). A phylogenetic study of the new world Cleome (Brassicaceae, Cleomoideae). Ann. Mo. Bot. Gard. 92, 179201. Schranz, M.E., Dobes, C., Koch, M.A., and Mitchell-Olds, T. (2005). Sexual reproduction, hybridization, apomixis, and polyploidization in the genus Boechera (Brassicaceae). Am. J. Bot. 92, 17971810. Seoighe, C., and Gehring, C. (2004). Genome duplication led to highly selective expansion of the Arabidopsis thaliana proteome. Trends Genet. 20, 461464.[CrossRef][Web of Science][Medline] Simillion, C., Vandepoele, K., Van Montagu, M.C.E., Zabeau, M., and Van de Peer, Y. (2002). The hidden duplication past of Arabidopsis thaliana. Proc. Natl. Acad. Sci. USA 99, 1362713632. Song, K.M., Lu, P., Tang, K.L., and Osborn, T.C. (1995). Rapid genome change in synthetic polyploids of Brassica and its implications for polyploid evolution. Proc. Natl. Acad. Sci. USA 92, 77197723. Sterck, L., Rombauts, S., Jansson, S., Sterky, F., Rouze, P., and Van de Peer, Y. (2005). EST data suggest that poplar is an ancient polyploid. New Phytol. 167, 165170.[CrossRef][Web of Science][Medline] Tate, J.A., Soltis, D.E., and Soltis, P.S. (2005). Polyploidy in plants. In The Evolution of the Genome, T.R. Gregory, ed (San Diego, CA: Elsevier), pp. 371426. Thompson, J.D., Higgins, D.G., and Gibson, T.J. (1994). CLUSTAL W: Improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 22, 46734680. Van de Peer, Y. (2004). Computational approaches to unveiling ancient genome duplications. Nat. Rev. Genet. 5, 752763.[Web of Science][Medline] Vision, T.J., Brown, D.G., and Tanksley, S.D. (2000). The origins of genomic duplications in Arabidopsis. Science 290, 21142117. Warwick, S.I., and Al-Shehbaz, I.A. (2006). Brassicaceae: Chromosome number index and database on CD-ROM. In Evolution and Phylogeny of the Brassicaceae, M.A. Koch and K. Mummenhoff, eds (Heidelberg, Germany: Springer-Verlag), in press. Wendel, J.F. (2000). Genome evolution in polyploids. Plant Mol. Biol. 42, 225249.[CrossRef][Web of Science][Medline] Windsor, A.J., Schranz, M.E., Formanova, N., Gebauer-Jung, S., Bishop, J.G., Schnabelrauch, D., Kroymann, J., and Mitchell-Olds, T. (2006). Partial shotgun sequencing of the Boechera stricta genome reveals extensive microsynteny and promoter conservation with Arabidopsis. Plant Physiol. 140, 11691182. Wolfe, K.H. (2001). Yesterday's polyploids and the mystery of diploidization. Nat. Rev. Genet. 2, 333341.[CrossRef][Web of Science][Medline] Yang, T.J., Kim, J.S., Lim, K.B., Kwon, S.J., Kim, J.A., Jin, M., Park, J.Y., Lim, M.H., Kim, H.I., Kim, S.H., Lim, Y.P., and Park, B.S. (2005). The Korea Brassica Genome Project: A glimpse of the Brassica genome based on comparative genome analysis with Arabidopsis. Comp. Funct. Genomics 6, 138146.[CrossRef][Medline] Yang, Y.W., Lai, K.N., Tai, P.Y., and Li, W.H. (1999). Rates of nucleotide substitution in angiosperm mitochondrial DNA sequences and dates of divergence between Brassica and other angiosperm lineages. J. Mol. Evol. 48, 597604.[CrossRef][Web of Science][Medline] Yu, J., et al. (2005). The genomes of Oryza sativa: A history of duplications. PLoS Biol. 3, e38.[CrossRef][Medline] Zahn, L.M., King, H.Z., Leebens-Mack, J.H., Kim, S., Soltis, P.S., Landherr, L.L., Soltis, D.E., dePamphilis, C.W., and Ma, H. (2005). The evolution of the SEPALLATA subfamily of MADS-Box genes: A preangiosperm origin with multiple duplications throughout angiosperm history. Genetics 169, 22092223. Zhang, X.Y., and Wessler, S.R. (2004). Genome-wide comparative analysis of the transposable elements in the related species Arabidopsis thaliana and Brassica oleracea. Proc. Natl. Acad. Sci. USA 101, 55895594. Related articles in Plant Cell:
This article has been cited by other articles:
|
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| ASPB Publications | THE PLANT CELL | PLANT PHYSIOLOGY | |
|---|---|---|---|