|
|
||||||||
|
First published online January 19, 2005; 10.1105/tpc.104.025627 © 2005 American Society of Plant Biologists
Evolution of DNA Sequence Nonhomologies among Maize Inbreds
|
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
2.3 Mb of orthologous regions from Asian rice subspecies indica and japonica (Feng et al., 2002
The biological implications of a lack of colinearity could be profound. Recombination rates highly increased within genes and reduced in retrotransposon clusters have been noted before in maize (Dooner, 1986
; Fu et al., 2002
). Obviously, nonshared sequences are excluded from recombination events. Fu and Dooner (2002)
proposed that complementation of nonshared genes could be one of the factors contributing to heterosis, whereas Song and Messing (2003)
identified unexpected differences in the expression of shared and nonshared genes in reciprocal hybrids. Therefore, analyzing the extent of genomic noncolinearities may help an understanding of recombinational properties and heterosis in maize.
Large contiguous maize sequences have revealed a gene-island structure of 10 to 20 kb (three to four genes), interspersed with long stretches of repetitive DNA that makes up a significant portion of the genome (>80%) (Hake and Walbot, 1980
; SanMiguel et al., 1996
). The large size of the maize genome has been attributed to long terminal repeat (LTR)-retrotransposons, which occupy most of the nongenic space and are also present in many other plant species at high abundance levels (Flavell, 1992
; Voytas et al., 1992
; SanMiguel et al., 1996
; Kumar and Bennetzen, 1999
). In maize, LTR-retrotransposons account for >60% of the nuclear genome length (SanMiguel et al., 1998
; Meyers et al., 2001
; Messing et al., 2004
). The retroelements are classified into numerous distinct families (Kumar and Bennetzen, 1999
) and show a tendency to form nested insertions (SanMiguel et al., 1996
, 1998
; Wicker et al., 2003
).
An approximate time for insertion of these retroelements can be estimated from the divergence of the LTR-sequences, which would be identical at the time of insertion of a particular copy of the element (SanMiguel et al., 1998
, 2002
; Ma and Bennetzen, 2004a
). This approach can be used to follow the evolutionary history of specific genomic regions and to date the accumulation of nonshared LTR-retrotransposons in individuals of the species.
To understand better the phenomenon of noncolinearities in the maize genome, including their frequency and biological implications, we analyzed DNA sequences from several allelic genome segments, in the maize inbreds Mo17 and B73.
| RESULTS |
|---|
|
|
|---|
|
A total of 17 maize BAC clones were sequenced from both inbreds, generating >2.3 Mb of sequence (Table 1). Numerous DNA segments, which are not shared by the two inbreds, were identified.
Sequence Comparison of Locus9002
Locus9002 yielded the longest sequence for the intraspecific comparison. The sum of the sequences available for the comparison between the inbreds Mo17 and B73 is 634 kb (Table 1). We identified the second highest number of retroelements among the surveyed regions (Table 1; see Supplemental Table 1 online). Six genes (geneA9002 to geneF9002) are shared between the two inbred lines (Figure 1; see Supplemental Table 2 online). Up to 15 additional genes or genic fragments (geneG9002 to geneU9002) could only be found in inbred line B73 (Figure 1; see Supplemental Table 2 online). GeneG9002 and geneH9002 match a predicted and an expressed protein of rice, respectively, whereas geneI9002 is similar to an Arabidopsis thaliana protein kinase family member, geneJ9002 is homologous to a putative rice PRLI-interacting factor, geneK9002 matches another predicted rice gene, geneL9002 is similar to a putative phosphatidylinositol-phoshpatidylcholine transfer protein of rice, and geneM9002 is homologous to a putative rice AMP deaminase. The first four and the last three of these genes are clustered and all in the same orientation. The two clusters are separated from each other by the insertion of a complete rire LTR-retrotransposon, including a 5-bp target site duplication. This whole arrangement itself is inserted next to indy, a new type of LTR-retrotransposon (see below). Within this indy retroelement, four additional partial genes are present. All are clustered and in the same orientation: geneN9002 partially matches the 40S ribosomal protein S8 of maize, geneO9002 and geneP9002 have some homology to unknown proteins of rice, and geneQ9002 is homologous to a putative rice cytosolic monodehydroascorbate reductase. Another insertion of three nonshared genes has been found in the close vicinity. These three genes are also clustered and in the same orientation and show some homology over a part of the sequence to known genes: geneR9002 is similar to a putative rice hairpin inducing protein, geneS9002 is homologous to the rice origin recognition complex subunit 1, and geneT9002 partially matches a maize Lys-ketoglutarate reductase/saccharopine dehydrogenase bifunctional enzyme. Interestingly, transposon 5, which is present only in inbred B73 and shows homology to a DOPA-like transposon, has inserted into the second intron of geneT9002. Nonshared geneU9002 with homology to a rice hypothetical protein is inserted upstream of LTR-retrotransposon jaws (see below).
|
|
Both inbred lines share transposon 2, which is located upstream of geneC9002 and shows nucleotide similarity with transposons 1 and 3 (87.7 and 84.6%, respectively) (Figure 1). These two transposons are orientated in the same direction and have inserted into transposon 2 downstream of the shared geneC9002 in Mo17 (Figure 1). Transposon 1 was classified as a CACTA transposon because of the CACTA motif at the end of the inverted repeat. No such motif was found in the inverted repeat of transposons 2 and 3. The inverted repeats of transposons 1 and 3 are less similar than their internal parts that show
94% homology on the nucleotide level. Furthermore, transposon 4, which is located downstream of geneC9002 and only present in B73, was identified because of homology to an En/Spm-like transposon protein from Arabidopsis (Figure 1).
Sequence Comparison of Locus9008
Locus9008 yielded a slightly smaller sequence for the intraspecific comparison than the other two loci. The sum of the sequence available for comparison between the inbreds Mo17 and B73 is
586 kb (Table 1). Seven genes (geneA9008 to geneG9008) are shared between Mo17 and B73 in this region (Figure 2; see Supplemental Table 2 online). Unlike in locus 9002, nongenic regions represent most of the differences here. Two genes, geneH9008 and geneI9008, clustered in the same orientation upstream of the prem1y1 retrotransposon (labeled with k in Figure 2) in inbred B73, are not found in Mo17 (Figure 2). GeneH9008 is similar to the rice MADS31 transcription factor, and geneI9008 shows homology to a putative rice phosphoinositide phosphatase (see Supplemental Table 2 online).
|
Sequence Comparison of Locus9009
Locus9009 yielded the second largest sequence for the intraspecific comparison. The sum of the sequence available for comparison between the inbreds Mo17 and B73 is
631 kb. This region is relatively gene rich (Table 1). Seventeen annotated genes (geneA9009 to geneQ9009) are shared between the two inbreds (Figure 3; see Supplemental Table 2 online). Most of them have a nearly full-length match to a known or predicted gene.
|
The 18 complete LTR-retrotransposons with target site duplications at locus9009 were classified in five known families, with ji, opie, and huck being the most common ones (Table 2; see Supplemental Table 1 online). Eight LTR-retrotransposons are shared (Figure 3). The insertion times of five of these shared elements are estimated to be in excess of 1 Myr (see Supplemental Table 3 online).Ten LTR-retrotransposons are found to be nonshared (Figure 3). Eight of them showed insertion time points of 0.55 Myr or less (see Supplemental Table 4 online). The two other retroelements (miltz1 and jiz3, labeled with l and m in Figure 3) are much older and show nested insertion. It is therefore likely that these two elements were not present in the ancestor of Mo17 that contributed that particular region of chromosome 7. The inserted retroelements huckz1 and miltz1 (labeled with i and l in Figure 3) are flanked by nonshared DNA of unknown origin. A block of shared sequence of
105 kb, containing eight genes (geneD9009 to geneK9009), uninterrupted by any allelic noncolinearity, is located in the middle of the analyzed genomic segment (Figure 3). Several inbred-specific retroelements as well as the Mo17-specific genes R9009 to W9009 are in the immediate vicinity.
Sequence Comparison of the adh1 Locus
Seven genes are shared at the adh1 locus and therefore make it relatively gene rich (
18.5 kb/gene) (Figure 4, Table 1). The only differences between inbreds are attributable to four complete nonshared LTR-retrotransposons (one in Mo17 and three in B73), whereof two are nested (Figure 4). These nonshared elements make up 21% of the analyzed sequence (Figure 5; see Supplemental Table 5 online). Copia-type retrotransposons are the major part of the repetitive fraction (Table 2). All LTR-retrotransposons at the adh1 locus are of recent origin (<1.16 Myr) (see Supplemental Tables 3 and 4 online).
|
|
Nonshared Genes Break the Colinearity with Rice
The rice orthologs for all the shared and nonshared genes at loci 9002, 9008, and 9009 were identified in the rice genome by TBLASTN analysis. Rice orthologs of 23 of the 30 (77%) shared maize genes have been assigned to the colinear rice regions (Figure 6; see Supplemental Table 2 online). Exceptions are geneC9008 with no identified rice ortholog and geneD9002 and genes D, F, H, L, and N from locus9009, whose rice orthologs are all located on different rice chromosomes (see Supplemental Table 2 online). The intergenic distance between colinear genes is enlarged in maize compared with rice. Colinear regions in locus9002 and locus9009 are moderately larger in the two maize inbreds than in rice (more than two times the rice sequence), compared with locus9008, which is up to six times enlarged in maize than in rice (Figure 6). Several rice orthologs were identified for most of the nonshared genes, but none of them mapped to the colinear chromosomal region in rice (Figure 6; see Supplemental Table 2 online). Their independent insertion in maize or deletion from rice might explain this interspecific difference.
|
79 kb of the Mo17 allele present on BAC clone b106.c20. The Mo17 sequence is different from both previously sequenced alleles. It shares sequence similarity with B73 in all regions annotated as genes and also lacks entirely three of the four genes, which were reported as missing in B73 compared with McC (Fu and Dooner, 2002
0.94 ± 0.24 Myr in both inbreds (see Supplemental Table 3 online). The time of insertion of all other nonshared retroelements range from 1.15 ± 0.20 to 0.29 ± 0.07 Myr (see Supplemental Table 4 online).
|
Sequence divergence data comparing the exons (synonymous substitutions) and introns (all substitutions but no indels) of each shared gene among the three haplotypes reveal a closer relation between B73 and Mo17 only for genes rpl35A, tac6058, and hypro1 (see Supplemental Figure 1 and Table 7 online). In all other genes, McC is either closer to Mo17 (genes bz1, stc tac7077, and uce2) or to B73 (genes stk and znf). In general, divergence estimates obtained from genes do not agree well with the divergence times we estimated from the retroelement insertions. No rice/maize colinearity was observed for the 13 genes at the maize bz1 locus because the orthologs identified by TBLASTN against the complete genomic sequence of rice are present at various locations on different rice chromosomes (see Supplemental Table 8 online).
Nonshared Sequences Are Gene-Poor and Consist of Clusters of Truncated Genes and Recently Inserted or Incomplete Repetitive Sequences
The sequencing of loci 9002, 9008, 9009, and adh1 resulted in >2.1 Mb of sequence for comparative analysis (Table 1). The sequence shared between the two inbreds at each locus was counted only once in the calculation of the ratios described in Figure 5 and Supplemental Table 5 online. The genic fraction is
9.1% when averaged over the four loci (Figure 5; see Supplemental Table 5 online). The gene density within these four regions ranges from 13 to 56 kb/gene (average of 22 kb/gene) (Table 1), which extrapolates to an estimate of
113.000 genes for the whole maize genome. This high estimate may be explained by a bias toward gene-rich regions because of the selection of segments with a high density of overgo probes, a selection that is necessary to establish allelic relationships. The majority of the sequence space is made up by retroelements or noncharacterized sequences (Figure 5; see Supplemental Table 5 online). Almost half of the total sequence analyzed is nonshared, but there are large differences in the amount of inbred-specific sequences among the loci (Figure 5; see Supplemental Table 5 online). The nonshared fraction makes only one-fifth of the adh1 locus and one-third of the gene-rich and highly homologous locus9009, but makes up half of locus9008 and more than two-thirds of locus9002. Thus, the sequence composition data confirm the initial high information content fingerprinting (HICF) fingerprinting data.
The nonshared sequences are, on average, more than sevenfold lower in gene content than the shared sequences (loci 9002, 9008, 9009, and adh1: 2.1% versus 15.8%), but this is locus dependent (Figure 5; see Supplemental Table 5 online). In total, 59 genes have been identified at the four loci of which more than one-third (23) are nonshared (Table 1). Nonshared genes are truncated, and the homology of the translated protein products is limited only to N-terminal, C-terminal, or central portions of protein entries from GenBank (see Supplemental Table 2 online). Furthermore, nonshared gene PCR products were amplified from both inbred lines in high-stringency conditions, suggesting that they also may be present elsewhere in the inbred that lacks the sequences in the particular region being studied. However, it is not possible to ascertain if those are, in fact, the closest homologs without further experimentation.
Interestingly, 26 of the 27 nonshared genes (including also the four nonshared genes at the bz1 locus; Fu and Dooner, 2002
) are present in seven clusters of 1.8 to 7.6 kb. Statistical tests of the distribution of distances between shared versus nonshared genes identified a denser gene arrangement for nonshared genes (Kolmogorov-Smirnov test, P = 0.001, D = 0.5093; permutation test, P = 8.8 x 104) (see Supplemental Figure 2 online). Surprisingly, most of the nonshared genes have the same orientation within clusters. Homologous maize ESTs (cutoff score value of 120) and/or expressed maize massively parallel signature sequencing (MPSS) tags (cutoff value of 2 ppm) were identified for 89% of the nonshared compared with 96% of the shared genes (see Supplemental Table 9 online). Three nonshared genes identified neither any maize ESTs nor any expressed maize MPSS tag. ESTs matching nonshared genes showed homology only over a part of the sequence. Therefore, these ESTs represent, rather, transcripts from expressed homologs than from nonshared genes.
At least 97% of the inbred-specific fraction is composed of repetitive or noncharacterized elements (see Supplemental Table 5 online). We identified 62 nonshared LTR-retrotransposons, including also those from the bz1 locus (Table 2). There are 33 copia-type retroelements, represented by 19 ji, 10 opie, and four prem elements. Twenty-nine elements are of gypsy or unknown type and belong to a variety of families, none of which are present in more than three copies.
No particular repetitive element is unique to the shared fraction, but copia types are more numerous than gypsy ones (17 and 10, respectively), with seven ji and eight opie elements (Table 2). Contingency
2 analysis did not detect any significant difference in the distribution of copia versus gypsy elements among shared and nonshared elements (P = 0.90) nor of the three most abundant families (ji, opie, and huck) (P = 0.43).
Forty-three of the nonshared LTR-retroelements have inserted within the last 1 Myr, and 27 of these are younger than 0.5 Myr (Figure 8; see Supplemental Table 4 online). The majority (>75%) of the shared LTR-retroelements have inserted within the last 2 Myr (Figure 8; see Supplemental Table 3 online). The distribution of insertion time points of the shared and nonshared LTR-retrotransposons is different (Kolmogorov-Smirnov test, P = 0.027, D = 0.328; nonshared retroelements, mean = 0.91, median = 0.61; shared retroelements, mean = 1.34, median = 1.16). Thus, nonshared retrotransposons are significantly more recent than the shared ones.
|
| DISCUSSION |
|---|
|
|
|---|
Most of the nonshared sequences consist of LTR-retrotransposons (Table 2) and other mobile elements. The majority of the identified LTR-retrotransposons belong to the ji, opie, and huck types, which have been reported to be the three most abundant retroelements in the maize genome (Meyers et al., 2001
).
The differences in LTR-retrotransposon content between lines could have arisen by retrotransposition, leading to insertions, or by recombinational events that would lead to deletions (Devos et al., 2002
). Homologous recombination events would mainly produce solo LTRs, whereas nonhomologous events would produce incomplete elements. Homologous unequal and nonhomologous illegitimate recombination events that counteract genome expansion caused by retroelement insertions have been reported in Arabidopsis (Devos et al., 2002
), rice (Bennetzen et al., 2005
; Ma et al., 2004
), and other plant species (SanMiguel et al., 1996
; Shirasu et al., 2000
; Wicker et al., 2003
). The differences that we observed between lines usually encompass entire elements and carry target site duplications and thus appear to be attributable to insertions rather than to deletions. Deletions because of recombinational events should be more likely in older elements than in younger ones, as has been observed in Arabidopsis and rice (Devos et al., 2002
; Vitte and Panaud, 2003
; Ma and Bennetzen, 2004b
; Ma et al., 2004
). We observed only one product of a putative homologous recombination event, which affected an older element (jix3 at locus9002).
Most variation in plant genome size is caused by differences in the amounts of repetitive DNA (SanMiguel et al., 1996
; Tikhonov et al., 1999
; Vicient et al., 2001
; Wicker et al., 2001
; Ma et al., 2004
). Seventy-four percent of sequence differences between sorghum (Sorghum bicolor) and maize are estimated to be a result of the accumulation of retrotransposons since their divergence (Tikhonov et al., 1999
). The large majority of the nonshared sequence in maize is also either repetitive or consists of uncharacterized nongenic sequences. Thus, repetitive sequences also accumulate within individuals of the same species, generating large nonshared sequence differences. The amount of nonhomology between some maize alleles is similar to that reported between maize and other grass species (Tikhonov et al., 1999
; Bennetzen and Ma, 2003
).
Our analysis of MITEs suggests that they preferentially insert into repetitive sequences. Other data indicate that MITEs are common both in the repetitive sequences (Cheng and Lin, 2004
) and in the noncoding regions of grass genes (Bureau and Wessler, 1992
, 1994a
, 1994b
; Bureau et al., 1996
). In 12 cases, MITEs are present in both LTRs of a retroelement, which means that the MITE was present before the replication of the host element. A single case of a MITE insertion into only one LTR of a nonshared retroelement of recent origin suggests more recent MITE activity in maize, as was observed in rice (Jiang et al., 2003
; Kikuchi et al., 2003
; Nakazaki et al., 2003
). Because only two nonshared MITEs (M1 and M2) are individual insertion events, and not part of a larger nonshared segment, their direct contribution to the evolution of the nonshared fraction is small.
Nonshared Genes in Maize Alleles
A total of 23 putative genes was identified in the nonshared fraction at loci 9002, 9008, 9009, and adh1 (Table 1; see Supplemental Table 2 online). In contrast with the z1C-1 locus (Song and Messing, 2003
), where half of the nonshared sequence is genic because of segmental duplications affecting the number of zein gene copies in each haplotype, we did not find any gene duplication associated with the nonhomologous sequences, but a much lower gene density in nonshared segments than in shared ones. The loci we analyzed are therefore closer to the bz1 locus mode of evolution, where four genes make up just 5% of the nonshared sequence (Fu and Dooner, 2002
).
In most cases, the nonshared genes are clustered and oriented in the same direction within the clusters. The clustering of nonshared, relative to shared, genes is highly statistically significant. They are all truncated, and their homology to known genes or ESTs is only over a part of the sequence, suggesting that they may be pseudogenes. Such a pattern was observed earlier in maize (Meyers et al., 2001
; Ramakrishna et al., 2002
) and in an intraspecific comparison in rice (Feng et al., 2002
; Han and Xue, 2003
) and in other plants (Parniske et al., 1997
; Noel et al., 1999
; Holub, 2001
), where genes were postulated to have arisen from multiple illegitimate and complex break repair events or from retroelements or nonfunctional hypothetical genes. These observations together suggest that clustering may be common among nonshared genes in maize.
It has been postulated that novel genes arise after a gene or genome duplication (Lewis, 1951
). As a result of an ancient tetraploidization event (Gaut and Doebley, 1997
; Langham et al., 2004
), the maize genome contains duplicated chromosomal segments with colinear gene arrangements (Gaut, 2001
; Ilic et al., 2003
; Lai et al., 2004
). Some of the differences between inbred lines may be attributable to the loss of genes in homeologous regions in one inbred lineage that were retained in the other (Ilic et al., 2003
; Lai et al., 2004
). A detailed analysis of larger homeologous segments could clarify this hypothesis. Fu and Dooner (2002)
showed that the genes not shared by McC and B73 are present elsewhere in the maize genome and suggested that they may have arisen by deletion. However, no deletion mechanism, such as intrachromosomal recombination between the 5' and 3' LTRs of neighboring LTR-retrotransposons or unequal crossing over between related retrotransposon sequences, was described. The available evidence does not support a deletion hypothesis, even for the genes at the bz1 locus. Preliminary PCR results suggest that all genes, which are not shared between Mo17 and B73 at the investigated loci, are polymorphic and are present elsewhere in the maize genome. Assuming that rice is a representation of the ancestral condition, the consistent lack of colinearity of the nonshared genes and colinearity of the shared ones is best explained by insertion events that occurred in maize after its divergence from rice. This is also supported by the finding that the nonshared genes are incomplete because two successive deletion events would otherwise need to be invoked: the first one partially deleting the gene and the second one erasing the remains of it in one lineage.
Because nonshared genes preserve a normal intronexon structure, it is unlikely that they are integrated processed pseudogenes. Insertions of multiple nonshared genes can be explained by the activity of retroelements or transposons. This kind of gene trafficking across the genome is well documented in vertebrates and to some extent in plants (Talbert and Chandler, 1988
; Bureau et al., 1994
; Jin and Bennetzen, 1994
; Palmgren, 1994
; Martinez-Izquierdo et al., 1997
; Le et al., 2000
; Pickeral et al., 2000
; Elrouby and Bureau, 2001
). Interestingly, the four nonshared genes N, O, P, and Q at locus9002 are located within LTR-retrotransposon huckx39002. It is unknown if these four nonshared genes were already part of the retroelement before its insertion or if they have inserted later. The recently uncovered large number of Pack-MULEs, carrying fragments of single or multiple cellular genes, represents a new mechanism for the evolution of genes in higher plants (Jiang et al., 2004
). Our data do not indicate the involvement of Pack-MULEs in the insertion of nonshared genes in maize because no Mutator-like sequences were found flanking the nonshared gene clusters.
Nonshared genes are incomplete, and it is unknown if they encompass a promoter, but their expression could still be induced and modulated by promoter elements from neighboring repetitive elements (Kumar and Bennetzen, 1999
; Speek, 2001
; Vicient et al., 2001
; Dunn et al., 2003
; Kashkush et al., 2003
; Schramke and Allshire, 2003
). Although nonshared genes showed homology to many maize ESTs, the similarity was always restricted to a short segment of the EST sequence, implying that the EST was derived from a different, functional, and presumably full-length copy of the gene. The identified maize MPSS tags did not help to distinguish between the expression of nonshared genes and their homologs. The hypothesis that adjacent genic insertions may give rise to novel gene products (Lander et al., 2001
; Jiang et al., 2004
) could not be confirmed because no EST derived from a transcript across clustered nonshared genes was present in any database.
Analysis of the Three Alleles at the bz1 Locus
Three allelic sequences around the bz1 locus are available: B73, McC (Fu and Dooner, 2002
), and Mo17, reported here. As suggested earlier on the basis of DNA gel blot evidence, the three alleles are distinct (Fu and Dooner, 2002
). Inferences on the relationship among the three sequences are complicated by the fact that different segments within the same region may evolve differently. The shared genic regions may be differently affected by recombination than nonshared intergenic regions, even assuming mutational rate homogeneity throughout the region. The Mo17 and B73 haplotypes share the grande LTR-retrotransposon and lack the same four genes (cdl1, hypro2, hypro3, and rlk), but they differ by the presence of an internal portion of a Hopscotch element, shared between Mo17 and McC. The divergence of the sequences specific to each of the three haplotypes might be inferred from the LTRs of retrotransposons and from the sequence diversity between alleles of the nine shared genes. Divergence estimates obtained from genes do not agree well with those from the retroelement insertions. This may be explained by differences in recombination rates between retrotransposons and genic sequences. Randomization of genic sequences by recombination could be involved, if recombination within the region is largely restricted to shared genes and intergenic regions. The dating of the retrotransposon insertions relies on a fast molecular clock (Ma and Bennetzen, 2004a
). Although the molecular clock used to date the insertion of LTR-retrotransposons is at least twofold faster than synonymous base substitutions within grass genes (Gaut et al., 1996
; SanMiguel et al., 1998
), the actual molecular drift rate may be even higher (Ma and Bennetzen, 2004a
). The nonshared sequences are common in the vicinity of bz1 and are expected to be associated with a reduced recombination rate in inverse proportion to their allelic frequency. By contrast, the recombination rate of shared genes may not be suppressed. For example, bz1 has the highest intragenic recombination rate of any maize gene measured to date (Dooner, 1986
). A high recombination rate has also been reported for the genes on the distal side of bz1, whereas it is reduced in retrotransposon clusters, even if they are shared (Fu et al., 2002
).
Fu and Dooner (2002)
suggested on the basis of hybridization data with rlk-, tac7077-, and bz1-specific probes that the rlk gene is also present at the bz1 locus in Mo17. However, we were unable to identify the rlk gene on the corresponding genomic sequence of Mo17.
Origin of Allelic Nonhomologies
The frequent persistence of major allelic nonhomologies in maize indicates that new allelic variants either are of recent origin or are constantly created, or that balancing selection leads to the maintenance of variants, which would otherwise be fixed or eliminated (Aguilar et al., 2004
), or that effective population size (Ne) is sufficiently large. The persistence time of a newly inserted sequence within an allele may be predicted by coalescent theory and is proportional to 2N, where N is the effective population size (Nordborg, 2001
). The persistence times of the order of several millions of years are to be expected in maize (Eyre-Walker et al., 1998
; Remington et al., 2001
; Vigouroux et al., 2002
).
By contrast, in genomes lacking major allelic nonhomologies, retroelement insertions have either occurred in the distant past or are occurring at low frequency so that they are very unlikely to be polymorphic in the population at any given time. Alternative hypotheses, such as hybridization between subspecies or populations, which have been subject to a long period of reproductive isolation, would also explain the presence of allelic nonhomologies. Recent phylogenetic data, which postulate a single maize domestication, suggest only modest evidence for increase of diversity by postdomestication gene flow from teosinte into maize (Matsuoka et al., 2002
). Therefore, introgressions fail to explain the large amount of observed nonhomologies between the two maize inbreds.
Our data are consistent with the hypothesis of an expanding maize genome, primarily because of the large accumulation of LTR-retrotransposon, which is counteracted by a low frequency of predicted homologous recombination events between LTRs (SanMiguel et al., 1996
, 1998
; Meyers et al., 2001
; Bennetzen, 2002
; Messing et al., 2004
), as well as with an interpopulation hybridization origin hypothesis, where the present allelic composition could arise from a cross between ancestors that have evolved separately for quite some time (during which time retrotransposon amplification occurred independently in the two lineages) before the cross. The nonshared LTR-retroelements contribute to maize genome expansion, and these elements are of more recent origin than the shared fraction. A few older (>3 Myr) nonshared LTR-retroelements may have originated from a deletion in one of the lineages. Insertion age distribution differences between the shared and nonshared retroelement sets were observed even though only two maize inbreds were sampled. Whereas the nonshared set is unambiguously identified even with two inbreds, the shared set may comprise elements that are absent in other maize inbreds. The observation of a statistically significant difference in the insertion age distribution, despite this uncertainty in assignment to the shared class, indicates that the B73 and Mo17 genomes represent well the elements that are either universally shared (fixed) or close to fixation.
In contrast with barley (Hordeum vulgare) retroelement BARE-1 (Vicient et al., 2001
) and rice retroelement Tos17 (Yamazaki et al., 2001
; Miyao et al., 2003
), no actively transposing maize LTR-retroelements have been described. Retroelement-derived transcripts in maize correspond to low copy types and not to the high copy ones, such as ji, opie, and huck (Meyers et al., 2001
). Because the nonshared repetitive fraction consists mainly of high copy number LTR-retroelements, these elements might have transposed more efficiently in a not too distant past.
Biological Implications of the Intraspecific Noncolinearity
Complementation of haplotypes carrying different nonshared genes could contribute to the phenomenon of heterosis (Fu and Dooner, 2002
). Although nonshared genic sequences appear to be nonfunctional, they could act through mechanisms similar to transgene cosuppression, siRNA-mediated gene silencing of homologous sequences (Hamilton and Baulcombe, 1999
; Hannon, 2002
), or interactions with functional proteins forming multimers and causing distinct phenotypic effects (Tsuchisaka and Theologis, 2004
).
We advance an alternative hypothesis for the role of nonshared sequences in heterosis, focusing on the differences in the repetitive fraction rather than in the genes. In several instances, conserved and active genes in the two inbreds are flanked by different DNA, for example, by nonconserved retrotransposons inserted nearby (geneB9002, geneC9002, geneD9002, and geneF9002, Figure 1; geneC9008, geneD9008, geneE9008, and geneF9009, Figure 2; geneB9009, geneC9009, geneD9009, geneL9009, and geneO9009, Figure 3). Such retroelements usually are inactive but can be induced by various stresses (Kuff and Lueders, 1988
; Pouteau et al., 1991
; Hirochika et al., 1996
) and may affect the expression of neighboring genes by producing single, chimeric, or antisense transcripts or by acting as enhancers (Medstrand et al., 2001
; Speek, 2001
; Whitelaw and Martin, 2001
; Llave et al., 2002
; Nigumann et al., 2002
; Dunn et al., 2003
; Kashkush et al., 2003
; Schramke and Allshire, 2003
). It is therefore likely that different repetitive sequence environments affect tissue specificity or temporal regulation of expression. Such differences have been proposed to be the cause of heterotic complementation (Birchler et al., 2003
; Song and Messing, 2003
) and are comparable to allelic interactions proposed by the overdominance theory explaining hybrid vigor (Crow, 1948
; Song and Messing, 2003
). Furthermore, shared gene expression might also be altered by a different chromatin state related to the presence of nonshared repetitive sequences nearby (Mette et al., 2002
; Plasterk, 2002
; Dawe, 2003
; Schramke and Allshire, 2003
).
Noncolinear regions of the genome cannot engage in homologous recombination, except in a cross with an identical allele. Therefore, the distribution of crossover points will show strong dependence on the specific combination of alleles. Although shared retrotransposon clusters have a reduced recombination rate (Fu et al., 2002
), nonshared retrotransposons may contribute significantly to the low recombination rate of the retrotransposon fraction in maize (Yao et al., 2002
). Thus, a desired combination of alleles may not be achievable in certain crosses. Genetic-to-physical distance ratios will show extreme local differences between crosses, when examined in sufficient detail. Nonshared sequences will also affect map-based cloning projects, where nonshared genes cannot be cloned from certain BAC libraries.
The effective population size of sequences represented in a fraction of individuals of a population will be different from the value for a genome segment represented in all individuals. Different segments of the genome will therefore behave as if they belonged to different populations, with respect to rate of decay of linkage disequilibrium and other population-dependent parameters. Even if only a fraction of the nonshared genes have measurable biological effects, the implications for maize genetics, breeding, and for maize genome sequencing are enormous.
| METHODS |
|---|
|
|
|---|
The Mo17 BAC clone b106.c20, which is allelic to the previously sequenced bz1 alleles from maize inbreds B73 and McC (AF391808 and AF448416) (Fu and Dooner, 2002
), was identified by hybridizing filters of the Mo17 BAC library with a bz1-specific probe (E. Ananiev, personal communication).
Similarly, the Mo17 BAC clone b161.k19, which is allelic to the published adh1 locus from B73 (AF123535) (Tikhonov et al., 1999
), was identified and sequenced as described (Jung et al., 2004
).
Sequencing and Assembling of Maize BAC Clones
All BAC clones were sequenced by the shotgun strategy as described (Tarchini et al., 2000
). The sequence reads were assembled using Phred and Phrap software (http://www.phrap.org/) (Green, 1996
), and the assemblies were viewed and edited in Consed (http://www.phrap.org/consed/consed.html). Vector sequences and bacterial contaminants were masked, and clone-mate information was used to make assessments regarding the validity of the assemblies with the assistance of the program exgap (http://www.genome.ou.edu/informatics.html).
PCR primers were designed to walk across the sequence gaps by extracting the nonrepetitive ends of the relevant contig sequences and importing them together into the Primer 3.0 program (Rozen and Skaletsky, 2000
). The following conditions were used in the selection of primers: the smallest allowable product size, primer size of
18 bases, annealing temperature of 55°C, ideal GC of 50%, no more than three consecutive identical nucleotides, and a two-base GC clamp. T3 (5'-AATTAACCCTCACTAAAGGG-3') and T7 (5'-GTAATACGACTCACTATAGGGC-3') tags were added to the 5' ends of the forward and reverse primers, respectively, to facilitate direct sequencing of the PCR products. PCR was performed using a Perkin-Elmer 9700 thermocycler under the following conditions: 95°C for 10 min; 10 cycles of 95°C for 1 min, 55°C for 1 min, and 72°C for 1 min; 35 cycles of 95°C for 30 s and 68°C for 1 min; 92°C for 7 min; and then a constant temperature of 4°C. The 25-µL PCR reaction mix consisted of 2 µL of BAC culture diluted 1:1 with 50% glycerol, 10 mM of each primer, 5% DMSO, 12.5 µL of Hot Star Taq Master Mix (Qiagen, Valencia, CA), and sterile water. PCR products (4 µL) were analyzed via agarose gel electrophoresis. PCR products were prepared for sequencing using exonuclease-I and shrimp alkaline phosphatase (USB, Cleveland, OH) and sequenced directly from both the T3 and T7 primers using an ABI 3700 sequencer (PE-Applied Biosystems) and the BigDye Terminator v3.0 cycle sequencing kit (PE-Applied Biosystems).
Subcontigs robustly connected by clone mates were merged manually where the sequencing failed. Merged sequences were further confirmed by PCR on genomic DNA.
Annotation and Comparative Sequence Analysis
A maize trained version of program FGENESH (Softberry, Mount Kisco, NY) and Repeatmasker (A.F.A. Smit and P. Green http://ftp.genome.washington.edu/RM/RepeatMasker.html) using release 4.0 version of The Institute for Genomic Research maize repeat database (http://www.tigr.org/tdb/tgi/maize/repeat_db.shtml) were used for gene prediction and masking of repetitive elements, respectively. Gene annotation was based on BLAST (BLAST E < 107) and BLAT (minimal sequence identity of 80%) analysis against the GenBank and the DuPont maize EST databases, respectively. Predicted genes, which still showed homology to any repetitive element from these databases, were added to the repetitive fraction of the sequences. MITEs were identified using the program FINDMITE (Tu, 2001
) with the following parameter settings: target site duplication, TA, TAA, TAC, TGA, TTA, or TCA; length of tandem inverted repeat, 11 bp; number of mismatches, 1; minimum distance, 30 bp; maximum distance, 400 bp; filtering A/T and C/G strings, AT/TA repeats, and terminal inverted repeats composed of >85% of two bases. Programs BESTFIT, GAP, PILEUP, and ASSEMBLE of the GCG Wisconsin package version 10.3 and the program Dotter (Sonnhammer and Durbin, 1995
) were used for sequence comparison. Divergence times (DT) for the LTR-retrotransposon were estimated using k = K/2*DT, where k is the proposed mutation rate of 1.3 x 10 to 8 substitutions per site per year (Ma and Bennetzen, 2004a
), and K is the estimated number of substitutions per site between sequences using the Kimura two-parameter method (Kimura, 1980
). Phylogenetic and molecular evolutionary analyses were conducted using MEGA version 2.1 (Kumar et al., 2001
). Statistical analysis was performed using Kolmogorov-Smirnov test statistics (http://faculty.vassar.edu/lowry/webtext.html) and permutation methods (Mielke and Berry, 2001
).
Sequence data from this article have been deposited with the EMBL/GenBank data libraries under accession numbers AY664413 (B73_locus9002), AY664417 (Mo17_locus9002), AY664414 (B73_locus9008), AY664418 (Mo17_locus9008), AY664415 (B73_locus9009), AY664419 (Mo17_locus9009), AY664416 (b103.c20), and AY691949 (b161.k19).
| Acknowledgments |
|---|
| Footnotes |
|---|
Online version contains Web-only data. ![]()
Article, publication date, and citation information can be found at www.plantcell.org/cgi/doi/10.1105/tpc.104.025627.
Received July 2, 2004; accepted November 17, 2004.
| REFERENCES |
|---|
|
|
|---|
Arabidopsis Genome Initiative (2000). Analysis of the genome sequence of the flowering plant Arabidopsis thaliana. Nature 408, 796815.[CrossRef][Medline]
Bennetzen, J.L. (2002). Mechanisms and rates of genome expansion and contraction in flowering plants. Genetica 115, 2936.[CrossRef][Web of Science][Medline]
Bennetzen, J.L., and Ma, J. (2003). The genetic colinearity of rice and other cereals on the basis of genomic sequence analysis. Curr. Opin. Plant Biol. 6, 128133.[CrossRef][Web of Science][Medline]
Bennetzen, J.L., Ma, J., and Devos, K.M. (2005). Mechanisms of recent genome size variation in flowering plants. Ann. Bot. 95, 127132.
Birchler, J.A., Auger, D.L., and Riddle, N.C. (2003). In search of the molecular basis of heterosis. Plant Cell 15, 22362239.
Bureau, T.E., Ronald, P.C., and Wessler, S.R. (1996). A computer-based systematic survey reveals the predominance of small inverted-repeat elements in wild-type rice genes. Proc. Natl. Acad. Sci. USA 93, 85248529.
Bureau, T.E., and Wessler, S.R. (1992). Tourist: A large family of small inverted repeat elements frequently associated with maize genes. Plant Cell 4, 12831294.
Bureau, T.E., and Wessler, S.R. (1994a). Mobile inverted-repeat elements of the Tourist family are associated with the genes of many cereal grasses. Proc. Natl. Acad. Sci. USA 91, 14111415.
Bureau, T.E., and Wessler, S.R. (1994b). Stowaway: A new family of inverted repeat elements associated with the genes of both monocotyledonous and dicotyledonous plants. Plant Cell 6, 907916.[Abstract]
Bureau, T.E., White, S.E., and Wessler, S.R. (1994). Transduction of a cellular gene by a plant retroelement. Cell 77, 479480.[CrossRef][Medline]
Cheng, Y.M., and Lin, B.Y. (2004). Molecular organization of large fragments in the maize B chromosome: Indication of a novel repeat. Genetics 166, 19471961.
Crow, J.F. (1948). Alternative hypotheses of hybrid vigor. Genetics 33, 477487.
Dawe, R.K. (2003). RNA interference, transposons, and the centromere. Plant Cell 15, 297301.
Devos, K.M., Brown, J.K.M., and Bennetzen, J.L. (2002). Genome size reduction through illegitimate recombination counteracts genome expansion in Arabidopsis. Genome Res. 12, 10751079.
Dooner, H.K. (1986). Genetic fine structure of the bronze locus in maize. Genetics 113, 10211036.
Dunn, C.A., Medstrand, P., and Mager, D.L. (2003). An endogenous retroviral long terminal repeat is the dominant promoter for human 1,3-galactosyltransferase 5 in the colon. Proc. Natl. Acad. Sci. USA 100, 1284112846.
Elrouby, N., and Bureau, T.E. (2001). A novel hybrid open reading frame formed by multiple cellular gene transductions by a plant long terminal repeat retroelement. J. Biol. Chem. 276, 4196341968.
Eyre-Walker, A., Gaut, R.L., Hilton, H., Feldman, D.L., and Gaut, B.S. (1998). Investigation of the bottleneck leading to the domestication of maize. Proc. Natl. Acad. Sci. USA 95, 44414446.
Feng, Q., et al. (2002). Sequence and analysis of rice chromosome 4. Nature 420, 316320.[CrossRef][Medline]
Fengler, K.A., Faller, M.L., Meyers, B.C., Dolan, M., Tingey, S.V., and Morgante, M. (2000). Construction of a contig-based physical map of corn using fluorescent fingerprint technology. Plant & Animal Genome VIII Conference, Jan. 912, 2000 (San Diego, CA), http://www.intl-pag/8/abstracts/pag8265.html.
Flavell, A.J. (1992). Ty1-copia group retrotransposons and the evolution of retroelements in the eukaryotes. Genetica 86, 203214.[CrossRef][Web of Science][Medline]
Fu, H., and Dooner, H.K. (2002). Intraspecific violation of genetic colinearity and its implications in maize. Proc. Natl. Acad. Sci. USA 99, 95739578.
Fu, H., Zheng, Z., and Dooner, H.K. (2002). Recombination rates between adjacent genic and retrotransposon regions in maize vary by 2 orders of magnitude. Proc. Natl. Acad. Sci. USA 99, 10821087.
Gale, M.D., and Devos, K.M. (1998). Comparative genetics in the grasses. Proc. Natl. Acad. Sci. USA 95, 19711974.
Gardiner, J., et al. (2004). Anchoring 9,371 maize expressed sequence tagged unigenes to the bacterial artificial chromosome contig map by two-dimensional overgo hybridization. Plant Physiol. 134, 13171326.
Gaut, B.S. (2001). Patterns of chromosomal duplication in maize and their implications for comparative maps of the grasses. Genome Res. 11, 5566.
Gaut, B.S., and Doebley, J.F. (1997). DNA sequence evidence for the segmental allotetraploid origin of maize. Proc. Natl. Acad. Sci. USA 94, 68096814.
Gaut, B.S., Morton, B.R., McCaig, B.M., and Clegg, M.T. (1996). Substitution rate comparisons between grasses and palms: Synonymous rate differences at the nuclear gene Adh1 parallel rate differences at the plastid gene rblL. Proc. Natl. Acad. Sci. USA 93, 1027410279.
Goff, S.A., et al. (2002). A draft sequence of the rice genome (Oryza sativa L. ssp. japonica). Science 296, 92100.
Green, P. (1996). Towards completely automated sequence assembly. DOE Human Genome Program Contractor-Grantee Workshop V, Jan. 28Feb. 1, 1996 (Santa Fe, NM), http://www.ornl.gov/sci/techresources/Human_Genome/publicat/96santa/informat/green.html.
Hake, S., and Walbot, V. (1980). The genome of Zea mays, its organization and homology to related grasses. Chromosoma 79, 251270.[CrossRef][Web of Science]
Hamilton, A.J., and Baulcombe, D.C. (1999). A species of small antisense RNA in posttranscriptional gene silencing in plants. Science 286, 950952.
Han, B., and Xue, Y. (2003). Genome-wide intraspecific DNA-sequence variations in rice. Curr. Opin. Plant Biol. 6, 134138.[CrossRef][Web of Science][Medline]
Hannon, G.J. (2002). RNA interference. Nature 418, 244251.[CrossRef][Medline]
Hirochika, H., Sugimoto, K., Otsuki, Y., Tsugawa, H., and Kanda, M. (1996). Retrotransposons of rice involved in mutations induced by tissue culture. Proc. Natl. Acad. Sci. USA 93, 77837788.
Holub, E.B. (2001). The arms race is ancient history in Arabidopsis, the wildflower. Nat. Rev. Genet. 2, 516527.[CrossRef][Web of Science][Medline]
Ilic, K., SanMiguel, P.J., and Bennetzen, J.L. (2003). A complex history of rearrangement in an orthologous region of the maize, sorghum, and rice genomes. Proc. Natl. Acad. Sci. USA 100, 1226512270.
Jiang, N., Bao, Z., Zhang, X., Eddy, S.R., and Wessler, S.R. (2004). Pack-MULE transposable elements mediate gene evolution in plants. Nature 431, 569573.[CrossRef][Medline]
Jiang, N., Bao, Z., Zhang, X., Hirochika, H., Eddy, S.R., McCouch, S.R., and Wessler, S.R. (2003). An active DNA transposon family in rice. Nature 421, 163167.[CrossRef][Medline]
Jin, Y.K., and Bennetzen, J.L. (1994). Integration and nonrandom mutation of a plasma membrane proton ATPase gene fragment within the Bs1 retroelement of maize. Plant Cell 6, 11771186.[Abstract]
Jung, M., Ching, A., Bhattramakki, D., Dolan, M., Tingey, S., Morgante, M., and Rafalski, A. (2004). Linkage disequilibrium and sequence diversity in a 500-kbp region around the adh1 locus in elite maize germplasm. Theor. Appl. Genet. 109, 681689.[CrossRef][Web of Science][Medline]
Kashkush, K., Feldman, M., and Levy, A.A. (2003). Transcriptional activation of retrotransposons alters the expression of adjacent genes in wheat. Nat. Genet. 33, 102106.[CrossRef][Web of Science][Medline]
Keller, B., and Feuillet, C. (2000). Colinearity and gene density in grass genomes. Trends Plant Sci. 5, 246251.[CrossRef][Web of Science][Medline]
Kikuchi, K., Terauchi, K., Wada, M., and Hirano, H.Y. (2003). The plant MITE mPing is mobilized in anther culture. Nature 421, 167170.[CrossRef][Medline]
Kimura, M. (1980). A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequences. J. Mol. Evol. 16, 111120.[CrossRef][Web of Science][Medline]
Kuff, E.L., and Lueders, K.K. (1988). The intracisternal A-particle gene family: Structure and functional aspects. Adv. Cancer Res. 51, 183276.[Web of Science][Medline]
Kumar, A., and Bennetzen, J.L. (1999). Plant retrotransposons. Annu. Rev. Genet. 33, 479532.[CrossRef][Web of Science][Medline]
Kumar, S., Tamura, K., Jakobsen, I.B., and Nei, M. (2001). MEGA2: Molecular evolutionary genetics analysis software. Bioinformatics 17, 12441245.
Lai, J., Ma, J., Swigonova, Z., Ramakrishna, W., Linton, E., Llaca, V., Tanyolac, B., Park, Y.J., Jeong, O.Y., Bennetzen, J.L., and Messing, J. (2004). Gene loss and movement in the maize genome. Genome Res. 14, 19241931.
Lander, E.S., et al. (2001). Initial sequencing and analysis of the human genome. Nature 409, 860921.[CrossRef][Medline]
Langham, R.J., Walsh, J., Dunn, M., Ko, C., Goff, S.A., and Freeling, M. (2004). Genomic duplication, fractionation and the origin of regulatory novelty. Genetics 166, 935945.
Le, Q.H., Wright, S., Yu, Z., and Bureau, T. (2000). Transposon diversity in Arabidopsis thaliana. Proc. Natl. Acad. Sci. USA 97, 73767381.
Lewis, E.B. (1951). Pseudoallelism and gene evolution. Cold Spring Harb. Symp. Quant. Biol. 16, 159174.
Llave, C., Kasschau, K.D., Rector, M.A., and Carrington, J.C. (2002). Endogenous and silencing-associated small RNAs in plants. Plant Cell 14, 16051619.
Ma, J., and Bennetzen, J.L. (2004a). Rapid recent growth and divergence of rice nuclear genomes. Proc. Natl. Acad. Sci. USA.
Ma, J., and Bennetzen, J.L. (2004b). Rapid recent growth and divergence of rice nuclear genomes. Proc. Natl. Acad. Sci. USA 101, 1240412410.
Ma, J., Devos, K.M., and Bennetzen, J.L. (2004). Analyses of LTR-retrotransposon structures reveal recent and rapid genomic DNA loss in rice. Genome Res. 14, 860869.
Martinez-Izquierdo, J.A., Garcia-Martinez, J., and Vicient, C.M. (1997). What makes Grande1 retrotransposon different? Genetica 100, 1528.[CrossRef][Web of Science][Medline]
Matsuoka, Y., Vigouroux, Y., Goodman, M.M., Sanchez, G.J., Buckler, E., and Doebley, J. (2002). A single domestication for maize shown by multilocus microsatellite genotyping. Proc. Natl. Acad. Sci. USA 99, 60806084.
Medstrand, P., Landry, J.R., and Mager, D.L. (2001). Long terminal repeats are used as alternative promoters for the endothelin B receptor and apolipoprotein CI genes in humans. J. Biol. Chem. 276, 18961903.
Messing, J., Bharti, A.K., Karlowski, W.M., Gundlach, H., Kim, H.R., Yu, Y., Wei, F., Fuks, G., Soderlund, C.A., Mayer, K.F., and Wing, R.A. (2004). Sequence composition and genome organization of maize. Proc. Natl. Acad. Sci. USA 101, 1434914354.
Mette, M.F., van der Winden, J., Matzke, M., and Matzke, A.J.M. (2002). Short RNAs can identify new candidate transposable element families in Arabidopsis. Plant Physiol. 130, 69.
Meyers, B.C., Scalabrin, S., and Morgante, M. (2004). Mapping and sequencing complex genomes: Let's get physical! Nat. Rev. Genet. 5, 578588.[CrossRef][Web of Science][Medline]
Meyers, B.C., Tingey, S.V., and Morgante, M. (2001). Abundance, distribution and transcriptional activity of repetitive elements in the maize genome. Genome Res. 11, 16601676.
Mielke, P.W., and Berry, K.J. (2001). Permutation Methods: A Distance Function Approach. (New York: Springer-Verlag).
Miyao, A., Tanaka, K., Murata, K., Sawaki, H., Takeda, S., Abe, K., Shinozuka, Y., Onosato, K., and Hirochika, H. (2003). Target site specificity of the Tos17 retrotransposon shows a preference for insertion within genes and against insertion in retrotransposon-rich regions of the genome. Plant Cell 15, 17711780.
Nakazaki, T., Okumoto, Y., Horibata, A., Yamahira, S., Teraishi, M., Nishida, H., Inoue, H., and Tanisaka, T. (2003). Mobilization of a transposon in the rice genome. Nature 421, 170172.[CrossRef][Medline]
Nigumann, P., Redik, K., Matlik, K., and Speek, M. (2002). Many human genes are transcribed from the antisense promoter of L1 retrotransposon. Genomics 79, 628634.[CrossRef][Web of Science][Medline]
Noel, L., Moores, T.L., van Der Biezen, E.A., Parniske, M., Daniels, M.J., Parker, J.E., and Jones, J.D. (1999). Pronounced intraspecific haplotype divergence at the RPP5 complex disease resistance locus of Arabidopsis. Plant Cell 11, 20992112.
Nordborg, M. (2001). Coalescent theory. In Handbook of Statistical Genetics, D.J. Balding, M. Bishop, and C. Cannings, eds (Chichester, UK: John Wiley and Sons), pp. 179212.
Palmgren, M.G. (1994). Capturing of host DNA by a plant retroelement: Bs1 encodes plasma membrane H(+)-ATPase domains. Plant Mol. Biol. 25, 137140.[CrossRef][Medline]
Parniske, M., Hammond-Kosack, K.E., Golstein, C., Thomas, C.M., Jones, D.A., Harrison, K., Wulff, B.B., and Jones, J.D. (1997). Novel disease resistance specificities result from sequence exchange between tandemly repeated genes at the Cf-4/9 locus of tomato. Cell 91, 821832.[CrossRef][Web of Science][Medline]
Pickeral, O.K., Makalowski, W., Boguski, M.S., and Boeke, J.D. (2000). Frequent human genomic DNA transduction driven by LINE-1 retrotransposition. Genome Res. 10, 411415.
Plasterk, R.H.A. (2002). RNA silencing: The genome's immune system. Science 296, 12631265.
Pouteau, S., Huttner, E., Grandbastien, M.A., and Caboche, M. (1991). Specific expression of the tobacco Tnt1 retrotransposon in protoplasts. EMBO J. 10, 19111918.[Web of Science][Medline]
Rafalski, A. (2002). Applications of single nucleotide polymorphisms in crop genetics. Curr. Opin. Plant Biol. 5, 94100.[CrossRef][Web of Science][Medline]
Ramakrishna, W., Emberton, J., Ogden, M., SanMiguel, P., and Bennetzen, J.L. (2002). Structural analysis of the maize rp1 complex reveals numerous sites and unexpected mechanisms of local rearrangement. Plant Cell 14, 32133223.
Remington, D.L., Thornsberry, J.M., Matsuoka, Y., Wilson, L.M., Whitt, S.R., Doebley, J., Kresovich, S., Goodman, M.M., and Buckler IV, E.S. (2001). Structure of linkage disequilibrium and phenotypic associations in the maize genome. Proc. Natl. Acad. Sci. USA 98, 1147911484.
Rozen, S., and Skaletsky, H.J. (2000). Primer3 on the WWW for general users and for biologist programmers. In Bioinformatics Methods and Protocols: Methods in Molecular Biology, S. Misener and S.A. Krawetz, eds (Totowa, NJ: Humana Press), pp. 365386.
SanMiguel, P., Gaut, B.S., Tikhonov, A., Nakajima, Y., and Bennetzen, J.L. (1998). The paleontology of intergene retrotransposons of maize. Nat. Genet. 20, 4345.[CrossRef][Web of Science][Medline]
SanMiguel, P., Tikhonov, A., Jin, Y.K., Motchoulskaia, N., Zakharov, D., Melake-Berhan, A., Springer, P.S., Edwards, K.J., Lee, M., Avramova, Z., and Bennetzen, J.L. (1996). Nested retrotransposons in the intergenic regions of the maize genome. Science 274, 765768.
SanMiguel, P.J., Ramakrishna, W., Bennetzen, J.L., Busso, C., and Dubcovsky, J. (2002). Transposable elements, genes and recombination in a 215-kb contig from wheat chromosome 5Am. Funct. Integr. Genomics 2, 7080.[CrossRef][Medline]
Schramke, V., and Allshire, R. (2003). Hairpin RNAs and retrotransposon LTRs affect RNAi and chromatin-based gene silencing. Science 301, 10691074.
Shirasu, K., Schulman, A.H., Lahaye, T., and Schulze-Lefert, P. (2000). A contiguous 66-kb barley DNA sequence provides evidence for reversible genome expansion. Genome Res. 10, 908915.
Soderlund, C., Humphray, S., Dunham, A., and French, L. (2000). Contigs built with fingerprints, markers, and FPCV4.7. Genome Res. 10, 17721787.
Song, R., and Messing, J. (2003). Gene expression of a gene family in maize based on noncollinear haplotypes. Proc. Natl. Acad. Sci. USA 100, 90559060.
Sonnhammer, E.L., and Durbin, R. (1995). A dot-matrix program with dynamic threshold control suited for genomic DNA and protein sequence analysis. Gene 167, 110.[CrossRef][Web of Science][Medline]
Speek, M. (2001). Antisense promoter of human L1 retrotransposon drives transcription of adjacent cellular genes. Mol. Cell. Biol. 21, 19731985.
Talbert, L.E., and Chandler, V.L. (1988). Characterization of a highly conserved sequence related to mutator transposable elements in maize. Mol. Biol. Evol. 5, 519529.[Abstract]
Tarchini, R., Biddle, P., Wineland, R., Tingey, S., and Rafalski, A. (2000). The complete sequence of 340 kb of DNA around the rice Adh1-adh2 region reveals interrupted colinearity with maize chromosome 4. Plant Cell 12, 381391.
Tikhonov, A.P., SanMiguel, P.J., Nakajima, Y., Gorenstein, N.M., Bennetzen, J.L., and Avramova, Z. (1999). Colinearity and its exceptions in orthologous adh regions of maize and sorghum. Proc. Natl. Acad. Sci. USA 96, 74097414.
Tsuchisaka, A., and Theologis, A. (2004). Heterodimeric interactions among the 1-amino-cyclopropane-1-carboxylate synthase polypeptides encoded by the Arabidopsis gene family. Proc. Natl. Acad. Sci. USA 101, 22752280.
Tu, Z. (2001). Eight novel families of miniature inverted repeat transposable elements in the African malaria mosquito, Anopheles gambiae. Proc. Natl. Acad. Sci. USA 98, 16991704.
Vicient, C.M., Jaaskelainen, M.J., Kalendar, R., and Schulman, A.H. (2001). Active retrotransposons are a common feature of grass genomes. Plant Physiol. 125, 12831292.
Vigouroux, Y., Jaqueth, J.S., Matsuoka, Y., Smith, O.S., Beavis, W.D., Smith, J.S., and Doebley, J. (2002). Rate and pattern of mutation at microsatellite loci in maize. Mol. Biol. Evol. 19, 12511260.
Vitte, C., and Panaud, O. (2003). Formation of solo-LTRs through unequal homologous recombination counterbalances amplifications of LTR retrotransposons in rice Oryza sativa L. Mol. Biol. Evol. 20, 528540.
Voytas, D., Cummings, M., Konieczny, A., Ausubel, F., and Rodermel, S. (1992). Copia-like retrotransposons are ubiquitous among plants. Proc. Natl. Acad. Sci. USA 89, 71247128.
Whitelaw, E., and Martin, D.I. (2001). Retrotransposons as epigenetic mediators of phenotypic variation in mammals. Nat. Genet. 27, 361365.[CrossRef][Web of Science][Medline]
Wicker, T., Stein, N., Albar, L., Feuillet, C., Schlagenhauf, E., and Keller, B. (2001). Analysis of a contiguous 211 kb sequence in diploid wheat (Triticum monococcum L.) reveals multiple mechanisms of genome evolution. Plant J. 26, 307316.[CrossRef][Web of Science][Medline]
Wicker, T., Yahiaoui, N., Guyot, R., Schlagenhauf, E., Liu, Z.D., Dubcovsky, J., and Keller, B. (2003). Rapid genome divergence at orthologous low molecular weight glutenin loci of the A and A(m) genomes of wheat. Plant Cell 15, 11861197.
Wolfe, K.H.M., Gouy, M., Yang, Y.W., Sharp, P.M., and Li, W.-H. (1989). Date of the monocot-dicot divergence estimated from chloroplast DNA sequence data. Proc. Natl. Acad. Sci. USA 86, 62016205.
Yamazaki, M., Tsugawa, H., Miyao, A., Yano, M., Wu, J., Yamamoto, S., Matsumoto, T., Sasaki, T., and Hirochika, H. (2001). The rice retrotransposon Tos17 prefers low-copy-number sequences as integration targets. Mol. Genet. Genomics 265, 336344.[CrossRef][Web of Science][Medline]
Yao, H., Zhou, Q., Li, J., Smith, H., Yandeau, M., Nikolau, B.J., and Schnable, P.S. (2002). Molecular characterization of meiotic recombination across the 140-kb multigenic a1-sh2 interval of maize. Proc. Natl. Acad. Sci. USA 99, 61576162.
Yu, J., et al. (2002). A draft sequence of the rice genome (Oryza sativa L. ssp. indica). Science 296, 7992.
This article has been cited by other articles:
![]() |
L. He and H. K. Dooner Inaugural Article: Haplotype structure strongly affects recombination in a maize genetic interval polymorphic for Helitron and retrotransposon insertions PNAS, May 26, 2009; 106(21): 8410 - 8416. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. F. Weil Too many ends: aberrant transposition Genes & Dev., May 1, 2009; 23(9): 1032 - 1036. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. H. Paterson, J. E. Bowers, F. A. Feltus, H. Tang, L. Lin, and X. Wang Comparative Genomics of Grasses Promises a Bountiful Harvest Plant Physiology, January 1, 2009; 149(1): 125 - 131. [Full Text] [PDF] |
||||
![]() |
T. Wicker, S. G. Krattinger, E. S. Lagudah, T. Komatsuda, M. Pourkheirandish, T. Matsumoto, S. Cloutier, L. Reiser, H. Kanamori, K. Sato, et al. Analysis of Intraspecies Diversity in Wheat and Barley Genomes Identifies Breakpoints of Ancient Haplotypes and Provides Insight into the Structure of Diploid and Hexaploid Triticeae Gene Pools Plant Physiology, January 1, 2009; 149(1): 258 - 270. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. B. Barbazuk, Y. Fu, and K. M. McGinnis Genome-wide analyses of alternative splicing in plants: Opportunities and challenges Genome Res., September 1, 2008; 18(9): 1381 - 1392. [Abstract] [Full Text] [PDF] |
||||
![]() |
O. N. Danilevskaya, X. Meng, D. A. Selinger, S. Deschamps, P. Hermon, G. Vansant, R. Gupta, E. V. Ananiev, and M. G. Muszynski Involvement of the MADS-Box Gene ZMM4 in Floral Induction and Inflorescence Development in Maize Plant Physiology, August 1, 2008; 147(4): 2054 - 2069. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Hoecker, B. Keller, N. Muthreich, D. Chollet, P. Descombes, H.-P. Piepho, and F. Hochholdinger Comparison of Maize (Zea mays L.) F1-Hybrid and Parental Inbred Line Primary Root Transcriptomes Suggests Organ-Specific Patterns of Nonadditive Gene Expression and Conserved Expression Trends Genetics, July 1, 2008; 179(3): 1275 - 1283. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. K. Dooner and L. He Maize Genome Structure Variation: Interplay between Retrotransposon Polymorphisms and Genic Recombination PLANT CELL, February 1, 2008; 20(2): 249 - 258. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. A. Kronmiller and R. P. Wise TEnest: Automated Chronological Annotation and Visualization of Nested Plant Transposable Elements Plant Physiology, January 1, 2008; 146(1): 45 - 59. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Makarevitch, R. M. Stupar, A. L. Iniguez, W. J. Haun, W. B. Barbazuk, S. M. Kaeppler, and N. M. Springer Natural Variation for Alleles Under Epigenetic Control by the Maize Chromomethylase Zmet2 Genetics, October 1, 2007; 177(2): 749 - 760. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. M. Springer and R. M. Stupar Allele-Specific Expression Patterns Reveal Biases and Embryo-Specific Parent-of-Origin Effects in Hybrid Maize PLANT CELL, August 1, 2007; 19(8): 2391 - 2402. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Fengler, S. M. Allen, B. Li, and A. Rafalski Distribution of Genes, Recombination, and Repetitive Elements in the Maize Genome Crop Sci., July 16, 2007; 47(S2): S-83 - S-95. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Liu, C. Vitte, J. Ma, A. A. Mahama, T. Dhliwayo, M. Lee, and J. L. Bennetzen A GeneTrek analysis of the maize genome PNAS, July 10, 2007; 104(28): 11844 - 11849. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. A. Lee, M. J. Ash, and B. Good Re-examining the Relationship between Degree of Relatedness, Genetic Effects, and Heterosis in Maize Crop Sci., March 1, 2007; 47(2): 629 - 635. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. M. Springer and R. M. Stupar Allelic variation and heterosis in maize: How do two halves make more than a whole? Genome Res., March 1, 2007; 17(3): 264 - 275. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. C. Lamb, T. Danilova, M. J. Bauer, J. M. Meyer, J. J. Holland, M. D. Jensen, and J. A. Birchler Single-Gene Detection and Karyotyping Using Small-Target Fluorescence in Situ Hybridization on Maize Somatic Chromosomes Genetics, March 1, 2007; 175(3): 1047 - 1058. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. J. Lawrence, M. L. Schaeffer, T. E. Seigfried, D. A. Campbell, and L. C. Harper MaizeGDB's new data types, resources and activities Nucleic Acids Res., January 12, 2007; 35(suppl_1): D895 - D900. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Sherman-Broyles, N. Boggs, A. Farkas, P. Liu, J. Vrebalov, M. E. Nasrallah, and J. B. Nasrallah S Locus Genes and the Evolution of Self-Fertility in Arabidopsis thaliana PLANT CELL, January 1, 2007; 19(1): 94 - 106. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. J. Emrich, L. Li, T.-J. Wen, M. D. Yandeau-Nelson, Y. Fu, L. Guo, H.-H. Chou, S. Aluru, D. A. Ashlock, and P. S. Schnable Nearly Identical Paralogs: Implications for Maize (Zea mays L.) Genome Evolution Genetics, January 1, 2007; 175(1): 429 - 439. [Abstract] [Full Text] [PDF] |
||||
![]() |
Q. Wang and H. K. Dooner Eukaryotic Transposable Elements and Genome Evolution Special Feature: Remarkable variation in maize genome structure inferred from haplotype diversity at the bz locus PNAS, November 21, 2006; 103(47): 17644 - 17649. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Fu, T.-J. Wen, Y. I. Ronin, H. D. Chen, L. Guo, D. I. Mester, Y. Yang, M. Lee, A. B. Korol, D. A. Ashlock, et al. Genetic Dissection of Intermated Recombinant Inbred Lines Using a New Genetic Map of Maize Genetics, November 1, 2006; 174(3): 1671 - 1683. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Q. Gu, J. Salse, D. Coleman-Derr, A. Dupin, C. Crossman, G. R. Lazo, N. Huo, H. Belcram, C. Ravel, G. Charmet, et al. Types and Rates of Sequence Evolution at the High-Molecular-Weight Glutenin Locus in Hexaploid Wheat and Its Ancestral Genomes Genetics, November 1, 2006; 174(3): 1493 - 1504. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Bruggmann, A. K. Bharti, H. Gundlach, J. Lai, S. Young, A. C. Pontaroli, F. Wei, G. Haberer, G. Fuks, C. Du, et al. Uneven chromosome contraction and expansion in the maize genome Genome Res., October 1, 2006; 16(10): 1241 - 1251. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. M. Stupar and N. M. Springer Cis-transcriptional Variation in Maize Inbred Lines B73 and Mo17 Leads to Additive Expression Patterns in the F1 Hybrid Genetics, August 1, 2006; 173(4): 2199 - 2210. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. C. Lamb and J. A. Birchler Retroelement Genome Painting: Cytological Visualization of Retroelement Expansions in the Genera Zea and Tripsacum Genetics, June 1, 2006; 173(2): 1007 - 1021. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Gonzalo, T. J. Vyn, J. B. Holland, and L. M. McIntyre Mapping Density Response in Maize: A Direct Approach for Testing Genotype and Treatment Interactions Genetics, May 1, 2006; 173(1): 331 - 348. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. K. Anderson, A. Lai, S. M. Stack, C. Rizzon, and B. S. Gaut Uneven distribution of expressed sequence tag loci on maize pachytene chromosomes Genome Res., January 1, 2006; 16(1): 115 - 122. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. M. Devos, J. Ma, A. C. Pontaroli, L. H. Pratt, and J. L. Bennetzen Analysis and mapping of randomly chosen bacterial artificial chromosome clones from hexaploid bread wheat PNAS, December 27, 2005; 102(52): 19243 - 19248. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. H. Paterson, M. Freeling, and T. Sasaki Grains of knowledge: Genomics of model cereals Genome Res., December 1, 2005; 15(12): 1643 - 1650. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. J. Conrad and T. P. Brutnell Ac-Immobilized, a Stable Source of Activator Transposase That Mediates Sporophytic and Gametophytic Excision of Dissociation Elements in Maize Genetics, December 1, 2005; 171(4): 1999 - 2012. [Abstract] [Full Text] [PDF] |
||||
![]() |
J.-S. Kim, M. N. Islam-Faridi, P. E. Klein, D. M. Stelly, H. J. Price, R. R. Klein, and J. E. Mullet Comprehensive Molecular Cytogenetic Analysis of Sorghum Genome Architecture: Distribution of Euchromatin, Heterochromatin, Genes and Recombination in Comparison to Rice Genetics, December 1, 2005; 171(4): 1963 - 1976. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Haberer, S. Young, A. K. Bharti, H. Gundlach, C. Raymond, G. Fuks, E. Butler, R. A. Wing, S. Rounsley, B. Birren, et al. Structure and Architecture of the Maize Genome Plant Physiology, December 1, 2005; 139(4): 1612 - 1624. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. P. Delmer Inaugural Article: Agriculture in the developing world: Connecting innovations in plant research to downstream applications PNAS, November 1, 2005; 102(44): 15739 - 15746. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Yao and P. S. Schnable Cis-effects on Meiotic Recombination Across Distinct a1-sh2 Intervals in a Common Zea Genetic Background Genetics, August 1, 2005; 170(4): 1929 - 1944. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. M. Li, D. Rotter, S. A. Bonos, W. A. Meyer, and F. C. Belanger Identification of a Gene in the Process of Being Lost from the Genus Agrostis Plant Physiology, August 1, 2005; 138(4): 2386 - 2395. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. K. Lal and L. C. Hannah Helitrons contribute to the lack of gene colinearity observed in modern maize inbreds PNAS, July 19, 2005; 102(29): 9993 - 9994. [Full Text] [PDF] |
||||
![]() |
J. Lai, Y. Li, J. Messing, and H. K. Dooner From the Cover: Gene movement by Helitron transposons contributes to the haplotype variability of maize PNAS, June 21, 2005; 102(25): 9068 - 9073. [Abstract] [Full Text] [PDF] |
||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| ASPB Publications | THE PLANT CELL | PLANT PHYSIOLOGY | |
|---|---|---|---|