- © 1999 American Society of Plant Physiologists
Abstract
The TRANSPARENT TESTA GLABRA1 (TTG1) locus regulates several developmental and biochemical pathways in Arabidopsis, including the formation of hairs on leaves, stems, and roots, and the production of seed mucilage and anthocyanin pigments. The TTG1 locus has been isolated by positional cloning, and its identity was confirmed by complementation of a ttg1 mutant. The locus encodes a protein of 341 amino acid residues with four WD40 repeats. The protein is similar to AN11, a regulator of anthocyanin biosynthesis in petunia, and more distantly related to those of the β subunits of heterotrimeric G proteins, which suggests a role for TTG1 in signal transduction to downstream transcription factors. The 1.5-kb TTG1 transcript is present in all major organs of Arabidopsis. Sequence analysis of six mutant alleles has identified base changes producing truncations or single amino acid changes in the TTG1 protein.
INTRODUCTION
The TRANSPARENT TESTA GLABRA1 (TTG1) locus controls many apparently unrelated characters of Arabidopsis (catalogued by Koornneef, 1981), several of which appear to be confined to the epidermal cell layer of different tissues. ttg1 mutants have a glabrous phenotype, possessing none of the leaf or stem hairs (trichomes) that normally are derived from the meristematic L1 cell layer. Purple anthocyanin pigments are absent from the ttg1 seed coat, causing the transparent testa phenotype in which the yellow cotyledons are visible through the testa. In wild-type plants, anthocyanins are present in the hypocotyl of seedlings and in the stem and leaves of plants as they age, and they are inducible by many forms of stress, including high light, poor nutrients, or water stress. ttg1 mutants completely lack anthocyanins in the epidermis and in subepidermal layers of leaves and stems. Mucilage normally found in the cell wall of the seed coat is absent in ttg1 mutants. Seeds of ttg1 plants do not require drying and cold treatments to germinate and therefore exhibit an altered seed dormancy when compared with ecotypes, such as Landsberg erecta (Ler; Koornneef, 1981; Léon-Kloosterziel et al., 1994). This characteristic of ttg1 mutants may be linked to an altered seed coat structure. The TTG1 gene appears to have the opposite effect on root hair formation when compared with its effect on leaf hair initiation. In Arabidopsis, root hairs extend from root epidermal cells only in files of cells that contact two underlying cortical cells, whereas in ttg1 mutants, extra root hairs occur in the atrichoblast cell files (Galway et al., 1994). Under laboratory growth conditions, mutations at the ttg1 locus do not greatly affect the viability of the plants.
In ttg1 mutants, the anthocyanin biosynthetic pathway is blocked at the dihydroflavonol-4-reductase (DFR) step, because DFR-encoding transcripts have not been detected in these mutants (Shirley et al., 1995). By contrast, transcripts of the chalcone synthase and chalcone isomerase genes are unaffected. The point of regulation of the pathway by TTG1 was confirmed by the cloning and analysis of the expression of the Arabidopsis gene encoding flavanone 3-hydroxylase, which catalyzes the enzymatic step between chalcone isomerase and DFR (Pelletier and Shirley, 1996).
This feature of ttg1 mutants resembles the effect of the Delila mutation of snapdragon (Martin et al., 1991). Delila mutants also lack transcripts encoding DFR and subsequent enzymes in the anthocyanin pathway. In maize, a group of closely related genes, including Booster, R, and Lc, regulates the entire anthocyanin biosynthetic pathway from chalcone synthase onward. Delila and R both encode MYC-like transcription factors (Ludwig et al., 1989; Goodrich et al., 1992). R has been shown to activate directly the transcription of several genes encoding anthocyanin biosynthetic enzymes in conjunction with a MYB transcription factor encoded by the C1 gene in maize (Goff et al., 1992; Lesnick and Chandler, 1998).
The cauliflower mosaic virus 35S promoter was used to express the maize R gene in Arabidopsis plants. The phenotypes of these plants complemented all of the mutant phenotypes of ttg1 plants, suggesting that TTG1 might encode an Arabidopsis R homolog or might regulate an R-like gene (Lloyd et al., 1992). Transformation of wild-type Arabidopsis plants with the cauliflower mosaic virus 35S–R construct resulted in more anthocyanin in the plant and increased numbers of trichomes on leaves and stems. However, although transgenic tobacco plants expressing the maize R gene had flowers with elevated anthocyanin levels, no change in trichome development was observed (Lloyd et al., 1992).
Many genetic loci involved in trichome differentiation and development in Arabidopsis have been described (Hülskamp et al., 1994; Marks, 1997). TTG1 and GLABROUS1 (GL1) play a key role in the initiation of trichomes; mutations at either locus lead to hairless leaves and stems (Koornneef, 1981; Koornneef et al., 1982). The GL1 gene encodes a MYB transcription factor (Oppenheimer et al., 1991). Double mutant analysis has suggested that GL1 and TTG1 may affect the same point in the pathway leading to trichome formation (Larkin et al., 1994), although gl1 mutants do not show the same pleiotropic effects as ttg1 mutants. Another locus that affects both trichome development and pigment levels has recently been identified and isolated. This locus has been designated TTG2 (C.S. Johnson and D.R. Smyth, personal communication).
Mutations at another GLABRA locus, GL2, have some features in common with ttg1, although gl2 mutants have a normal anthocyanin content and have rudimentary trichomes, suggesting that GL2 is not required to determine the fate of these cells (Koornneef, 1981). gl2 mutants have an increased number of ectopic root hairs, although gl2 atrichoblast cells resemble those of the wild type more than they do those of the ttg1 mutant, indicating a role later in the development of the root epidermal cells (Masucci et al., 1996). The effects on the seed coat and mucilage are similar to those of the ttg1 mutations (Koornneef, 1981). The GL2 locus encodes another transcription factor, a homeodomain protein with a leucine zipper domain (Rerie et al., 1994). By using in situ hybridization, studies showed that the GL2 gene is expressed in developing trichomes (Rerie et al., 1994) and in the atrichoblast cell files of roots (Masucci et al., 1996). GL2 transcripts are greatly reduced in ttg1 mutants, suggesting that the TTG1 gene product is required for normal levels of GL2 expression (Di Cristina et al., 1996). However, TTG1 does not influence the pattern of GL2 expression, which occurs at very low levels in specific tissues in the absence of a functional TTG1 gene (Hung et al., 1998; Szymanski et al., 1998).
Another MYB-related gene, CAPRICE (CPC), may have an effect opposite to that of TTG1 in promoting root hair development (Wada et al., 1997). Plants overexpressing CPC have extra root hairs and a reduced number of leaf trichomes. The data from studies of GL1, CPC, and R suggest that the MYB and MYC transcription factors regulate trichome, root hair, and anthocyanin production and that TTG1 may control directly or indirectly the activation of one or more transcription factors.
The TTG1 locus has been mapped by Koornneef et al. (1983) to chromosome 5 between MS1 and GA3. We used this information to isolate the TTG1 locus by using map-based techniques. Restriction fragment length polymorphisms (RFLPs) were used to map crossovers on both sides of the ttg1-1 allele followed by complementation of the mutant phenotype by the introduction of genomic fragments containing the TTG1 locus. The TTG1 locus encodes a WD40 repeat protein. This protein may regulate signal transduction pathways governing leaf hair formation and anthocyanin biosynthesis.
RESULTS
Molecular Mapping of ttg1-1
Two sets of recombinants between ttg1-1 and flanking genetic markers were used for fine mapping of the TTG1 region of chromosome 5. A total of 140 recombinants between the ttg1-1 and MS1 loci were analyzed. Between ttg1 and GA3, 299 recombinants were selected covering a 4-centimorgan (cM) interval. Figure 1A shows a large-scale map of the TTG1 region between MS1 and GA3 that was derived from RFLP analysis of the recombinants on both sides of the ttg1-1 mutation by using a variety of probes. Figure 1B shows a higher resolution map of the region covered by yeast artificial chromosome (YAC) EG20H2 (Grill and Somerville, 1991). RFLPs visible upon probing with the g4556 cosmid (Nam et al., 1989) could be separated from ttg1-1 only in a single recombinant called Dennis (on the MS1 side of ttg1-1; Figure 1B), suggesting that g4556 was very close to the mutation in ttg1-1. Figure 2A shows a DNA gel blot of EcoRI-digested genomic DNA of several recombinants probed with g4556. The pattern of hybridizing bands in Dennis is similar to that of the Wassilewskija (Ws) parent.
The cosmid g4556 and YAC EG20H2, which hybridized with cosmid g4556, were used to isolate overlapping fragments of Ler genomic DNA in λ vectors. The fine-scale map (Figure 1B) shows the order of DNA fragments and the positions of several recombination events close to TTG1-1, including the crossover in Dennis. It was not possible to map nearby crossover events on the proximal side of ttg1-1 due to the lack of RFLPs between the ecotypes used (Ler and Rschew [RLD1]). Moreover, part of this region from YAC EG20H2 could not be subcloned into bacterial vectors and was not represented by clones from three genomic libraries (including a cosmid library) screened with the YAC DNA.
The fine mapping of the TTG1 locus was aided by analysis of the ttg1-13 mutation, which was generated by fast-neutron bombardment from an RLD1 line carrying the dis2 mutation. The results of a DNA gel blot (Figure 2B) of mutant and parental DNA digested with EcoRV and EcoRI and probed with λ8 (Figure 1B) suggest that a deletion of this region could account for the phenotype of the ttg1-13 mutant, indicating that λ8 lies close to or contains the TTG1 locus.
Complementation of ttg1 Mutations Using Genomic Fragments
The entire genomic fragments from overlapping λ clones 1.1A, 8, and X6 (marked in Figure 1B) were inserted into the plasmid binary vector pBINNOT (a derivative of pBIN19 [Bevan, 1984] with a unique NotI site in the polylinker) by using NotI restriction sites in the λ vector; the plasmids were called pB1.1A, pB8, and pBX6. These constructs were used to transform Arabidopsis (the ttg1-1 mutant in the Ler background) by root coculture, with Agrobacterium carrying the plasmids (Valvekens et al., 1988). One plant transformed with pB8, derived from λ8, had trichomes and produced anthocyanin, but it failed to set seed. By contrast, several plants transformed with pB1.1A or pBX6 did not have trichomes. These results suggest that the TTG1 locus might be contained in the genomic insert in pB8.
Three deletions of pB8 were made by using restriction sites within the genomic sequence and the polylinker of the vector. These deletion constructs (shown in Figure 1C) were used to transform Arabidopsis plants, carrying the ttg1-9 mutation in a Columbia background, via vacuum infiltration (Bechtold et al., 1993), with Agrobacterium carrying the plasmids. From the pB8DE3 construct, 60 of 61 kanamycin-resistant transformants produced trichomes. All 19 transformants from the deletion construct pB8DX1 and 36 transformants from pB8DE2 had no or very few trichomes, a phenotype similar to that of plants carrying the ttg1-9 allele. All transformants from pB8DE3 bearing normal trichomes also exhibited wild-type characteristics of brown seed, seed mucilage, purple coloring of the plant, and normal root hair numbers, indicating that the other phenotypes of the ttg1 mutant had also been complemented. The DNA region specific to pB8DE3, and therefore the region that was presumed to contain the TTG1 locus, is as shown in Figure 1C.
Maps of the TTG1 Region.
(A) Large-scale map from MS1 to GA3 indicating the position of the most important markers used to determine the position of the TTG1 locus. Boxes show end probes generated from YACs. Cosmid, λ, and plasmid probes for RFLPs were made by labeling whole constructs.
(B) Fine map of the TTG1 region showing the estimated position of TTG1 (arrowhead) in relation to the DNA constructs, including YACs EG20H2 and CIC8B11, cosmid g4556, and λ clones (obtained from library screens). The positions of four crossovers close to TTG1 are indicated by X's, and Dennis (the closest recombinant to ttg1-1) is labeled.
(C) Restriction map of the insert from λ8 showing EcoRI (E) and XbaI (X) sites used to generate the deletion constructs indicated below. Solid lines represent the DNA remaining in the constructs; dotted lines indicate the deleted regions. Sequencing showed the position and direction of the TTG1 transcript (arrow). All maps are drawn in the same orientation.
Analysis of the TTG1 Region
Sequence analysis of 5777 bp from pB8DE3 (Figure 1C), using Genmark (Borodovsky and McIninch, 1993) and NetPlantGene (Hebsgaard et al., 1996), revealed a single gene. Both computer programs rely on the very high A+T content of Arabidopsis intergenic regions to predict coding regions. This gene was shown to represent a single-copy sequence by comparing DNA gel blot data with cloned genomic restriction fragments (data not shown).
Genomic DNA Gel Blots Used to Locate the TTG1 Locus.
(A) Genomic DNA (1.5 μg) from recombinants from the MS1 side of TTG1 digested with EcoRI and probed with g4556. The three differences in band mobility between the parental lines Ler and Ws are indicated by arrows. Rachael (R), Simon (S), Dennis (D), Oscar (O), Elaine (E), and Melissa (M) are the six recombinant lines that were used.
(B) Genomic DNA (1 μg) from RLD1–dis2 (R-d), ttg1-13 (13), and RLD1 (R) was digested with EcoRV or EcoRI and probed with λ8. The 4.1-kb EcoRI fragment containing the TTG1 locus is indicated.
The predicted gene has very high similarity to an Arabidopsis expressed sequence tag (EST) (GenBank accession numbers F20055 and F20056) that, by sequence analysis, starts at base 467 and corresponds to the 3′ end of the gene shown in Figure 3. Comparison of the genomic and EST sequences showed an intron of 700 bp, located one base 3′ of the predicted termination codon. Other introns were not identified in the gene by using computer analysis or by comparison to the EST. The predicted protein sequence of 341 amino acid residues shows sequence similarity to a large and diverse group of proteins with WD40 repeat motifs (see Discussion). The TTG1 protein has four WD40 repeats indicated in Figure 3. There is a proline-rich region in which eight of 26 amino acid residues are proline. The amino acid sequence most similar to that of TTG1 in the databases is the product of the An11 gene of petunia (de Vetten et al., 1997), which contains 79.5% identical and 87.5% similar amino acid residues. An alignment of TTG1 with the AN11 sequence is shown in Figure 4 together with the sequence of the β subunit of a heterotrimeric G protein.
To identify the 5′ end of the transcript and to confirm the likely translation initiation codon, we conducted a primer extension experiment with an end-labeled oligonucleotide designed from the complementary strand of the TTG1 gene. The start of transcription was located 110 bp 5′ of the predicted start of translation and is indicated as the beginning of the cDNA sequence (Figure 3). There are no other ATG codons in the 5′ untranslated region. A possible TATA box is located 22 bp 5′ of the start of transcription (Figure 3). The poly(A) addition site is evident in the sequence of the EST clone and is located at position 1907 (Figure 3). The transcript is predicted to be 1317 nucleotides plus a poly(A) tail.
The expression of the gene in different organs of Arabidopsis was examined by using an RNA gel blot hybridized with a TTG1 probe (Figure 5). The transcript was estimated to be 1.5 kb and was present in all organs examined, including roots, leaves, stems, meristems, flowers, and flower buds. TTG1 appears to be highly expressed in floral buds, in which there are only a few trichomes on the sepals and no anthocyanin in the flower petals or stamen tissue.
Identification and Analysis of ttg1 Alleles
Several different ttg1 alleles had been identified previously (Koornneef, 1981). Six new ttg1 alleles were identified in the Kranz collection (Table 1). Initially, the trichome, anthocyanin, and seed mucilage characteristics were compared with those of the ttg1-1 mutants and were shown to be similar to this mutant. Lines were then crossed to ttg1-1 to determine whether they carried mutations at the same locus. In the F1 generation, all plants lacked trichomes and anthocyanin, and there was no segregation of the mutant phenotype in the F2 generation. These new alleles have been designated ttg1-15 to ttg1-20 (Table 1).
Sequence of the TTG1 Locus.
The cDNA sequence is shown in capital letters; untranscribed regions and the intron are in lowercase letters. The first capitalized base shown (at position −109) is at the start of transcription, although nucleotide and amino acid numbering is from the start of translation. The protein sequence is given below the nucleotide sequence (GenBank accession number AJ133743). The two amino acids (normally WD) at the end of each repeat are underlined. The asterisk denotes the termination codon. The putative tata box is underlined.
The DNA sequence of a number of ttg1 mutant alleles and their parental ecotypes was determined to identify the mutations in the TTG1 locus. Polymerase chain reaction (PCR) was conducted with genomic DNA, and the PCR products were sequenced directly with the primers used to generate them. Sequence comparison of four ecotypes (Ler, Ws, Columbia, and RLD1) showed no nucleotide changes in the coding region, although there were several differences within the intron and in the promoter region (data not shown).
Mutations were identified in all ttg1 alleles examined, and these are represented in Figure 6A and summarized in Table 1. The mutation in ttg1-1 is a single C-to-T base transition that introduces a termination codon in place of the codon for glutamine 317. The resulting protein product is truncated, lacking the C-terminal 25 amino acid residues. Termination codons in place of codons for tryptophan 183 in ttg1-19 and serine 310 in ttg1-15, ttg1-16, ttg1-17, and ttg1-18 would also lead to truncated translation products. There are two mutations in ttg1-20: the same nonsense mutation as in ttg1-15 and an additional C-to-G base transversion leading to serine 30 being changed to cysteine (Table 1). The only mutation that resulted from a single amino acid substitution was found in ttg1-9, in which a C-to-T base transition leads to a change from serine 282 to phenylalanine.
RNA gel blot analysis of total RNA extracted from flower buds of 6-week-old wild-type and mutant plants (Figure 6B) shows that all but one of the mutants described above contain transcripts of the TTG1 gene. Because ttg1-15, ttg1-16, ttg1-17, and ttg1-18 all have the same mutation, ttg1-17 was selected to represent these lines. The lane containing RNA from the deletion mutant ttg1-13 shows no hybridizing band, confirming that the 1.5-kb transcript is a product of the TTG1 gene.
DISCUSSION
The TTG1 Locus Encodes a WD40 Repeat Protein
The isolation and analysis of the TTG1 locus have revealed that TTG1 is a WD40 repeat protein similar to An11 from petunia (de Vetten et al., 1997) and that it is not related to MYC transcription factors, as suggested by the results of Lloyd et al. (1992), which showed that constitutive expression of the maize R gene in ttg1 mutants was able to complement the phenotype of ttg1 mutants. The identification of TTG1 as a WD40 repeat protein suggests that TTG1 regulates MYC transcription factors or pathways in which MYC factors are involved.
Alignment of Amino Acid Sequences of TTG1, AN11, and the β Subunit of Heterotrimeric G Protein.
The structural features of G-β in the complexed form (betac) with G-γ and in a form without G-γ (betau) are represented at individual residue positions. Because some of the side chains of G-β are not well defined in electron density, they are modeled as alanine residues in the crystal structure (Sondek et al., 1996). The residue numbering of TTG1 is shown at the top of the alignment. A key to the structural features in the JOY format (Mizuguchi et al., 1988) is as follows: uppercase letters, the solvent is inaccessible; lowercase letters, the solvent is accessible; italic letters, positive ϕ; tilde above letters, hydrogen bond to side chain; boldface letters, hydrogen bond to main chain amide; underlined letters, hydrogen bond to main chain carbonyl. Dashes indicate gaps inserted to aid alignment.
Structure of WD40 Repeat Proteins
Proteins containing WD40 repeats are involved in a number of different types of regulatory roles, such as signaling (e.g., β subunits of heterotrimeric G proteins), cell cycle regulation (e.g., CDC20 and CDC4), transcriptional repression (e.g., yeast TUP1 and Drosophila extra sex combs), vesicular trafficking (e.g., SEC13), and RNA processing (reviewed in Neer et al., 1994). The G-β subunits, which contain seven repeats of the WD40 motif, are the best characterized of the WD40 repeat proteins. The crystal structure of the G-βγ complex shows that the G-β subunit adopts a seven-bladed β propeller fold (Sondek et al., 1996). Each blade is composed of four β strands, and an N-terminal amphipathic α helix interacts closely with the G-γ subunit, which is required for correct folding and function of the G-β subunit. The structure of the G-βγ complex is not significantly altered in the heterotrimeric G-αβγ protein (Lambright et al., 1996).
A typical WD repeat motif is characterized by GH and WD doublets, which are separated by a characteristic pattern of hydrophobic and hydrophilic residues. The sequence of TTG1 has only four clearcut WD repeat motifs, whereas G-β has seven repeats. However, TTG1 may have seven structural repeats; several proteins that display no characteristic WD repeats or have no amino acid sequence similarity to G-β have been observed to have a sevenfold symmetric β-propeller shape (Murzin, 1992). For example, the crystal structure of the terminal domain of the clathrin heavy chain shows a seven-bladed β propeller closely resembling G-β but with no sequence similarity to WD40 proteins (ter Haar et al., 1998). Murzin (1992) also showed that no strong sequence constraints are necessary for proper β-sheet twist and packing to result in a sevenfold symmetric structure.
An alignment of structural elements of G-β and TTG1 (Figure 4) aided in modeling the TTG1 protein. A three-dimensional model of TTG1 (Figure 7A) was generated on the basis of the crystal structure of the G-β subunit by using the coordinates of the G-β subunit crystal structure of the G-βγ complex and using the suite of programs called COMPOSER (Blundell et al., 1987; Srinivasan and Blundell, 1993). Whereas the gross structure of TTG1 is likely to resemble that of the G-β subunit, there will be some differences. The N-terminal helix of TTG1, if present, will probably differ in its orientation compared with the equivalent helix in the G-β structure, which interacts with the helix of the G-γ subunit. Deletion of a single residue in the middle of the putative helix in AN11 compared with TTG1 (Figure 4) supports the suggestion that the structure and orientation of this helical region could radically differ from that of the G-β subunit. The first two blades do not match the WD40 consensus sequence well but could still fold to resemble propeller blades. However, it is possible that the proline-rich region present at the beginning of the propeller fold could result in a disruption in the first two blades. These features, together with deviations in the periodicity of the GH and WD repeat motifs, suggest that the regularity of the repeats in the TTG1 structure is lower than that of the G-β subunit structure. The amino acid residues in the G-β subunit identified as interacting with G-α and G-γ subunits are not conserved in TTG1, suggesting that TTG1 does not interact with partners that are very similar to Gα and Gγ.
TTG1 Is Expressed in All Major Organs of Arabidopsis.
RNA gel blot analysis of total RNA (10 μg) of different organs of 4-week-old Arabidopsis ecotype Columbia plants probed with the coding region of the TTG1 gene and with a β-tubulin (β-tub) probe.
Mutations Identified in ttg1 Mutant Alleles
There are several sequences in the databases encoding putative proteins that show higher similarity to TTG1 than to the G-β subunit from either plants or animals. An Arabidopsis genomic sequence, located on chromosome 3 and called atan11b (de Vetten et al., 1997), could encode a protein showing 72% similarity to TTG1. An Arabidopsis EST called atan11a encodes a protein also with 72% similarity to TTG1, although the putative products of atan11a and atan11b are much more similar to each other (93% similarity). Two nematode genes arrayed in tandem (in cosmid F53C11) may encode proteins with greater similarity to TTG1 than proteins encoded by any locus from yeast, although a yeast gene (S61017) encodes a protein that is 60% similar to TTG1. Because TTG1 regulates pathways that involve cell differentiation in Arabidopsis, this type of function also may be required for the TTG1 homologs in other multicellular organisms. In yeast, the TTG1 homolog may be involved in regulating metabolic pathways, as it is in Arabidopsis.
Genes encoding several WD40 proteins, including putative G-β subunits, the constitutive photomorphogenic COP1 protein, and retinoblastoma binding–like proteins, have been isolated from plants (reviewed in Ma, 1994; Ach et al., 1997). The COP1 protein from Arabidopsis contains three domains and acts as part of a protein complex negatively regulating gene transcription. The WD40 region of COP1 has been shown to be essential for the function of this protein (McNellis et al., 1994) as a negative regulator of photomorphogenesis. The interaction with other proteins in the complex is probably through the WD40 repeat domain. The PRL1 gene of Arabidopsis encodes a WD40 repeat protein involved in the repression of glucose-responsive genes (Németh et al., 1998). A mutation at this locus has pleiotropic effects, including short roots and ectopic root hairs, and altered responses to sucrose, plant growth factors, and temperature. However, sequence comparisons between TTG1 and other plant proteins such as COP1 and PRL1 show that TTG1 is more similar to G-β subunits than to other classes of WD proteins.
Analysis of ttg1 Mutations.
(A) Scheme of the coding region of the TTG1 gene with the position of mutations (X) indicated. The wild-type (wt) codon is given above together with the amino acid (aa). The base changes in the mutants are in boldface, and the amino acid sequence changes are shown below. The ttg1 allele numbers are shown below. The stop codon in ttg1-17 is also present in ttg1-15, ttg1-16, ttg1-18, and ttg1-20.
(B) RNA gel blot of total RNA (10 μg) isolated from flower buds of mutant alleles probed with the coding region of the TTG1 gene and the β-tubulin (β-tub) gene. Parental ecotypes Ws (W), Columbia (C), Ler (L), and RLD1 (R) and mutant alleles with identified mutations are shown.
Model of the TTG1 Structure Based on the Crystal Structure of the Gβ Subunit.
(A) Ribbon representation of the TTG1 protein. The α helices, β strands, and loops are shown in blue, green, and yellow, respectively. The positions of the mutations at residue 183 in ttg1-19, residue 310 in ttg1-15, ttg1-16, ttg1-17, ttg1-18, and ttg1-20, and residue 317 in ttg1-1 that all lead to truncations are shown in red, whereas serine-282, which is changed to phenylalanine in ttg1-9, is indicated in pink. The side chains of these four amino acid residues are represented in the ball-and-stick model in red. The N and C termini are labeled. This diagram was produced using the program MOLSCRIPT (Kraulis, 1991).
(B) The electrostatic potential surface of the model of TTG1 seen from the opposite side diagrammed in (A) and showing the acidic patch at the center. The acidic and basic clusters are shown in red and blue, respectively. The image was produced using the program GRASP (Nicholls et al., 1991).
TTG1 and AN11 May Regulate Different Transcription Factors
TTG1 is most similar to the petunia AN11 protein encoded by the An11 locus, which was isolated by transposon tagging (de Vetten et al., 1997). AN11 controls anthocyanin pigmentation only in flowers; mutations at this locus do not result in changes to trichomes or to anthocyanins in the rest of the plant. AN11 controls anthocyanin biosynthesis in flowers, most probably by regulating AN2, a MYB transcription factor similar to C1 from maize. AN2 can complement the an11 mutation in a transient assay in mutant flowers (de Vetten et al., 1997). In petunia, the transcription factors AN2 and JAF13 or AN1, MYC transcription factors similar to R from maize, are thought to act together in a manner similar to C1 and R to regulate positively the anthocyanin biosynthetic pathway (reviewed in Mol et al., 1996). The regulation of a MYB transcription factor by AN11 in petunia appears to be contrary to the evidence that TTG1 might regulate a MYC transcription factor, based on the expression in Arabidopsis of the maize R gene (Lloyd et al.,1992). One possible explanation for these apparently conflicting results is that TTG1 regulates the MYB–MYC complex (rather than one component), which then activates transcription. Expression of the heterologous maize R gene in Arabidopsis might bypass the normal requirement for activation of endogenous transcription factors by TTG1. GL1, the MYB transcription factor of Arabidopsis involved in trichome initiation, does not play a role in other pathways in which TTG1 is involved, whereas one MYC factor may function in many pathways. This suggests that other MYB factors might act in conjunction with the R-like MYC to regulate the anthocyanin biosynthesis and root hair initiation pathways.
Dissimilarity between TTG1 and AN11 in the N-terminal region (Figure 4) might possibly result in the regulation of different components, with TTG1 regulating MYC and AN11 regulating MYB factors. However, this seems unlikely because functional assays using yeast and human homologs of AN11 (de Vetten et al., 1997) suggest that TTG1 could replace AN11 in petunia and that AN11 might be capable of replacing TTG1 function in Arabidopsis. The identification of two WD40 repeat proteins, which regulate anthocyanin biosynthesis and, in the case of the TTG1 protein, several other pathways, suggests that this class of protein may be involved in regulating biosynthetic pathways in other organisms.
From studies of other WD40 proteins, it seems probable that the TTG1 protein does not act directly as a transcription factor but binds to other proteins to promote the initiation of trichomes in leaves and stems. Figure 7B shows a model of the reverse side of TTG1, with a large acidic patch (shown in red) that could be involved in interactions with other proteins. The TTG1 protein might possibly act as part of a DNA binding complex to regulate transcription. However, the TTG1 sequence contains no recognizable nuclear localization signal, although, due to the difficulty in recognizing such signals, a cryptic site might be present. It is also possible that another protein is required to form a complex for nuclear import, as is the case with APETALA3 and PISTILLATA from Arabidopsis (McGonigle et al., 1996). Alternatively, the TTG1 protein may be located only in the cytoplasm, and it may act as part of a signal transduction pathway to downstream transcription factors. Preliminary investigations using β-glucuronidase–TTG1 fusion proteins (data not shown) suggest that TTG1 may be located in the cytoplasm. This is in agreement with the results of a study showing the location of AN11 (de Vetten et al., 1997). In this study, gel blot analysis with proteins of cell fractions showed that AN11 was primarily located in the cytoplasm. Currently, we are generating TTG1–green fluorescent protein fusions to study subcellular localization in vivo under different conditions, including stress, in which activation of the anthocyanin pathway can be controlled.
Cloning of the TTG1 Locus
The positional cloning of the TTG1 locus has provided information about the order of and distances between a number of RFLP markers that is complementary to the data given in the physical maps of the region (Schmidt et al., 1997; Thorlby et al., 1997). Within a 10-cM region, 439 recombinants were analyzed, and the line Dennis contained the only crossover close to the TTG1 locus. This suggests that recombination rates are reduced close to this gene. Recombination frequencies are known to vary along chromosomes in many species (reviewed in Lichten and Goldman, 1995).
A large number of mutant alleles at the TTG1 locus have been identified over the years. These alleles originally suggested that all of the phenotypes of the mutant were associated with one locus (Koornneef, 1981). We have shown that single base changes within the coding region of the TTG1 gene are correlated with mutant phenotypes, verifying Koornneef's postulate. The mutations mainly result from single base changes near the 3′ end of the coding region, indicating that the C-terminal end of the protein is important for proper functioning of the protein. The ttg1-1 mutation, which has been used most extensively in experiments pertaining to the TTG1 locus (e.g., Galway et al., 1994), results in C-terminal truncation of 25 amino acid residues. Plants carrying the homozygous ttg1-1 mutation have a severe phenotype, with very few trichomes on leaves and no anthocyanin in the testa or plant, indicating that this C-terminal region is vital for the structure and function of TTG1.
The ttg1-9 mutation is the result of a single base change resulting in the replacement of a serine residue at position 282 with phenylalanine. Serine is conserved at this position in AN11 and in the proteins encoded by atan11b and the yeast gene and is replaced by threonine in the proteins encoded by atan11a and the two nematode genes, suggesting that a hydroxyl group at this position is important for the structure or function of TTG1 and related proteins. The position of this amino acid residue is marked on the ribbon model (Figure 7A). This residue is positioned by the modeling on the surface of the outer edge of the protein and may be involved in interacting with other proteins as well as maintaining the tertiary structure of TTG1.
The high degree of conservation between TTG1, AN11, and other proteins from nematodes, yeast, and humans (de Vetten et al., 1997) and the array of effects of the ttg1 mutations together suggest that this family of WD40 repeat proteins plays important and varied roles in many organisms.
METHODS
Plant Strains and Growth Conditions
Arabidopsis thaliana ecotypes Landsberg erecta (Ler), Columbia, Rschew (RLD1), and Wassilewskija (Ws) were supplied by the Nottingham (UK) Arabidopsis Stock Centre (NASC) together with the ttg1-1 mutant. The Ler line carrying ms1 and ttg1-1 was obtained from C. Somerville (Carnegie Institute of Washington, Stanford, CA). The line containing ttg1-1, ga3, and ch5 in the Ler background was a gift from M. Koornneef (University of Wageningen, Wageningen, The Netherlands). Mutants ttg1-9 and ttg1-13 were obtained from J. Larkin (Louisiana State University, Baton Rouge, LA) and D. Oppenheimer (University of Alabama, Tuscaloosa, AL), respectively. Possible ttg1 alleles were selected by their description (glabrous and yellow seed) from lines in the Kranz collection held at the NASC.
For transformation, five to 10 plants were grown in a peat-based soil mixture of Levington (Ipswich, UK) M3 potting compost–vermiculite–perlite (3:2:1 [v]) in 4-inch pots in a greenhouse at 22°C with supplementary lighting. The soil was covered with muslin, and the seeds, which were mixed with sand, were sprinkled on top. Plantlets were thinned to ⩽10 per pot at 2 to 3 weeks and allowed to grow until flowering bolts were ~10 cm tall. Bolts were cut back once, and new bolts were allowed to emerge for several days before infiltration. For crosses and seed production, Arabidopsis was grown in the Arabicon system (Beta Tech, Ghent, Belgium) with 16 hr of light (100 μmol of photons m−2 sec−1) and 8 hr of dark in a growth chamber at 22°C. Plants for DNA extraction were grown in trays with 12 hr of light (100 μmol of photons m−2 sec−1) followed by 12 hr of dark in a growth chamber at 22°C to delay the onset of flowering and obtain more leaf material. All plants received a weekly feed of macronutrients in the form of Tomorite (Levington).
To obtain root material for RNA gel blot analysis, plants were grown for 21 days on 1.2% phytagel (Sigma) containing Murashige and Skoog salts (2.16 g L−1; Sigma) with the plates in a nearly vertical position in a growth room as above with 16 hr of light per day.
Generation of Recombinant Lines
Lines that were generated in a cross between Ler carrying ttg1 and ms1 and the Ws ecotypes and covering a 6-centimorgan (cM) region were selected and analyzed. Lines resulting from recombination events on the proximal side of ttg1-1 covering a 4-cM interval between ttg1-1 and GA3 were selected from a cross of Ler, carrying the mutations ttg1-1, ga3, and ch5, and the RLD1 ecotype.
Restriction Fragment Length Polymorphism Analysis
DNA was isolated from 4-week-old plant material as described by Dellaporta et al. (1983) followed by CsCl banding to remove RNA and polysaccharides (Walker et al., 1997). The purity of the DNA enabled the detection of small mobility differences of restriction fragments on DNA gel blots. Restriction enzyme digestion of DNA, gel electrophoresis, blotting, and hybridization techniques were as described in Walker et al. (1997).
A variety of probes detecting restriction fragment length polymorphisms (RFLPs) between these pairs of parental lines were used, including cosmid and bacteriophage λ constructs already mapped to this region by Nam et al. (1989) and Chang et al. (1988). End probes from yeast artificial chromosomes (YACs) (Ward and Jen, 1990) that had been mapped between ms1 and ga3 (Chapple et al., 1996; Schmidt et al., 1997; Thorlby et al., 1997) were generated by plasmid end rescue and inverse polymerase chain reaction (PCR) to map the location of ttg1. Probe AF3, a cDNA mapped to this region in other experiments (Chow et al., 1998), and CR12, a genomic fragment in a λ replacement vector (A.R. Walker, unpublished data), were also used.
Probes were labeled using random primer extension with α-32P-dATP, as discussed in Walker et al. (1997). Yeast chromosome preparations (prepared essentially as given in Carle and Olson, 1985) were electrophoresed in 1% low-melting-point agarose (Seaplaque; Flowgen, Lichfield, UK), using a Beckman (Palo Alto, CA) Geneline II pulsed-field electrophoresis system to facilitate recovery of whole YACs for subcloning or labeling for probes used in library screens. YAC end probes were generated by plasmid end rescue and inverse PCR (Bancroft et al., 1992).
Library Screening
A genomic library in λ DashII (Stratagene, La Jolla, CA) from ecotype Ler (Boyce et al., 1994) distributed by the EEC-BRIDGE Arabidopsis DNA Stock Centre (Cologne, Germany) was screened with YAC EG20H2 and cosmid g4556 by using standard techniques (Sambrook et al., 1989). The isolated λ clones were ordered by restriction pattern and hybridization to fragments of DNA from the region, and the genomic inserts of some were ligated into the NotI site of the pBINNOT vector for transformation of plants.
Construction of pBINNOT
The pBIN19 vector (Bevan, 1984) was modified to contain a NotI site in the polylinker. To remove an unwanted NotI site, pBIN19 was digested with NotI, treated with the Klenow fragment of DNA polymerase I (Boehringer Mannheim) and deoxynucleotide triphosphates to fill in the site, ligated in a large volume, and digested again with NotI. Escherichia coli XL1-Blue (Stratagene) was transformed with the ligation product using electroporation (Dower et al., 1988). Plasmid DNA was isolated from kanamycin-resistant colonies and digested with NotI to check that the original NotI site had been removed. The vector was digested with XbaI and Asp718I, and two annealed oligonucleotides (N1, 5′-GTACCGCGGCCGCAT-3′; and N2, 5′-CTAGATGCGGCCGCG-3′) containing a NotI site were ligated into the vector to reconstitute the XbaI and Asp718I sites in the polylinker. The ligated DNA was digested with BamHI to remove parental molecules, and pBINNOT was purified after transformation of E. coli XL1-Blue.
Sequence Analysis
Sequencing of TTG1 was performed using an ABI 377 sequencer (Perkin-Elmer, Warrington, UK) from subclones of the construct pB8 in pUC18 and pBluescript SK+ (Stratagene). The sequence was completed with two primers designed from a previously obtained sequence. Sequence analysis was conducted at the Human Genome Mapping Project Resource Centre (Hinxton, UK), using the software package of the Genetics Computer Group (Madison, WI); BLAST (Altschul et al., 1990) and FASTA (Pearson and Lipman, 1988) searches were conducted to find similar sequences.
PCR products from the region of the TTG1 gene were generated with primers designed to give overlapping fragments of ~700 bp by using genomic DNA from different ttg1 mutants and their parental ecotypes as the template and a mixture (10:1) of BioTaq (Bioline, London, UK) and Pfu (Stratagene) polymerases. The PCR products were separated by electrophoresis on a 1.2% agarose gel in 1 × TAE buffer (Sambrook et al., 1989). The relevant bands were excised, and the DNA was purified using Qiaex II gel kit (Qiagen, Chatsworth, CA) and then sequenced using the PCR primers.
RNA Analysis
Total RNA from different organs of 4-week-old soil-grown Columbia plants was extracted from 200 mg of tissue using Tripure (Boehringer Mannheim). Roots were gathered from plants grown on vertical agar plates, and RNA was extracted using the same technique. Floral buds of soil-grown plants were used for the mutant analysis. For RNA gel blots, 10 μg of total RNA was fractionated by electrophoresis on a formaldehyde–1.2% agarose gel and blotted onto Genescreen Plus (Du Pont), according to the manufacturer's instructions. TTG1 probes were prepared as described by Walker et al. (1997), and hybridization was performed at 42°C in 50% formamide for 16 hr. Blots were washed in 0.1 × SSC (1 × SSC is 0.15 M NaCl and 0.015 M sodium citrate) and 2% SDS at 65°C for 15 min. After exposure to film, blots were stripped by boiling in 0.1 × SSC and 2% SDS, and hybridized with a β-tubulin sequence (Z25960; 3′ untranslated region of the TUB5 gene) as a loading control.
Primer extension was conducted as given in Sambrook et al. (1989). An oligonucleotide primer designed to the complementary strand of the TTG1 sequence starting at base 54 shown in Figure 3 (5′-GTATGTGACGGCGGTTTCCGATCTGG-3′; 400 ng) was end labeled using 50 μCi γ-32P-ATP (3000 Ci mmol−1) (Amersham) and polynucleotide kinase (Boehringer Mannheim). The primer (32 ng) was annealed to 20 μg of total RNA extracted from Columbia flower buds and was extended using Moloney murine leukemia virus reverse transcriptase (Promega). RNA from ttg1-13 plants was used as a negative control.
Plant Transformation
Agrobacterium tumefaciens Agl1 (Lazo et al., 1991) was transformed with constructs in the pBINNOT vector by electroporation (Shen and Forde, 1989). Root transformation of ttg1-1 material with constructs pB1.1A, pB8, and pBX6 was conducted essentially as described by Valvekens et al. (1988). By using vacuum infiltration (Bechtold et al., 1993), genomic fragments were also introduced into ttg1-9 plants. Plants were covered for 2 days to maintain humidity, then grown for 4 weeks to produce as much seed as possible. Mutant plants containing ttg1-10 in a WS background did not transform efficiently. Selection of transformants from seed of plants subjected to vacuum infiltration was conducted on 0.8% agar containing Murashige and Skoog salts (2.16 g L−1) and kanamycin (35 μg mL−1) for 10 days. Kanamycin-resistant seedlings were moved onto fresh kanamycin plates for 10 days and then transferred to soil to set seed.
Three-Dimensional Modeling
A search in the Brookhaven protein data bank (Bernstein et al., 1977) for sequences similar to that of TTG1 identified the G-β subunit of G proteins as the most similar protein. Whereas the G-β subunit has typical WD repeats, the percentage of sequence identity between TTG1 and G-β subunits is low (~18%), and this resulted in ambiguity in some regions of the alignment. Hence, an alignment of all known WD repeat proteins, including TTG1, was made (data not shown), and these data were used to make the alignment shown in Figure 4. The structure of the G-β subunit (Wall et al., 1995; Lambright et al., 1996; Sondek et al., 1996) was used as the basis to construct a model of TTG1.
The suite of programs encoded in COMPOSER (Blundell et al., 1987; Srinivasan and Blundell, 1993) and incorporated into SYBYL (Tripos Inc., St. Louis, MO) was used to generate a three-dimensional model of TTG1 (Figure 7). The structures of most of the conserved regions of G-β proteins have been extrapolated to the equivalent regions of TTG1. The variable regions in the alignment, which often involved the insertion or deletion of stretches of amino acids, were modeled by identifying a suitable segment from a known structure in the data bank. A search was made for segments having the desired number of residues and the proper end-to-end distances across the three “anchor” C-α atoms at either side of the putative loop, so that the segment could be fitted to the contiguous conserved regions. Side chains were modeled either by extrapolating from the equivalent positions in the basic structure, where appropriate, or by using rules derived from the analysis of known protein structures (Sutcliffe et al., 1987). Energy minimization of the model helped to rectify the inconsistencies in the stereochemistry and relieved the short contacts between nonbonded atoms.
ACKNOWLEDGMENTS
We thank Maarten Koornneef, David Oppenheimer, Chris Somerville, and John Larkin for gifts of seeds; Zoe Wilson and Renate Schmidt for sharing results of chromosome walks; and George Coupland, Ian Furner, and Diane Hatton for helpful discussions. Technical assistance was provided by Martyn Seekings, Sue Sanderson, and Rebecca Rochford. We thank members of the Cambridge Centre for Molecular Recognition for oligonucleotide synthesis and DNA sequencing. Computing facilities were provided by the Human Genome Mapping Project (Cambridge, UK). Libraries were supplied by the EEC-BRIDGE Arabidopsis DNA Stock Centre, and some seed stocks came from the NASC; the EST was obtained from the Arabidopsis Biological Resource Center (Ohio State University, Columbus). This work was funded by research grants from the Agricultural and Food Research Council (Grant Nos. PMB00535 and PMB00658) and the Biotechnology and Biological Sciences Research Council (BBSRC) (Grant No. GER00618) to J.C.G., and a BBSRC Research Studentship to A.C.B.-W. T.L.B. and N.S. thank the Wellcome Trust for support. Some aspects of the work were initiated by J.J.E. and M.D.M. and sponsored by the National Science Foundation (Grant No. IBN 9506192 to M.D.M.).
Footnotes
-
↵2 Current address: Department of Molecular Biology and Biotechnology, University of Sheffield, Western Bank, Sheffield S10 2TN, UK.
-
↵3 Current address: Plant Breeding and Biotechnology, Horticulture Research International, East Malling, Kent ME19 6BJ, UK.
-
↵4 Current address: Molecular Biophysics Unit, Indian Institute of Science, Bangalore 560 012, India.
-
↵5 Current address: Department of Botany, University of Wisconsin, Madison, WI 53706.
-
↵6 Current address: Department of Genetics and Cell Biology, University of Minnesota, St. Paul, MN 55108.
- Received January 20, 1999.
- Accepted April 9, 1999.
- Published July 1, 1999.