The Plant Cell, Vol. 14, 1033-1052,
May 2002, Copyright © 2002,
American Society of Plant Biologists
Structural Basis for Broad Substrate Specificity in Higher Plant -D-Glucan Glucohydrolases
Maria Hrmovaa,
Ross De Gorib,
Brian J. Smithc,
Jon K. Fairweatherd,
Hugues Driguezd,
Joseph N. Vargheseb and
Geoffrey B. Fincher1,a
a Department of Plant Science, University of Adelaide, Waite Campus, Glen Osmond, South Australia 5064, Australia
b Commonwealth Scientific and Industrial Research Organization, Division of Health Sciences and Nutrition, 343 Royal Parade, Parkville, Victoria 3052, Australia
c The Walter and Eliza Hall Institute of Medical Research, Parkville, Victoria 3050, Australia
d Centre de Recherches sur les Macromolécules Végétales, Centre National de la Recherche Scientifique (affiliated with Université Joseph Fourier), BP 53, 38041 Grenoble cedex 09, France
1 To whom correspondence should be addressed. E-mail geoff.fincher{at}adelaide.edu.au; fax 61-8-8303-7109
 |
Abstract
|
|---|
Family 3 -D-glucan glucohydrolases are distributed widely in higher plants. The enzymes catalyze the hydrolytic removal of -D-glucosyl residues from nonreducing termini of a range of -D-glucans and -D-oligoglucosides. Their broad specificity can be explained by x-ray crystallographic data obtained from a barley -D-glucan glucohydrolase in complex with nonhydrolyzable S-glycoside substrate analogs and by molecular modeling of enzyme/substrate complexes. The glucosyl residue that occupies binding subsite -1 is locked tightly into a fixed position through extensive hydrogen bonding with six amino acid residues near the bottom of an active site pocket. In contrast, the glucosyl residue at subsite +1 is located between two Trp residues at the entrance of the pocket, where it is constrained less tightly. The relative flexibility of binding at subsite +1, coupled with the projection of the remainder of bound substrate away from the enzyme's surface, means that the overall active site can accommodate a range of substrates with variable spatial dispositions of adjacent -D-glucosyl residues. The broad specificity for glycosidic linkage type enables the enzyme to perform diverse functions during plant development.
 |
INTRODUCTION
|
|---|
-D-Glucan glucohydrolases have been purified and characterized from barley seedlings, maize coleoptiles, soybean cultures, Acacia cells, nasturtium cells, and cultured tobacco cells (Cline and Albersheim, 1981 ; Nari et al., 1982 ; Lienart et al., 1986 ; Labrador and Nevins, 1989 ; Hrmova et al., 1996 ; Kotake et al., 1997 ; Crombie et al., 1998 ; Kim et al., 2000 ; Koizumi et al., 2000 ). They can hydrolyze glycosidic linkages in several -D-glucans, in -D-oligoglucosides containing (1 2)-, (1 3)-, (1 4)-, or (1 6)-linkages, in aryl -D-glucosides such as 4'-nitrophenyl -D-glucopyranoside (4NPGlc), and in some -D-oligoxyloglucosides (Crombie et al., 1998 ; Hrmova and Fincher, 1998 ; Kim et al., 2000 ). The barley -D-glucan glucohydrolases also hydrolyze cyanogenic -D-glucosides, albeit with low activity (M. Hrmova and G.B. Fincher, unpublished data). Single Glc molecules are released from the nonreducing termini of these substrates, with retention of the anomeric configuration (Hrmova et al., 1996 ). Their broad substrate specificity makes it difficult to assign these higher plant -D-glucan glucohydrolases to current Enzyme Commission classes; therefore, they have been described variously as -D-glucan glucohydrolases, (1 3)- -D-glucan exohydrolases, and -D-glucosidases. Nevertheless, they can be classified according to the structural criteria of Henrissat (1991) and fall into the family 3 group of glycoside hydrolases (http://afmb.cnrs-mrs.fr/CAZY/).
The distribution of the broad-specificity -D-glucan glucohydrolases in various tissues of monocotyledons and dicotyledons, together with the presence of expressed sequence tags in gymnosperm sequence databases (e.g., Pinus taeda), suggests that they may play a fundamental role in plant growth and development. They have been implicated in wall loosening during cell elongation, in wall remodeling, in defense reactions against fungal pathogens, in the release of Glc from wall polysaccharides as an energy source in dark-grown seedlings, and in the general recovery of Glc from different classes of polysaccharides and oligosaccharides (Hrmova and Fincher, 2001 ; Roulin et al., 2002 ).
The three-dimensional structure of the barley -D-glucan glucohydrolase isoenzyme ExoI has been solved by x-ray crystallography to a resolution of 2.2 Å (Hrmova et al., 1998a ; Varghese et al., 1999 ). The enzyme adopts a globular, two-domain conformation. The first domain of 357 amino acid residues has an ( / )8 triosephosphate isomerase-barrel fold and is joined by a 16amino acid, helix-like linker to the second domain, which consists of residues 374 to 559 arranged in an ( / )6 sandwich. A 13-Ådeep pocket at the interface of the two domains has been identified as the active site of the enzyme. The catalytic nucleophile of the barley enzyme is amino acid residue Asp-285, and the catalytic acid/base is Glu-491 (Varghese et al., 1999 ; Harvey et al., 2000 ; Hrmova et al., 2001 ). The crystal structure revealed that a Glc molecule remains tightly bound in the active site pocket and is probably the product of the enzyme-catalyzed reaction that is not released after hydrolysis (Varghese et al., 1999 ).
The availability of procedures to crystallize the barley enzyme (Hrmova et al., 1998a ), together with the previously solved three-dimensional structure (Varghese et al., 1999 ), allowed us to examine the structural basis for its broad specificity. In view of the structural similarities of the barley and other plant -D-glucan glucohydrolases (Harvey et al., 2000 ), the barley enzyme can be used as a model to explain the broad specificity of these enzymes more generally in higher plants. In particular, a structural rationale was sought for the ability of the barley enzyme to hydrolyze the (1 2)-, (1 3)-, (1 4)-, or (1 6)- -Dlinked disaccharides sophorose, laminaribiose, cellobiose, and gentiobiose (Hrmova and Fincher, 1998 ).
Here, -D-glucan glucohydrolase crystals were soaked with nonhydrolyzable S-glycoside substrate analogs of the preferred disaccharide substrate, laminaribiose, and the structure of the resulting complex was compared with that of the most slowly hydrolyzed substrate, cellobiose (Hrmova et al., 2001 ). The enzyme binds these analogs, but the S-glycosidic linkage is not hydrolyzed (Sulzenbacher et al., 1996 ; Driguez, 2001 ; Hrmova et al., 2001 ). As a result, the molecular interactions between amino acid residues on the enzyme's active site and the substrates can be defined precisely. S-Glycoside substrate analogs of sophorose and gentiobiose, the substrates that are hydrolyzed at intermediate rates, were not available for crystallography, but reliable molecular models of the corresponding enzyme/substrate complexes have been generated. In addition, a structural rationale was sought for the substrate specificity of the -D-xylosidaselike group of family 3 enzymes from higher plants.
Substrate binding by the barley -D-glucan glucohydrolase also was investigated by subsite-mapping techniques. The substrate binding regions of polysaccharide hydrolases are envisaged as a series of tandemly arranged subsites in which each subsite binds a single glucosyl residue of the polymeric or oligomeric substrate (Hiromi, 1970 ; Thoma et al., 1970 ). The catalytic efficiency (kcat·Km-1) values for oligoglucosides of increasing chain length have been used here to define the number of subsites and to calculate the binding affinities of individual -D-glucosyl binding subsites in the barley -D-glucan glucohydrolase (Hrmova et al., 1995 , 1998b ).
 |
RESULTS
|
|---|
Catalytic Efficiencies during Hydrolysis of -D-Oligoglucosides
The barley -D-glucan glucohydrolase is capable of catalyzing both hydrolytic and glycosyl transfer reactions (Figure 1)
, depending on substrate concentrations (Hrmova and Fincher, 1998 ; Kim et al., 2000 ). Therefore, care was exercised in steady state kinetic analyses of hydrolytic reactions to ensure that high substrate concentrations were avoided and that initial reaction rates were always measured. The kinetic parameters of hydrolysis of -D-glucopyranosyl-(1 2)-D-glucose (sophorose or G2OG), -D-glucopyranosyl-(1 3)-D-glucose (laminaribiose or G3OG), -D-glucopyranosyl-(1 4)-D-glucose (cellobiose or G4OG), and -D-glucopyranosyl-(1 6)-D-glucose (gentiobiose or G6OG) by the barley -D-glucan glucohydrolase are shown in Table 1. The catalytic efficiency was highest for G3OG, which was identified previously as the preferred disaccharide substrate (Hrmova and Fincher, 1998 ), and lowest for G4OG. These values reflect earlier estimates of the relative rates of hydrolysis of the enzyme against various -D-oligoglucosides (Hrmova and Fincher, 1998 ).

View larger version (12K):
[in this window]
[in a new window]
|
Figure 1. Kinetics of Hydrolytic or Glycosyl Transfer Reactions Catalyzed by a Plant Family 3 -D-Glucan Glucohydrolase.
After enzyme containing a noncovalently bound Glc product in the active site (E Glc) binds the first molecule of substrate (S), the Michaelis complex (E - S) is formed (k1), and the Glc product of the previous reaction is released from the active site. In the second step, the glycosidic bond is cleaved (k2), and the glycone part of the substrate becomes attached covalently to the enzyme to produce a metastable covalent glycosyl-enzyme intermediate (E . Glc). At the same time, the aglycone part of the substrate (A) is released. In the third step, the covalent glycosyl-enzyme intermediate is subjected (k3) to cleavage by a water molecule (H-OH), and a noncovalent E Glc product complex is formed, which is ready to interact (k1) with the second substrate molecule (S) to generate the next Michaelis complex (E - S), and again, the Glc molecule (Glc) is released from the active site. Alternatively, in the third step, the covalent glycosyl-enzyme intermediate (E . Glc) can be cleaved by an activated substrate molecule (R-OH), leading to a glycosyl transfer product (E Glc-OR), which remains noncovalently bound to the enzyme and is released (k4) when a second substrate molecule approaches the active site and forms the next Michaelis complex (E - S).
|
|
The Gibbs free energy of activation ( G ) values for G4OG and G3OG were used to calculate the difference in activation energies for the glycosylation steps (Namchuk and Withers, 1995 ), where ( G ) = -RT ln [(kcat·Km-1)G3OG/(kcat·Km-1)G4OG] (Fersht, 1999 ). This value is 9 kJ·mol-1 (Table 1), which represents a small but significant difference between the two positional isomers. The nonreducing -D-disaccharides , -trehalose, , -trehalose, and , -trehalose were not hydrolyzed, nor were xyloglucan or the xyloglucan oligosaccharides XXXG, with degree of polymerization 7 (DP 7), XXLG (DP 8), and XLLG (DP 9) (data not shown). Nomenclature and abbreviations for the xyloglucan oligosaccharides are as described by Fry (1995) .
Glycosyl Transfer Reaction in the Presence of 4NPGlc
When incubated with 4NPGlc at concentrations greater than 10 mM, the barley -D-glucan glucohydrolase (Hrmova and Fincher, 1998 ; Kim et al., 2000 ) and a putative family 3 soybean -D-glucosidase (Nari et al., 1982 ) form various oligoglucosides through glycosyl transfer reactions. Here, close to 25% of products released by the enzyme after 8 hr of incubation with 100 mM 4NPGlc were oligomeric (Table 2). The major product was 4NP- -gentiobioside, and the trisaccharide 4NP- -gentiotrioside also was detected. The structures of these oligoglucosides were confirmed (data not shown) by electrospray ionization mass spectrometry and 13C-NMR spectroscopy (Hrmova et al., 1998b ).
View this table:
[in this window]
[in a new window]
|
Table 2. Properties of Glycosyl Transfer Products Synthesized by Barley -D-Glucan Glucohydrolase Isoenzyme ExoI with 100 mM 4NPGlc
|
|
Subsite Mapping
Subsite maps were determined by kinetic analysis using (1 3)- -D-oligoglucosides and (1 4)- -D-oligoglucosides (Table 3, Figure 2)
, based on the observation that the enzyme hydrolyzes G3OG at the highest rate and G4OG at the lowest rate (Table 1) (Hrmova and Fincher, 2001 ). In both cases, subsite binding affinities, or "transition-state interaction energies" (Malet and Planas, 1997 ), were highest at subsite +1, and the affinity for the penultimate glucosyl residue in (1 3)- -D-oligoglucosides was higher than that for the same glucosyl residue in (1 4)- -D-oligoglucosides. Binding was detected at subsite +2, particularly for the (1 4)- -D-oligoglucosides, but binding affinities beyond subsite +2 were close to zero (Figure 2). It can be concluded from these data that there are three subsites in the active site of the barley -D-glucan glucohydrolase.
Synthesis of 4-NP S-( -D-Glucopyranosyl)-(1 3)- (3-Thio- -D-Glucopyranosyl)-(1 3)- -D-Glucopyranoside
The molecular and structural analyses of substrate specificity of the barley -D-glucan glucohydrolase required oligomeric substrates that would be bound in the active site of the enzyme but that would resist hydrolysis and thus allow the collection of x-ray diffraction data of the enzyme/substrate com-plexes. A nonhydrolyzable (1 4)- linked substrate analog 4I,4III,4V-S-trithiocellohexaose (G4SG4OG4SG4OG4SG) was synthesized previously (Driguez, 2001 ) and was used to examine the S-cellobioside/enzyme complex (Hrmova et al., 2001 ). Here, the (1 3)- linked substrate analog 4-NP S-( -D-glucopyranosyl)-(1 3)-(3-thio- -D-glucopyranosyl)-(1 3)- -D- glucopyranoside (4NP-G3SG3OG; Figure 3A)
was synthesized using "glycosynthase" methodology (Mackenzie et al., 1998 ; Malet and Planas, 1998 ; Fort et al., 2000 ). A mutant barley (1 3)- -D-glucan endohydrolase isoenzyme GII (E231G), in which the catalytic nucleophile has been altered to a Gly residue, acts as a highly efficient glycosynthase for the generation of (1 3)- -D-glucan polymers (Hrmova et al., 2000 ).
Using this mutant E231G (1 3)- -D-glucan glycosynthase, thiolaminaribiosyl fluoride was condensed with 4NPGlc to generate the trisaccharide 4NP-G3SG3OG (Figure 3A) in 50% yield. The condensation reaction with thiolaminaribiosyl fluoride was less efficient than for other syntheses using glycosynthases, for which yields of 80 to 100% have been reported (Mackenzie et al., 1998 ; Fort et al., 2000 ). Nevertheless, NMR analysis of the product clearly indicated that the newly formed linkage was in the -anomeric configuration (1H 4.45, J 7.5 Hz) and joined to C3 of the D-glucopyranosyl unit bearing the aromatic 4NP aglycone (13C 86.3) (Bock et al., 1984 ). Further proof of its identity was obtained by electrospray ionization mass spectrometry and by x-ray crystallography of -D-glucan glucohydrolase crystals soaked with the synthesized 4NP-G3SG3OG.
In using these S-glycosides for structural studies, it is acknowledged that the geometries of S- and O-glycosidic linkages are not identical and that there are differences in the C1-S(O)-C3'(4') bond angles and the C1-S(O) and C3'(C4')-S(O) bond lengths (Perez and Vergelati, 1984 ). Nevertheless, it is unlikely that the glycosidic O atom of the native substrate would be placed more than 0.35 Å away from the position of the S atom in the nonhydrolyzable substrate analogs (Driguez, 2001 ).
Inhibition Kinetics
The compound 4NP-G3SG3OG inhibited the barley -D-glucan glucohydrolase isoenzyme ExoI competitively, with a Ki value of 243.2 µM (Figure 3B). This value is 2.5 times lower than the inhibition found with G4SG4OG4SG4OG4SG (Ki = 614.6 µM) (Hrmova et al., 2001 ) and parallels the findings on hydrolytic preferences (Table 1). The Ki values of thiooligosaccharide inhibitors for -D-glycosidase hydrolases vary within the micromolar (Sulzenbacher et al., 1996 ; Czjzek et al., 2001 ; Fort et al., 2001 ; Hrmova et al., 2001 ) (Figure 3B) and millimolar (Moreau et al., 1996 ; Reverbel-Leroy et al., 1998 ) ranges.
The Ki values of both inhibitors were used to calculate the difference in contributions to binding free energies ( G ) of the two inhibitors according to ( G ) = -RT ln [(Ki)G4SG4OG4SG4OG4SG/(Ki)4NP-G3SG3OG] (Fersht, 1999 ). The value obtained was 2.4 kJ·mol-1.
Crystal Structure of the S-Laminaribioside/ -D-Glucan Glucohydrolase Complex
The S-glucosyl substrate analog 4NP-G3SG3OG was diffused into crystals of the barley -D-glucan glucohydrolase. The three-dimensional structure of the enzyme/S-laminaribioside complex was solved subsequently to 2.40 Å resolution with an R factor of 20.08%, using the rigid body refinement technique (Table 4). The coordinates have been deposited in the Protein Data Bank (http://www.rcsb.org/pdb/; Berman et al., 2000 ).
View this table:
[in this window]
[in a new window]
|
Table 4. Data Collection and Refinement Statistics of the Three-Dimensional Structure of Barley -D-Glucan Glucohydrolase with Bound S-Laminaribioside Moiety
|
|
The Glc molecule that remains bound in the active site pocket of the enzyme after hydrolysis (Varghese et al., 1999 ) was displaced by the substrate analog, and the two nonreducing terminal (1 3)- -glucosyl residues were defined clearly in the difference Fourier electron density map. These two residues occupy binding subsites -1 and +1. The remainder of the substrate analog molecule was disordered and therefore not visible in the electron density map (Figures 4C and 5B)
because it probably projects from the surface of the enzyme without extensive molecular binding to the enzyme. For this reason, the structure is referred to as an S-laminaribioside/enzyme complex. The data showed that the enzyme was fully occupied with the S-laminaribioside moiety.

View larger version (100K):
[in this window]
[in a new window]
|
Figure 4. Stereo Representation of the Active Site of Barley -D-Glucan Glucohydrolase with Bound S-Cellobioside Moiety (A), and the Superposed S-Cellobioside Moiety with G2OG (B), S-Laminaribioside Moiety (C), and G6OG (D).
A MOLSCRIPT (Kraulis, 1991 ) diagram of the S-cellobioside moiety is shown in cyan, and the superposed G2OG, S-laminaribioside moiety, and G6OG are shown in yellow. Fluorescent green indicates glycosidic O or S atoms in the superposed G2OG, S-laminaribioside moiety, and G6OG. Transparent green and magenta represent the molecular surfaces (Nicolls et al., 1991 ) of domains 1 and 2, respectively. Black, red, blue, and yellow spheres represent C, O, N, and S atoms, respectively. The structures were superposed over the C atoms of Asp-95, Phe-144, Arg-158, Lys-206, His-207, Glu-220, Tyr-253, Asp-285, Trp-286, Glu-287, Arg-291, Met-316, Trp-434, and Glu-491, with root mean square deviations in the C chain of 0.995 Å for G2OG and G4SG, 0.158 Å for G3SG and G4SG, and 1.062 Å for G6OG and G4SG. Subsites -1 and +1 are indicated. To improve the clarity of the diagrams, only amino acid residues Arg-158, Asp-285, Trp-286, Trp-434, and Glu-491 are shown. The entrance to the active site is located toward the bottom right corner. This figure is best viewed using a three-dimensional stereo viewer.
|
|

View larger version (41K):
[in this window]
[in a new window]
|
Figure 5. Bonding Interactions of Barley -D-Glucan Glucohydrolase with G2OG (A), S-Laminaribioside Moiety (B), S-Cellobioside Moiety (C), and G6OG (D).
Ligands are shown in the 4C1 conformation with atomic numbering of the C atoms. Dashed lines indicate hydrogen bonding interactions between the ligands and amino acid residues. All distances are expressed in angstroms and are drawn to scale where possible.
|
|
The glucopyranosyl residue at subsite -1 adopts the low-energy 4C1 conformation, without any apparent ring distortion, and is held in position by extensive hydrogen bonding interactions with six amino acid residues on the enzyme surface (Figures 4C and 5B). The C1-S-C3' bond angle is 102.45°, and the C1-S and S-C3' bond lengths are both 1.81 Å. The heteroglycosidic S atom is located 2.64 Å from the O 1 of the catalytic acid/base Glu-491 (Figure 5B), again indicating that Glu-491 is the catalytic acid/base. The glucopyranosyl residue at subsite +1 is held by hydrophobic -stacking interactions with Trp-286 and Trp-434 and by hydrogen bonds between C2'OH and Tyr-253 and Arg-291 (Figures 4C and 5B). More specifically, the glucopyranosyl residue at subsite +1 is clamped between the two Trp residues that form the entrance of the substrate binding pocket (Varghese et al., 1999 ; Hrmova et al., 2001 ). Furthermore, the C1-S-C3'-C4' dihedral angle is 128.05°, compared with the dihedral angle of crystalline laminaribiose of 77.71° (Noguchi et al., 1992 ). Thus, the glucopyranosyl ring of the bound substrate analog at subsite +1 is rotated and translated substantially; the difference between the torsion angles is 50°.
Comparison of the S-Laminaribioside/ and S-Cellobioside/ -D-Glucan Glucohydrolase Complexes
The three-dimensional structure of the S-cellobioside/enzyme complex was solved previously (Hrmova et al., 2001 ) using the nonhydrolyzable substrate analog G4SG4OG4SG-4OG4SG (Driguez, 2001 ). As in the S-cellobioside/enzyme complex, only the two glucosyl residues at the nonreducing end of the analogs were visible in the electron density maps (Figure 4A). No ring distortion could be detected (Hrmova et al., 2001 ). It was then possible to compare directly the three-dimensional conformations of the S-laminaribioside/enzyme and S-cellobioside/enzyme complexes to provide a structural rationale for the broad specificity of the enzyme and for the differences in hydrolytic efficiencies (Table 1).
For both S-substrate analogs, the glucopyranosyl residues that are bound at subsite -1 are in almost identical positions (Figures 4C and 5B; cf. Figure 5C). The only differences are small changes in hydrogen bonding distances between the OH groups of the bound substrate analogs and two (Asp-95 and Lys-206) of the six amino acid residues involved in binding at subsite -1 (Figures 5B and 5C). These distances are shorter with the S-laminaribioside moiety than with the S-cellobioside moiety. In contrast, the glucopyranosyl residues of the S-laminaribioside and S-cellobioside moieties that occupy subsite +1 are located between the Trp-286 and Trp-434 residues, but in significantly different positions.
In the case of the S-laminaribioside moiety, the more hydrophobic -face of the glucopyranosyl residue (apolar face) at subsite +1 is geometrically complementary with the pyrrole ring of Trp-286, whereas the more hydrophilic -face of the glucopyranosyl residue (polar face) sits over the phenyl ring of Trp-434. In the S-cellobioside moiety, the hydrophilic -face and the hydrophobic -face of the glucopyranosyl residue at subsite +1 are in contact with the pyrrole ring of Trp-286 and the phenyl ring of Trp-434, respectively (Figure 6)
. The -face versus -face designation of the glucopyranosyl ring in (1 4)- -Dlinked glycoside polymers is based on a clockwise versus an anti-clockwise numbering of the carbons, respectively. Thus, the -face of a glucopyranosyl ring is slightly more hydrophobic than the -face (Johnston et al., 1988 ). Additionally, differences of 0.5 Å are observed in hydrogen bond lengths between NH1 of Arg-291 and C2'OH of the S-laminaribioside moiety and between NH1 of Arg-291 and C6'OH of the S-cellobioside moiety.

View larger version (32K):
[in this window]
[in a new window]
|
Figure 6. Positions of G2OG (A), S-Laminaribioside Moiety (B), S-Cellobioside Moiety (C), and G6OG (D) in the Active Site of the Barley -D-Glucan Glucohydrolase with Respect to the Two Trp Amino Acid Residues That Constitute Binding Subsite +1.
Bound carbohydrates and carbohydrate moieties are shown in atom colors, and the stacked Trp-286 (magenta) and Trp-434 (cyan) amino acid residues at subsite +1 are rotated to the positions where their pyrrole (Trp-286) and phenyl (Trp-434) portions overlap. Substrate binding subsites -1 and +1 are marked.
|
|
Another difference between the bound substrate analogs is that the interresidue hydrogen bond between C6OH and C3'OH is much shorter (2.62 Å) in the S-cellobioside/enzyme complex than between C6OH and C4'OH (3.00 Å) of the S-laminaribioside/enzyme complex (Figures 5B and 5C). These values can be compared with a C6OH-C3'OH distance of 3.12 Å in crystalline cellobiose (Chu and Jeffrey, 1968 ) and with a C6OH-C4'OH distance of 3.30 Å in crystalline laminaribiose (Noguchi et al., 1992 ). The crystallographic data indicate that the flexibility of the bound S-laminaribioside and S-cellobioside moieties could be relatively high. Furthermore, the difference in torsion angles C1-O(S)-C4'-C3' of the two glucopyranosyl rings in the bound S-cellobioside moiety and crystalline cellobiose is only 7°, compared with a difference of 50° between the torsion angles C1-O(S)-C3'-C4' in the S-laminaribioside moiety and crystalline laminaribiose.
The overall differences in conformations of the glucopyranosyl rings in the bound S-laminaribioside and S-cellobioside moieties are illustrated further by superposing the two difference electron density maps (Figure 7)
. Again, the close coincidence of binding of the two substrate analogs at subsite -1 is evident. On the other hand, the noncoincidence of electron densities at subsite +1 indicates small, but significant, differences between the two rings at this subsite (Figure 7).
Molecular Modeling of G2OG and G6OG in the Active Site
The possible binding conformations of these oligoglucosides were determined by molecular modeling, based on the crystal structures of sophorose (G2OG) (Ikegami et al., 1995 ) and gentiobiose (G6OG) (Rohrer et al., 1980 ), and the structure of the barley -D-glucan glucohydrolase (Hrmova et al., 2001 ). Clearly, there are interpretative limitations associated with molecular modeling based on crystal structures of the disaccharide substrates, and these were exemplified by the large difference between torsion angles observed by x-ray crystallography in the bound S-laminaribioside substrate analog and those in crystalline laminaribiose, as noted above.
Within these interpretative limitations, the modeled structures with G2OG and G6OG again indicated that the glucopyranosyl residues at subsite -1 were held in place by hydrogen bonding interactions with the six key amino acid residues and were located in almost exactly the same positions as the corresponding residues in the S-laminaribioside/enzyme and S-cellobioside/enzyme complexes (Figures 4B, 4D, 5A, and 5D). Differences of 1 Å were observed in the lengths of the hydrogen bonds from O 1 of Asp-95 to C4OH and O 2 of Asp-285 to C2OH for G2OG. Similar differences were observed in the lengths of hydrogen bonds from NH1 and NH2 of Arg-291 to C3'OH of G2OG and to C4'OH of G6OG. At the +1 subsite, again, the glucopyranosyl rings were located between the two hydrophobic Trp-286 and Trp-434 residues, but in slightly different positions (Figure 6).
For G2OG, the more hydrophilic -face of the glucopyranosyl ring at subsite +1 was superposed with the pyrrole region of Trp-286, and the more hydrophobic -face was superposed with the phenyl region of Trp-434 (Figure 6A). During the molecular modeling process, the extra rotatable bond between the two glucopyranosyl rings in the G6OG substrate resulted in the outward displacement of the subsite +1 glucopyranosyl ring from between the two Trp residues (data not shown). However, preliminary crystallographic evidence now indicates that the glucopyranosyl ring of G6OG that occupies subsite +1 is aligned between the pyrrole ring of Trp-286 and the pyrrole/phenyl region of Trp-434 (M. Hrmova, R. De Gori, J.N. Varghese, and G.B. Fincher, unpublished data), presumably because of the flexibility of the interresidue linkage in G6OG. Therefore, the glucosyl residue of G6OG at subsite +1 is aligned between the two Trp residues in Figures 4D and 6D.
Molecular Dynamics of the Binding of -D-Glucopyranosyl-(1 3)- -D-Glucopyranosyl- (1 3)-D-Glucose in the Active Site
To investigate further the possible existence of subsite +2 in the active site of the barley -D-glucan glucohydrolase (Figure 2), molecular dynamics calculations were performed. The molecular model of the -D-glucopyranosyl-(1 3)- -D-glucopyranosyl-(1 3)-D-glucose (G3OG3OG)/enzyme complex was calculated in a protein environment that included amino acid residues within a 10-Å radius of the bound glucosyl residue in subsite -1 of the S-laminaribioside/enzyme complex. The model of the G3OG3OG substrate was built manually, and predicted interactions with the active site were calculated. The average structure of the trisaccharide from the dynamics simulation is shown in Figure 8
. The atoms are colored according to the variance in their mean positions. Atoms of the G3OG3OG substrate in subsites -1 and +1 showed low variation in their mean positions (blue), whereas atoms of the third residue at the reducing end of G3OG3OG showed significantly greater variation (red). A small difference (0.9 Å) in the distance between the O5 ring O atom of the glucopyranose in subsite +1 and the O4 hydroxyl atom of the reducing end glucopyranose ring of the G3OG3OG substrate was observed during the molecular dynamics simulation (Figure 8). This difference indicates that a hydrogen bond is formed between these two atoms, and as a result, a more constrained conformation of the trisaccharide G3OG3OG exists within the protein environment.

View larger version (29K):
[in this window]
[in a new window]
|
Figure 8. Average Structures of the Nonhydrogen Atoms of G3OG3OG Substrate Evaluated by Molecular Dynamics Calculations.
Atoms of the putative subsite +2 show larger variation in their mean positions than those in subsites -1 and +1. Color scale: blue, <0.1 Å2; red, >0.5 Å2. The graph illustrates the variation in the distance between the O5 ring O atom of the glucopyranose in subsite +1 and the O4 hydroxyl atom of the glucopyranose in the putative subsite +2 (connected by a black dotted line) during the molecular dynamics simulation, either in a protein environment (blue) or a nonprotein environment (red). The larger variation in the O5 to O4 hydroxyl atom distance in the nonprotein environment is apparent.
|
|
On the other hand, calculations under similar conditions with the trisaccharide G3OG3OG in a water box (i.e., in a nonprotein environment) showed a much larger variation in the distance between the O5 ring O atom of the glucopyranose in subsite +1 and the O4 hydroxyl atom of the reducing end glucopyranose ring of the G3OG3OG substrate. The distance in this case varied between 2.6 and 4.3 Å (Figure 8), and the difference of 1.7 Å indicates that the hydrogen bond in the nonprotein environment would be formed only transiently.
Thus, the molecular dynamics calculations suggest that it is the protein environment that confers stability on the hydrogen bond formed between the second and third glucopyranosyl rings of the trisaccharide G3OG3OG. Furthermore, the calculations provide no evidence for direct or indirect interactions between the enzyme and the reducing end glucopyranosyl residue of the G3OG3OG substrate.
 |
DISCUSSION
|
|---|
Comparative x-ray crystallographic analyses of the barley -D-glucan glucohydrolase isoenzyme ExoI in complex with nonhydrolyzable S-glucoside substrate analogs, coupled with molecular modeling of other disaccharide/enzyme complexes, have provided a structural rationale for the broad specificity of this group of higher plant enzymes. The enzymes hydrolyze barley -D-glucans or -D-glucosides with (1 2)-, (1 3)-, (1 4)-, or (1 6)-linkages (Table 1) (Cline and Albersheim, 1981 ; Hrmova et al., 1996 ; Kotake et al., 1997 ; Hrmova and Fincher, 1998 ; Kim et al., 2000 ), despite the different relative orientations of adjacent glucopyranosyl rings imposed on the substrates by the various linkage positions. The active site pocket of the enzyme is 13 Å deep, which is enough to accommodate approximately two -D-glucosyl residues, and is 15 Å wide at its entrance (Varghese et al., 1999 ). The overall kinetic schemes for both the hydrolytic and glycosyl transfer reactions catalyzed by the enzyme, together with the three-dimensional structures of key intermediates in the hydrolytic reaction pathway, have been described in detail (Figure 1) (Hrmova et al., 2001 ). Other glycoside hydrolases from microbial and plant sources exhibit a breadth of specificity against substrates with different glycosidic linkage positions (Frandsen and Svensson, 1998 ; Hashimoto et al., 1998 ; Stubbs et al., 1999 ), but no structural data are available to explain the molecular basis for this broad specificity.
The key feature of the barley -D-glucan glucohydrolase that allows it to hydrolyze the range of (1 2)-, (1 3)-, (1 4)-, or (1 6)-linked -D-glucosides is the presence of a relatively broad hydrophobic clamp, constituted by Trp-286 and Trp-434, which are placed 8 Å apart at the entrance of the active site pocket and which bind the -D-glucosyl residue that occupies substrate binding subsite +1. Trp residues play key roles in many other types of carbohydrateprotein interactions (Quiocho, 1986 ; Vyas et al., 1988 ; Divne et al., 1998 ). The x-ray data presented here show that the nonreducing terminal -D-glucosyl residue of the substrate is locked firmly in position at subsite -1 by extensive cooperative and bidentate hydrogen bonding interactions (Quiocho, 1986 ), with six amino acid residues at the bottom of the active site pocket. Therefore, all hydroxyl groups of the accessible hydrophilic surface area (Vyas et al., 1988 ) of the glucosyl residue at subsite -1 are bound to the enzyme (Figures 4 and 5). As a result, the glycosidic O atom would be held in a tightly fixed position with respect to the catalytic amino acid residues Asp-285 and Glu-491.
However, to explain the broad specificity of the enzyme for -D-glucoside substrates, the enzyme would need to accommodate, in subsite +1, the penultimate nonreducing -D-glucosyl residue of the various (1 2)-, (1 3)-, (1 4)-, or (1 6)-linked substrates with a significant degree of positional flexibility. The x-ray data show that this positional flexibility at subsite +1 is achieved by hydrophobic -stacking interactions with Trp-286 and Trp-434, which sit above and below the -D-glucosyl residue at subsite +1. Moreover, relatively few hydrogen bonds are formed between the enzyme and the -D-glucosyl residue at subsite +1. Thus, the -D-glucosyl residue in subsite +1 is not fixed as firmly in position as the -D-glucosyl residue in subsite -1. The crystallographic analyses of the two S-glycoside/enzyme complexes show that the -D-glucosyl residue in subsite +1 can be rotated and translated partially yet still remain located between the indole moieties of the two Trp residues (Figure 6). The flexibility in binding positions at subsite +1 presumably is allowed because the glucopyranosyl ring is 3 Å wide, whereas the indole moiety of the Trp residues is 5 Å wide. The Trp-286 and Trp-434 residues at the entrance of the substrate binding pocket of the barley -D-glucan glucohydrolase also might play a role in the binding of acceptor molecules during glucosyl transfer reactions (R-OH; Figure 1).
The crystallographic studies presented here and elsewhere (Figures 4, 6, and 7) (Hrmova et al., 2001 ) show that glucosyl residues at subsites -1 and +1 adopt 4C1 conformations. This is in contrast to several well-characterized exo- and endo-acting -D-glycoside hydrolases in which the subsite -1 glycosyl residues are distorted significantly (Sulzenbacher et al., 1996 ; Tews et al., 1996 , 1997 ; Davies et al., 1998 ; Zou et al., 1999 ; Fort et al., 2001 ; Papanikolau et al., 2001 ).
The flexibility in substrate positioning allowed by the two relatively wide Trp residues at subsite +1, and hence the broad specificity of these family 3 enzymes for -D-glucoside substrates, can be contrasted to the situation in a family 5 (1 3)- -D-glucan glucohydrolase from Candida albicans. The latter enzyme also has an active site pocket that accommodates two -D-glucosyl residues, but in this case, the penultimate -D-glucosyl residue at subsite +1 is sandwiched between two Phe residues (Phe-144 and Phe-258) at the entrance to the pocket (Cutfield et al., 1999 ). One might predict, based on the central role of the Trp-286/Trp-434 clamp in allowing binding of substrates with different relative orientations of adjacent glucopyranosyl residues in the barley -D-glucan glucohydrolase, that the relatively narrow Phe-144/Phe-258 clamp of the Candida enzyme would tighten its substrate specificity significantly. Indeed, this is the case. The relative catalytic efficiencies for the Candida (1 3)- -D-glucan glucohydrolase during hydrolysis of G3OG, G4OG, and G6OG are 100, 0.06, and 0.14%, respectively (Stubbs et al., 1999 ), whereas the corresponding values for the barley enzyme are 100, 3.1, and 19%, respectively (Table 1). The catalytic efficiency value for G2OG for the Candida enzyme was not reported, but for the barley enzyme, it is 25% of the efficiency for G3OG (Table 1). Conversely, changing the Trp residues to amino acid residues with much smaller side chains might broaden the substrate specificity and alter catalytic efficiencies. Site-directed mutagenesis of the barley Trp-286 and Trp-434 residues now can be used to test further the effect of these residues on substrate specificity.
Although the structural basis for the broad specificity of the barley -D-glucan glucohydrolase has become evident, it is not so easy to account for small differences in relative hydrolytic rates and catalytic efficiencies during hydrolysis of the dimeric -D-oligoglucosides (Table 1). The preference for G3OG, compared with G4OG, is reflected in the generally higher subsite binding affinities for (1 3)- linked substrates (Figure 2), but the differences are not large. Similarly, the differences in G values are small, but they also parallel the differences between subsite binding energy values obtained during subsite mapping with the (1 3)- -D-oligoglucosides and the (1 4)- -D-oligoglucosides (Figure 2, Table 1). Additional techniques, such as time-resolved crystallography, together with site-directed mutagenesis, the use of transition-state analogs, and pre-steady state kinetic analysis, will be necessary to define the rate-limiting step during catalysis. In many cases for family 3 enzymes, the rate-limiting step is the formation or hydrolysis of the glycosyl enzyme intermediate (Legler et al., 1980 ; Li et al., 2001 ).
Additional data also will be necessary to explain the apparent discrepancy between the number of -D-glucosyl binding subsites indicated by the subsite mapping analyses, which suggested three -D-glucosyl binding subsites on the enzyme (Figure 2), and the x-ray crystallography and molecular dynamics predictions that there are only two subsites (Figures 4, 5, and 8). It is possible that there is weak affinity for a portion of or the entire third residue of the bound substrate beyond the +1 subsite (Stone and Svensson, 2002 ) and that this is responsible for the subsite mapping result. Bound substrates also might adopt unexpected curved conformations (Parsiegla et al., 2000 ), which would allow them to interact with other parts of the enzyme outside of the active site pocket. However, the molecular dynamics calculations provide no evidence for direct or indirect interactions between the enzyme and the third glucopyranosyl residue of the G3OG3OG; rather, they suggest that the binding energy at the +2 site observed in subsite mapping is attributable to the stabilization of intermolecular hydrogen bonding in the ligand (Figure 8). The kinetics of substrate binding now can be investigated further by isothermal titration microcalorimetry (Creagh et al., 1996 ) or differential scanning microcalorimetry (Creagh et al., 1998 ).
The x-ray crystallography data presented here to describe in detail the molecular interactions at the -1 and +1 subsites of the barley -D-glucan glucohydrolase can be applied more generally to examine the structural basis for substrate specificity in other family 3 glycoside hydrolases from higher plants. There are >180 known members of this family, most of which are classified as -D-glucosidases, -D-xylosidases, or N-acetyl -D-glucosaminidases and are of microbial origin (Henrissat, 1991 ; Harvey et al., 2000 ; http://afmb.cnrs-mrs.fr/CAZY/). However, when the sequences derived exclusively from higher plant enzymes are used to generate a phylogenetic tree, two quite distinct groups of enzymes become evident (Figure 9A)
. One group contains the -D-glucan glucohydrolases or -D-glucan glucohydrolaselike enzymes from plants, for which sequence identities lie in the range of 60 to 80%. The second group of higher plant enzymes from family 3 contains -D-xylosidaselike enzymes, with sequence identities of 60% (data not shown).
Although none of the higher plant -D-xylosidaselike group in the databases have been characterized, they are assigned this specificity on the basis of their similarity to well-characterized microbial enzymes. In addition, we recently isolated cDNAs corresponding to purified family 3 -D-xylosidases from barley seedlings, and these barley enzymes share a high level of sequence similarity to the other members of the higher plant -D-xylosidaselike group (R.C. Lee, M. Hrmova, R.A. Burton, and G.B. Fincher, unpublished data). The clear separation of the two groups of higher plant family 3 enzymes seen in Figure 9A is reflected in sequence identities of <27% between members of the two groups (data not shown). No plant N-acetyl -D-glucosaminidase sequences from family 3 have been reported (http://afmb.cnrs-mrs.fr/CAZY/).
In the case of the family 3 enzymes from barley, the -D-glucan glucohydrolases do not hydrolyze -D-xylosides (Hrmova and Fincher, 1998 ), and the -D-xylosidases do not hydrolyze -D-glucosides (R.C. Lee, M. Hrmova, R.A. Burton, and G.B. Fincher, unpublished data). The three-dimensional data presented here to explain the broad specificity of the barley -D-glucan glucohydrolase also might explain the apparent differences in substrate specificity between the two groups (Figure 9B). The catalytic amino acid residues, corresponding to the barley Asp-285 and Glu-491, are conserved absolutely across the two groups. Similarly, amino acid residues involved in hydrogen bonding to the C2OH, C3OH, and C4OH groups (Lys-206, His-207, Arg-158, and Tyr-253) are conserved completely in both the -D-glucan glucohydrolaselike and -D-xylosidaselike groups (Figure 9B). However, amino acid residues corresponding to Asp-95, Arg-291, and Met-316 are replaced by other amino acid residues in the -D-xylosidaselike group of family 3 enzymes (Figure 9B).
It is noteworthy that Asp-95 and Met-316 interact with the CH2OH group and C5 of the glucosyl residue bound at subsite -1, and Arg-291 forms a hydrogen bond with C6'OH of the glucosyl residue bound at subsite +1, at least in the case of bound G4SG. The pentose -D-xylopyranose has no C5CH2OH group, and amino acids necessary for binding this hydroxyl group in the -D-glucan glucohydrolaselike group could be replaced in the -D-xylosidaselike group of family 3 enzymes. Before too many conclusions are drawn in relation to the molecular and structural basis for the differences in substrate specificity between the -D-glucan glucohydrolaselike group and the -D-xylosidaselike group of family 3 enzymes, it should be emphasized that members of the plant -D-xylosidaselike group often have associated -L-arabinofuranosidase activity (R.C. Lee, M. Hrmova, R.A. Burton, and G.B. Fincher, unpublished data). Therefore, the different sizes and conformations of the pyranosyl and furanosyl rings, together with the various orientations of projecting hydroxyl groups, need to be taken into account. In some enzymes, substrate binding amino acid residues simply adjust sterically to accommodate the binding of different groups of glycosides, as shown for a (1 4)- -glycanase (cellulase/xylanase) from Cellulomonas fimi (White et al., 1996 ; Notenboom et al., 1998 ).
Another major difference between the higher plant -D-glucan glucohydrolaselike and -D-xylosidaselike groups of family 3 enzymes is that the -D-xylosidaselike enzymes do not have the Trp-286 or Trp-434 residues that constitute the conserved hydrophobic clamp at subsite +1 in all of the -D-glucan glucohydrolaselike enzymes (Figures 4 to 6). In the -D-xylosidaselike group, a Cys residue replaces Trp-286 and Pro, Ala, or Met residues replace Trp-434 (Figure 9B). Whether these substitutions dramatically affect the specificity of binding at the +1 subsite of the -D-xylosidaselike group remains to be demonstrated.
We conclude from the data presented here that the family 3 -D-glucan glucohydrolases from higher plants can hydrolyze a range of -D-glucans and -D-oligoglucosides because of their ability to bind the glucosyl residues at subsite +1 in a relatively flexible manner. The -D-xylosidaselike group of family 3 enzymes in higher plants does not have the amino acid residues necessary for hydrogen bonding to the C5 CH2OH substituent of bound substrates and also might have evolved a different mechanism for binding the glycosyl residue that occupies binding subsite +1. Clearly, the three-dimensional structure of a family 3 plant -D-xylosidase is required to explain these possibilities further, together with the structure of the enzyme in complex with its substrates. In a more general sense, detailed structural information of the type provided here could prove useful in the functional annotation of genes discovered in genomics and genome-sequencing programs. For example, examination of sequences encoding substrate binding regions could discriminate between plant family 3 xylosidases, glucosidases, and, if they are present, N-acetyl -D-glucosaminidases.
 |
METHODS
|
|---|
Materials
The Glc diagnostic kit, 4'-nitrophenyl -D-glucopyranoside (4NPGlc), gentiobiose, sophorose, , -trehalose, , -trehalose, , -trehalose, esculin, salicin, BSA, and orcinol were purchased from Sigma (St. Louis, MO). Microcon microconcentrators were from Amicon (Beverly, MA), Sep-Pak Plus cartridges were from Waters (Milford, MA), Kieselgel 60 thin layer plates and sodium 2,2-dimethyl-2-silapentane-5-sulfonic acid were from Merck (Darmstadt, Germany), chromatography paper (3 MM Chr) was from Whatman (Maidstone, Kent, UK), and (1 3)- -D-oligoglucosides of degree of polymerization (DP) 2 to 7 and (1 4)- -D-oligoglucosides of DP 2 to 6 were from Seikagaku Kogyo (Tokyo, Japan). Tamarind (Tamarindus indica) xyloglucan and the xyloglucan oligosaccharides XXXG (DP 7), XXLG (DP 8), and XLLG (DP 9), were provide |