A fragment screen of a library of 560 commercially available fragments using a kinetic assay identified a small molecule that increased the activity of the fungal glycoside hydrolase TrBgl2. An analogue by catalogue approach and detailed kinetic analysis identified improved compounds that behaved as nonessential activators with up to a 2-fold increase in maximum activation. The compounds did not activate the related bacterial glycoside hydrolase CcBglA demonstrating specificity. Interestingly, an analogue of the initial fragment inhibits both TrBgl2 and CcBglA, apparently through a mixed-model mechanism. Although it was not possible to determine crystal structures of activator binding to 55 kDa TrBgl2, solution NMR experiments demonstrated a specific binding site for the activator. A partial assignment of the NMR spectrum gave the identity of the amino acids at this site, allowing a model for TrBgl2 activation to be built. The activator binds at the entrance of the substrate-binding site, generating a productive conformation for the enzyme–substrate complex.

The depletion of fossil fuel in combination with the increasing demand for energy worldwide has stimulated research on alternative and sustainable energy sources. β-glucosidases (EC 3.2.1.21), which hydrolyse β-1,4-glycosidic bonds, have received considerable attention due to their essential role in bioethanol production from lignocellulosic (LC) biomass such as wood, agricultural residues and dedicated energy crops [1]. LC biomass is composed mainly of cellulose (40–50%), in combination with hemicellulose (25–30%) and lignin (15–20%) [1]. A mixture of enzymes catalyses cellulose degradation and comprises three categories; endoglucanases (EC 3.2.1.4), exoglucanases or cellobiohydrolases (EC 3.2.1.91) and β-glucosidases (EC 3.2.1.21) [2,3]. Endoglucanases cleave the internal β-1,4-glycosidic bonds of cellulose releasing small fragments. Subsequently, exoglucanases or cellobiohydrolases (CBH) act on the reducing and non-reducing ends resulting in short-chain cello-oligosaccharides such as cellobiose, which are then hydrolysed into glucose by the action of β-glucosidases [2]. However, the low activity of β-glucosidase results in the accumulation of cellobiose and subsequent inhibition of other cellulases [4,5,6,7]. To counter these effects biomass hydrolysis requires high concentrations of β-glucosidases, increasing the cost and reducing the efficiency of large-scale conversions [6,7]. There is great interest therefore in identifying β-glucosidases with improved catalytic activity and thermostability.

The fungus Trichoderma reesei (TrBgl2) produces thermostable cellulolytic enzymes, which make them attractive targets for industrial applications [8]. The glycosyl hydrolase, TrBgl2 from this organism, belongs to the β-retaining glycoside hydrolase family 1 according to the classification of carbohydrate-active enzymes (CAZY) [9,10]. The structure and biochemical properties of TrBgl2 have been elucidated by Jeng et al. [11]. As shown in Figure 1, TrBgl2 adopts a (α/β)8-ΤΙΜ barrel-like fold as a typical member of the GH1 enzyme family [11]. The active site of the enzyme is in a 15–20 Å deep slot-like cleft and is surrounded by negatively charged residues. The catalytic acid/proton donor is Glu165 (E165) while the catalytic nucleophile/base is Glu367 (E367) for TrBgl2 [11]. Mutation of amino acids in the substrate entrance region away from the active site such as P172L, L167W, T178V and P172L/F250A gave enhanced kcat/Km values by 5.3-fold compared with the wild type (WT) [12]. Similarly, modified β-glucosidases from T. reesei are included in cellulolytic enzyme cocktails that have been commercialised.

The crystal structure of TrBgl2.

Figure 1.
The crystal structure of TrBgl2.

(A) Cartoon and (B) Molecular surface representation of the TrBgl2 structure (PDB ID 3AHY) [11]. The two catalytic glutamic acid residues of TrBgl2 are shown in yellow. The helices are shown in cyan, the strands in magenta and loops in green. The activity-enhanced mutants are shown in red. The figures were created in PyMOL.

Figure 1.
The crystal structure of TrBgl2.

(A) Cartoon and (B) Molecular surface representation of the TrBgl2 structure (PDB ID 3AHY) [11]. The two catalytic glutamic acid residues of TrBgl2 are shown in yellow. The helices are shown in cyan, the strands in magenta and loops in green. The activity-enhanced mutants are shown in red. The figures were created in PyMOL.

Close modal

Although the majority of applications of fragment-based ligand discovery (FBLD) approaches focus on enzyme inhibition, our group has previously described the identification of the first small molecule that directly activates a glucosidase, the bacterial O-GlcNAc hydrolase from Bacteroides thetaiotaomicron (BtGH84) [13]. This provided the experience to use FBLD methods to explore the activation of TrBgl2 by small molecules. The work reported here is the identification and validation of a novel activator of the fungal TrBgl2. Ligand-observed NMR and thermal shift assays were used to further characterise the binding of the activator to TrBgl2, whilst protein-observed NMR experiments combined with molecular docking were used to generate a model to give insight into the mechanism of small molecule activation of TrBgl2.

Chemicals and reagents

The fragment library was assembled as described by Schulz et al. [14], dissolved in DMSO-d6 to a concentration of 200 mM and quality controlled using 1H NMR spectroscopy. The library is a subset of the Maybridge Ro3 diversity library (https://www.maybridge.com/portal/alias__Rainbow/lang__en/tabID__230/DesktopDefault.aspx). The entire library contains 2500 compounds of which 560 compounds were selected for maximum diversity in chemical representation and higher solubility. Additional neighbours were purchased from Chembridge and Enamine.

Cloning

The gene T. reesei (Trbgl2; AB003110) was optimised to the favoured codon usage for Escherichia coli and synthesised by Invitrogen. The Trbgl2 gene fragment was amplified by polymerase chain reaction (PCR) with forward 5′-CCAGGGACCAGCA ATGCTGCCGAAAGATTTTCAGTGG-3′ and reverse 5′-GAGGAGAAGGCGCGTTATGCTGCTGCAATCAGTTCATCAAAC-3′ primers (overhangs underlined). The PCR product was cloned into the expression vector of pET-YSBLIC3C using the NEBuilder HiFi DNA Assembly Cloning kit (Biolabs). The resulting DNA construct pET-YSBLIC3C_Trbgl2 was sequenced and verified.

Site-directed mutagenesis

The appropriate codon in the previously obtained recombinant plasmid pET-YSBLIC3C_Trbgl2 was mutated by PCR in order to obtain the E367Q substitution in TrBgl2. The E367Q Trbgl2 gene fragment was produced by PCR with forward 5′-CCTATTTATGTTACCCAAAATGGCACCAG-3′ and reverse 5′-GCCATTTTGGGTAACATAAATAGGCGGA-3′ primers (overhangs underlined). The PCR product was cloned into the expression vector of pET-YSBLIC3C using a NEBuilder HiFi DNA Assembly Cloning kit (Biolabs). The resulting DNA construct pET-YSBLIC3C_E367QTrbgl2 was sequenced and verified.

Trbgl2 and E367Q TrBgl2 expression and purification

Unlabelled protein

TrBgl2 and the mutant E367Q TrBgl2 were expressed and purified in an analogous manner to that previously described by Jeng et al. [11].

15N Isotope-labelled TrBgl2 and E367Q TrBgl2 protein

Production of 15N-labelled TrBgl2 and E367Q TrBgl2 followed a similar protocol to the unlabelled proteins. An overnight TrBgl2/E367Q TrBgl2 LB medium preculture from a freshly transformed colony was pelleted by centrifugation at room temperature (4000 rpm, 10 min) and resuspended in baffled flasks with 0.5 L of M9 medium containing 3 g Na2HPO4, 1.5 g KH2PO4, 0.25 g NaCl, 0.5 g 15NH4Cl, 2 mM MgSO4, 100 μM CaCl2, 0.4% glucose, 5 mL Gibco MEM vitamins solution (SIGMA) and 0.5 mL trace elements solution (50 mM FeCl3, 50 mM ZnSO4, 100 mM MnCl2, 10 mM CuSO4) and 100 µg/mL Kanamycin. Baffled flasks increase oxygenation of the medium during shaking for optimal protein expression. The 0.5 L M9 culture was subsequently incubated at 30°C with shaking until the medium reached an optical density, OD600 of 0.8–1.0. The temperature was then reduced to 13°C for one hour before adding the isopropyl β-thiogalactopyranoside (IPTG) to a final concentration of 0.5 mM. The cells were pelleted by centrifugation after incubation at 13°C overnight with shaking at 200 rpm.

TrBgl2 and the mutant E367Q TrBgl2 were purified in an analogous manner to that previously described by Jeng et al. [11].

15N, 13C, 2H and IVL methyl groups specifically labelled TrBgl2 protein

15N, 13C, 2H and IVL methyl groups specifically labelled TrBgl2 protein was expressed and purified in an analogous manner to that previously described by Makraki (BMRB 50158) [15].

Enzyme activity assay

A fluorescent kinetic assay was performed for library screening of TrBgl2 using 500 µM 4-methylumbelliferyl-β-d-glucopyranoside (MUG) in a total volume of 150 µL per well. Compounds dissolved in DMSO were added to each well to yield a final concentration from 1 to 2 mM. The reactions were performed in 160 mM phosphate–citrate buffer in pH 6.0, initiated by adding 20 µL of the enzyme to give a final concentration of 100 nM and were assayed at 40°C. The fluorescent 4-methylumbelliferyl release was recorded continuously at 355 nm excitation and 460 nm emission for 20 s using a BMG Labtech POLARstar OPTIMA plate reader. Control experiments were performed to evaluate the possibility of interference from the intrinsic fluorescence of compounds in the library by repeating the assay in the absence of the enzyme.

AC50 measurements of fragment activators were performed using a final concentration of 100 nM protein and 500 µM MUG with various concentrations of compounds depending on the solubility limits of each compound in a total volume of 150 µL per well. Assays were carried out in a 96 well plate (Black Nunc F96 MicroWell Plates) at 40 and 45°C for TrBgl2 and CcBglA, respectively, in 160 mM phosphate–citrate buffer in pH 6.0 and 2.5% DMSO. Reactions were initiated by the addition of 20 µL of protein to a final concentration of 100 nM, followed by 5 s of shaking. The fluorescent 4-methylumbelliferyl release was recorded continuously at 355 nm excitation and 460 nm emission for 20 s using a BMG Labtech POLARstar OPTIMA plate reader. Rates were calculated using GraphPad Prism 5.0 as the slope of the curve of the collected data. Corrections were made to the detected gradients to adjust for the inner filter effect of MUG and convert the gradients into units of µmol min−1 according to a standard curve for MUG. The initial slopes were fitted to sigmoidal dose-response curves using GraphPad Prism 5.0 and the AC50 values extracted. Inhibition of TrBgl2 by isofagomine (IC50) was determined in similar manner to the AC50 values.

Apparent KM and kcat values were calculated from a Michaelis–Menten curve fit to 9-point substrate titrations at a constant concentration of compounds. The covariation experiments used the same range of substrate concentrations across an activator and inhibitor concentration range. The data obtained from these experiments was fit to the nonessential activator and mixed-model inhibition equation, respectively.

Thermal shift assay (TSA)

Thermal shift screens were performed in 96-well polypropylene PCR plates (Agilent 401334) with a final volume 25 µL/well. Well components were E367Q TrBgl2 (300 nM), SYPRO orange dye (8X), DMSO (5.2%), substrate, MUG (4 mM) and compound at 4 mM in the same buffer as used for the enzymatic assays. Melting data were collected using an Agilent Stratagene Mx3005P rtPCR machine ramping from 25 to 95°C at 30 s per degree. Data were analysed with a combination of the Agilent MxPro software, MTS [16], JTSA (http://paulsbond.co.uk/jtsa) and the DSF analysis spreadsheet in Excel [17].

NMR spectroscopy

All NMR experiments were collected at 298 K on a Bruker Avance Neo 700 MHZ spectrometer equipped with a triple-resonance cryogenic probe. Data were processed using Topspin 3.2 (Bruker) and analysed with Sparky [18].

STD NMR

The samples contained 20 µM unlabelled TrBgl2, fragments at a concentration of 500 µM per compound and 100 µM sodium trimethylsilylpropanesulfonate (DSS) in 50 mM phosphate buffer containing 25 mM NaCl and 5% D2O at pH 6.0. The experiments were recorded by using a Bruker standard pulse sequence (stddiffesgp.3 [19]). The on-resonance frequency was set to 0.25 ppm and off-resonance irradiation was applied at 40 ppm. STD NMR spectra were collected with a total of 16 scans (NS), 5 s delay between the scans (d1), a saturation time of 4 s (d20) and 1.5 s acquisition time (AQ).

NMR titrations

For backbone chemical shift perturbations (CSPs), each sample was prepared with 120 µM 15N isotope-labelled TrBgl2 or E367Q TrBgl2, 8 mM or 16 mM of compound 8 or 10, 5 mM of MUG substrate and 100 µM DSS in 50 mM TRIS buffer containing 100 mM NaCl and 10% D2O at pH 8.0. The final DMSO concentration was usually not above 3.8%(v/v). 1H-15N TROSY spectra of the enzyme with DMSO in the buffer were collected as controls. 1H-15N TROSY spectra were collected by using a Bruker standard pulse sequence (trosyetf3gpsi [20,21]) with 128 scans (NS), 1.5 s delay between the scans (d1), 35 ppm spectra width (SW), 256-time domain data size (TD) and 117 ppm irradiation carrier frequency offset (O1P).

For IVL methyl group CSPs, each sample was prepared with 250 µM of 15N, 13C, 2H and IVL methyl group specifically labelled TrBgl2, 8 mM of compound 8, in 50 mM TRIS buffer containing 100 mM NaCl and 10% D2O at pH 8.0. The final DMSO concentration was usually not above 3.8% (v/v). 1H-13C HSQC spectra of the enzyme with DMSO in the buffer were collected as controls. 1H-13C HSQC spectra were recorded by using a Bruker standard pulse sequence (hsqcctetgpsp [22]) with 8 scans (NS), 1.5 s delay between the scans (d1), 30 ppm spectra width (SW), 256-time domain data size (TD) and 15 ppm irradiation carrier frequency offset (O1P).

Protein–ligand docking

The HADDOCK (High Ambiguity Driven DOCKing) Web server [23,24] used 1000 protein structures for initial rigid-body docking solutions. The best 200 solutions based on their intermolecular energy were used for a semi-flexible refinement. After initial docking iterations, a final explicit solvent refinement was performed on all 200 models which then clustered based on root mean square deviation criteria. Analysis of HADDOCK scores was performed to select the best docking solutions. HADDOCK calculations were driven by the available CSPs recorded after binding of compound 8 to TrBgl2. Three-dimensional (3D) conformations of the substrate, MUG and ligand (8) were generated using SMILES [25] as input.

Library screening

A 560 compound (<250 Da) fragment library was screened in an enzymatic assay using 4-MUG as a substrate. In the initial screen, one compound (1) enhanced the ability of TrBgl2 to cleave MUG (Figure 2A). Kinetic characterisation of the hit compound using MUG as substrate showed an activation of TrBgl2 with Amax and AC50 value of ∼155% and 1 ± 2 mM, respectively (Figure 2B; Amax = maximal activation obtained, AC50 = concentration of compound eliciting 50% of maximal activation), whilst Michaelis–Menten analysis of 1 yielded an approximately 2-fold improvement in kcat/KM values (Figure 2C; kcat = catalytic rate constant, KM = Michaelis constant).

Compound 1 activates TrBgl2.

Figure 2.
Compound 1 activates TrBgl2.

(A) Chemical structure of the initial hit (1) identified by fragment screening. (B) MUG cleavage assay AC50 curve for 1. Substrate concentration = 500 μM. (C) MUG cleavage assay Michaelis–Menten plot for TrBgl2 in the absence and presence of 1. [S] = substrate concentration.

Figure 2.
Compound 1 activates TrBgl2.

(A) Chemical structure of the initial hit (1) identified by fragment screening. (B) MUG cleavage assay AC50 curve for 1. Substrate concentration = 500 μM. (C) MUG cleavage assay Michaelis–Menten plot for TrBgl2 in the absence and presence of 1. [S] = substrate concentration.

Close modal

Ligand-observed NMR spectroscopy was used for binding characterisation of the hit compound (Figure 3). Hit compound (1) showed evidence of binding to the enzyme in saturation transfer difference (STD) NMR [19]; binding was also observed in the presence of the known glycoside hydrolase inhibitor isofagomine (2) with an IC50 value of 0.53 ± 0.12 μM (Supplementary Figure S1). This allowed the categorisation of the hit compound as non-competitive with isofagomine and suggested that binding must occur at an allosteric site.

Compound 1 binds to TrBgl2.

Figure 3.
Compound 1 binds to TrBgl2.

(A) 1D proton reference spectrum of 1. (B) STD NMR spectrum of 1 (500 µM) in the presence of TrBgl2 (20 µM) and (C) the STD NMR spectrum of the same sample following the addition of isofagomine (2) (40 µM).

Figure 3.
Compound 1 binds to TrBgl2.

(A) 1D proton reference spectrum of 1. (B) STD NMR spectrum of 1 (500 µM) in the presence of TrBgl2 (20 µM) and (C) the STD NMR spectrum of the same sample following the addition of isofagomine (2) (40 µM).

Close modal

To improve upon the initial hit, commercially available analogues of compound 1 were identified through an ‘analogue-by-catalogue’ approach. Many these analogues gave an increased activation of TrBgl2. Replacement of the thiourea group by urea or thio-amide yielded an inactive analogue. Other available compounds that retained the thiourea moiety but had a methyl group in the meta or para (rather than ortho) position were also inactive. As summarised in Table 1, groups other than methyl (such as chloro, methoxy) ortho to the thiourea also yielded active compounds (Supplementary Figure S2). Groups at the ortho position twist the aryl ring out of the plane to avoid steric clashes. This twist of aryl ring might be essential for the activation of TrBgl2 and could explain why compounds that had a group meta or para instead of ortho appeared to be inactive. Attempts to alter the morpholino moiety were more successful and led to more active compounds. Compound 8 retains the important thiourea and the ortho methyl group generating an activator of TrBgl2 with significantly improved Amax up to 188% in comparison with 1 and the other analogues identified. Compound 8 also had improved solubility (up to 16 mM) compared with other tested compounds and therefore was taken forward for further characterisation. Interestingly, one of the fragment analogues (10) that retains the thiourea but does not have ortho substituents appears to be a potent TrBgl2 inhibitor with an IC50 value of 312 ± 32 μΜ (Table 2). Evidence of fragment binding was also observed by STD NMR with a clear signal intensity for both compound 8 (Supplementary Figure S3) and 10 (Supplementary Figure S4) in the presence and absence of isofagomine (2). NMR experiments were also attempted to characterise further the orientation of 8 binding; the difference in peak intensities in the STD measurements were not consistent enough for epitope mapping and an NOE experiment was unable to detect any intermolecular NOEs, probably because of the weak affinity of binding.

Table 1.
Activation of TrBgl2 by analogues of compound 1
CompoundChemical structureAC50 (µM)Amax (%)
3   >40001 1551 
4   1809±961 1391 
5  >40001 1601 
6  >40002 1432 
7   1534 ± 3122 1612 
8   >40002 1882 
9   >40001 1281 
CompoundChemical structureAC50 (µM)Amax (%)
3   >40001 1551 
4   1809±961 1391 
5  >40001 1601 
6  >40002 1432 
7   1534 ± 3122 1612 
8   >40002 1882 
9   >40001 1281 
1

Maximum activation obtained with a titration of compound from 30 µM to 3.8 mM (substrate conc. = 500 µM);

2

Maximum activation obtained with a titration of compound from 63 µM to 8 mM (substrate conc. = 500 µM).

Table 2.
Inhibition of TrBgl2 by compound 10.
CompoundChemical structureIC50 (µM)Maximum inhibition (%)
10   312 ± 321 961 
CompoundChemical structureIC50 (µM)Maximum inhibition (%)
10   312 ± 321 961 
1

Maximum activation obtained with a titration of compound from 63 µM to 8 mM (substrate conc. = 500 µM)

Kinetic characterisation

The kinetics of TrBgl2 activation revealed a nonessential activation model [26], similar to a reversible mixed inhibition model in which the compound displays binding affinity for both the free enzyme and the enzyme–substrate complex. In the nonessential type activation model, the dissociation constant of the substrate and activator are modified in the presence of activator and substrate respectively by the constant α. In addition, the kcat of the enzyme in the presence of an activator is modified by the constant. Values for the α and β modifiers were calculated as 0.64 and 1.5, respectively, for activator 8. The change in substrate binding and catalytic rate constant reflects an approximately 1.5-fold enhancement of substrate affinity and an increase of kcat in the presence of activator 8 (Figure 4). Therefore, both values contribute to an enhanced catalytic rate through the enzyme–substrate–activator complex.

Covariation curve fits to the nonessential activator model for activator 8.

Figure 4.
Covariation curve fits to the nonessential activator model for activator 8.
Figure 4.
Covariation curve fits to the nonessential activator model for activator 8.
Close modal

Inhibitor 10 has an influence on enzyme kinetics that is consistent with a reversible mixed inhibition model. In this situation, two dissociation constants are defined, one for the binary enzyme–inhibitor complex (KI) and one for the ternary enzyme–substrate–inhibitor complex (αkI). The modifier α was calculated as 3.65 for inhibitor 10 which indicates an increased dissociation constant for the MUG substrate in the presence of 10 (Supplementary Figure S5).

Thermal shift assay (TSA)

To investigate the possible role of the substrate (MUG) in fragment binding, an inactive mutant TrBgl2 was generated (E367Q TrBgl2). The influence of fragment binding on E367Q TrBgl2 stability was assessed by TSA, measuring the melting temperature (Tm) with and without the substrate (MUG) (Figure 5). Both 8 and 10 destabilise apo E367Q TrBgl2, decreasing the Tm by ∼1 and 4°C, respectively. In contrast, the destabilising effect of 8 is significantly diminished in the presence of MUG (Tm increases by ∼1.2°C) whereas the addition of MUG does not affect the significantly large destabilising effect on E367Q TrBgl2 caused by 10 (Tm decreases by ∼4°C). This further suggests a contribution of the substrate to fragment binding as the activator affects enzyme activity.

Maximum shift in melting temperature from reference (ΔTm) for selected compounds and MUG.

Figure 5.
Maximum shift in melting temperature from reference (ΔTm) for selected compounds and MUG.
Figure 5.
Maximum shift in melting temperature from reference (ΔTm) for selected compounds and MUG.
Close modal

Selectivity

The bacterial β-glycosidase Clostridium cellulovorans (CcBglA) is homologous to the fungal TrBgl2 sharing 54% sequence similarity (Supplementary Figure S6). None of the fragments activated CcBglA demonstrating that the effect is specific. Interestingly, 10 inhibited both enzymes to a similar degree (Supplementary Figure S7).

NMR chemical shift perturbation (CSP) experiments

Although the apo structure of TrBgl2 is known (PDB ID 3AHY) [11], extensive efforts to obtain a crystal structure of 8 bound to either TrBgl2 or E367Q TrBgl2 with or without MUG were not successful. We have previously reported the sequential backbone and IVL methyl group partial assignment of the 55 kDa apo TrBgl2 (BMRB 50158) [15]. Therefore, NMR spectroscopy was used to generate information on the binding site of the activator.

First, we investigated the effect of 8 and 10 binding to the protein by recording a 2D 1H-15N TROSY spectrum in the presence and absence of the compounds. The quality of apo TrBgl2 1H-15N TROSY spectrum obtained, with well-dispersed peaks, is unusually good for such a large protein. Following addition of 8 at 8 and 16 mM some changes in amide chemical shifts occurred across the protein, indicating a specific binding event (Figure 6A,B). Backbone 1H-15N TROSY CSPs are limited by the fact that most ligand–protein interactions are mediated by sidechains which are distant from the detected backbone amides. Therefore, a sample of deuterated TrBgl2 with Ile, Val and Leu (IVL) sidechain methyl groups protonated was produced. A constant-time (CT)-[1H-13C HSQC] spectrum was recorded on this sample in the presence and absence of 8. Following the addition of 8 at 8 mM, several methyl CSPs were observed reconfirming the binding of 8 to TrBgl2 (Figure 6C).

NMR analysis of compound 8 binding to TrBgl2.

Figure 6.
NMR analysis of compound 8 binding to TrBgl2.

(A) 1H-15N TROSY spectrum (700 MHz, 298 K) of apo TrBgl2 (blue) overlaid with spectra of TrBgl2 with 8 at 8 mM (green) and 16 mM (magenta). The peaks corresponding to selected backbone amide groups are indicated. (B) Expanded panel of 1H-15N TROSY spectrum. Red arrows indicate the direction of chemical shifts on increasing ligand concentration for selected amino acids. (C) CT 1H-13C HSQC spectrum of an IVL methyl protonated sample (700 MHz, 298 K) of apo TrBgl2 (purple) overlaid with spectrum of TrBgl2 with 8 at 8 mM (red). Spectra collected under identical conditions. (D) Weighted backbone and methyl CSPs induced by 8; the dashed line is the experimentally determined significance threshold. (E) Assigned backbone and IVL methyl CSPs caused by the presence of 8 mapped onto the crystal structure of TrBgl2 (PDB ID 3AHY). Assigned backbone and methyl CSPs are coloured green and blue respectively. Unassigned residues are coloured grey. The protein backbone is shown as a ribbon diagram in cyan. (F) The above backbone and IVL methyl CSPs mapped on the protein surface, shaded as in (E). The figure was created in PyMOL.

Figure 6.
NMR analysis of compound 8 binding to TrBgl2.

(A) 1H-15N TROSY spectrum (700 MHz, 298 K) of apo TrBgl2 (blue) overlaid with spectra of TrBgl2 with 8 at 8 mM (green) and 16 mM (magenta). The peaks corresponding to selected backbone amide groups are indicated. (B) Expanded panel of 1H-15N TROSY spectrum. Red arrows indicate the direction of chemical shifts on increasing ligand concentration for selected amino acids. (C) CT 1H-13C HSQC spectrum of an IVL methyl protonated sample (700 MHz, 298 K) of apo TrBgl2 (purple) overlaid with spectrum of TrBgl2 with 8 at 8 mM (red). Spectra collected under identical conditions. (D) Weighted backbone and methyl CSPs induced by 8; the dashed line is the experimentally determined significance threshold. (E) Assigned backbone and IVL methyl CSPs caused by the presence of 8 mapped onto the crystal structure of TrBgl2 (PDB ID 3AHY). Assigned backbone and methyl CSPs are coloured green and blue respectively. Unassigned residues are coloured grey. The protein backbone is shown as a ribbon diagram in cyan. (F) The above backbone and IVL methyl CSPs mapped on the protein surface, shaded as in (E). The figure was created in PyMOL.

Close modal

CSPs were detected on titration with compound 8 and previously determined assignments [15] allowed some to be mapped onto the known crystal structure of TrBgl2 (PDB ID 3AHY) (Figure 6E,F). Τhe assignment was not complete [15] but it was sufficient to reveal defined patches of interaction between 8 and TrBgl2, consistent with a specific binding event at the entrance of the active site. Although most of the CSPs are located at the substrate entrance, residues at the surrounding part of the active site such as D22, G27, T33, S46, Y130 and Y440 are also significantly affected, potentially indicating that conformational changes in the protein are associated with binding of activator 8. The addition of 10 at 8 mM caused smaller but similar changes as seen for 8 (Supplementary Figure S8). Due to solubility issues 1H-15N TROSY spectra of 10 at a concentration higher than 8 mM were not recorded.

The possible role of substrate in fragment binding was further investigated by 1H-15N TROSY NMR using the inactive mutant, E367Q TrBgl2. Although the peaks of the spectra are well dispersed, characteristic of a fully folded protein, the resonances were broadened compare to the WT TrBgl2 spectra. Following the addition of MUG at 5 mM and 8 at 8 mM, the substrate-specific chemical shifts were further enhanced indicating the interaction site remains the same in the ternary complex. Moreover, the magnitude of the substrate-specific chemical shifts is larger in the ternary complex E367Q TrBgl2–MUG-8 complex (Figure 7), compared with the binary E367Q TrBgl2–MUG complex, suggesting that more enzyme is in complex with the substrate. Therefore, considering that the conditions and concentration of MUG were kept identical, these results also indicate higher substrate affinity to the enzyme in the presence of 8. On the other hand, following the addition of 10 at 4 mM, the substrate-specific chemical shifts were degraded which suggests lower substrate affinity to the enzyme in the presence of 10 (Supplementary Figure S9). Both observations are in full agreement with the kinetic experiments.

NMR TROSY spectra of 8 and MUG binding to E367Q TrBgl2.

Figure 7.
NMR TROSY spectra of 8 and MUG binding to E367Q TrBgl2.

(A) 1H-15N TROSY spectrum (700 MHz, 298 K) of apo E367Q TrBgl2 (black) overlaid with spectrum of E367Q TrBgl2 with MUG at 5 mM concentration (orange) and spectrum of E367Q TrBgl2 bound with both MUG and 8 at 5- and 8-mM concentration, respectively (green). (B) Expanded panel of representative portion of the overlaid spectra. Red arrows indicate the direction of chemical shifts. Spectra collected under identical conditions.

Figure 7.
NMR TROSY spectra of 8 and MUG binding to E367Q TrBgl2.

(A) 1H-15N TROSY spectrum (700 MHz, 298 K) of apo E367Q TrBgl2 (black) overlaid with spectrum of E367Q TrBgl2 with MUG at 5 mM concentration (orange) and spectrum of E367Q TrBgl2 bound with both MUG and 8 at 5- and 8-mM concentration, respectively (green). (B) Expanded panel of representative portion of the overlaid spectra. Red arrows indicate the direction of chemical shifts. Spectra collected under identical conditions.

Close modal

TrBgl2–8-MUG complex structure

The crystal structure of the closely related NkBgl–E193D in complex with pNPG was used to generate models of the TrBgl2–MUG and TrBgl2–8-MUG complexes (PDB ID 3IA0) [11] using the program HADDOCK [23,24]. After binding of the substrate (pNPG) to the inactive NkBgl–E193D, significant changes in the structure occur at the side chains of residues Glu/Asp193, Asp253, Trp374, Trp444, Asn449 and Glu451 [11]. The substrate pNPG was observed to form direct hydrogen bonds to Gln45, His148, Asn255, Glu402, Trp444, Glu451 and Trp452 of NkBgl–E193D. A structural alignment suggests that these residues in NkBgl are equivalent to Gln16, His119, Asp227, Glu367, Trp417, Glu424 and Trp425 of TrBgl2 (Supplementary Figure S10). The equivalent hydrogen bonding interactions between NkBgl and pNPG can therefore be used as restraints to model the structure of the TrBgl2–MUG complex (Supplementary Figure S10) using HADDOCK. Asp227, the amino acid that directly interacts with the nitrophenyl part of the substrate, pNPG, was excluded. The HADDOCK calculations generated a single cluster (Model 1) which exhibited a HADDOCK score of −60 and a minimum restraint violation energy of 1.1 kcal/mol (Supplementary Figure S11).

Next, HADDOCK was also used to generate a model of 8 docked into the TrBgl2–MUG cluster using model 1, the chemical structure of 8 and the most significant backbone amide and methyl CSPs as ambiguous restraints (assigned as backbone CSPs: G175, S176, T178, F179, G226, M295, N296, N301, W339 and W425; IVL methyl group CPSs: L167, I171, V316 and V321). Because the NMR experiments did not provide sufficient information for epitope mapping or intermolecular NOEs, there is insufficient information to define an orientation for the ligands. However, the HADDOCK calculations generated two clusters indicating the region where the ligand binds. The lowest energy cluster (Model 2) exhibited a HADDOCK score of −19.5 and a minimum restraint violation energy of 35.4 kcal/mol, whereas the next ligand cluster (Model 3) exhibited a HADDOCK score and minimal restraint violation energy of −8.1 and 72.1 kcal/mol, respectively. Both models suggest that the binding site of 8 is at the substrate entrance of TrBgl2 with a direct contact between activator 8 and MUG (Figure 8). The binding of 8 seems to close the active site entrance, trapping the bound substrate resulting in a more suitable orientation for enzyme activity. Interestingly, the previously identified mutations that lead to enhanced activity of TrBgl2, such as L167W and T178V [12] are clustered at the same site where the TrBgl2 activator interacts.

Structural model of compound 8 binding to TrBgl2.

Figure 8.
Structural model of compound 8 binding to TrBgl2.

Overlays of the lowest energy HADDOCK model structures of 8 docked with TrBgl2–MUG complex using the backbone amide and methyl CSPs interpreted to locate at the TrBgl2's active site. The figure was created in PyMOL.

Figure 8.
Structural model of compound 8 binding to TrBgl2.

Overlays of the lowest energy HADDOCK model structures of 8 docked with TrBgl2–MUG complex using the backbone amide and methyl CSPs interpreted to locate at the TrBgl2's active site. The figure was created in PyMOL.

Close modal

The pattern of mapped CSPs of 10 is quite similar to that of 8 indicating that the two compounds bind in the same binding site (Supplementary Figure S8). It is not clear why 10 is an inhibitor — a possible explanation that is consistent with the kinetic models and the TSA measurements is that 10 significantly destabilises binding of the substrate generating a less productive conformation for enzyme activity. This switch between activation and inhibition with just small modifications of the compound has been observed in other systems, such as for resveratrol-like compounds on sirtuin [27] and in compounds that modulate the interaction between KRas and SOS [28].

We have used fragment-based methods to identify a selective, small molecule compound that increases the activity of the industrial glycoside hydrolase, TrBglA by 188%. Structure–activity relationship (SAR) of closely related compounds suggest that activation is achieved by substituents inducing a particular conformation on the activators which results in a productive conformation for the enzyme–substrate complex; a closely related compound that was unable to achieve such a conformation was an inhibitor. This work would need to be extended to the covalent linkage of the activators (as previously demonstrated for BtGH84, [29]) to generate enzymes that could feasibly be used for industrial applications. However, it does demonstrate an alternate approach to traditional genetic approaches for generating enzymes with improved activity.

The authors declare that there are no competing interests associated with the manuscript.

E.M. was supported by the European Union's Horizon2020 MSCA Programme under grant agreement 675899 (FRAGNET); research in the group of REH was additionally supported by research grants from the BBSRC (BB/N008332/1) and institutional infrastructure support from funds provided by the Welcome Trust and EPSRC.

Open access for this article was enabled by the participation of University of York in an all-inclusive Read & Publish pilot with Portland Press and the Biochemical Society under a transformative agreement with JISC.

Roderick E. Hubbard: Supervision. Eleni Makraki: Resources, Formal analysis, Investigation, Visualization, Methodology, Writing — original draft. John F. Darby: Supervision. Marta G. Carneiro: Software, Methodology. James D. Firth: Investigation. Alex Heyam: Methodology. Eiso AB: Software, Methodology. Peter O'Brien: Supervision. Gregg D. Siegal: Supervision.

CBH

cellobiohydrolases

CSPs

chemical shift perturbations

FBLD

fragment-based ligand discovery

LC

lignocellulosic

MUG

methylumbelliferyl-β-d-glucopyranoside

PCR

polymerase chain reaction

STD

saturation transfer difference

TSA

thermal shift assay

1
Service
,
R.F.
)
Cellulosic ethanol. Biofuel researchers prepare to reap a new harvest
.
Science
315
,
1488
1491
2
Sticklen
,
M.B.
(
2008
)
Plant genetic engineering for biofuel production: towards affordable cellulosic ethanol
.
Nat. Rev. Genet.
9
,
433
443
3
Brethauer
,
S.
and
Studer
,
M.H.
(
2015
)
Biochemical conversion processes of lignocellulosic biomass to fuels and chemicals – a review
.
Chimia
69
,
572
581
4
Lynd
,
L.R.
,
Weimer
,
P.J.
,
van Zyl
,
W.H.
and
Pretorius
,
I.S.
(
2002
)
Microbial cellulose utilization: fundamentals and biotechnology
.
Microbiol. Mol. Biol. Rev.
66
,
506
577
5
Zhang
,
Y.H.
and
Lynd
,
L.R.
(
2004
)
Toward an aggregated understanding of enzymatic hydrolysis of cellulose: noncomplexed cellulase systems
.
Biotechnol. Bioeng.
88
,
797
824
6
Bommarius
,
A.S.
,
Katona
,
A.
,
Cheben
,
S.E.
,
Patel
,
A.S.
,
Ragauskas
,
A.J.
,
Knudson
,
K.
et al (
2008
)
Cellulase kinetics as a function of cellulose pretreatment
.
Metab. Eng.
10
,
370
381
7
Resa
,
P.
and
Buckin
,
V.
(
2011
)
Ultrasonic analysis of kinetic mechanism of hydrolysis of cellobiose by beta-glucosidase
.
Anal. Biochem.
415
,
1
11
8
Gao
,
J.
,
Qian
,
Y.
,
Wang
,
Y.
,
Qu
,
Y.
and
Zhong
,
Y.
(
2017
)
Production of the versatile cellulase for cellulose bioconversion and cellulase inducer synthesis by genetic improvement of Trichoderma reesei
.
Biotechnol. Biofuels
10
,
272
9
Cantarel
,
B.L.
,
Coutinho
,
P.M.
,
Rancurel
,
C.
,
Bernard
,
T.
,
Lombard
,
V.
and
Henrissat
,
B.
(
2009
)
The carbohydrate-active enzymes database (CAZy): an expert resource for glycogenomics
.
Nucleic Acids Res.
37
,
D233
D238
10
Henrissat
,
B.
(
1991
)
A classification of glycosyl hydrolases based on amino acid sequence similarities
.
Biochem. J.
280
(Pt
2
),
309
316
11
Jeng
,
W.Y.
,
Wang
,
N.C.
,
Lin
,
M.H.
,
Lin
,
C.T.
,
Liaw
,
Y.C.
,
Chang
,
W.J.
et al (
2011
)
Structural and functional analysis of three beta-glucosidases from bacterium Clostridium cellulovorans, fungus Trichoderma reesei and termite Neotermes koshunensis
.
J Struct. Biol.
173
,
46
56
12
Lee
,
H.L.
,
Chang
,
C.K.
,
Jeng
,
W.Y.
,
Wang
,
A.H.
and
Liang
,
P.H.
(
2012
)
Mutations in the substrate entrance region of beta-glucosidase from Trichoderma reesei improve enzyme activity and thermostability
.
Protein Eng. Des. Sel.
25
,
733
740
13
Darby
,
J.F.
,
Landstrom
,
J.
,
Roth
,
C.
,
He
,
Y.
,
Davies
,
G.J.
and
Hubbard
,
R.E.
(
2014
)
Discovery of selective small-molecule activators of a bacterial glycoside hydrolase
.
Angew. Chem. Int. Ed. Engl.
53
,
13419
13423
14
Schulz
,
M.N.
,
Landstrom
,
J.
,
Bright
,
K.
and
Hubbard
,
R.E.
(
2011
)
Design of a fragment library that maximally represents available chemical space
.
J. Comput. Aided Mol. Des.
25
,
611
620
15
Makraki
,
E.
,
Carneiro
,
M.G.
,
Heyam A
,
A.B.E.
,
Siegal
,
G.
and
Hubbard
,
R.E.
(
2020
)
1H, 13C, 15N backbone and IVL methyl group resonance assignment of the fungal β-glucosidase from Trichoderma reesei
.
Biomol NMR Assign.
14
,
265
268
16
Schulz
,
M.N.
,
Landstrom
,
J.
and
Hubbard
,
R.E.
(
2013
)
MTSA–a Matlab program to fit thermal shift data
.
Anal. Biochem.
433
,
43
47
17
Niesen
,
F.H.
,
Berglund
,
H.
and
Vedadi
,
M.
(
2007
)
The use of differential scanning fluorimetry to detect ligand interactions that promote protein stability
.
Nat. Protoc.
2
,
2212
2221
18
Lee
,
W.
,
Tonelli
,
M.
and
Markley
,
J.L.
(
2015
)
NMRFAM-SPARKY: enhanced software for biomolecular NMR spectroscopy
.
Bioinformatics
31
,
1325
1327
19
Mayer
,
M.
and
Meyer
,
B.
(
1999
)
Characterization of ligand binding by saturation transfer difference NMR spectroscopy
.
Angew. Chem. Int. Ed. Engl.
38
,
1784
1788
20
Czisch
,
M.
and
Boelens
,
R.
(
1998
)
Sensitivity enhancement in the TROSY experiment
.
J. Magn. Reson.
134
,
158
160
21
Pervushin
,
K.V.
,
Wider
,
G.
and
Wuthrich
,
K.
(
1998
)
Single transition-to-single transition polarization transfer (ST2-PT) in [15N,1H]-TROSY
.
J. Biomol. NMR
12
,
345
348
22
Vuister
,
G.W.
and
Bax
,
A.
(
1992
)
Measurement of two-bond JCOH alpha coupling constants in proteins uniformly enriched with 13C
.
J. Biomol. NMR
2
,
401
405
23
de Vries
,
S.J.
,
van Dijk
,
M.
and
Bonvin
,
A.M.
(
2010
)
The HADDOCK web server for data-driven biomolecular docking
.
Nat. Protoc.
5
,
883
897
24
van Zundert
,
G.C.P.
,
Rodrigues
,
J.
,
Trellet
,
M.
,
Schmitz
,
C.
,
Kastritis
,
P.L.
,
Karaca
,
E.
et al (
2016
)
The HADDOCK2.2 web server: user-friendly integrative modeling of biomolecular complexes
.
J. Mol. Biol.
428
,
720
725
25
Weininger
,
D.
(
1988
)
SMILES, a chemical language and information system. 1. Introduction to methodology and encoding rules
.
J. Chem. Inf. Comput. Sci.
28
,
31
36
26
Sebastian
,
J.S.
(
1987
)
Reversible activators of enzymes
.
J. Chem. Edu.
64
,
1031
1032
27
Gertz
,
M.
,
Nguyen
,
G.T.
,
Fischer
,
F.
,
Suenkel
,
B.
,
Schlicker
,
C.
,
Franzel
,
B.
et al (
2012
)
A molecular mechanism for direct sirtuin activation by resveratrol
.
PLoS One
7
,
e49761
28
Liao
,
T.J.
,
Jang
,
H.
,
Fushman
,
D.
and
Nussinov
,
R.
(
2018
)
Allosteric KRas4B can modulate SOS1 fast and slow Ras activation cycles
.
Biophys. J.
115
,
629
641
https://doi.org/10.1016/j.bpj.2018.07.016
29
Darby
,
J.F.
,
Atobe
,
M.
,
Firth
,
J.D.
,
Bond
,
P.
,
Davies
,
G.J.
,
O'Brien
,
P.
et al (
2017
)
Increase of enzyme activity through specific covalent modification with fragments
.
Chem. Sci.
8
,
7772
7779
This is an open access article published by Portland Press Limited on behalf of the Biochemical Society and distributed under the Creative Commons Attribution License 4.0 (CC BY).

Supplementary data