Specific isotopic labelling and reverse labelling for protein NMR spectroscopy: using metabolic precursors in sample preparation

Rowlinson, Benjamin; Crublet, Elodie; Kerfah, Rime; Plevin, Michael J.

doi:10.1042/BST20210586

The study of protein structure, dynamics and function by NMR spectroscopy commonly requires samples that have been enriched (‘labelled') with the stable isotopes ¹³C and/or ¹⁵N. The standard approach is to uniformly label a protein with one or both of these nuclei such that all C and/or N sites are in principle ‘NMR-visible'. NMR spectra of uniformly labelled proteins can be highly complicated and suffer from signal overlap. Moreover, as molecular size increases the linewidths of NMR signals broaden, which decreases sensitivity and causes further spectral congestion. Both effects can limit the type and quality of information available from NMR data. Problems associated with signal overlap and signal broadening can often be alleviated though the use of alternative, non-uniform isotopic labelling patterns. Specific isotopic labelling ‘turns on' signals at selected sites while the rest of the protein is NMR-invisible. Conversely, specific isotopic unlabelling (also called ‘reverse' labelling) ‘turns off' selected signals while the rest of the protein remains NMR-visible. Both approaches can simplify NMR spectra, improve sensitivity, facilitate resonance assignment and permit a range of different NMR strategies when combined with other labelling tools and NMR experiments. Here, we review methods for producing proteins with enrichment of stable NMR-visible isotopes, with particular focus on residue-specific labelling and reverse labelling using Escherichia coli expression systems. We also explore how these approaches can aid NMR studies of proteins.

Introduction

Unlike many other spectroscopic techniques, NMR spectroscopy produces spectra in which it is possible to identify signals that correspond to specific atoms in the target molecule. NMR spectroscopy can, therefore, provide information about molecular structure, dynamics, interactions and biological function at atomic resolution even in molecules as large as proteins or nucleic acids. The full power of the technique is unlocked through the process of resonance assignment: That is, linking a signal in an NMR spectrum to a specific nucleus in the molecule. With assignments in hand, it is possible to determine information about local and global chemistry (i.e. structure, interactions, modifications, etc.) and how this changes with time (i.e. dynamics, kinetics, etc.), and to link this information to prior knowledge of the molecule (e.g. its basic chemical structure, experimental conditions, etc.). Resonance assignment requires being able to resolve individual signals in NMR spectra. Two major issues that complicate the assignment process are signal overlap (resolution) and signal-to-noise ratio (sensitivity). These two factors, which are often linked, can be addressed by combining specialised sample preparation techniques (i.e. isotopic labelling) with appropriate spectroscopic methods.

Biomolecular NMR spectroscopy and low abundance stable isotopes

Spin-½ nuclei are particularly important for the NMR spectroscopy of biomacromolecules. The ¹H isotope (natural abundance level: 99.985%) is spin-½, which means that ¹H NMR spectra can be recorded of proteins without the need for any special isotopic enrichment schemes. However, the dispersion of ¹H chemical shifts typically found in spectra of proteins is relatively narrow and consequently it is common for ¹H NMR spectra to suffer from signal overlap. In practice, it becomes challenging to resolve individual ¹H NMR signals in NMR spectra of proteins larger than 10 kDa.

Spin-½ isotopes of carbon (¹³C) and nitrogen (¹⁵N) offer improved signal dispersion over ¹H. However, both have low natural abundance (¹³C at 1.1% and ¹⁵N at 0.05%) and it is, therefore, necessary to isotopically enrich proteins with these nuclei. Using a protein labelled with ¹³C and/or ¹⁵N affords better separation of signals by enabling the use of heteronuclear NMR experiments [1]. These experiments can correlate different sets of ¹H, ¹³C and/or ¹⁵N nuclei (spin systems) across different dimensions to generate multi-dimensional datasets, which can greatly reduce signal overlap seen in n-dimensional ¹H spectra of proteins [1].

The combination of uniform isotopic labelling of proteins and multi-dimensional heteronuclear experiments is the foundation of modern biomolecular NMR spectroscopy. That said, the application of NMR spectroscopy to proteins is not always straightforward and it is often necessary to overcome significant obstacles.

Too many signals

NMR spectroscopy can potentially detect signals from each NMR-visible nucleus in a protein, which can mean that even spectra of small proteins will have several hundred observable signals. This problem scales with the size of the protein: As molecular size increases so does the number of observable signals and the likelihood of nuclei having overlapping resonance frequencies.

Size is not the only issue that causes spectral congestion. Many proteins have low or reduced complexity sequences which can lead to signal overlap due to high numbers of residues of the same type or repeats of the same or similar sequence. Transmembrane proteins are often enriched in aliphatic and aromatic amino acids, and regions of NMR spectra where nuclei from these residues resonate can become highly congested [2]. Intrinsically disordered sequences can also present congested NMR spectra due to the lack of chemical shift dispersion that results from secondary and tertiary structure, and that fact that such sequences are often enriched in certain amino acid types and depleted of others [3]. Extreme examples are proteins carrying repeats of certain amino acid types, e.g. poly-glutamine stretches.

Bigger is not always better

As well as issues relating to signal overlap, NMR spectra of larger proteins also suffer from reduced sensitivity. Larger molecules tumble more slowly in solution, which causes more rapid decay (relaxation) of the NMR signal, broadening of the signal linewidth and concomitant decrease in signal intensity (sensitivity). The principal problem here is that ¹H nuclei efficiently relax other ¹H, ¹³C and ¹⁵N nuclei in their local vicinity. This effect increases with the size of the molecule and becomes highly detrimental for biomolecular NMR studies of proteins larger than 20–25 kDa.

The density of protons in a protein can be reduced by producing the sample in a deuterated expression medium. Deuteration replaces protons with deuterium atoms, which have a smaller impact on the relaxation rates of nearby nuclei [4]. Protein deuteration levels of up to 86% can be achieved using minimal media prepared in deuterium oxide (D₂O) rather than water [5]. Should higher levels be required then all protons in the culture medium need to be replaced with deuterium, which can be achieved through the use of deuterated carbon sources (e.g. [U-²H,¹³C] glucose) [5]. While this approach successfully reduces the levels of ¹H nuclei in proteins, it creates another problem: Protons are commonly used in NMR experiments to excite the system and to detect the final signal, and so retaining some ¹H nuclei at select sites is beneficial. Protons can be selectively re-introduced into deuterated proteins by exposing the sample to water-based buffers (e.g. during purification) to facilitate the exchange of deuterium for protons in labile sites. Alternatively, protonated (i.e. natural abundance) molecules such as amino acids or metabolic precursors can be added to deuterated expression medium to introduce protons at specific sites in the protein (see below).

Combining deuteration with relaxation-optimised spectroscopic approaches such as TROSY can enable NMR characterisation of much larger proteins and protein assemblies, potentially exceeding MDa sizes (under favourable situations) [6,7].

Isotopic labelling approaches for protein NMR spectroscopy

Many sample preparation approaches have been developed to alleviate issues of signal overlap and low sensitivity in the NMR spectra of proteins. At the base of most of these is the process of preparing a protein that is uniformly enriched in ¹³C and/or ¹⁵N nuclei. Protocols for uniform ¹³C and/or ¹⁵N enrichment of recombinant proteins produced in Escherichia coli are well established [8]. The isotopic labelling pattern of a recombinant protein over expressed from E. coli can be easily modified by adjusting the labelling pattern of the proton, nitrogen and carbon sources and by supplementing the medium with amino acids or metabolic precursors with the desired level and type of isotopic enrichment. These approaches can be applied to smaller proteins (<15 kDa) by over-expressing from media prepared in water, or to larger proteins (>20 kDa) by over-expression from media prepared with ²H₂O.

Uniform or site-specific labelling with ¹⁵N and/or ¹³C

An expression medium containing ¹⁵NH₄Cl as the sole nitrogen source would produce a [U-¹⁵N]-labelled protein, i.e. a protein in which each nitrogen site is ¹⁵N labelled. A 2D (¹H,¹⁵N) HSQC spectrum of such a protein should show a single crosspeak for each residue (excluding proline) as well as crosspeaks for side-chain amine groups (Figure 1a). The enrichment pattern of the protein can be modified by adjusting the composition of the expression medium used to produce it. For example, supplementing unlabelled minimal medium with [U-¹⁵N]-labelled lysine introduces ¹⁵N label at nitrogen sites in lysine residues [9]. A 2D (¹H,¹⁵N) HSQC spectrum of a protein with this labelling pattern would only show crosspeaks for each lysine NH group in the protein and hence be considerably simpler than that of a uniformly labelled sample (Figure 1a). Similar approaches can be considered for metabolic precursors of amino acids.

Turning NMR signals on and off using selective labelling or reverse labelling.

Figure 1.

(a) 2D (1H,15N) HSQC spectra of ubiquitin prepared with [U-15N] labelled expression medium (uniform labelling; left), unlabelled medium supplemented with [15N]-lysine (residue-specific labelling; centre), or [U-15N] labelled expression medium supplemented with unlabelled lysine (reverse labelling; right; red arrows indicate missing crosspeaks). (b) 2D (1H,13C) HSQC spectra of ubiquitin prepared with [U-13C] labelled expression medium (uniform labelling; left) or unlabelled medium supplemented with 2-[13C]-methyl acetolactate (specific labelling of leucine and valine proS methyl groups; right). (c) 2D (1H,13C) HSQC spectra of ubiquitin prepared with [U-13C] labelled expression medium (uniform labelling; left) or [U-13C] labelled expression medium supplemented with unlabelled phenylpyruvate (reverse labelling of phenylalanine; right). Data taken from Rasia et al. 2012 (figure 1A,C) [12] and Gans et al. 2010 (figure 1B) [10].

View large Download slide

Turning NMR signals on and off using selective labelling or reverse labelling.

(a) 2D (¹H,¹⁵N) HSQC spectra of ubiquitin prepared with [U-¹⁵N] labelled expression medium (uniform labelling; left), unlabelled medium supplemented with [¹⁵N]-lysine (residue-specific labelling; centre), or [U-¹⁵N] labelled expression medium supplemented with unlabelled lysine (reverse labelling; right; red arrows indicate missing crosspeaks). (b) 2D (¹H,¹³C) HSQC spectra of ubiquitin prepared with [U-¹³C] labelled expression medium (uniform labelling; left) or unlabelled medium supplemented with 2-[¹³C]-methyl acetolactate (specific labelling of leucine and valine proS methyl groups; right). (c) 2D (¹H,¹³C) HSQC spectra of ubiquitin prepared with [U-¹³C] labelled expression medium (uniform labelling; left) or [U-¹³C] labelled expression medium supplemented with unlabelled phenylpyruvate (reverse labelling of phenylalanine; right). Data taken from Rasia et al. 2012 (figure 1A,C) [12] and Gans et al. 2010 (figure 1B) [10].

If a protein is produced using [U-¹³C] glucose (CAS number of the unlabelled molecule: 50-99-7) as the sole carbon source, a 2D (¹H,¹³C) HSQC spectrum should show crosspeaks for each CH group in the protein (Figure 1b). Supplementing an unlabelled expression medium with 2-[¹³C]-methyl acetolactate (CAS: 71698-08-3), which is a precursor in the biosynthesis of leucine and valine [10], produces a protein with [¹H,¹³C] labelling of proS methyl groups. The resulting 2D (¹H,¹³C) HSQC spectrum is substantially simplified compared to the uniformly labelled protein as it only shows crosspeaks for pro-S methyl groups of leucine and valine (Figure 1b). All other CH sites are not ¹³C labelled and so are not observed in the spectrum.

Selective reverse labelling of sites in ¹⁵N and/or ¹³C labelled proteins

An alternative approach is to reverse label specific residues or atoms in an otherwise uniformly labelled protein [11–13]. Supplementing a minimal medium containing ¹⁵NH₄Cl with unlabelled lysine produces a protein in which all nitrogen sites are ¹⁵N labelled except for those in lysine. The resulting 2D (¹H,¹⁵N) HSQC spectrum shows that crosspeaks corresponding to lysine residues have disappeared (Figure 1a). Likewise, supplementing ¹³C and/or ¹⁵N labelled media with unlabelled precursors can turn off signals of specific sets of atoms. For example, adding natural abundance phenylpyruvate (CAS: 114-76-1), a precursor of phenylalanine, to a medium containing [U-¹³C]-glucose produces a [¹³C]-labelled protein in which the side-chain groups of phenylalanine are unlabelled [12]. Comparing 2D (¹H,¹³C) HSQC spectra of uniform and reverse labelled samples show select peaks disappear in the reverse labelled sample (Figure 1c).

Isotopic labelling using amino acids and metabolites

Specific labelling and reverse labelling of residues and atom subsets is not uniformly applicable. Not every amino acid or precursor will be incorporated ‘as is' into the target protein. Many amino acids and their precursors are metabolised by E. coli, which results in the scrambling of the isotopic labelling pattern of the molecule added to the culture medium. There has been considerable research over the last 30–40 years to identify amino acids and metabolic precursors that can be used to manipulate the isotopic enrichment pattern of recombinant proteins overexpressed in E. coli with no or minimal isotopic scrambling [4,14,15]. Only certain amino acids are compatible, typically those with an isolated biosynthesis pathway that includes one or more irreversible step. Biosynthetic precursors have emerged as an alternative to using the full amino acid, as precursors often lack stereochemical sites that render the full amino acid expensive to synthesise. Below, we summarise current procedures for residue-specific isotopic labelling and reverse labelling of proteins.

Routes for targeting aliphatic residues

Aliphatic residues (Leu, Ile, Val and Ala) represent a highly desirable target for specific isotope labelling or reverse labelling due to their relatively high abundance (Leu: 9%, Ile: 5.2%, Val: 6.6% and Ala: 8.3%) and broad distribution across proteins [16,17]. Moreover, these residues contain methyl groups, which are excellent NMR probes for studying larger proteins [18].

Specific labelling and reverse labelling of branched-chain aliphatic amino acids has been used for generating backbone, side-chain and stereospecific assignments as well as for measuring NOEs [4,12,13,19–21].

The carbon atoms of leucine and valine can be labelled or reverse labelled with minimal scrambling through the use of the biosynthetic precursors α-ketoisovalerate (CAS: 759-05-7) or acetolactate (CAS: 71698-08-3) (Figure 2) [4,10,12,22–24]. Both precursors are chemically synthesised as a racemic mixture, which impacts how they label prochiral methyl groups. Only the 2S stereoisomer of acetolactate is a substrate of ketol-acid reductoisomerase (EC: 1.1.1.86), which means that it is possible to use acetolactate to stereospecifically target the prochiral methyl groups of leucine and valine [10,19]. α-ketoisovalerate can be used for applications that require (or can tolerate) labelling of both prochiral methyl groups. Leucine alone can be labelled by the addition of 2-ketoisocaproate (CAS: 4502-00-5) (Figure 2), a precursor of leucine that sits after the divergence of leucine and valine biosynthesis [4,25]. Selectively deuterated versions of these precursors can be used to label larger proteins in combination with deuterated glucose and ²H₂O [10,25].

Metabolic precursors that can be used for isotopic labelling and reverse labelling of the carbon sites in branched-chain aliphatic amino acids.

Figure 2.

Sites corresponding to labelled or unlabelled groups are coloured to show their starting and end positions. The CAS numbers of the unlabelled precursors are given.

View large Download slide

Metabolic precursors that can be used for isotopic labelling and reverse labelling of the carbon sites in branched-chain aliphatic amino acids.

Sites corresponding to labelled or unlabelled groups are coloured to show their starting and end positions. The CAS numbers of the unlabelled precursors are given.

The carbon atoms of isoleucine can be labelled with 2-ketobutyrate (CAS: 2013-26-5) or 2-hydroxy-2-ethyl-3-ketobutyrate (CAS: 595-85-7) (Figure 2). These molecules also differentially target the methyl groups of Ile: 2-ketobutyrate is used to selectively label the Ile-δ₁ methyl group, while 2-(S)-hydroxy-2-ethyl-3-ketobutyrate can be used to target Ile-δ₁ and/or Ile-γ₂ methyl groups [26–28].

Biosynthetic precursors that target isoleucine, leucine and valine can be used in combination with other metabolites or amino acids which suppress scrambling of carbon sites and off-target effects. For instance, prochiral methyl groups of valine can be selectively labelled by the addition of labelled pro-R acetolactate-¹³C₄ (CAS: 71698-08-3) or pro-S acetolactate-¹³C₃ together with L-Leucine at natural abundance [29]. More details on methyl labelling of isoleucine, leucine and valine can be found in Kerfah et al. 2015 [4] and Schultz and Sprangers 2020 [6].

The methyl group of alanine provides a good probe for monitoring the local structure and dynamics of the protein backbone [30]. Specific isotopic labelling or reverse labelling of alanine is hampered by the presence of alanine transaminases which convert alanine into the widely used metabolite pyruvate [9]. Pyruvate is an early precursor in isoleucine, valine and leucine biosynthesis, which means that the labelling pattern of alanine will be scrambled into other aliphatic residues in the target protein. The addition of 1 g/L natural abundance alanine (CAS: 56-41-7) will result in approximately 50% loss of signal from valine [31]. That said, scramble-free ¹³C labelling of the carbonyl of alanine is possible via the addition of 1-[¹³C]-alanine [32], while labelling of the alanine methyl group can be achieved by adding other metabolites to suppress crosstalk between biosynthesis pathways [33,34]. In principle, reverse labelling can be achieved using similar approaches by adding alanine at the natural abundance and other precursors with ¹³C labelling, though this would be expensive and impractical. [¹⁵N] labelling or reverse labelling of the backbone amine group of alanine, isoleucine, leucine and valine is problematic due to the action of various transaminases [35].

Routes for targeting aromatic residues

Aromatic residues are found at interaction interfaces and in the hydrophobic core of proteins and hence can serve as excellent reporters of protein structure, dynamics and interactions [36]. Tryptophan can be used as a sole carbon source by E. coli and so significant scrambling of carbon atoms occurs when tryptophan is added to the culture medium [37]. Additionally, tryptophanase (EC:4.1.99.1) can convert tryptophan to indole, pyruvate and ammonia which leads to nitrogen scrambling as ammonia is used in amino acid synthesis [31]. Aromatic amino acid transaminases cause significant nitrogen scrambling between tyrosine and phenylalanine when attempting to label or reverse label with either amino acid [31].

An early example of the selective labelling of aromatic residues through metabolic precursors was the use of shikimic acid (CAS: 138-59-0) to label aromatic protons of phenylalanine, tyrosine and tryptophan against a deuterated background (Figure 3) [38]. However, the synthesis of isotopically labelled shikimic acid is complicated, which has precluded widespread use. Phenylpyruvate (CAS: 156-06-9) and 4-hydroxy phenylpyruvate (CAS: 156-39-8) have been used to reverse label the carbon atoms phenylalanine and tyrosine, respectively (Figure 3) [12]. ¹³C labelled versions of these precursors were later reported for isotopic labelling of phenylalanine and tyrosine [39]. Indole (CAS: 120-72-9) can be used for selective tryptophan labelling and reverse labelling of the tryptophan side chain (Figure 3) [40]. Isotopic labelling and reverse labelling of tryptophan can also be achieved through the use of indolepyruvate (CAS: 392-12-1) (Figure 3), which is part of the tryptophan degradation pathway rather than the biosynthesis pathway [41]. Anthranilic acid (CAS: 118-92-3) (Figure 3) has also been reported as an alternative tryptophan labelling precursor, which allows both ¹⁵N and ¹³C labelling of side-chain sites with minimal scrambling [42].

Metabolic precursors used for isotopic labelling and reverse labelling of phenylalanine, tryptophan and tyrosine.

Figure 3.

Sites corresponding to labelled or unlabelled groups are coloured to show their starting and end positions. The CAS numbers for the unlabelled precursors are given.

View large Download slide

Metabolic precursors used for isotopic labelling and reverse labelling of phenylalanine, tryptophan and tyrosine.

Sites corresponding to labelled or unlabelled groups are coloured to show their starting and end positions. The CAS numbers for the unlabelled precursors are given.

Histidine can be labelled or reverse labelled by the addition of the amino acid itself to the expression media. Histidine can also be labelled, without scrambling, by the metabolic precursor imidazolepyruvate (CAS: 2504-83-8; Figure 4) [43].

Histidine (un)labelling by the metabolic precursor imidazolepyruvate with incorporated atoms shown in red.

Figure 4.

The CAS number for the unlabelled precursor is given.

View large Download slide

Histidine (un)labelling by the metabolic precursor imidazolepyruvate with incorporated atoms shown in red.

The CAS number for the unlabelled precursor is given.

Routes for targeting polar residues

Serine is connected to glycine via a glycine-hydroxymethyltransferase (EC: 2.1.2.1), which is in turn linked to threonine. Serine is also a precursor of tryptophan and cysteine biosynthesis and can be converted into pyruvate by serine dehydratase (EC: 4.3.1.17). Currently, there is no protocol for scramble-free specific labelling or reverse labelling of serine using traditional expression hosts. Similarly, cysteine is converted to pyruvate by cysteine desulfhydrases (EC: 4.4.1.28) which leads to significant scrambling when cysteine is added to the culture medium.

Threonine is connected to glycine, serine, cysteine, tryptophan and isoleucine, which leads to significant scrambling for nitrogen labelling or reverse labelling [44]. For labelling or reverse labelling of carbon sites, threonine is connected to isoleucine and glycine biosynthesis and can cause significant scrambling when added to the expression media. This scrambling effect has been overcome by the addition of the isoleucine precursor 2-ketobutyrate (CAS: 2013-26-5) (or isoleucine) and glycine to the expression media [44,45].

Asparagine and glutamine are particularly difficult amino acids to specifically label or reverse label. Specific ¹⁵N labelling of these amino acids has been achieved through the use of media supplemented with ¹⁵NH₄Cl and all amino acids at natural abundance apart from asparagine or glutamine [46]. Specific labelling of side-chain sites of these residues using metabolic precursors has not been possible due to their position in metabolic pathways. Asparagine and glutamine synthesis is closely linked to aspartate and glutamate synthesis, which are used in the synthesis of many amino acids. In addition, glutamate is the primary nitrogen donor in amino acid biosynthesis. This means that scramble-free specific labelling of these amino acids with the residues themselves or their precursors is not possible without the addition of a full amino acid complement to the media or the use of auxotrophic strains or cell-free systems [37,47].

Routes for targeting charged residues

The final steps of both lysine and arginine biosynthesis are irreversible, which means that both amino acids can be used directly for labelling and reverse labelling (Figure 1a), thus negating the need for supplemental precursors to reduce isotopic scrambling.

For reasons discussed above, scrambling free specific labelling of aspartate and glutamate either with the amino acids themselves or with metabolic precursors has not been achieved.

Special cases

Methionine is commonly used to introduce methyl labelled probes for NMR analyses of proteins [7,48,49]. The relatively low abundance of methionine in proteins (2.4%) can reduce the chance of spectral overlap than with aliphatic residues [17]. Methionine can be both isotopically labelled and reverse labelled with minimal scrambling by the addition of the amino acid itself to the media [31,50]. An alternative approach uses the metabolic precursor methylthio-2-oxobutanoate (CAS: 583-92-6) (Figure 5) for labelling without nitrogen [51].

Methionine (un)labelling by methylthio-2-oxobutanoate with isotopically labelled sites indicated in red.

Figure 5.

View large Download slide

Methionine (un)labelling by methylthio-2-oxobutanoate with isotopically labelled sites indicated in red.

Glycine is linked to serine and threonine by glycine-hydroxymethyltransferase (EC: 2.1.2.1) and threonine aldolase (EC: 4.1.2.5), respectively, meaning extensive scrambling for both carbon and nitrogen sites occurs when attempting to label or reverse label with glycine.

Proline can be used as a sole carbon and nitrogen source in bacterial cell culture and so induces significant scrambling when added to the media [12,31,37,52]. Metabolic precursors of proline have not been used for the production of proteins with proline labelling or reverse labelling. The interconnectivity of the proline and other amino acid biosynthetic pathways makes this unlikely.

Discussion

In this review, we have summarised approaches for manipulating the isotopic labelling patterns of recombinant proteins for NMR studies. We have focused on approaches that allow labelling or reverse labelling of specific sets of atoms or residues and discussed how this can be achieved through the addition of amino acids or their precursors to bacterial cell culture media. We have highlighted applications to solution NMR spectroscopy, but the labelling approaches described would also benefit solid-state NMR spectroscopy of proteins.

Extensive research into isotopic labelling protocols means that today's protein NMR scientist can make use of a wide range of labelling and reverse labelling schemes and enrichment patterns. The task now is to utilise these schemes to the greatest effect. In addition to helping with spectral crowding and line broadening, specific isotope labelling or reverse labelling can provide residue-type assignment. These approaches also place site-specific probes into protein that can report on the structure, dynamics and binding (often bypassing the need for full resonance assignment) [12]. For example, a study of the interaction of yeast ubiquitin hydrolase with ubiquitin used a sample prepared with specific ¹³C labelling of Met-Cε, Ala-Cβ, His-Cε, Tyr-Cε and Trp-Cδ and ¹⁵N labelling of Arg backbone amide groups [53]. Residues involved at the interface were predicted based on the number and type of amino acid chemical shifts that were perturbed on complex formation. These assignment predictions were then used as an input for computational docking using HADDOCK [54] and the resulting models were consistent with a crystal structure of a related complex. The potential of site-specific isotopic labelling or reverse labelling to provide useful information without full assignments can be particularly useful where full assignment may not be possible or be too time consuming to obtain. In the post-AlphaFold2 world [55], lower levels of resonance assignment will likely often be sufficient to confirm a predicted structure or to link protein structure to function [56].

This review has exclusively focussed on the over-expression of recombinant proteins using standard E. coli-based techniques, i.e. cytosolic protein production under the control of an IPTG-inducible T7-based expression system. However, not all proteins can be produced using E. coli-based approaches and eukaryotic hosts are often required. There has been considerable progress in isotopic labelling using eukaryotic cell types, including yeast, insect and mammalian cells [57–63]. Differences in the metabolic processing of amino acids and their precursors, and different requirements for cell culture media have meant that the production of labelled proteins using eukaryotic systems is still not widely reported in the literature. An overriding concern for isotopic labelling in insect or mammalian cells is the complexity of the cell culture medium compared to the minimal recipes that can be used for E. coli. Broadly speaking, modifying eukaryotic cell culture media composition to support isotopic labelling is considerably more expensive than bacterial cell culture. That said, a number of impressive studies have been reported that use eukaryotic expression systems to generate labelled proteins for NMR studies [60,64].

In vitro or ‘cell free' protein synthesis is an alternative and highly adaptable approach for producing labelled protein, providing it is compatible with the protein of interest. Working with an S30 cell extract significantly reduces issues from metabolic scrambling compared to protein expression from live cells, and allows residue-specific labelling of each of the 20 proteinogenic amino acids [65,66]. Issues remain with the scrambling of Asp, Asn, Gln and Glu but these can be alleviated through the use of small molecule inhibitors or by using an auxotrophic strain of E. coli to produce the cell extract [67,68]. Moreover, the smaller scale of in vitro protein synthesis reactions compared to cell culture reduces the costs of reagents. Consequently, the use of amino acids with isotopic enrichment patterns that are complicated or expensive to synthesise becomes feasible because only very low milligram-level quantities are required [69]. Lastly, in vitro protein synthesis using amber stop codons and pre-charged tRNA molecules can allow isotopic labelling of a single residue position in the protein of interest [66,70].

NMR spectra can be extremely rich in information. A drawback from selective labelling or reverse labelling of proteins is that it greatly reduces the number of NMR reporters in the target molecule, which can limit some applications and the types of question that can be addressed. Despite that, there are multiple examples of where specific labelling of proteins for NMR studies has considerable benefit over uniform labelling. Ultimately, the isotopic labelling pattern chosen for a particular protein target will depend on the parameters of the system being studied and the biological questions of interest. Many projects do not require a high number of NMR-visible sites or complete resonance assignment to provide relevant information. As NMR spectroscopy moves away from being a method for solving the 3D structures of proteins and embraces a role as a site-resolved spectroscopic technique, quick and efficient access to site-specific NMR probes will become more and more important. Specific isotopic labelling or reverse labelling can provide these important site-specific NMR-visible probes.

Perspectives

Uniform enrichment of recombinantly produced samples with ¹³C and/or ¹⁵N isotopes is a fundamental platform for NMR studies of proteins. Approaches that allow site-specific modulation of the enrichment pattern of a protein can simplify the process of data analysis, provide resonance assignment information and unlock experimental strategies for NMR analysis of structure, dynamics and function.
Amino acids and/or their biosynthetic precursors can be added to bacterial culture media to achieve site- or residue-specific labelling or reverse labelling of proteins either alone or in combination with standard ¹³C and ¹⁵N labelling protocols. We review the approaches for achieving different isotopic labelling patterns, outline the motivations for employing these approaches and discuss the experimental benefits for their use.
Combining experiments that exploit proteins with site-specific labelling patterns with the improved prediction of 3D structure now available via tools such as AlphaFold promises to greatly broaden the range of targets and biological questions that can be addressed by NMR spectroscopy.

Competing Interests

NMR-Bio manufacture reagents for isotopic labelling of samples for NMR studies of proteins. The authors declare no other competing interests associated with this manuscript.

Funding

B.R. receives PhD funding from White Rose BBSRC DTP (BB/M011151/1). M.J.P. acknowledges funding from EPSRC (EP/W024063/1).

Open Access

Open access for this article was enabled by the participation of the University of York in an all-inclusive Read & Publish agreement with Portland Press and the Biochemical Society under a transformative agreement with JISC.

Author Contributions

B.R. and M.J.P. outlined the review. B.R. wrote the first draft and made the figures. M.J.P., E.C. and R.K. revised the text.

References

1

Sattler

,

M.

,

Schleucher

,

J.

and

Griesinger

,

C.

(

1999

)

Heteronuclear multidimensional NMR experiments for the structure determination of proteins in solution employing pulsed field gradients

.

Prog. Nucl. Magn. Reson. Spectrosc.

34

,

93

–

158

https://doi.org/10.1016/s0079-6565(98)00025-9

Google Scholar

Crossref

2

Danmaliki

,

G.I.

and

Hwang

,

P.M.

(

2020

)

Solution NMR spectroscopy of membrane proteins

.

Biochim. Biophys. Acta-Biomembr.

1862

,

183356

https://doi.org/10.1016/j.bbamem.2020.183356

Google Scholar

Crossref

PubMed

3

Konrat

,

R.

(

2014

)

NMR contributions to structural dynamics studies of intrinsically disordered proteins

.

J. Magn. Reson.

241

,

74

–

85

https://doi.org/10.1016/j.jmr.2013.11.011

Google Scholar

Crossref

PubMed

4

Kerfah

,

R.

,

Plevin

,

M.J.

,

Sounier

,

R.

,

Gans

,

P.

and

Boisbouvier

,

J.

(

2015

)

Methyl-specific isotopic labeling: a molecular tool box for solution NMR studies of large proteins

.

Curr. Opin. Struct. Biol.

32

,

113

–

122

https://doi.org/10.1016/j.sbi.2015.03.009

Google Scholar

Crossref

PubMed

5

Leiting

,

B.

,

Marsilio

,

F.

and

O'Connell

,

J.F.

(

1998

)

Predictable deuteration of recombinant proteins expressed in Escherichia coli

.

Anal. Biochem.

265

,

351

–

355

https://doi.org/10.1006/abio.1998.2904

Google Scholar

Crossref

PubMed

6

Schutz

,

S.

and

Sprangers

,

R.

(

2020

)

Methyl TROSY spectroscopy: a versatile NMR approach to study challenging biological systems

.

Prog. Nucl. Magn. Reson. Spectrosc.

116

,

56

–

84

https://doi.org/10.1016/j.pnmrs.2019.09.004

Google Scholar

Crossref

PubMed

7

Mas

,

G.

,

Guan

,

J.Y.

,

Crublet

,

E.

,

Debled

,

E.C.

,

Moriscot

,

C.

,

Gans

,

P.

et al (

2018

)

Structural investigation of a chaperonin in action reveals how nucleotide binding regulates the functional cycle

.

Sci. Adv.

4

,

eaau4196

https://doi.org/10.1126/sciadv.aau4196

Google Scholar

Crossref

PubMed

8

McIntosh

,

L.P.

and

Dahlquist

,

F.W.

(

1990

)

Biosynthetic incorporation of ¹⁵N and ¹³C for assignment and interpretation of nuclear-magnetic-resonance spectra of proteins

.

Q. Rev. Biophys.

23

,

1

–

38

https://doi.org/10.1017/s0033583500005400

Google Scholar

Crossref

PubMed

9

Muchmore

,

D.C.

,

McIntosh

,

L.P.

,

Russell

,

C.B.

,

Anderson

,

D.E.

and

Dahlquist

,

F.W.

(

1989

)

Expression and ¹⁵N labeling of proteins for proton and ¹⁵N nuclear-magnetic-resonance

.

Methods Enzymol.

177

,

44

–

73

https://doi.org/10.1016/0076-6879(89)77005-1

Google Scholar

Crossref

PubMed

10

Gans

,

P.

,

Hamelin

,

O.

,

Sounier

,

R.

,

Ayala

,

I.

,

Dura

,

M.A.

,

Amero

,

C.D.

et al (

2010

)

Stereospecific isotopic labeling of methyl groups for NMR spectroscopic studies of high-molecular-weight proteins

.

Angew. Chem.-Int. Ed.

49

,

1958

–

1962

https://doi.org/10.1002/anie.200905660

Google Scholar

Crossref

11

Krishnarjuna

,

B.

,

Jaipuria

,

G.

,

Thakur

,

A.

,

D'Silva

,

P.

and

Atreya

,

H.S.

(

2011

)

Amino acid selective unlabeling for sequence specific resonance assignments in proteins

.

J. Biomol. NMR

49

,

39

–

51

https://doi.org/10.1007/s10858-010-9459-z

Google Scholar

Crossref

PubMed

12

Rasia

,

R.M.

,

Brutscher

,

B.

and

Plevin

,

M.J.

(

2012

)

Selective isotopic unlabeling of proteins using metabolic precursors: application to NMR assignment of intrinsically disordered proteins

.

ChemBioChem

13

,

732

–

739

https://doi.org/10.1002/cbic.201100678

Google Scholar

Crossref

PubMed

13

Vuister

,

G.W.

,

Kim

,

S.J.

,

Wu

,

C.

and

Bax

,

A.

(

1994

)

2D and 3D NMR-study of phenylalanine residues in proteins by reverse isotopic labeling

.

J. Am. Chem. Soc.

116

,

9206

–

9210

https://doi.org/10.1021/ja00099a041

Google Scholar

Crossref

14

Schorghuber

,

J.

,

Geist

,

L.

,

Platzer

,

G.

,

Feichtinger

,

M.

,

Bisaccia

,

M.

,

Scheibelberger

,

L.

et al (

2018

)

Late metabolic precursors for selective aromatic residue labeling

.

J. Biomol. NMR

71

,

129

–

140

https://doi.org/10.1007/s10858-018-0188-z

Google Scholar

Crossref

PubMed

15

Lacabanne

,

D.

,

Meier

,

B.H.

and

Bockmann

,

A.

(

2018

)

Selective labeling and unlabeling strategies in protein solid-state NMR spectroscopy

.

J. Biomol. NMR

71

,

141

–

150

https://doi.org/10.1007/s10858-017-0156-z

Google Scholar

Crossref

PubMed

16

Janin

,

J.

(

1979

)

Surface and inside volumes in globular proteins

.

Nature

277

,

491

–

492

https://doi.org/10.1038/277491a0

Google Scholar

Crossref

PubMed

17

McCaldon

,

P.

and

Argos

,

P.

(

1988

)

Oligopeptide biases in protein sequences and their use in predicting protein coding regions in nucleotide-sequences

.

Proteins

4

,

99

–

122

https://doi.org/10.1002/prot.340040204

Google Scholar

Crossref

PubMed

18

Ruschak

,

A.M.

and

Kay

,

L.E.

(

2010

)

Methyl groups as probes of supra-molecular structure, dynamics and function

.

J. Biomol. NMR

46

,

75

–

87

https://doi.org/10.1007/s10858-009-9376-1

Google Scholar

Crossref

PubMed

19

Kerfah

,

R.

,

Hamelin

,

O.

,

Boisbouvier

,

J.

and

Marion

,

D.

(

2015

)

CH₃-specific NMR assignment of alanine, isoleucine, leucine and valine methyl groups in high molecular weight proteins using a single sample

.

J. Biomol. NMR

63

,

389

–

402

https://doi.org/10.1007/s10858-015-9998-4

Google Scholar

Crossref

PubMed

20

Tugarinov

,

V.

,

Choy

,

W.Y.

,

Orekhov

,

V.Y.

and

Kay

,

L.E.

(

2005

)

Solution NMR-derived global fold of a monomeric 82-kDa enzyme

.

Proc. Natl Acad. Sci. U.S.A.

102

,

622

–

627

https://doi.org/10.1073/pnas.0407792102

Google Scholar

Crossref

PubMed

21

Sprangers

,

R.

and

Kay

,

L.E.

(

2007

)

Quantitative dynamics and binding studies of the 20S proteasome by NMR

.

Nature

445

,

618

–

622

https://doi.org/10.1038/nature05512

Google Scholar

Crossref

PubMed

22

Tugarinov

,

V.

and

Kay

,

L.E.

(

2004

)

An isotope labeling strategy for methyl TROSY spectroscopy

.

J. Biomol. NMR

28

,

165

–

172

https://doi.org/10.1023/B:JNMR.0000013824.93994.1f

Google Scholar

Crossref

PubMed

23

Lichtenecker

,

R.

,

Ludwiczek

,

M.L.

,

Schmid

,

W.

and

Konrat

,

R.

(

2004

)

Simplification of protein NOESY spectra using bioorganic precursor synthesis and NMR spectral editing

.

J. Am. Chem. Soc.

126

,

5348

–

5349

https://doi.org/10.1021/ja049679n

Google Scholar

Crossref

PubMed

24

Tugarinov

,

V.

and

Kay

,

L.E.

(

2003

)

Ile, Leu, and Val methyl assignments of the 723-residue malate synthase G using a new labeling strategy and novel NMR methods

.

J. Am. Chem. Soc.

125

,

13868

–

13878

https://doi.org/10.1021/ja030345s

Google Scholar

Crossref

PubMed

25

Lichtenecker

,

R.J.

,

Coudevylle

,

N.

,

Konrat

,

R.

and

Schmid

,

W.

(

2013

)

Selective isotope labelling of leucine residues by using alpha-ketoacid precursor compounds

.

ChemBioChem

14

,

818

–

821

https://doi.org/10.1002/cbic.201200737

Google Scholar

Crossref

PubMed

26

Ruschak

,

A.M.

,

Velyvis

,

A.

and

Kay

,

L.E.

(

2010

)

A simple strategy for ¹³C,¹H labeling at the Ile-gamma 2 methyl position in highly deuterated proteins

.

J. Biomol. NMR

48

,

129

–

135

https://doi.org/10.1007/s10858-010-9449-1

Google Scholar

Crossref

PubMed

27

Gardner

,

K.H.

and

Kay

,

L.E.

(

1997

)

Production and incorporation of ¹⁵N, ¹³C, ²H (¹H-δ1 methyl) isoleucine into proteins for multidimensional NMR studies

.

J. Am. Chem. Soc.

119

,

7599

–

7600

https://doi.org/10.1021/ja9706514

Google Scholar

Crossref

28

Ayala

,

I.

,

Hamelin

,

O.

,

Amero

,

C.

,

Pessey

,

O.

,

Plevin

,

M.J.

,

Gans

,

P.

et al (

2012

)

An optimized isotopic labelling strategy of isoleucine-γ(2) methyl groups for solution NMR studies of high molecular weight proteins

.

Chem. Commun.

48

,

1434

–

1436

https://doi.org/10.1039/c1cc12932e

Google Scholar

Crossref

29

Mas

,

G.

,

Crublet

,

E.

,

Hamelin

,

O.

,

Gans

,

P.

and

Boisbouvier

,

J.

(

2013

)

Specific labeling and assignment strategies of valine methyl groups for NMR studies of high molecular weight proteins

.

J. Biomol. NMR

57

,

251

–

262

https://doi.org/10.1007/s10858-013-9785-z

Google Scholar

Crossref

PubMed

30

Godoy-Ruiz

,

R.

,

Guo

,

C.Y.

and

Tugarinov

,

V.

(

2010

)

Alanine methyl groups as NMR probes of molecular structure and dynamics in high-molecular-weight proteins

.

J. Am. Chem. Soc.

132

,

18340

–

18350

https://doi.org/10.1021/ja1083656

Google Scholar

Crossref

PubMed

31

Bellstedt

,

P.

,

Seiboth

,

T.

,

Hafner

,

S.

,

Kutscha

,

H.

,

Ramachandran

,

R.

and

Gorlach

,

M.

(

2013

)

Resonance assignment for a particularly challenging protein based on systematic unlabeling of amino acids to complement incomplete NMR data sets

.

J. Biomol. NMR

57

,

65

–

72

https://doi.org/10.1007/s10858-013-9768-0

Google Scholar

Crossref

PubMed

32

Takeuchi

,

K.

,

Ng

,

E.

,

Malia

,

T.J.

and

Wagner

,

G.

(

2007

)

1-¹³C amino acid selective labeling in a (HN)-²H¹⁵N background for NMR studies of large proteins

.

J. Biomol. NMR

38

,

89

–

98

https://doi.org/10.1007/s10858-007-9152-z

Google Scholar

Crossref

PubMed

33

Ayala

,

I.

,

Sounier

,

R.

,

Use

,

N.

,

Gans

,

P.

and

Boisbouvier

,

J.

(

2009

)

An efficient protocol for the complete incorporation of methyl-protonated alanine in perdeuterated protein

.

J. Biomol. NMR

43

,

111

–

119

https://doi.org/10.1007/s10858-008-9294-7

Google Scholar

Crossref

PubMed

34

Isaacson

,

R.L.

,

Simpson

,

P.J.

,

Liu

,

M.

,

Cota

,

E.

,

Zhang

,

X.

,

Freemont

,

P.

et al (

2007

)

A new labeling method for methyl transverse relaxation-optimized spectroscopy NMR spectra of alanine residues

.

J. Am. Chem. Soc.

129

,

15428

–

15429

https://doi.org/10.1021/ja0761784

Google Scholar

Crossref

PubMed

35

Waugh

,

D.S.

(

1996

)

Genetic tools for selective labeling of proteins with α-¹⁵N-amino acids

.

J. Biomol. NMR

8

,

184

–

192

https://doi.org/10.1007/BF00211164

Google Scholar

Crossref

PubMed

36

Perez

,

L.M.

,

Ielasi

,

F.S.

,

Bessa

,

L.M.

,

Maurin

,

D.

,

Kragelj

,

J.

,

Blackledge

,

M.

et al (

2022

)

Visualizing protein breathing motions associated with aromatic ring flipping

.

Nature

602

,

695

–

700

https://doi.org/10.1038/s41586-022-04417-6

Google Scholar

Crossref

PubMed

37

Reitzer

,

L.

(

2005

)

Catabolism of amino acids and related compounds

1

,

1

–

56

https://doi.org/10.1128/ecosalplus.3.4.7

Google Scholar

38

Rajesh

,

S.

,

Nietlispach

,

D.

,

Nakayama

,

H.

,

Takio

,

K.

,

Laue

,

E.D.

,

Shibata

,

T.

et al (

2003

)

A novel method for the biosynthesis of deuterated proteins with selective protonation at the aromatic rings of Phe, Tyr and Trp

.

J. Biomol. NMR

27

,

81

–

86

https://doi.org/10.1023/a:1024710729352

Google Scholar

Crossref

PubMed

39

Lichtenecker

,

R.J.

,

Weinhaupl

,

K.

,

Schmid

,

W.

and

Konrat

,

R.

(

2013

)

α-Ketoacids as precursors for phenylalanine and tyrosine labelling in cell-based protein overexpression

.

J. Biomol. NMR

57

,

327

–

331

https://doi.org/10.1007/s10858-013-9796-9

Google Scholar

Crossref

PubMed

40

Rodriguez-Mias

,

R.A.

and

Pellecchia

,

M.

(

2003

)

Use of selective TRP side chain labeling to characterize protein-protein and protein-ligand interactions by NMR spectroscopy

.

J. Am. Chem. Soc.

125

,

2892

–

2893

https://doi.org/10.1021/ja029221q

Google Scholar

Crossref

PubMed

41

Schroghuber

,

J.

,

Sara

,

T.

,

Bisaccia

,

M.

,

Schmid

,

W.

,

Konrat

,

R.

and

Lichtenecker

,

R.J.

(

2015

)

Novel approaches in selective tryptophan isotope labeling by using Escherichia coli overexpression media

.

ChemBioChem

16

,

746

–

751

https://doi.org/10.1002/cbic.201402677

Google Scholar

Crossref

PubMed

42

Schorghuber

,

J.

,

Geist

,

L.

,

Bisaccia

,

M.

,

Weber

,

F.

,

Konrat

,

R.

and

Lichtenecker

,

R.

(

2017

)

Anthranilic acid, the new player in the ensemble of aromatic residue labeling precursor compounds

.

J. Biomol. NMR

69

,

13

–

22

https://doi.org/10.1007/s10858-017-0129-2

Google Scholar

Crossref

PubMed

43

Schorghuber

,

J.

,

Geist

,

L.

,

Platzer

,

G.

,

Konrat

,

R.

and

Lichtenecker

,

R.J.

(

2017

)

Highly selective stable isotope labeling of histidine residues by using a novel precursor in E. coli-based overexpression systems

.

ChemBioChem

18

,

1487

–

1491

https://doi.org/10.1002/cbic.201700192

Google Scholar

Crossref

PubMed

44

Ayala

,

I.

,

Chiari

,

L.

,

Kerfah

,

R.

,

Boisbouvier

,

J.

,

Gans

,

P.

and

Hamelin

,

O.

(

2020

)

Asymmetric synthesis of methyl specifically labelled L-threonine and application to the NMR studies of high molecular weight proteins

.

ChemistrySelect

5

,

5092

–

5098

https://doi.org/10.1002/slct.202000827

Google Scholar

Crossref

45

Velyvis

,

A.

,

Ruschak

,

A.M.

and

Kay

,

L.E.

(

2012

)

An economical method for production of ²H, (CH₃)-¹³C-threonine for solution NMR studies of large protein complexes: application to the 670 kDa proteasome

.

PLoS One

7

,

e43725

https://doi.org/10.1371/journal.pone.0043725

Google Scholar

Crossref

PubMed

46

Cao

,

C.

,

Chen

,

J.L.

,

Yang

,

Y.

,

Huang

,

F.

,

Otting

,

G.

and

Su

,

X.C.

(

2014

)

Selective ¹⁵N-labeling of the side-chain amide groups of asparagine and glutamine for applications in paramagnetic NMR spectroscopy

.

J. Biomol. NMR

59

,

251

–

261

https://doi.org/10.1007/s10858-014-9844-0

Google Scholar

Crossref

PubMed

47

Goux

,

W.J.

,

Strong

,

A.A.D.

,

Schneider

,

B.L.

,

Lee

,

W.N.P.

and

Reitzer

,

L.J.

(

1995

)

Utilization of aspartate as a nitrogen-source in Escherichia-coli - analysis of nitrogen flow and characterization of the products of aspartate catabolism

.

J. Biol. Chem.

270

,

638

–

646

https://doi.org/10.1074/jbc.270.2.638

Google Scholar

Crossref

PubMed

48

Nygaard

,

R.

,

Zou

,

Y.Z.

,

Dror

,

R.O.

,

Mildorf

,

T.J.

,

Arlow

,

D.H.

,

Manglik

,

A.

et al (

2013

)

The dynamic process of β(2)-adrenergic receptor activation

.

Cell

152

,

532

–

542

https://doi.org/10.1016/j.cell.2013.01.008

Google Scholar

Crossref

PubMed

49

Stoffregen

,

M.C.

,

Schwer

,

M.M.

,

Renschler

,

F.A.

and

Wiesner

,

S.

(

2012

)

Methionine scanning as an NMR tool for detecting and analyzing biomolecular interaction surfaces

.

Structure

20

,

573

–

581

https://doi.org/10.1016/j.str.2012.02.012

Google Scholar

Crossref

PubMed

50

Gelis

,

I.

,

Bonvin

,

A.

,

Keramisanou

,

D.

,

Koukaki

,

M.

,

Gouridis

,

G.

,

Karamanou

,

S.

et al (

2007

)

Structural basis for signal-sequence recognition by the translocase motor SecA as determined by NMR

.

Cell

131

,

756

–

769

https://doi.org/10.1016/j.cell.2007.09.039

Google Scholar

Crossref

PubMed

51

Fischer

,

M.

,

Kloiber

,

K.

,

Hausler

,

J.

,

Ledolter

,

K.

,

Konrat

,

R.

and

Schmid

,

W.

(

2007

)

Synthesis of a ¹³C-methyl-group-labeled methionine precursor as a useful tool for simplifying protein structural analysis by NMR spectroscopy

.

ChemBioChem

8

,

610

–

612

https://doi.org/10.1002/cbic.200600551

Google Scholar

Crossref

PubMed

52

Frank

,

L.

(

1963

)

Proline metabolism in Escherichia coli II. Regulation of total growth of proline auxotroph by a proline-oxidizing system

.

J. Bacteriol.

86

,

781

–

784

https://doi.org/10.1128/jb.86.4.781-784.1963

Google Scholar

Crossref

PubMed

53

Kodama

,

Y.

,

Reese

,

M.L.

,

Shimba

,

N.

,

Ono

,

K.

,

Kanamori

,

E.

,

Dotsch

,

V.

et al (

2011

)

Rapid identification of protein-protein interfaces for the construction of a complex model based on multiple unassigned signals by using time-sharing NMR measurements

.

J. Struct. Biol.

174

,

434

–

442

https://doi.org/10.1016/j.jsb.2011.04.001

Google Scholar

Crossref

PubMed

54

Dominguez

,

C.

,

Boelens

,

R.

and

Bonvin

,

A.

(

2003

)

HADDOCK: a protein-protein docking approach based on biochemical or biophysical information

.

J. Am. Chem. Soc.

125

,

1731

–

1737

https://doi.org/10.1021/ja026939x

Google Scholar

Crossref

PubMed

55

Jumper

,

J.

,

Evans

,

R.

,

Pritzel

,

A.

,

Green

,

T.

,

Figurnov

,

M.

,

Ronneberger

,

O.

et al (

2021

)

Highly accurate protein structure prediction with AlphaFold

.

Nature

596

,

583

–

589

https://doi.org/10.1038/s41586-021-03819-2

Google Scholar

Crossref

PubMed

56

Zweckstetter

,

M.

(

2021

)

NMR hawk-eyed view of AlphaFold2 structures

.

Protein Sci.

30

,

2333

–

2337

https://doi.org/10.1002/pro.4175

Google Scholar

Crossref

PubMed

57

Clark

,

L.

,

Dikiy

,

I.

,

Rosenbaum

,

D.M.

and

Gardner

,

K.H.

(

2018

)

On the use of Pichia pastoris for isotopic labeling of human GPCRs for NMR studies

.

J. Biomol. NMR

71

,

203

–

211

https://doi.org/10.1007/s10858-018-0204-3

Google Scholar

Crossref

PubMed

58

Franke

,

B.

,

Opitz

,

C.

,

Isogai

,

S.

,

Grahl

,

A.

,

Delgado

,

L.

,

Gossert

,

A.D.

et al (

2018

)

Production of isotope-labeled proteins in insect cells for NMR

.

J. Biomol. NMR

71

,

173

–

184

https://doi.org/10.1007/s10858-018-0172-7

Google Scholar

Crossref

PubMed

59

Sugiki

,

T.

,

Ichikawa

,

O.

,

Miyazawa-Onami

,

M.

,

Shimada

,

I.

and

Takahashi

,

H.

(

2012

)

Isotopic labeling of heterologous proteins in the yeast Pichia pastoris and Kluyveromyces lactis

831

,

19

–

36

https://doi.org/10.1007/978-1-61779-480-3_2

Google Scholar

60

Kofuku

,

Y.

,

Yokomizo

,

T.

,

Imai

,

S.

,

Shiraishi

,

Y.

,

Natsume

,

M.

,

Itoh

,

H.

et al (

2018

)

Deuteration and selective labeling of alanine methyl groups of β(2)-adrenergic receptor expressed in a baculovirus-insect cell expression system

.

J. Biomol. NMR

71

,

185

–

192

https://doi.org/10.1007/s10858-018-0174-5

Google Scholar

Crossref

PubMed

61

Sitarska

,

A.

,

Skora

,

L.

,

Klopp

,

J.

,

Roest

,

S.

,

Fernandez

,

C.

,

Shrestha

,

B.

et al (

2015

)

Affordable uniform isotope labeling with ²H, ¹³C and ¹⁵N in insect cells

.

J. Biomol. NMR

62

,

191

–

197

https://doi.org/10.1007/s10858-015-9935-6

Google Scholar

Crossref

PubMed

62

Yagi

,

H.

,

Nakamura

,

M.

,

Yokoyama

,

J.

,

Zhang

,

Y.

,

Yamaguchi

,

T.

,

Kondo

,

S.

et al (

2015

)

Stable isotope labeling of glycoprotein expressed in silkworms using immunoglobulin G as a test molecule

.

J. Biomol. NMR

62

,

157

–

167

https://doi.org/10.1007/s10858-015-9930-y

Google Scholar

Crossref

PubMed

63

Yanaka

,

S.

,

Yagi

,

H.

,

Yogo

,

R.

,

Yagi-Utsumi

,

M.

and

Kato

,

K.

(

2018

)

Stable isotope labeling approaches for NMR characterization of glycoproteins using eukaryotic expression systems

.

J. Biomol. NMR

71

,

193

–

202

https://doi.org/10.1007/s10858-018-0169-2

Google Scholar

Crossref

PubMed

64

Werner

,

K.

,

Richter

,

C.

,

Klein-Seetharaman

,

J.

and

Schwalbe

,

H.

(

2008

)

Isotope labeling of mammalian GPCRs in HEK293 cells and characterization of the C-terminus of bovine rhodopsin by high resolution liquid NMR spectroscopy

.

J. Biomol. NMR

40

,

49

–

53

https://doi.org/10.1007/s10858-007-9205-3

Google Scholar

Crossref

PubMed

65

Kigawa

,

T.

,

Muto

,

Y.

and

Yokoyama

,

S.

(

1995

)

Cell-free synthesis and amino acid-selective stable-isotope labeling of proteins for NMR analysis

.

J. Biomol. NMR

6

,

129

–

134

https://doi.org/10.1007/bf00211776

Google Scholar

Crossref

PubMed

66

Yabuki

,

T.

,

Kigawa

,

T.

,

Dohmae

,

N.

,

Takio

,

K.

,

Terada

,

T.

,

Ito

,

Y.

et al (

1998

)

Dual amino acid-selective and site-directed stable-isotope labeling of the human c-Ha-Ras protein by cell-free synthesis

.

J. Biomol. NMR

11

,

295

–

306

https://doi.org/10.1023/a:1008276001545

Google Scholar

Crossref

PubMed

67

Su

,

X.C.

,

Loh

,

C.T.

,

Qi

,

R.H.

and

Otting

,

G.

(

2011

)

Suppression of isotope scrambling in cell-free protein synthesis by broadband inhibition of PLP enymes for selective ¹⁵N-labelling and production of perdeuterated proteins in H₂O

.

J. Biomol. NMR

51

,

409

–

409

https://doi.org/10.1007/s10858-011-9562-9

Google Scholar

Crossref

68

Calhoun

,

K.A.

and

Swartz

,

J.R.

(

2006

)

Total amino acid stabilization during cell-free protein synthesis reactions

.

J. Biotechnol.

123

,

193

–

203

https://doi.org/10.1016/j.jbiotec.2005.11.011

Google Scholar

Crossref

PubMed

69

Kainosho

,

M.

,

Torizawa

,

T.

,

Iwashita

,

Y.

,

Terauchi

,

T.

,

Ono

,

A.M.

and

Guntert

,

P.

(

2006

)

Optimal isotope labelling for NMR protein structure determinations

.

Nature

440

,

52

–

57

https://doi.org/10.1038/nature04525

Google Scholar

Crossref

PubMed

70

Urbanek

,

A.

,

Morato

,

A.

,

Allemand

,

F.

,

Delaforge

,

E.

,

Fournet

,

A.

,

Popovic

,

M.

et al (

2018

)

A general strategy to access structural information at atomic resolution in polyglutamine homorepeats

.

Angew. Chem. Int. Ed.

57

,

3598

–

3601

https://doi.org/10.1002/anie.201711530

Google Scholar

Crossref

2022

This is an open access article published by Portland Press Limited on behalf of the Biochemical Society and distributed under the Creative Commons Attribution License 4.0 (CC BY). Open access for this article was enabled by the participation of the University of York in an all-inclusive Read & Publish agreement with Portland Press and the Biochemical Society under a transformative agreement with JISC.

Specific isotopic labelling and reverse labelling for protein NMR spectroscopy: using metabolic precursors in sample preparation

Introduction

Biomolecular NMR spectroscopy and low abundance stable isotopes

Too many signals

Bigger is not always better

Isotopic labelling approaches for protein NMR spectroscopy

Uniform or site-specific labelling with ¹⁵N and/or ¹³C

Turning NMR signals on and off using selective labelling or reverse labelling.

Selective reverse labelling of sites in ¹⁵N and/or ¹³C labelled proteins

Isotopic labelling using amino acids and metabolites

Routes for targeting aliphatic residues

Metabolic precursors that can be used for isotopic labelling and reverse labelling of the carbon sites in branched-chain aliphatic amino acids.

Routes for targeting aromatic residues

Metabolic precursors used for isotopic labelling and reverse labelling of phenylalanine, tryptophan and tyrosine.

Histidine (un)labelling by the metabolic precursor imidazolepyruvate with incorporated atoms shown in red.

Routes for targeting polar residues

Routes for targeting charged residues

Special cases

Methionine (un)labelling by methylthio-2-oxobutanoate with isotopically labelled sites indicated in red.

Discussion

Perspectives

Competing Interests

Funding

Open Access

Author Contributions

References

Cited By

Get Email Alerts

CONNECT

EXPLORE

Cover Image

Specific isotopic labelling and reverse labelling for protein NMR spectroscopy: using metabolic precursors in sample preparation

Introduction

Biomolecular NMR spectroscopy and low abundance stable isotopes

Too many signals

Bigger is not always better

Isotopic labelling approaches for protein NMR spectroscopy

Uniform or site-specific labelling with 15N and/or 13C

Turning NMR signals on and off using selective labelling or reverse labelling.

Selective reverse labelling of sites in 15N and/or 13C labelled proteins

Isotopic labelling using amino acids and metabolites

Routes for targeting aliphatic residues

Metabolic precursors that can be used for isotopic labelling and reverse labelling of the carbon sites in branched-chain aliphatic amino acids.

Routes for targeting aromatic residues

Metabolic precursors used for isotopic labelling and reverse labelling of phenylalanine, tryptophan and tyrosine.

Histidine (un)labelling by the metabolic precursor imidazolepyruvate with incorporated atoms shown in red.

Routes for targeting polar residues

Routes for targeting charged residues

Special cases

Methionine (un)labelling by methylthio-2-oxobutanoate with isotopically labelled sites indicated in red.

Discussion

Perspectives

Competing Interests

Funding

Open Access

Author Contributions

References

Cited By

Get Email Alerts

CONNECT

EXPLORE

This Feature Is Available To Subscribers Only

Uniform or site-specific labelling with ¹⁵N and/or ¹³C

Selective reverse labelling of sites in ¹⁵N and/or ¹³C labelled proteins