The design and development of selective ligands for the human OT (oxytocin) and AVP (arginine vasopressin) receptors is a big challenge since the different receptor subtypes and their native peptide ligands display great similarity. Detailed understanding of the mechanism of OT's interaction with its receptor is important and may assist in the ligand- or structure-based design of selective and potent ligands. In the present article, we compared 69 OT- and OT-like receptor sequences with regards to their molecular evolution and diversity, utilized an in silico approach to map the common ligand interaction sites of recently published G-protein-coupled receptor structures to a model of the human OTR (OT receptor) and compared these interacting residues within a selection of different OTR sequences. Our analysis suggests the existence of a binding site for OT peptides within the common transmembrane core region of the receptor, but it appears extremely difficult to identify receptor or ligand residues that could explain the selectivity of OT to its receptors. We remain confident that the presented evolutionary overview and modelling approach will aid interpretation of forthcoming OTR crystal structures.
The neurohypophyseal nonapeptides OT (oxytocin) and AVP (arginine vasopressin) play an important role in many physiological functions through GPCR (G-protein-coupled receptor) signal transduction. In humans and other mammalian species, OT and AVP target the OTR (oxytocin receptor) and the three vasopressin receptors V1aR, V1bR and V2R . Peripherally, OT acts as a hormone that triggers uterine smooth muscle contraction during childbirth, facilitates milk ejection for breastfeeding and is involved in male ejaculation. Centrally, it functions as a neurotransmitter where it is involved in complex social behaviour, maternal care, stress and anxiety . AVP regulates fluid balance and blood pressure and is mainly involved in memory, learning, stress regulation and aggressive behaviour (summarized in ). Although OT and AVP elicit distinct physiological functions, they display a certain degree of cross-reactivity, which is caused by the high ligand similarity (OT and AVP only differ by two residues) and the high extracellular homology (~80%) between the OT and AVP receptors. Contrary to many other signalling systems, selectivity is not achieved via the ligands, but via interplay of factors including receptor up- or down-regulation, release of specific ligand-degrading enzymes, local ligand production and receptor clustering. In terms of drug development, the extracellular receptor homology represents a major hurdle since a lack of selectivity can cause undesired side effects [3,4]. To address the need for receptor-selective ligands, we decided to review the molecular evolution of the OT/AVP ligand–receptor system to gain insight into the mechanism of receptor–ligand recognition and binding.
Evolutionary aspects of the OT–vasopressin signalling system
OT/AVP-related peptides have been found in a wide range of species and are thought to have evolved via gene duplication from the ancestral VT (vasotocin) peptide some 600 million years ago . VT-like peptide hormones have been found in non-mammalian vertebrates, fish, mammals, molluscs, annelids, arthropods and several insects, including social ants . All members of the OT, AVP and VT peptide family share high sequence similarity, namely an N-terminal six-residue ring, formed by a disulfide bond between the two cysteine residues at positions 1 and 6, and a flexible C-terminal three-residue tail with a highly conserved Pro7 and a glycine amide at position 9 (CXXXXCPXG). Residue variability is observed at positions 2–5 and at position 8 . These amino acid variations are presumably responsible for species-selective recognition, binding and activation of the different receptors. Although the four human receptors display high extracellular sequence similarity, there exists low inter-species receptor correlation reflected by the fact that many ligands selective in rat or mice are not selective in humans . Hence it was of interest to us whether inter-species differences of the individual native ligands could be correlated to receptor sequence variations and vice versa. We hypothesized that a detailed molecular understanding of recognition, binding and activation of OT and AVP receptors by their native ligands could assist the design and development of novel selective ligands.
OTR sequence comparison of residues Arg34, Phe103, Tyr209 and Phe284
A total of 69 known receptor sequences of OT, OT-like, VT and human AVP GPCRs with their native ligands were selected from a wide range of species for sequence comparison. The phylogenetic tree analysis based on these sequences was performed with ClustalW and Figure 1 illustrates the comparison of the ligand sequence variation with the receptor sequence evolution. Consistent with the high receptor sequence similarity across closely related species (e.g. bony fish, amphibians or primates), it was found that the respective ligands within these clusters were also highly conserved. This is in agreement with the evolution of OT/AVP and related nonapeptides [9,10]. Residues believed to be responsible for OT–AVP ligand–receptor binding were analysed and their degree of conservation was compared.
Previous studies determined that the residues Arg34, Phe103, Tyr209 and Phe284 were important for ligand binding and selectivity at the human OTR [11–15]. Additionally, Asp85 and Lys270 are involved in receptor signalling . Interestingly, all OT and OT-related receptors share the Asp85 residue in the TM (transmembrane) domain 2 and the Lys270 residue in ICL (intracellular loop) 3 (Figures 2c and 2h), and these residues were reported to be important for receptor activation [17,18]. Particularly, the Asp85 is also conserved in other class A GPCRs and is thought to play a more general role in receptor–G-protein signalling .
The N-terminal Arg34 is highly conserved (only three receptors have a different residue in that position: Asp34 in chicken OTR, Tyr34 in cephalotocin receptor CTR2 and Val34 in conopressin receptor LSCPR1; see Figure 2a), indicating its importance for ligand binding . Phe103 was demonstrated to be important for ligand selectivity in the OTR and is thought to interact with the residue at position 8 of the peptide ligand . OT receptors with a phenylalanine in that position generally bind native ligands that have either a leucine or an isoleucine residue at position 8 (in contrast to Arg8 of AVP, with human V1aR containing a tyrosine at corresponding position 115). Exceptions are the OT-like peptide from the New World monkeys Saimiri sciureus and Callithrix jacchus as well as inotocin from the water flea Daphnia pulex, which have a proline at position 8. Interestingly, the receptors of C. jacchus and D. pulex have a hydrophobic phenylalanine at corresponding position 103 whereas the receptor of S. sciureus has a hydrophilic tyrosine in that position, indicating that a direct interaction of this residue in the receptor and the residue at position 8 of the ligand may not be present in all OT receptor–ligand pairs, since the interaction of the polar tyrosine and the hydrophobic proline residues is much weaker compared with the hydrophobic phenylalanine–proline interaction. Further examples of ligand–receptor variation in position 103 of the receptor and position 8 of the ligand include the red flour beetle Tribolium castaneum (tyrosine in receptor, arginine in ligand), the earthworm Eisenia foetida (histidine in receptor, threonine in ligand), the great pond snail Lymnea stagnalis (isoleucine in receptor and ligand) and the common octopus Octopus vulgaris (valine in receptor, isoleucine in ligand). Tyr209 and Phe284, located in the TM region are two other important residues for ligand–receptor binding . In the human receptor, Tyr209 and Phe284 interact with residues at positions 2 and 3 of OT [15,16]. Tyr209 is highly conserved among all OT-like receptors, being a tyrosine in most sequences or another aromatic residue (Figure 2g). All native peptide ligands contain an aromatic residue (tyrosine or phenylalanine) in position 2 and a hydrophobic or aromatic residue in position 3 (isoleucine, phenylalanine, valine or tryptophan)  indicating that this ligand–receptor interaction may indeed be conserved throughout evolution. This is also true for VT and human AVP receptors that we analysed; the receptors contain a tyrosine in position 209 and their respective ligands contain a tyrosine in position 2 and a phenylalanine or isoleucine in position 3 (Figure 1). Accordingly, the residue at position 284 of the receptor is, in most instances, aromatic (phenylalanine or tyrosine) and the only exception is the CTR1 from O. vulgaris that contains a cysteine in that position (Figure 2i). Again, these residues are possibly involved in hydrophobic receptor–ligand interactions. The importance of N-terminal hydrophobicity of the ligands is further supported by the identification of the superagonist desamino-OT, where the deletion of the N-terminal amine led to a more hydrophobic and potent OT analogue . The analysis of the ligand–receptor evolution with respect to the residues at positions 34, 103, 209 and 284 of OT-like receptors is in agreement with previous studies (summarized in ), but it is likely that these molecular contacts may not be present in all native ligand–receptor pairs.
To gain further insight into the molecular interaction of OT and its receptors, we created a homology model of the human OTR based on recently published GPCR structures. Based on the information from various ligand co-crystal structures of human or mammalian GPCRs [20–30], one may anticipate that not only certain residues, but also the overall three-dimensional receptor architecture of the human OTR plays a significant role in ligand–receptor interactions. Residues that interact with GPCR ligands were extracted from crystal structures and were mapped on to the homology model of the OTR as discussed in the next paragraph.
Ligand–GPCR interaction: lessons learned from recent crystal structures
Ligands are recognized by the extracellular region of the receptor and interact with residues in the three-dimensional environment of the receptor TM domain. This has been observed for orthosteric ligands that interact with class A GPCRs in recent crystal structures . With the aim of understanding the binding mode of OT to its receptor in the context of its three-dimensional architecture, we developed a homology model of the human OTR on the template of the mouse μ-opioid receptor crystal structure with the intention of identifying whether and which residues are oriented towards the binding pocket and interact with the ligand. The mouse μ-opioid receptor was selected owing to its high sequence similarity, high resolution and canonical receptor conformation. We analysed the publicly available crystal structures (for details see caption of Figure 3) and determined the contacts of the OTR side chains with the respective ligand (Supplementary Table S1 at http://www.biochemsoctrans.org/bst/041/bst0410197add.htm) by overlaying the GPCR crystal structures with the OTR and then transferring each ligand from the crystal structures into the OTR model. The observed frequency of residue–ligand contacts (i.e. the number of structures in which a direct interaction of the receptor residue with the ligand was observed) was mapped on to the OTR model. With this approach we were able to define a common binding site, deep within the vestibule of the receptor structure that is shared by all GPCR ligands (agonists and antagonists) that were used for this analysis. The positions with the highest frequency of direct contact with ligands include Gln119 and Met123 in helix 3, Ile204 and Val208 in helix 5, Trp288, Phe291, Phe292 and Gln295 in helix 6 and residue Met315 in helix 7 (numbering according to human OTR). The α-carbon atoms of all these residues are within 16 Å (1 Å=0.1 nm) from each other. The side chains are all orientated towards the ligand-binding vestibule and many of them are in direct contact with each other. Residues at the rim of the vestibule displayed lower interaction frequency and it appears that this strongly depends on the size and orientation of the ligand (Figures 3a and 3b). Of note, we did not distinguish between different types of ligand such as agonist or antagonist.
Contrary to what we expected, the four residues described in the literature to be important for human OT ligand–receptor interaction (Arg34, Phe103, Tyr209 and Phe284) are not within the group of frequently occurring interacting residues. Arg34 and Phe103 are located at the rim of the vestibule, above the common interacting residues, on the extracellular side, whereas Tyr209 and Phe284 (TM5 and TM6 respectively) are located below the common binding site. Phe284 is located below Trp288 in the centre of the TM core region. Phe284 can form π interactions with Trp288 and is believed to contribute to orienting the side chain of Trp288, which faces towards the proposed ligand-binding pocket and was found to be in contact with ligands in high frequency in GPCR structures. The four residues (Arg34, Phe103, Tyr209 and Phe284) that are important for OT binding are located below and above the site in which the model structure provided information on the interaction between ligands and GPCRs with high frequency. However, the ligands in the analysed crystal structures were all small organic molecules, significantly smaller than the nonapeptides in this study. The size of the nonapeptides would allow them to interact simultaneously with Tyr209 and Phe284 at the bottom of the ligand binding pocked and with Arg34 and Phe103 on the extracellular side. It is therefore very likely that OT binds to a common binding site that is located in between those four residues (Arg34, Phe103, Tyr209 and Phe284) as proposed in the OTR model. Recently, the first crystal structure of a peptide-agonist-bound GPCR, namely the neurotensin receptor 1 has been reported [30a]. In agreement with our finding that the binding pocket involves residues in the common binding motif as well as on the extracellular side, White et al. [30a] found that neurotensin (residues 8–13) binds closer to the surface of its receptor compared with other small-molecule agonists.
The research groups of Slusarz et al.  and Fanelli et al.  have previously studied the interaction of human OT with its receptor by molecular dynamics simulation and identified a number of residues, mainly within the TM region of the receptor, considered important for interaction with the ligand. A comparison of these studies with our approach to map GPCR ligand-binding residues on to the homology model of the human OTR identified the common residues Lys116, Gln119, Met123 and Gln295 that display a high frequency contact in our model (Figure 3 and Supplementary Table S1). Following the analysis of the human OTR model and the common GPCR ligand-binding sites, we return now to the receptor–ligand sequence comparison in order to discuss the significance of the residues that were identified in our modelling approach.
Comparison of the structural model and the OTR–ligand sequence pairs
On the basis of the three largest clusters of receptor–ligand evolution (Figure 1) and with a focus on the residues forming the common binding site in GPCR–ligand structures (Figure 3 and Supplementary Table 1), we compared the sequences of OT, isotocin and mesotocin receptors and their respective ligands (Figure 2). Residues of the seven TM helices that were oriented towards the ligand-binding site were found to be more conserved compared with the membrane exposed residues. Interestingly, the degree of conservation was unevenly distributed: helices 2, 3 and 6 were the most conserved, whereas residues in helix 7 displayed the largest variability. This was also observed in the comparison of the human OTR with the human V1aR and in the analysis of the binding pockets of the opioid receptors (T. Stockner, unpublished work), whereby interacting residues of TM6 are highly conserved and seem to be important for efficacy, whereas ligand-interacting residues of TM7 are highly variable and determine the selectivity of the receptor for its ligand [20,25]. Residues in TM7 show particular variation in the mesotocin receptor sequences, but we were not able to correlate these changes with variations in the ligand sequence. Four residues 49, 98, 204 and 208 in the receptor displayed some degree of correlation with the respective ligand and these residues are highlighted in Figures 2(c), 2(d), 2(g), 3(c) and 3(d).
The sequence of residue 98 (shown in magenta in Figures 3c and 3d) correlates with changes in the ligand: all analysed receptors have a non-polar aliphatic amino acid at this position (isoleucine or leucine) and the same amino acid in position 8 of the ligand. Residues in positions 49, 204 and 208, i.e. Ile49, Ile204 and Val208 in OT and mesotocin receptors (shown in green in Figures 3c and 3d) and Val49, Ile204 or Met204 and Ile208 in isotocin receptors show correlation with residue 4 in the nonapeptide sequence, which is either a glutamine in OT and mesotocin or a serine in isotocin. Since the amino acid changes in the receptor are conservative, i.e. amino acid type and hydrophobicity remain almost unchanged, we believe that these differences mainly affect residue size and could potentially result in a gentle variation in the size of the binding pocket.
Summary and conclusion: only a ligand-bound structure can tell…
Biochemical studies with the human OT receptor and its native ligand have identified four residues that are important for ligand binding and recognition, namely Arg34, Phe103, Tyr209 and Phe284 (summarized in ). In the present review, we compared 69 OT, OT-like and VT receptor sequences to gain further insight into OT ligand binding. We utilized an in silico approach to map the common ligand interaction sites of recently published GPCR structures to a model of the human OT receptor and compared the interacting residues within different receptor sequences. Our analysis suggests the existence of a binding site for OT peptides within the TM core region. Previous evolutionary studies of OTRs by van Kesteren and Geraerts  and Cho et al.  pointed out a number of conserved residues in TM2, TM3, TM4, TM6 and TM7 that may be important for ligand binding, but their analysis included only a few different receptor sequences. It is evident from our comparison that TM7 displays the greatest sequence variability, which might be the source for interspecies selectivity. This helix also experiences the biggest structural movements when the receptor is activated and is certainly important for G-protein signalling . Although this review brings together empirical data, molecular sequence comparison and homology modelling, there is still a lack of structure–activity and mutagenesis studies to propose a working OTR model that could explain selectivity differences observed in binding studies. We conclude that only a ligand-bound crystal structure of OT and its receptor will be able to shed light on to the mechanism of interaction and provide the means for future design of novel and selective ligands.
G-Protein-Coupled Receptors: from Structural Insights to Functional Mechanisms: A Biochemical Society Focused Meeting co-organized with Monash University held at Monash University Prato Centre, Prato, Italy, 12–14 September 2012. Organized and Edited by Bice Chini (CNR Institute of Neuroscience, Italy), Marco Parenti (University of Milano–Bicocca, Italy), David Poyner (Aston University, U.K.) and Mark Wheatley (University of Birmingham, U.K.)
The work on bioactive peptides in our laboratory was financially supported by the FWF Austrian Science Fund [grant number P22889-B11]. The research leading to the results of the review has received funding from the European Union Seventh Framework Programme (FP7/2007–2013) under grant agreement number 254897.