Soluble hydrolases represent the main proteins of lysosomes and vacuoles and are essential to sustain the lytic properties of these organelles typical for the eukaryotic organisms. The sorting of these proteins from ER residents and secreted proteins is controlled by highly specific receptors to avoid mislocalization and subsequent cellular damage. After binding their soluble cargo in the early stage of the secretory pathway, receptors rely on their own sorting signals to reach their target organelles for ligand delivery, and to recycle back for a new round of cargo recognition. Although signals in cargo and receptor molecules have been studied in human, yeast and plant model systems, common denominators and specific examples of diversification have not been systematically explored. This review aims to fill this niche by comparing the structure and the function of lysosomal/vacuolar sorting receptors (VSRs) from these three organisms.
Discovery and function of lysosomes and vacuoles
The plant vacuole was first discovered in 1676 by a Dutch scientist Antonie van Leeuwenhoek. Considered as the ‘father of microbiology’, he contributed to the development of a number of lenses for microscopes, which allowed him to be the first to observe living cells . Because vacuoles appear as optically empty sacs filling the volume of the cell they have been named from the Latin ‘vacuus’ meaning empty. It was only 300 years later that Christian de Duve, a Belgian biochemist, discovered mammalian lysosomes entirely based on biochemical methods  and as he calls it himself by ‘a gift of serendipity’ while working on insulin . de Duve named lysosomes respective of their digestive properties (from the Greek lysis- digestive and soma-body) and their existence was confirmed the following year by electron microscopy . This was the scientific breakthrough that would lead to the understanding of the physiological basis of autophagy and lysosomal storage diseases (LSD) [5,6]. It is now clear that vacuoles in fungi and plants [7,8] have a lot in common with mammalian lysosomes and are highly dynamic in the cell, constantly remodelling with cell cycle stages, growth conditions and cell types [9–13]. There are a few differences that can be noted. Although lysosomes are quite small (0.1–1 um) and numerous (50–1000 per cells depending on the cell type) [14–16] most fungal vacuoles are larger than lysosomes (5 um) and are present in a smaller number (1–5), occupying as much as 20% of the cell volume [10,17]. In contrast, plant vacuoles are large compartments and they can fill 30% and up to 90% of the cell volume . Although it was generally accepted that plant cells have at least two types of vacuoles: a storage vacuole (SV) and a lytic vacuole (LV)  this is now being challenged by new findings which favour the one unique vacuole type per cell.
Due to the large size of yeast and plant vacuoles, fluorescence microscopy has become the most widely used tool to study vacuolar trafficking.
Despite a diversity of morphologies and numbers across kingdoms and cell types, lysosomes and vacuoles share a unique and common feature: they represent the essential digestive compartment of the cell. Vacuoles are therefore often referred to as ‘fungal or plant lysosomes’ due to their common role in the degradation of cell components such as proteins, nucleic acids, polysaccharides and lipids. To perform this function, they maintain an acidic pH, between 4.5 and 5.5 for lysosomes and 5.5–6.2 for yeast and plant vacuoles [19–21]. Lysosomes and vacuoles also play roles in protein storage, cell homoeostasis, responses to pathogens, cell signalling, as well as the maintenance of turgor pressure and maintaining general cell shape [22,23]. Typically under conditions of glucose starvation, lysosomes and vacuoles are also involved in autophagy, a mechanism to retrieve energy and building blocks from surplus internal organelles and protein complexes that can be replaced later when glucose is no longer limiting. Whereas vacuoles engulf autophagic vesicles to internalize them, lysosomes are too small to do so and thus fuse with autophagosomes .
Lysosomal and vacuolar lumenal sorting signals
The principal function of lysosomes and vacuole in degradation implies that a number of proteins such as transporters and acid hydrolases have to be efficiently sorted to these compartments. Soluble proteins are diverted from the secretory bulk-flow of proteins due to the presence of sorting signals that are recognized by specific membrane spanning receptors to initiate transport to the degradative organelles. These signals are surface structures either directly displayed by the folded polypeptide (often in fungi and plants) or indirectly via post-translational modification of sugar chains (in vertebrates). Polypeptide signals can vary from 4 to 17 amino acids and although they are often found in an N-terminal propeptide region downstream of the signal peptide for translocation across the ER membrane, C-terminal and even internal domains have been shown to mediate vacuolar sorting.
In the yeast Saccharomyces cerevisiae, an N-terminal QRPL motif has been found to target carboxypeptidase (CPY) to the vacuole via the type I membrane spanning receptor Vps10p . Although the glutamate and the leucine seem to be important, the two other amino acids are dispensable, suggesting QXXΦ as consensus signal . However, not all vacuolar proteins in yeast contain such a motif, indicating that alternative signals may exist in yeast [27,28].
Similarly, in plants, a sequence specific vacuolar sorting signal (ssVSS) has been identified in a class of proteins such as barley Aleurain and sweet potato Sporamin, a cysteine protease and a storage protein respectively . It consists of a four amino acids NPIR motif, although some variation may exist. Two additional signals have been described but are still not well understood: a C-terminal vacuolar sorting signal (ctVSS), associated with stretches of hydrophobic residues, and internal targeting sequences forming signal patches [30,31].
In mammals, cathepsin D (aspartyl protease), prosaposin C and acid sphingomyelinase can also interact with their receptor sortilin via stretches of amino acids. In addition, β-glucocerebrosidase (β-GC) is transported to lysosomes by interacting in a similar manner with receptor LIMP-2 . Nevertheless, no consensus motif has been found for these types of proteins.
Mammals, avians, amphibians and invertebrates have developed a non-protein signal , based on a modified glycan, the so-called mannose-6-phosphate (M6P) pathway. Lysosomal proteins are translocated into the ER lumen, where the Asparagine in the motif Asn-X-Ser/Thr is glycosylated with an oligosaccharide high in mannose. To distinguish them from secreted glycosylated proteins, M6P formation occurs after arrival in the cis-Golgi in two steps: a GlcNac-1-phosphate is transferred from UDP-GlcNac by a phosphotransferase on to selected C6 hydroxyl groups of mannoses generating phosphodiester forms. In the trans-Golgi network, the M6P is then exposed by the action of a glucosaminidase: the uncovering enzyme, UCE. The modified glycan is subsequently recognized by M6P receptors (MPRs), large type I membrane spanning proteins that initiate protein export from the Golgi [33,34]. Isosteric analogues of M6P containing two negative charges have also been shown to be a substrate of MPRs .
Lumenal structure and function of lysosomal/vacuolar receptors
Apart from the mammalian receptor LIMP-2, all lysosomal and vacuolar receptors have the same general structure: one large lumenal domain, which is involved in the interaction with the cargo, one transmembrane segment and a C-terminal tail which contains signals for trafficking of the receptor. The N-terminal domain represents the region that is mostly different between receptors among different kingdoms, probably due to the variety of lysosomal/vacuolar proteins to be recognized. Receptors can be classified in four families (Figure 1): i) the MPR family, ii) the sortilin / Vps10 (vacuolar protein sorting 10) family, iii) the plant vacuolar sorting receptor (VSR) family and iv) the LIMP-2 (lysosomal integral membrane protein) family.
Comparison of lysosomal and vacuolar receptors
Mammalian M6P receptors recognize and bind to a M6P group present on most soluble lysosomal cargoes . A distant MPR related protein (Mrl1p) has been described in yeast but appears to be glycosylation independent . Two receptors have been characterized in mammals: a cation-independent M6P receptor which is approximately 300 kDa (CI-MPR or MPR300) and a cation-dependent M6P receptor which is approximately 46 kDa (CD-MPR or MPR46). CD-MPR contains one domain that can recognize M6P signal, whereas 15 repeats of this one domain are found in the lumenal domain of CI-MPR (Figure 1) . Only two of them however (module 3 and 9) can bind to M6P to similar affinities than the module in CD-MPR . Binding of CD-MPR is enhanced by the presence of divalent ions (hence the name cation-dependent receptor) in contrast with CI-MPR which lacks the critical aspartic acid that coordinates the cation in CD-MPR . Recently, two additional modules (5 and 15) have been found to display weak interaction for M6P and additionally for M6P_GlcNac, the glycosylation intermediate form before M6P. Hence this weak interaction probably represents a rescue mechanisms of lysosomal transport in the event of mutations in the M6P addition process [40,41]. CI-MPR also contains a binding site for insulin growth factor II on module 11, which can inhibit lysosomal cargo binding, suggesting binding site competition. CI-MPR is therefore sometimes called insulin-like growth factor II receptor (IGF II R) [35,42].
The second family, the Sortilin/Vps10 receptor family, has been named after the isolated yeast mutants defective for vacuolar transport (vacuolar protein sorting defective 10). Yeast Vps10p can be considered a specialized form of the much larger and more widespread family called sortilins (Figure 1, family 2). Mammals have five sortilins (sortilin/neurotensin receptor-3, SorC1, SorCS2, SorCS3 and SorLA), but sortilins are also found in most eukaryotic kingdoms including protists, green and red algae, except higher plants. The N-terminal moiety of the yeast Vps10p consists of two domains, currently called Vps10 domains (Vps10d). In Vps10p, these can bind differentially to ligands . The sortilins differ mainly in exhibiting just one single Vps10d. In mammals, sortilin has been found to bind nerve growth factors and neurotrophin (NT) precursors, and has also recently been involved in the regulation of LDL transport . The four sortilin variants (SorCS1–3 and SorLA) contain other domains among which leucine repeats, EGF repeats, LDLR repeats and β-propeller domain for SorLA, but their presence depends on the type of sortilin and their function is still being examined .
The third class of receptors, the plant VSRs [31,42], appears to be separated from the previous mentioned groups and are only found in vascular plants (including mosses and ferns), green unicellular algae, diatoms, watermolds, and the group of protists termed alveolates, but not red algae or other protists. VSRs may have evolved and diversified to compensate for the loss of the sortilin-class receptors in vascular plants. Seven VSRs isoforms exist in Arabidopsis thaliana, and they are classified in three groups: VSR1, VSR2 and VSR3 . Genetic studies have shown that the three groups are not functionally equivalent and suggest that VSR3 group might contribute to the transport of a different subset of vacuolar cargoes. The reason for this difference however is not understood as the three families share a high degree of protein identity and structure. The N-terminal region of all VSRs is composed of a PA (protease associated) domain and a central domain (CD), both of which contribute to cargo binding. These are followed by three EGF repeats, which are suggested to be involved in modulating cargo binding through calcium coordination . Finally a single transmembrane connects the lumenal N-terminal part of the protein to a short C-terminal tail of approximately 30 amino acids. The mechanism of cargo binding is still not well understood but it has been shown in vitro to depend on the pH and possibility calcium concentrations . A consistently short length of the cytosolic tail has greatly facilitated analysis of the sorting signals that control VSR recycling. A conserved YMPL motif resembles the YXXØ class of clathrin mediated sorting motifs for endocytic recycling from the plasma membrane. However, its role in VSR transport is to prevent arrival at the plasma membrane by directing it from the Golgi apparatus to the multivesicular body and back. Genetic evidence supports a requirement for the tyrosine residue in anterograde trafficking to multivesicular bodies, whereas the hydrophobic leucine is required for retrograde trafficking to earlier compartments . Deletion of the YMPL motif leads to cycling via the plasma membrane to the multivesicular bodies via a conserved IM motif immediately upstream of the YMPL motif that mediates endocytosis . The two sorting signals and their relative positions are extremely conserved among the VSR family (Figure 1, family 3) and the available mutants with distinct sorting properties can now be used to study plant specific aspects of the biosynthetic and endocytic pathways to the vacuole.
The fourth family of lysosomal/vacuolar transporter is uniquely represented by the mammalian model protein LIMP-2 protein (Figure 1, family 4). This is the only lysosomal receptor that is a type III membrane protein with both N- and C-termini in the cytosol and does not appear to have homologues involved in vacuolar trafficking in yeast or plants. LIMP-2 is member of the CD36 scavenger protein family at the cell surface. It is a multicargo receptor: although it can bind to its main cargo, β-GC, through a coiled-coil motif, it can also bind to other cargoes such as TSP-1 and EV71 via a different site localized upstream to the coiled-coil region . Interestingly, LIMP2 lumenal domain is highly glycosylated and a recent study has demonstrated that CI-MPR can interact with the LIMP2–β-GC complex through a specific glycosylation site on LIMP2. This will in turn form a ternary complex which would be the trafficking competent complex . Yet, this hypothesis is in disagreement with earlier reports showing that β-GC trafficking is MPR independent, but could indicate that β-GC could actually use two different trafficking pathways .
A striking feature of this four families of receptors is that they all have been suggested to bind their substrates in a pH-dependent manner, where binding would happen at high pH (such as the ER and/or the Golgi) and release would be promoted by acidic pH [late endosomes/multi vesicular bodies (MVBs)] [30,34,38,51,52]. Although the importance of pH in ligand binding/release has been questioned, it is still the preferred model for soluble protein transport to the lysosome and vacuole.
In contrast with differences in the nature of cargo and the structure of the lumenal domain of receptors, trafficking routes for lysosomal and vacuolar targeting are surprisingly very similar across kingdoms. Besides LIMP-2, which has been reported to bind its substrate in the ER , it is generally accepted that most receptors bind lysosomal/vacuolar substrates in the late Golgi compartments, although it is still controversial for the plant VSR [47,53]. After binding to their cargo, the loaded receptors are sorted to the degradative organelles via two routes: the canonical or the alternative route. In the canonical pathway, the receptors are recognized by the adaptor complexes AP1 and/or AP4 at the TGN level. This triggers vesicular targeting to the endosomes in a clathrin dependent pathway [30,34,38,47]. When they reach the late endosomes, the acidic pH triggers the release of the cargo from the receptor, which is then recycled back to the late Golgi by a complex called retromer. In the alternative route, so called ALP pathway in yeast, receptors are recognized by AP3 and reach the lysosome or vacuole by skipping the endosomes, either via the plasma membrane or not . Therefore, receptors using this route are not recycled and only perform one direction delivery . On their way to Lysosomes in either the canonical or alternative pathway, mammalian receptors have been shown to visit the plasma membrane to bind extracellular cargo, and then be endocytosed in an AP2-dependent manner . This is particularly true for multicargo receptors such as CI-MPR, sortilin and LIMP-2 [32,43]. In contrast, yeast Vps10p and plant VSRs do not normally reach the plasma membrane as this will trigger cargo release in the external media, although one plant VSR variant has been found in the plasma membrane in germinating pollen tubes [47,55].
Most receptors using the canonical pathway localize mainly to the endosomes (mostly late Golgi and MVB) with slight variations in ratios, i.e. Vps10p is found in late Golgi whereas VSR is mostly localizing in MVBs [30,56]. Interestingly, CD-MPR and CI-MPR appear to localize to the same region in late Golgi, but different regions in endosomes . Sortilin colocalizes with CI-MPR on endosomes, and both receptors are also partially found in the PM (approximately 10%). This suggests that late endosomes possess specialized areas, one for recycling to the late Golgi (CD-MPR) and one for recycling to the PM (CI-MPR, sortilin). LIMP-2, which uses the alternative pathway and therefore does not reach the MVBs, similarly partially localizes at the PM, but is mostly found in lysosomes .
In all trafficking steps, the sorting efficiency depends on the presence of targeting signals in the C-terminal tail of receptors. These signals which will permit interaction with various adaptor protein (AP) complexes. Although minor signals have been reported, two main sorting signals exist across kingdoms: tyrosine signals based YXXΦ motif and dileucine based signals (D/E)XXLL [30,34,38]. Intriguingly in plants, the same YMPL motif seems to mediate anterograde (Golgi to MVBs) and retrograde (MVBs to early trafficking organelles) transport, whereas a second hydrophobic motif (IM: Figure 1) acts as backup mechanism for receptors mistargeted to the plasma membrane. It is not clear if this IM motif relates to the above mentioned dileucine based signals or if it represents a new class of sorting signals [47,58]. In the yeast Vps10p and in lysosomal receptors, anterograde and retrograde sorting appear to be dependent on different motifs . Additionally, the same motifs can be used during endocytosis. The role of adjacent amino acids to the identified sorting signals is now being examined to explain how identical signals can be used to differentially interact with APs involved in different routes. One explanation is the secondary structure that the tail could adopt upon binding to the cargo. This could change depending on the oligomerization state of receptors, as most of these have been shown to dimerize or oligomerize . Moreover, additional signals such as phosphorylations and palmitoylation have been found to play a major role in lysosomal trafficking, but have not been exploited in vacuolar trafficking yet [38,59]. Although cysteines are not present in the C-terminal tail of yeast Vps10p and plant VSRs as potential sites of palmitoylation, they do contain many serines which could be modified by phosphorylation.
Although lysosomes and vacuoles are morphologically different, they do share a number of crucial functions in human, yeast and plants cells. Their main feature is the lytic degradation of a number of metabolites, which is performed mainly by soluble hydrolases. These proteins are transported to the lysosome and the vacuole by specific receptors. The majority are type I transmembrane proteins composed of a large N-terminal domain, that binds the cargo in early compartments and in a pH sensitive manner, one transmembrane segment and a C-terminal tail, which contains conserved signals for trafficking. Receptors can follow two pathways: a canonical and an alternative pathway. The first route is AP1-dependent and traffics through late endosomes, whereas the latter is AP3-dependent and traffics independent of late endosomes. Hence, despite a large diversification of the lumenal cargo binding domains of the sorting receptors, the three kingdoms appear to be very similar in terms of trafficking cycle, trafficking signals and complexes they interact with along the way (such as AP complexes).
Consequently, yeast and plants are emerging as good alternative organisms for the expression and the purification of human lysosomal proteins with important impacts on pharmaceuticals and enzyme replacement therapy (ERT) . Plants in particular seem to represent a promising system thanks to the cost-effective and easy production of lysosomal proteins with already successful attempts of expressing human hydrolases such as mannosidases and glucocerebrosidase [61,62]. Most importantly, plant-produced (carrot) recombinant human glucocerebrosidase has been shown to naturally possess terminal mannose residues, a step which has to be additionally performed in vitro for Chinese hamster ovary (CHO)-derived glucocerebrosidase .
Therefore, exploring more interactions between lysosomal and vacuolar fields would provide a step forward into the understanding of trafficking to the lytic organelles and future strategies towards enzyme replacement therapies.
We thank Dr Vincent Postis for reading the manuscript and critical discussion and the Denecke lab for useful comments and discussions.
This work was supported by the Biotechnology and Biological Sciences Research Council [grant number BB/K007181/1].
Membrane Proteins From A to Z: Held at University of Leeds, U.K., 16–17 December 2015