The development of B lymphocytes into antibody-secreting plasma cells is central to the adaptive immune system in that it confers protective and specific antibody response against invading pathogen. This developmental process involves extensive morphological and functional alterations that begin early after antigenic stimulation. These include chromatin restructuring that is critical in regulating gene expression, DNA rearrangement and other cellular processes. Here we outline the recent understanding of the three-dimensional architecture of the genome, specifically focused on its contribution to the process of B cell activation and terminal differentiation into antibody-secreting cells.
Introduction
Upon recognition of microbial-derived antigen, quiescent naïve B lymphocytes undergo rapid clonal expansion. This generates a large population of B cells with identical antigen specificity, engineered to combat the activating pathogen [1,2]. Ultimately proliferation is accompanied by differentiation, where activated B cells terminally differentiate into mature effector cells, such as antibody-secreting plasma cells and memory cells. Concurrent with this clonal expansion, B cells undergo somatic hypermutation (SHM) and class switch recombination (CSR) to diversify their antibody affinity and function [3]. Furthermore, activation induces dramatic transcriptional and morphological changes ensuring the B cells become solely dedicated to their antibody production function. Recent technological advances have shown that the two metres of DNA in every nucleus is packed in an intricate, non-random and critically important three-dimensional network [4,5]. As such, this organisation influences fundamental cellular processes, from transcription, DNA replication and repair to recombination and mitosis. Here we discuss how genome organisation might influence, and adapt to, the molecular and cellular rigors of rapid clonal expansion and differentiation into antibody-secreting plasma cells.
Loops, TADs and compartments—from linear genetics to 3D epigenetics
Enhancers have long been known to regulate gene expression over vast linear distances in a position- and orientation-independent manner [6]. However, for many years the mode of action remained enigmatic until it was shown in erythroid cells that enhancers of the β-globin locus, located 40–60 kb away from the globin promoter, come into close physical proximity with the promoter [7,8]. This DNA loop was erythroid specific. This work was the first evidence that enhancers exert their functions by recruiting transcription factors to stabilise RNA polymerase at the promoter through a three-dimensional contact, with the intervening sequence being looped out.
The communication between enhancers and promoters can be disrupted when a cis-element, traditionally known as insulator, is located between the two [9]. Over decades of active searching by scientists, the CCCTC-binding factor (CTCF) appears to be the only example of vertebrate insulator protein [10]. Unlike an enhancer, CTCF works in a position- and orientation-dependent manner and in some cases DNA methylation status is critical in its function [11,12]. Again, the underlying molecular mechanism of insulation remained unclear until the development of the chromosome conformation capture (3C) technique, which utilises a proximity ligation assay to evaluate the three-dimensional proximity of two genomic regions [13]. A high throughput approach using next-generation sequencing (Hi-C) was later developed to assess all DNA–DNA interactions genome-wide [14,15]. Hi-C revealed that complex genomes are spatially segregated into discrete globules of DNA, known as topologically associating domains (TADs). TADs are characterised by preferential interaction within and not between TADs [16,17]. Intriguingly, CTCF binding sites are enriched at the margins, or boundaries, of TADs. Deletion or disruption of these sites dramatically alters TAD structure [17–19], while acute degradation of CTCF completely removes TADs [20,21], indicating that CTCF is essential in the establishment of TADs. Recently it was demonstrated that CTCF, together with cohesin, act as architectural proteins and directly help in the formation of TADs through a loop-extrusion mechanism [18,22–24] (Figure 1A). Therefore, concurring with the traditional view as an insulator protein, CTCF can prevent cross-talk between promoters and enhancers by partitioning them into two different TADs. Thus TAD boundaries play a critical role in gene regulation as they facilitate appropriate interaction between enhancer and promoter, while insulating against inappropriate interactions. For example, recent studies have shown that disruption of TADs can lead to inappropriate interactions between enhancers and promoters, resulting in gene deregulation and disease [25,26].
Two major mechanisms of chromatin organisation.
At a broader scale, TADs with similar chromatin state tend to aggregate and form compartments [14]. Early Hi-C studies described the genome being partitioned into A- and B-compartments, where cross-talk is common among the same compartment type but minimal between them [14]. Compartment A generally associates with euchromatic regions and harbours active genes, whereas compartment B generally contains inactive heterochromatin. Subsequent Hi-C studies with greater resolution further divided the compartments into six different sub-compartments with distinct histone marks and epigenetic properties [15]. It is therefore speculated that the compartmentalisation is mediated by the similar physical properties of different chromatin states, possibly via phase separation [27–31] (Figure 1B). In support of this, several studies have demonstrated HP1α and other heterochromatic H3K9me3 recognition complexes are able to form phase-separated liquid droplets, containing other macromolecules, such as DNA [29,32,33]. Interestingly, other studies have shown the compartments remain entirely unaffected upon the degradation of CTCF or cohesin, which would completely abrogate TADs [20,21,34,35], demonstrating that compartmentalisation is independent of CTCF and the loop-extrusion mechanism.
While TADs are thought to be largely conserved between cell types, compartments and loops display lineage-specificity [14–16,36]. Genes can switch compartments and this generally associates with the entire TAD flipping from one compartment to another. A compartmental switch from A to B almost always associates with shutting down of transcription. However, switching from B to A does not necessarily link to transcriptional up regulation, but simply endows a permissive environment for looping to occur [37]. Thus promoter-enhancer contact is highly lineage-specific and is believed to be the ultimate factor in contributing cell type and stage specific transcriptional changes.
Early activation—discrete genome reorganisation partitioned from DNA replication and cell division
Shortly after antigen stimulation, naïve quiescent B cells experience a tremendous change in both morphology and function. Early studies documented an exit of G0 phase and entry into cell cycle, accompanied with increases in synthesis of RNA and protein, histone acetylation, and an increase in cell size so dramatic that it is easily observable under a conventional microscope [38–41]. Several recent studies have further shown during this early activation chromatin decondenses, and this requires the transcription factor Myc and energy input in the form of ATP, indicating it is a highly energy-dependent process [42–44]. Intriguingly, the first cell division does not occur until ∼30 h post-activation, relative to the subsequent rapid cell division every 6–10 h thereafter [45,46]. Transcriptomic profiling from our group at different time points post-activation has further stratified the cellular events during this pre-mitotic activation phase [47]. In the hours shortly after activation, gene ontology reveals that B cells dedicate their energies to RNA processing and ribosome biogenesis, and various metabolic processes (Figure 2). Later, just prior to the first division, the focus changes to genes that control DNA replication, packaging, chromatin remodelling and DNA conformational changes.
Cellular events during B cell clonal expansion and differentiation.
In striking contrast with the large number of early transcriptional changes after B cell activation, Hi-C analyses have revealed that genome organisation is relatively unaltered for the first 10 h after activation and undergoes a large genome reorganisation thereafter (Figure 2). Further investigation reveals that the majority of this genome reorganisation event occurs in the G1 phase of the cell cycle, prior to the initiation of DNA replication. Thus, it appears that genome architecture remodelling is independent of DNA synthesis and also mitosis, suggesting a partitioning of genome reorganisation from these physically challenging cellular processes. Like a number of previous studies on B cells [42,47–51], we have found pre-mitotic genome restructuring was associated with remarkable compartment stability [47], suggesting that a majority involve DNA loops. This is in striking contrast with stem cells, where there is extensive A/B compartment switch during lineage specification [36]. Presumably, this is a slower process as genes in the inactive compartment will need to flip to the permissive environment before any transcription could happen.
B cell activation induces the loss of long-range chromosomal contacts and the formation of thousands of DNA loops [42,47]. These short-range contacts mainly consist of CTCF-mediated loops and other enhancer-promoter contacts. It was demonstrated that the promoters gain interactions with multiple non-coding regions at this late stage of pre-mitotic phase and gene expression level is positively correlated with interaction frequency [42,47,49,52]. Thus, it appears that the predominant function of these loops is to drive and augment gene expression. Previously it has be proposed that enhancer-promoter interactions can modulate gene expression in two ways, being either instructive or permissive [53]. The positive correlation between loop formation and gene expression immediately before the first division suggests that these de novo loops are presumably largely instructive [47]. In contrast, in the hours shortly after activation the huge transcriptional burst with that is associated with minimal architectural changes suggests that pre-existing chromatin structures or loops are already poised and permissive, enabling rapid gene activation upon antigen stimulation.
Since the architectural changes are independent of DNA synthesis and mitosis, it is possible that the process of transcription or the action of transcription factors mediates genome reorganisation. There is growing evidence that the transcription of long non-coding RNA can mediate the 3D restructuring of chromatin [54,55]. For instance, during early T cell development, the transcription of a non-coding RNA ThymoD at an enhancer of the T cell lineage-specifying factor, Bcl11b, leads to the relocation of locus from nuclear periphery to interior, subsequently allowing proper enhancer-promoter interaction and gene up-regulation [56]. On the other hand, transcription factors can themselves remodel chromatin conformation in a transcription-independent manner. The aforementioned CTCF and cohesin are two examples [20,21,34,35]. In addition, our group has previously demonstrated that the B cell factor Pax5 does not require active transcription in order to wire the genome towards commitment to the B cell lineage [50]. This suggests that some lineage-specific transcription factors are also able to reorganise genome without the need for transcription.
Expansion phase—de novo loop formation during class switch recombination
Once the activated B cells undergo the first cell division, they rapidly divide and expand for many days until terminal differentiation [57]. Despite the time and number of generations that each cell has experienced, Hi-C analyses demonstrated the genome architecture stays remarkably similar [47] (Figure 2). This suggests the loops and structures that were established prior to the first division are preserved over many successive cell divisions. This is in line with cellular studies showing that in B and T lymphocytes fate decisions for terminal differentiation are made within a day after antigen stimulation [46,58–66], probably prior to the first cell division, and the stimuli received afterwards could not intervene. We speculate that the loops and other 3D structures established prior to the first division would pre-program subsequent transcription strength and profile, and the cells thereafter become somewhat refractory to external signals as the genome architecture is already established.
Although the global chromatin conformation remains very similar through successive cell divisions, some local architectural changes are critical. Class switch recombination is the molecular hallmark of B cell activation, where deletional DNA recombination elicits the replacement of the expressed immunoglobulin (Ig) µ and δ heavy chain constant regions (CH) with the downstream γ, ε, or α CH regions [67,68]. This is a tightly regulated process involving the interplay of non-coding transcription, double strand breaks (DSBs), DNA repair and 3D chromatin looping. In a resting B cell, a DNA loop is formed at the IgH locus, between the 3′ regulatory region (3′RR) and the intronic enhancer (Eµ) just upstream of the constant regions (Figure 3 left), which are ∼200 kb apart [69–71]. Downstream of 3′RR are multiple CTCF binding sites and recent studies have suggested that the DNA–DNA interaction is mediated by CTCF via loop-extrusion [72]. Additionally, the interaction co-occurs with a non-coding germline transcription (GLT) around the µ switch (S) region, which plays an important role in unwinding the S region DNA for activation-induced deaminase (AID), a ssDNA mutator, to induce DSBs [3,73]. Once the B cell is activated, depending on the stimulation and cytokines it received, for example LPS with IL-4 will result in a switch to IgG1, the corresponding S region (Sγ1 herein) will come into close proximity to the pre-existing loop, creating a three-point DNA synapsis [71] (Figure 3 right). In the meantime, the stimuli also trigger GLT at the acceptor switch region (Sγ1) followed by recruitment of AID to the interacting foci. DSBs are then induced at the S regions and subsequently resolved by non-homologous end joining (NHEJ) with the intervening DNA deleted. How such activation-induced chromosomal looping is established and its relationship with germline transcription remains unclear, despite several attempts in addressing the question [69,74–78]. As the induced germline transcription and 3D looping is antigen and cytokine dependent, it is conceivable that the downstream factors which elicit the GLT are also contributing to the juxtaposition of the acceptor switching region to the Eµ-3′RR loop [76] (Figure 3, middle path). However, given the emerging evidence about the role of transcription in shaping genome architecture [54,55,78], the germline transcript itself or the process of transcription could plausibly assist the formation of the DNA synapsis (Figure 3, top path). Also, conversely, it is possible that the looping itself allows the 3′RR to gain access and activate the GLT, herein the downstream protein factors induced by cytokine signals play an additional architectural role [76,78] (Figure 3, bottom path).
DNA synapsis during class switch recombination.
Other than CTCF, recent studies have unravelled the architectural roles of some other proteins in mediating CSR. The DSB damage response protein p53-binding protein 1 (53BP1) was once thought to be critical in resolving the DSB [79], however, several studies have demonstrated an additional role of 53BP1 in mediating the chromosomal loop in IgH locus [80,81]. In the absence of 53BP1, interaction between 3′RR and Eµ is greatly diminished. Furthermore, Mediator, an evolutionarily conserved protein complex that is essential for PolII transcription, has been shown to facilitate long-range DNA–DNA interactions between the switch regions and the 3′RR, and also the transcriptional activation of GLT at those switch regions [78]. Lastly, a ubiquitous transcription factor YY1 was also revealed to mediate the looping between Eµ and 3′RR [82]. This loop is still present when YY1 lacks its transcriptional activation domain, suggesting YY1 has a direct architectural role in establishing the long-distance loop.
Terminal differentiation—colocalization of genes for maximal antibody production
The prominent endoplasmic reticulum, extensive Golgi apparatus and eccentric position of the nucleus with wheel-like chromatin configuration are all defining features of plasma cells, thought to enhance their massive antibody production [83,84]. In contrast, the plasma cell genome organisation during differentiation and its functional significance to massive antibody synthesis and secretion is poorly understood. Similar to other terminally differentiating cells, heterochromatinization is thought to contribute to locking in the plasma cell transcriptional programme, while shutting down other unnecessary genes, for instance the regular B cell signature genes [85,86]. Deposition of H3K27me3 by EZH2 and the removal of H3K4me1 by LSD1 are implicated in this process [87–89]. Consistent with this, our Hi-C experiments documented a wave of chromatin reconfiguration during this final stage of differentiation, accompanied by substantial transcriptional changes [47] (Figure 2). The restructuring involves enhancer-promoter loops and also a considerable amount of compartment switching [47,90]. For example, the Prdm1 locus, which codes for the master regulator for plasma cell differentiation Blimp-1, exhibits the strongest enhancer-promoter interaction at the plasmablast stage [47]. Whereas the DNA contacts for Bcl6, an important transcription factor during B cell activation, are essentially lost.
Three-dimensional DNA modelling from conformation capture assays revealed that the chromosomes of plasma cells adopt a unique elongated, α-helical structure, resulting in the characteristic cartwheel appearance [90,91] (Figure 4). This is in striking contrast with the spherical-like structure exhibited by the chromosomes in naïve B cells. The cartwheel, elongated configurations are mainly contributed by the larger chromosomes (chr1–9) where the long-range intra-chromosomal interactions are severely diminished. Instead, intriguingly, there is an enrichment of inter-chromosomal contacts between chromosome ends.
Chromosome reorganisation during terminal plasma cell differentiation.
Using 3D imaging along with chromosome conformation capture, another study has shown that in plasma cells the active immunoglobulin genes (Igκ, IgH and IgJ), from three different chromosomes, are colocalised forming an apparent transcription factory [92] (Figure 4). Even more strikingly, this active cluster is located at the nuclear periphery, a region generally associated with gene silencing. More than 30 years ago, Blobel [93] proposed a gene gating hypothesis in which active genes are recruited near the nuclear pores for efficient transcript export. With recent technological advances in sequencing-based methods and electron microscopy, nuclear pores appear to be an exception at the nuclear periphery where gene activation is allowed [94,95]. In humans it has also been demonstrated that the nuclear pore complexes control gene expression by tethering to super-enhancers [96], suggesting an important regulatory role. In plasma cells, the colocalized Ig genes often have their transcripts share the same trafficking channel to the cytoplasm [92]. Therefore, it appears Ig genes colocalize near the nuclear pore for maximal transcript export, which further facilitates efficient and massive antibody production.
To accommodate massive protein synthesis and secretion, plasma cells elicit the unfolded protein response (UPR) to cope with the ER stress [97]. It is found that in plasma cells the genes involved in the UPR pathway, for instance Xbp1, Atf4 and Bax, also aggregate from different chromosomes and form transcription hubs [90] (Figure 4). Collectively, the inter-chromosomal interactions of Ig and UPR genes in plasma cells enable a co-ordinated and dedicated production of antibody.
Concluding remarks—Long-lived memory formation?
While a large body of research has focussed on understanding the cellular and molecular events governing B cell activation and terminal differentiation into antibody-secreting cells, very little is known about how the genome is wired in long-lived memory cells and how this contributes to their rapid recall upon rechallenge, due to the lack of good surface markers and their low abundance. Current literature suggests that memory B cells still possess a considerable amount of B cell signature and gene programme. For example, transcriptional profiling has shown a relatively similar gene expression landscape between memory and naïve B cells, with noticeable elevations of some anti-apoptotic genes and costimulatory molecules in memory cells [98]. Moreover, during a recall, memory B cells are able to promptly differentiate into plasmablasts and produce antigen-specific antibody, yet surprisingly they are also capable of undergoing a second round of somatic hypermutation and class switch recombination to modify the antibody specificity and isotype [99]. Thus, it is conceivable that the genome organisation of memory B cells would be different from terminally differentiated plasmablasts, while arranging similarly to naïve or activated B cells. Nevertheless, how such chromatin conformation facilitates a rapid memory response is entirely unknown. However, it is likely that it would involve some pre-existing loops poised for rapid activation.
Although the study of transcriptional regulation of memory B cells has long been hampered by their scarcity and heterogeneity, with recent advances in single cell techniques it is now possible to investigate them in greater details. Single-cell RNA-seq [100] and ATAC-seq [101] allow us to characterise the dynamics of transcription and chromatin accessibility, whereas high throughput microscopies and genome architecture mapping (GAM) [102] are able to resolve the chromatin conformation in each individual cell. These single-cell approaches could therefore provide valuable insights into the molecular identity of memory cells, and also other processes during B cell activation.
Perspectives
Three-dimensional genome organisation is critical in regulating a plethora of intricate cellular processes including gene transcription, DNA recombination, repair, replication and mitosis. This is of particular importance to B lymphocytes which, upon antigenic challenge, undergo a massive transcriptional burst, class switch recombination and rapid clonal expansion within a short period of time.
Naïve B cells possess pre-existing chromatin loops poised for rapid transcriptional burst upon stimulation and the subsequent pre-mitotic genome organisation is partitioned from DNA replication and mitosis. The newly established genome organisation will then dictate the transcriptional profiles over the successive clonal expansion until final differentiation into antibody-secreting cells.
Understanding the role of genome architecture in memory B cells is of particular interest to decipher the molecular principles behind their much more rapid secondary response. Also, individual cis-elements such as enhancers, loop anchors and long non-coding RNAs should be studied in depth to elucidate their contributions to genome organisation during B cell activation and differentiation.
Competing Interests
The authors declare that there are no competing interests associated with the manuscript.
Open Access
Open access for this article was enabled by the participation of Walter and Eliza Hall Institute in an all-inclusive Read & Publish pilot with Portland Press and the Biochemical Society under a transformative agreement with CAUL.