Article Text

PDF

DNA methylation 101: what is important to know about DNA methylation and its role in SLE risk and disease heterogeneity
  1. Cristina M Lanata,
  2. Sharon A Chung and
  3. Lindsey A Criswell
  1. Russell/Engleman Rheumatology Research Center, Department of Medicine, University of California San Francisco, San Francisco, California, USA
  1. Correspondence to Professor Lindsey A Criswell; lindsey.criswell{at}ucsf.edu

Abstract

SLE is a complex autoimmune disease that results from the interplay of genetics, epigenetics and environmental exposures. DNA methylation is an epigenetic mechanism that regulates gene expression and tissue differentiation. Among all the epigenetic modifications, DNA methylation perturbations have been the most widely studied in SLE. It mediates processes relevant to SLE, including lymphocyte development, X-chromosome inactivation and the suppression of endogenous retroviruses. The establishment of most DNA methylation marks occurs in utero; however, a small percentage of epigenetic marks are dynamic and can change throughout a person’s lifetime and in relation to exposures. In this review, we discuss the current understanding of the biology of DNA methylation and its regulators, the measurement and interpretation of methylation marks, the effects of genetics on DNA methylation and the role of environmental exposures with relevance to SLE. We also summarise research findings associated with SLE disease risk and heterogeneity. The robust finding of hypomethylation of interferon-responsive genes in patients with SLE and new associations beyond interferon-responsive genes such as cell-specific methylation abnormalities are described. We also discuss methylation changes associated with lupus nephritis, autoantibody status and disease activity. Lastly, we explore future research directions, emphasising the need for longitudinal studies, cell tissue and context-specific profiling, as well as integrative approaches. With new technologies, DNA methylation perturbations could be targeted and edited, offering novel therapeutic approaches.

  • DNA methyation
  • systemic lupus erythematosus
  • environmental exposures

This is an Open Access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/

Statistics from Altmetric.com

Introduction

In the last 20 years, there have been major advances in our understanding of the underlying genetic risk for SLE, with >80 established susceptibility loci.1 As our knowledge advances, the estimated heritability of SLE has increased from 12% to 43.9%.2 Nevertheless, genetic factors do not fully explain disease susceptibility, and environmental exposures such as smoking, ultraviolet (UV) light, infections, the microbiome, virome, diet and stressful events have all been associated with disease risk.3 Epigenetic modifications may link environmental exposures to changes in gene regulation that lead to disease, and it has been hypothesised as a mechanism contributing to the missing heritability of SLE. Among all the epigenetic modifications, DNA methylation perturbations have been the most widely studied in SLE.4 DNA methylation can be transmitted from parent to daughter cells,5 indicating that this form of epigenetic modification could represent a molecular mediator capable of propagating the memory of past cellular perturbations.6 In this review, we summarise the current understanding of mechanisms of DNA methylation, technical considerations related to its measurement and interpretation, and the current understanding relating it to SLE risk and disease heterogeneity.

Definition of epigenetics

Epigenetic processes modify gene expression without changing the DNA sequence of the gene. The definition of epigenetics has evolved substantially. In the 1950s, the concept of epigenetic systems as mediators of cellular memory and cellular identity arose.7 In the 1970s, DNA methylation was proposed to be a transcriptional regulatory mechanism that could be maintained through cell division.8 9 By linking DNA methylation to an epigenetic phenomenon, the definition of epigenetics became a molecular one. The development of next-generation sequencing for studying DNA methylation and other regulatory processes genome-wide has resulted in further evolution of the definition of epigenetics to encompass virtually all genome-wide chromatin modifications that do not change the underlying DNA sequence, including DNA methylation, histone modifications, chromatin accessibility, microRNA regulations and two-dimensional chromatin interactions.10 Such changes in DNA and chromatin structure correlate with changes in chromatin accessibility and transcription factor binding, which can lead to changes in gene expression. Among these, DNA methylation is the best-studied epigenetic modification in SLE.

DNA methylation

DNA methylation occurs when a methyl group is added to the fifth carbon of cytosine residues that are linked by a phosphate to a guanine nucleotide (a CpG dinucleotide) by DNA methyltransferases (DNMT1, DNMT3A and DNMT3B). This addition forms 5-methylcytosine (figure 1).11 The methyl group is obtained from the methyl donor S adenosine methionine (SAM). SAM levels are influenced by dietary intake of vitamins such as folic acid, vitamin B12 and pyridoxal phosphate (vitamin B6). CpG dinucleotides are methylated in a tissue and cell-type-specific manner, and their methylation critically influences cell differentiation and tissue development.12 13

Figure 1

DNA methylation and demethylation. Representation of DNA methylation: The addition of a methyl group at the fifth carbon position of the cytosine base. This process is mediated by the DNA methyltransferase (DNMT) enzyme family. DNA demethylation is mediated by the ten eleven translocation (TET) enzyme family. Active demethylation is a sequential process in which 5-methylcytosine is converted to 5-hydroxymethylcytosine (5-hmC), which is converted to 5-formylcytosine (5-fC) and finally 5-carboxylcytosine (5-caC). This process readies the sites for thymine-DNA glycosylase (TDG) to remove both 5-fC and 5-caC. BER, base excision repair; OG, oxoglutarate; SAH, S-adenosyl-L-homocysteine; SAM, S-adenosyl-L-methionine. Adapted with permission from Martin et al.105

DNA methylation and transcriptional regulation

Key points

  • DNA methylation occurs in CpG nucleotides that are distributed unevenly across the genome.

  • DNA methylation patterns are cell and tissue specific.

  • Methylation of CpGs in the promoter region is associated with gene repression.

  • Hypomethylation in the promoter region and hypermethylation in the gene body are associated with gene expression.

  • Methyl-binding proteins localise to methylated DNA to form inactive and compact heterochromatin.

DNA methylation effects on transcriptional regulation differ depending on the location of the CpG site (intragenic vs promoter region vs enhancer). Most of the genome does not contain CpG sites. However, clusters of CpG sites, termed ‘CpG islands’, occur and commonly span promoters of house-keeping genes. These promoter CpG islands typically remain unmethylated, resulting in active gene expression (CpG island, promoter; figure 2).14 Methylated CpG island promoters are associated with gene repression. CpG island ‘shores’ are regions of comparatively low CpG density, located approximately 2 kb from CpG islands. Shores also exhibit tissue-specific differential methylation and methylation of shores is associated with gene silencing. Gene bodies tend to have intermediate CpG densities. Unlike CpG island promoters, extensive exonic or genic methylation is typically associated with active gene expression. Beyond these regions, the genome has a lower-than-expected frequency of CpG sites which are typically methylated (intergenic; figure 2). This includes CpG-poor promoters and distal enhancers that regulate tissue-specific genes.15 DNA methylation frequently occurs in repeated genetic sequences and helps to suppress the expression and mobility of transposable elements and retroviral elements such as long interspersed nuclear element 1 (LINE-1).16

Figure 2

A schematic representation of the genome. White CpG sites reflect non-methylated CpG sites. Black represents methylated CpG sites. Gene expression can occur in the setting of unmethylated CpG sites in the promoter region and methylated CpG sites at the gene body (genic). Adapted with permission from Stirzaker et al.14

Demethylated promoter regions of genes are thought to facilitate gene transcription by allowing a mechanical opening and accessibility of DNA to transcription factor binding and therefore gene expression. However, recent studies have shown that transcription factor binding can induce DNA hypomethylation in some situations, therefore making it difficult to establish a generalised rule about the relationship between DNA methylation and gene transcription.17 DNA methylation is also associated with other epigenetic mechanisms such as histone modifications, particularly the absence of histone H3 lysine 4 methylation (H3K4me0) and the presence of H3 lysine 9 demethylation (H3K9me2). These histone marks are associated with gene silencing.

DNA methylation can also influence transcription of genes through methyl binding domain proteins (MBDs).18 MBDs recruit histone-modifying and chromatin-remodelling complexes to methylated sites, thereby forming compact, inactive heterochromatin. MBD2 has been linked to immune system function and studied in the context of autoimmunity. Although loss of MBD2 results in reduced numbers of T regulatory (Treg) cells, MBD2 null mice surprisingly do not develop autoimmunity. In humans, increased levels of MBD2 and global demethylation in CD4+T cells have been observed in several autoimmune disorders, including SLE.19 20 However, considerable effort will be required to fully understand the complexities of MBD2 function in autoimmunity.

Establishing and maintaining DNA methylation marks

Key points

  • DNA methylation occurs during gametogenesis and after fertilisation.

  • After each cell division, methylation marks are maintained by the DNA methyltransferases DNMT1, DNMT3A and DNMT3B.

  • The majority of methylation marks are stable; however, between 10% and 20% on average vary over time.

  • It is unclear how methylation marks vary over time in patients with SLE.

DNA is methylated by DNA methyltransferases DNMT1, DNMT3A and DNMT3B.21 22 They are highly expressed in undifferentiated cells and germ cell precursors, but present at much lower levels in somatic cells. During each cell replication, fully methylated CpG sites are converted into hemimethylated sites, which are then re-methylated primarily by methyltransferase DNMT1.23 These methylation marks are preserved for the rest of the life of an organism, with small tissue-specific changes.23 Of note, recent studies suggest that rearrangement of DNMT protein domains influences their enzymatic activity, which may contribute to the development of disease (both autoimmune and non-autoimmune).24–27

Methylation patterns of the germline and somatic cell lineages are established during early embryonic development. Methylation of CpG sites can also persist from the germline of one parent to the zygote, marking the region as being inherited from one parent or the other, known as genetic imprinting.28 Increasing evidence from animal models and human observational studies suggests that the 'programming' for various adverse health outcomes occurs during the in utero and early postnatal period, likely by disturbing DNA methylation marks. For example, maternal exposure to environmental polybrominated diphenyl ethers has been associated with decreased methylation of the tumour necrosis factor alpha (TNFα) promoter and increased TNFα protein levels in umbilical cord blood obtained at infant delivery.29 Although multiple studies have associated DNA methylation aberrations with in utero environmental exposures (table 1), their role in SLE remains unknown.

Table 1

Exposures associated with DNA methylation changes and their associations with SLE as well as other diseases

How stable are DNA methylation marks?

Studies of the temporal stability of DNA methylation in healthy individuals indicate that the majority of CpG sites are stable over time, with only 8%–18% of CpGs varying in methylation status over 6–10 years.30–32 A study comparing monozygotic twins of different ages (3 and 50 years) revealed that epigenetic differences between twins increase throughout life, presumably as a consequence of different environmental exposures. A recent twin study indicated that among CpGs exhibiting variation, variation of 90% of the sites was explained solely by the individual’s unique environmental factors and only 10% of these sites were influenced by familial factors (genetic or shared environment).33 A recent study examined the stability of methylation marks in the promoter region of two genes previously associated with SLE, CD3Z and VHL, in 16 patients at two points in time (median 24 months between assessments). This study found that methylation of the CD3Z promoter between the two time points from each patient was modestly correlated (r=0.488). In contrast, DNA methylation levels were highly correlated for CpGs at the VHL promoter (r=0.942), suggesting that some methylation marks change over time while others do not.34 Further longitudinal studies of DNA methylation marks in SLE are needed to assess their relationship to disease activity and treatment response.

Important functions of DNA methylation relevant to SLE

Lymphocyte development

During their development, lymphocytes undergo a series of cell-fate decisions that are mediated by changing patterns of DNA methylation. Many genes that are initially methylated in haematopoietic stem cells undergo selective demethylation in a tissue-specific or lineage-specific manner. DNMT1 is required for proper maturation of thymic progenitors,35 and conditional deletion of DNMT1 in early double-negative (CD4, CD8) thymocytes is followed by cellular and DNA replication and DNA demethylation. This is relevant since naïve T cells from patients with SLE have been reported to have global hypomethylation due to decreased DNMT1 activity. In addition, DNA methylation influences the expression of T cell cytokines interleukin (IL)-2, IFN-gamma, IL-4 and IL-13,35 36 and the Treg-canonical transcription factor, FOXP3.37

X chromosome inactivation

X chromosome inactivation (XCI) in female embryos, which occurs early in zygote development, is also mediated by DNA methylation. In females, one X chromosome is randomly selected for chromosome-wide transcriptional silencing, which equalises the expression of X-linked genes between genders.38 During development and ageing, varying degrees of escape from XCI via demethylation can occur in cells within a tissue resulting in phenotypic differences.39 The female bias observed in SLE has been associated with genes that escape XCI. The X chromosome has the greatest density of immunity-related genes outside of the major histocompatibility complex (MHC) region, and overexpression of these genes is a critical factor in the breakdown of self-tolerance. Individuals with Turner syndrome (XO) have a low risk of developing SLE compared with XX females.40 Individuals with Klinefelter’s syndrome (XXY) have a 14-fold increased risk of developing SLE compared with XY males,41 suggesting that gene dosage from the X chromosome influences SLE susceptibility. Consistently, CD4+ T and B cells from women with SLE are found to have demethylated promoters and subsequent overexpression of the X-linked genes CD40LG, TLR7, CXCR3 OGT, miR-98, let-7f-2, miR 188–3 p, miR-421 and miR-503, suggesting that their overexpression results from a reactivated X chromosome.38 42 43

Suppression of endogenous retroviruses and other transposable elements

The high interferon (IFN) signature seen in SLE may be due to over-transcription of retrotransposable elements, resulting from DNA methylation aberrations. Retrotransposable elements or endogenous retroviruses (ERVs) comprise as much as 40% of the human genome. Some retrotransposable elements have substantial copy number, such as 300 000 for Alu elements, and 10 000 for LINEs.2 Most ERVs in the genome are silenced by DNA methylation.44 45 Sensors of nucleic acids that provide protection from viral infection are implicated in sensing ERVs and can trigger an IFN response. Therefore, the role of ERVs as potential triggers of IFN production in SLE has been examined. A recent study showed that the transcription of LINE-1 is increased in kidney tissue from patients with lupus nephritis and that increased LINE-1 transcription correlates with greater expression of type I IFN.46 Furthermore, methylation of CpG sites in the LINE-1 promoter is negatively correlated with LINE-1 expression. Thus, demethylation may play a critical role in the overexpression of LINE-1.46 HRES-1, another ERV, has been shown to be overexpressed in SLE. SLE B cells have been characterised by their incapacity to methylate the HRES-1 promoter, both in unstimulated and in anti-IgM stimulated B cells.47

How does aberrant DNA methylation occur?

Key points

  • Methylation changes can occur due to the loss of maintenance of methylation marks by the DNMT enzymes during cell division (passive demethylation) or by active removal of methylation marks by the TET enzymes (active demethylation).

  • DNMT dysfunction has been linked to oxidative stress and environmental exposures.

  • The role of active demethylation by the TET enzymes in SLE remains to be defined.

  • Research regarding the regulators of DNMTs and TET enzymes will likely shed light into disease pathogenesis as well as therapeutic interventions.

Passive DNA demethylation: role of DNMTs and oxidative stress

Transcription factors, microRNAs and other proteins may influence DNMT activity and could cause DNA methylation aberrations. In SLE, changes in DNMT1 expression have been linked to aberrations in DNA methylation. Significantly lower DNMT1 and DNMT3A transcript levels in patients with SLE were observed compared with healthy controls.48 With lower levels of DNMT1, DNA methylation patterns are not completely copied from parent to daughter cells during mitosis, resulting in passive DNA demethylation. DNMT1 is regulated by the mitogen-activated protein kinase/ERK (MAPK/ERK) signalling pathway. Impairment of this pathway causes a failure to upregulate DNMT1 during mitosis and has been shown in vitro and in vivo to contribute to SLE flares.49 In SLE, T and B cells with a defective ERK signalling pathway have demethylation and overexpression of methylation-sensitive genes.47 50 Many studies have linked human SLE flares with heightened oxidative stress.50 51 Increased oxidative stress has been implicated in the deregulation of the ERK signalling pathway, leading to DNMT1 downregulation and global DNA demethylation in T cells from patients with SLE.50 Environmental exposures such as UV light, cigarette smoke, silica and other stressors stimulate reactive oxygen species production, which in turn could lead to DNA methylation aberrations by disrupting the MAPK/ERK pathway.52

Active DNA demethylation: the role of 5-TET proteins

Methylated cytosine can be converted back to unmethylated cytosine by ten eleven translocation (TET) proteins. Unmethylated cytosines are then restored back to the genome through base excision repair.53–57 This has been defined as active modification-active removal and is independent of DNA replication (figure 1). The regulation of TET proteins is just beginning to be studied, but factors such as hypoxia, iron, vitamin C and post-transcriptional and post-translational modifications by microRNAs have been implicated.58 Generally, studies have shown that key tissue-specific transcription factors recruit TET proteins to sites where CpGs need to be demethylated. Its role has been primarily studied in preimplantation development. This mechanism of demethylation is of interest in processes such as SLE.

How does one assess DNA methylation aberrations?

Key points

  • Whole genome bisulfite sequencing can profile the ~28 million CpG sites inthe human genome, but it is costly and computationally burdensome.

  • Arrays are a cost-effective way to asses for genome-wide methylation marks; however they only cover ~4% of all CpG sites across the genome.

The current ‘gold standard’ technique for identifying methylated cytosines across the genome is whole-genome bisulfite sequencing (WGBS).14 In this process, DNA is treated with sodium bisulfite, which converts unmethylated cytosine to uracil, leaving methylated cytosines unchanged. Treated DNA then undergoes whole-genome sequencing. WGBS can provide a complete map of the ~28 million CpG sites in the human genome.59 However, the high cost of this approach and significant technical expertise required to generate and process WGBS data precludes application of this method in large cohort studies at this time, and thus no WGBS studies have been performed in SLE.

In recent years, the Illumina HumanMethylation450 (HM450) BeadChip has provided a user-friendly platform to profile DNA methylation in human samples. The HM450 uses bisulfite-converted genomic DNA hybridised to arrays containing predesigned probes to distinguish methylated (cytosine) and unmethylated (uracil) CpG sites. A single-base step incorporates a labelled nucleotide that is fluorescently stained. Scanning of the array detects the ratio of fluorescent signal arising from the unmethylated probe to the methylated probe. The proportion of DNA methylation at a particular CpG site (the methylation beta-value (β)) is calculated by taking the ratio of the methylated to the total (unmethylated+unmethylated) signal. A β-value of 0 represents a completely unmethylated CpG site, and a β-value approaching 1 represents a fully methylated CpG site.60 Probes have been preferentially designed to target CpG islands due to the established relationship between DNA methylation at promoter CpG islands and gene expression. The HM450 BeadChip interrogates 482 422 cytosines, which represents approximately 1.7% of all CpG sites in the human genome. However, these sites are enriched for CpG (99.3%) islands and almost half of the probes on the array cover intergenic regions, such as bioinformatically predicted enhancers, DNase I hypersensitive sites and validated differentially methylated regions (DMRs).60 61 However, the HM450 platform may miss important regulatory regions. Thus, Illumina released the MethylationEPIC (EPIC) BeadChip, with new content specifically designed to target these regions. EPIC covers >850 000 CpG sites, including >90% of the CpGs from the HM450 and an additional 413 743 CpGs. Even though the additional probes improve the coverage of regulatory elements, only 7% of distal and 27% of proximal ENCODE regulatory elements are represented.60 An important caveat to remember is that array design is heavily biased due to inclusion of probes that interrogate only CpG sites that have been previously identified in methylation-based assays. Thus, there is a probe selection bias.

Epigenome-wide association studies (EWAS): caveats and pitfalls

Taking advantage of these new assays to interrogate DNA methylation marks across the genome, studies have been performed comparing individuals with SLE to healthy individuals. These EWAS are modelled after genome-wide association studies and test whether the level of DNA methylation at individual cytosines in the genome differs among individuals with SLE compared with healthy controls. It is inferred that differential methylation of CpG sites likely influences the expression of nearby genes, and thus differential methylation identifies specific genes that may affect SLE risk or the development of specific disease manifestations. This traditional EWAS approach has been criticised by several investigators62–66 based in part on complexities related to appropriate quality control, normalisation procedures and adjustments for background noise. These issues have been discussed in depth in several reviews.63 66 67 Below we discuss the issue of the interpretability of DNA methylation changes.

What is the hypothesis being tested in EWAS of SLE?

Most studies examine the association between level of DNA methylation and the outcome of interest. The primary hypothesis is generally that the change in methylation influences changes in gene expression that leads to disease or the change in methylation reflects an exposure associated with disease. However, given the cross-sectional nature of these studies, causality for these associations cannot be determined. The changes in methylation may be a manifestation of the disease itself. While these changes may reflect important pathogenic mechanisms, the inference of causality assumed in these hypotheses may not be valid.

Confounders are not fully accounted for in most SLE EWAS

Key points

When performing an epigenome-wide association study, there are important confounders to be considered:

  • Differences in cell proportions: When profiling unsorted peripheral blood, DNA methylation marks may reflect changes in cell proportions in cases and controls, rather than disease-associated changes.

  • Genetic ancestry as well as genetic variation influence methylation marks.

  • Medications can alter DNA methylation patterns.

Cell proportions in blood samples

A well-recognised source of variability of DNA methylation is the difference in cell-subtype proportions between experimental groups. Even when cells are isolated according to their surface markers, single-cell transcriptomics has revealed significant heterogeneity between cells that share the same surface markers.68 69 Even though bioinformatic methods exist to adjust for these differences, these methods likely are not able to account for all the methylation differences attributable to cell proportion differences seen between groups.70–72 Most EWAS report modest changes in methylation (eg, 20% change), after correcting for cell-subtype variation. This can be explained if the cellular reprograming is only happening in a subpopulation of cells or if the effects of cell subtypes have not been fully accounted for.

Genetic variation

Most associated single-nucleotide polymorphisms identified in genome-wide association studies of SLE are in non-coding regions, and thefunctional effects for many are unknown. DNA sequence variations can alter DNA methylation states and gene expression, linking the regulatory genome with individual genetic risk loci. In a recent study examining three major human immune cell types (CD14+ monocytes, CD16+ neutrophils and naive CD4+ T cells) in 197 healthy individuals, genetic variation explained the majority of DNA methylation variance for most genes, leaving a relatively modest independent proportion of DNA methylation changes that can be attributed to environmental effects.73

Variation in methylation can be attributed to genetic and non-genetic effects. Non-genetic effects can be further categorised as familial (shared environment) and non-familial (unique environment) effects. Studies of SLE discordant twins can also provide evidence of the genetic effect on DNA methylation, assuming the discordant twins share the same environment. Several twin studies have reported the heritability of DNA methylation between 18% and 37%.74–76 Discordant twin studies of SLE have identified several hypomethylated epigenetic marks associated with SLE in genes such as ITGAL (CD11a), PRF1 (perforin), CD70, GADD45A and IFN-responsive genes, strongly implicating an aetiological role for epigenetic factors in SLE.11 77 Overall, these studies highlight the importance of adequately accounting for effects of genetic variation for proper interpretation of EWAS results. To date, this has only been performed in one SLE EWAS described below.

Effects of population genetics

Emerging EWAS in various fields has revealed thousands of CpG methylation sites correlated with genetic ancestry.73 75 76 In SLE, a recent study comparing African-American (AA) to European American patients identified ethnic-specific clustering of DNA methylation patterns in naive CD4+ T cells. Pro-apoptotic and pro-inflammatory genes were significantly enriched with hypomethylated CpG sites in healthy AAs, with genetic variants within and directly disrupting CpG sites accounting for some ethnic-specific variability in DNA methylation.78 These ethnic-specific differences in methylation may help explain racial disparities in outcomes and disease manifestations observed in SLE. Further work is warranted in this area.

Medication effects

Medications can alter DNA methylation and are not accounted for in the majority of studies.79 In one SLE EWAS, the investigators tested whether methylation levels were associated with the most commonly prescribed medications in SLE.80 They found that 5196 differentially methylated CpGs (primarily hypomethylated) were associated with glucocorticoid use and 125 differentially methylated CpGs were associated with azathioprine (p<1.3×10-7, |Δβ|>0.05). Methylation changes were not associated with antimalarials, mycophenolate mofetil or methotrexate use. The negative findings associated with methotrexate and mycophenolate are likely a reflection of the small number of patients on these medications (n=22 and n=36, respectively). Methotrexate use has been shown to reverse methylation changes observed in B and T cells of patients with rheumatoid arthritis.81 Cyclophosphamide and biologics were not studied. Overall, there are sparse data on the effects of medications on methylation and more studies are needed to confirm these findings.

What have we learned about how changes in DNA methylation affect SLE risk and disease heterogeneity?

Two types of studies have been performed that address this issue: in vitro studies using targeted methylation-based assays and genome-wide studies using microarray data (eg, HM450).

Initial studies

The relationship between SLE and abnormal DNA methylation status was first described >20 years ago, when T cells from active patients with SLE were observed to have decreased global DNA methylation levels (15%–20% reduction).82 In vitro studies revealed that inhibitors of DNMT1 (5-azacytidine, procainamide or hydralazine) could induce hypomethylation with subsequent autoreactivity in naive T cells,83 providing strong evidence for causation in SLE. This was further supported by studies showing that disease activity in patients with SLE inversely correlated with global DNA methylation.84 In subsequent experiments, antigen-specific CD4+ T cells epigenetically modified with DNA methylation inhibitors overexpressed genes normally suppressed by this mechanism, including CD11a, CD70, CD40L, PRF1 and the KIR gene family. The altered cells became autoreactive, responding to self-class II MHC molecules without added antigen. These cells were sufficient to cause lupus-like disease in syngeneic mice. T cells overexpressing the same genes were found in patients with active SLE.42 85 These T cells have been further characterised as a specific subset with the following markers: CD3+CD4+CD28+CD11ahiCD70+CD40LhiKIR+. The proportion of these cells correlated with the severity of SLE flares.86

EWAS in SLE

Key points

  • The most robust finding from EWAS is the hypomethylation of interferon-responsive genes in patients with SLE compared to healthy controls; this has been observed in unsorted peripheral blood as well as lymphoid and myeloid cells.

  • In a twin study of discordant SLE twins, B cells had more robust methylation differences than other cell types, with hypermethylation of CpGs in TRAF5, CXCR5 and DDR1 as novel findings.

  • With larger sample sizes, new differentially methylated CpGs are being described beyond interferon-responsive genes.

  • These CpGs lie in enhancer and regulatory regions of the genome and implicate genes that are relevant to immune function, such as PDCD1 and TLR8.

  • EWAS with larger sample sizes and in diverse populations will likely continue to shed light into the role of DNA methylation in SLE.

Demethylation of IFN-responsive genes

One of the first EWAS in SLE compared methylation marks in naive CD4+ T cells between 36 patients with SLE and 36 controls, using the HM450 array. The investigators identified 86 CpGs that were differentially methylated, with <20% difference in methylation between patients and controls. Multiple IFN-regulated genes were hypomethylated in naive T cells in patients with SLE, without a corresponding increase in gene expression. The authors introduced the concept of naive T cells being 'epigenetically poised' to respond to type I IFN upon T cell activation. This builds on the concept that unmethylated CpGs reflect a permissive but not prescriptive transcription state, and gene activation requires additional epigenetic modifiers or transcription factors. Although this study has limitations including failure to control for potential confounders, the findings have been replicated by subsequent SLE EWAS described below.87

A second study examined CD19+, CD4+ and CD14+ cells in 49 patients with SLE and 58 controls using the HM450 array and applied more rigorous methods for quality control, including adjustment for ethnicity, age and gender.88 Several loci were differentially methylated with significance levels reaching the genome-wide threshold of p value <1E-08. Methylation differences as large as 40% were identified in adjusted analyses. Fifty genes were significantly differentially methylated in both T and B cells, and differential methylation of 19 genes was highly significant across all three cell types. Although there was significant overlap of demethylated genes across cell types, the authors could detect numerous cell-specific DNA methylation changes. Pathway analyses of the top 100 differentially methylated CpG sites for each cell type identified IFN signalling as the top pathway for CD4+, CD 19+ and CD14+cells. They also observed that the SLE-associated methylation changes in T cells were heterogeneous, concluding that these effects are likely mediated by different proportions of subpopulations of CD4+ T cells. To examine these possibilities, they sorted CD4+ T cells from an independent cohort of 26 patients with SLE and 18 controls into naive, memory and regulatory T cells. Only the IFN-related genes remained hypomethylated in all T cell subtypes, with other CpG sites restricted to specific subpopulations of T cells. This same phenomenon was also observed in neutrophils and low-density granulocytes in a study of 15 patients with SLE and 15 healthy matched controls.89

The most recently reported EWAS applied the HM450 array to peripheral blood samples from 548 Swedish patients with SLE and 587 matched healthy controls. With a discovery and replication study design, the largest methylation differences (>10%) were observed almost exclusively at IFN-regulated genes, with the effect being more prominent in active SLE.80 Only one EWAS in patients with SLE of non-European descent has been published, which focused on 12 Chinese patients with SLE and 12 controls. In this study, hypomethylation of MX1, IFI44L, NLRC5 and PLSCR1, all IFN-responsive genes were replicated in patients with SLE.

Although these studies have limitations, the consistent findings of widespread hypomethylation of IFN-responsive genes in cells from lymphoid and myeloid lineages are compelling. There are at least two potential hypotheses for the observed phenomena. One hypothesis is that these perturbations occurred in an early multipotent progenitor population and were subsequently propagated to lineages that are programmed to respond to IFN. This hypothesis suggests that the in utero environment might be of critical importance since most methylation marks are established during the embryonic period. Another hypothesis is that these studies are capturing the effects of high serum IFN on circulating peripheral immune cells, a hallmark of SLE pathogenesis.

More robust DNA methylation changes in B cells in SLE: the role of hypermethylation and gene silencing

Fifteen SLE twin pairs were recently studied, including six monozygotic, six same-sex dizygotic and three opposite-sex dizygotic twin pairs. Using the HM450 array, four cell types were profiled: CD4+ T cells, B cells, monocytes and granulocytes. Due to the small sample size, the authors focused on gene promoter and transcription start regions and defined a less stringent significance threshold. Among the 13 discordant SLE twin pairs, 35%–41% of all differentially methylated CpGs corresponded to IFN-responsive genes, of which most (but not all) overlapped in four cell types. Differential methylation was more widespread in B cells with predominantly hypermethylation, followed by monocytes, then granulocytes, and finally CD4+ T cells. The authors focused on 14 CpG probes of interest for pyrosequencing: 5 IFN-regulated genes, 3 non-IFN genes and 5 genes only differentially methylated in B cells. A CpG site in IFI44L had the largest overall difference observed in discordant SLE twins, with a median hypomethylation of 49.3% in monocytes and granulocytes. In addition, CpGs from TRAF5, CXCR5 and DDR1 were hypermethylated in B cells from discordant SLE twins.90 Of particular interest is TRAF5, which has been recently described as a negative regulator of TLR signalling in B cells.91

New findings with larger sample sizes: beyond IFN-responsive genes

In the latest Swedish SLE EWAS, 4034 differentially methylated sites were novel and annotated to 1638 unique genes that have not previously been associated with SLE. This likely reflects the larger sample size available for that study. Gene ontology enrichment analyses of these sites revealed enrichment in molecular functions related to regulatory function of the genome, such as DNA binding and transcription factor activity. Interestingly, these associated methylation marks were enriched in regions with a histone mark for active enhancers (H3K4me1) in B and T cells. The authors highlighted several CpG sites in genes found to be differentially methylated in lupus such as PDCD1 and AK2. PDCD1 is a confirmed SLE susceptibility locus that encodes the programmed cell death 1 protein that prevents autoimmunity by inhibiting activation of self-reactive lymphocytes. AK2 plays a role in apoptotic processes. When examining only females, 27 differentially methylated CpGs were found on the X chromosome, including CpGs in genes TLR8 and VSIG4, a negative regulator of T-cell proliferation. In males, three X chromosome CpGs in genes SH2D1A and SEPT6 were differentially methylated. This study indicates that there is still much to be learned from DNA methylation profiling not only with larger samples, but from diverse populations.

Genetic control of methylation marks

GWAS variants associated with risk of SLE have been found to influence methylation levels, suggesting a functional mechanism for these genetic variants. In an integrative analysis of genotype and DNA methylation data, 466 CpG sites of the 7245 differentially methylated sites identified in the Swedish SLE EWAS showed evidence of genetic control.92 Six SLE GWAS risk loci and the MHC class III region were associated with differentially methylated CpG sites: PTPRC (CD45), UHRF1BP1, IRF5, IRF7, IKZF3 and UBE2L3. This suggests that variants at SLE risk loci may in part exert their influence on SLE risk through alteration of DNA methylation levels at regulatory regions of target genes. An example was the UBE2L3 locus: the SLE-associated GWAS SNP is located downstream of the gene, but influences methylation for an SLE-associated CpG of the UBE2L3 promoter region. This study suggests that a small proportion of DNA methylation changes can be attributed to genetic variation.

Methylation changes in relationship to disease activity: transcription factors highlighted

There are only a few studies examining DNA methylation changes associated with disease activity defined by the SLE Disease Activity Index (SLEDAI) score. A cross-sectional study examining 12 pairs of active patients with SLE and healthy controls found that CpGs in the promoter regions of IL10 and IL1R2 were significantly hypomethylated in the samples of patients with SLE relative to the healthy controls.93 Another study of naive CD4+ T cells from 74 patients with SLE examined methylation changes associated with the SLEDAI score using the HM450 array. Using a less stringent threshold of significance, 4839 methylation sites were hypomethylated and 1568 methylation sites were hypermethylated with disease activity in lupus. Transcription factor binding site analysis revealed that hypermethylated sites associated with disease activity were enriched for binding sites of two repressive transcription factors, EZH2 and SUZ12. The authors also showed that reduced methylation in key cytokine genes such as IL4, IL5, IL13, IL12B and IL17F was associated with increased disease activity, suggesting an epigenetic shift towards a Th2 and possibly Th17 response. Increased SLEDAI scores were associated with hypomethylation of key transcription factors involved in T-cell differentiation such as PU.1, (RORγt) and BCL-6, which promote Th9, Th17 and follicular helper T cell responses, respectively.94 One interpretation is that in the setting of an SLE flare the circulating cytokine milieu could induce these subtle methylation changes, which could then programme the naive CD4 T cell response towards a certain fate once it encounters its cognate antigen. It is unclear whether this would be relevant to the flare itself or perhaps to future flares.

Based on the above finding, recent work explored the role of transcription factor EZH2 in SLE pathogenesis.95 EZH2 is an epigenetic regulator that mediates histone H3K27 trimethylation (H3K27me3) and modulates DNA methylation. The authors found that EZH2 was overexpressed in CD4+ T cells in SLE, resulting in hypomethylation of genes involved in leucocyte adhesion and migration, including F11R (which encodes junctional adhesion molecule A [JAM-A]), G11R, SELP and SELPLG. This was further tested in vitro, where overexpression of EZH2 was associated with increased adhesion of activated CD4+ T cells to endothelial cells, mediated by JAM-A. The results of this study identified a novel role of EZH2 in T cell adhesion in SLE, mediated by epigenetic remodelling.

Methylation changes in relationship to disease phenotype

Key points

  • DNA methylation changes have been associated with disease activity, antibody status and lupus nephritis, shedding light on genomic drivers of disease heterogeneity.

  • Transcription factors such as EZH2 have been shown to mediate methylation changes leading to increased T-cell adhesion in SLE flares.

  • Differentially methylated CpGs in TNK2, DUSP5, MAN1C1, PLEKHA1, IRF7, HIF3A, IFI44 and PRR4 have been associated with lupus nephritis.

  • Overall, hypomethylation of interferon-responsive genes is associated with higher disease activity, renal disease, as well as autoantibodies.

  • Replication of these findings in larger studies that capture active disease manifestations is needed to further define the role of DNA methylation in disease heterogeneity.

SLE is a complex disease, with disease manifestations that can range from mild to life-threatening. Although significant efforts have been made to understand the genomic causes of SLE risk, there are increasing efforts to better characterise the genomic drivers of disease heterogeneity and outcomes. DNA methylation has been studied in relation to several SLE phenotypes that we discuss below.

Lupus nephritis

Two EWAS have been published examining lupus nephritis using the HM450 chip. One study looked at naive CD4+ T cells in a multiethnic cohort of 28 SLE patients with nephritis, 28 SLE patients without nephritis and 56 healthy controls. Out of the 191 differentially methylated CpG sites unique to lupus patients with renal involvement, 64 were hypomethylated and 127 were hypermethylated compared with healthy matched controls. The two most hypomethylated genes were TNK2 and DUSP5, and the two most hypermethylated genes were MAN1C1 and PLEKHA1, with all CpGs having an overall methylation difference of 20%. IFN-responsive genes were hypomethylated to a greater extent in patients with renal involvement, and the authors highlighted the IFN regulatory factor family member gene IRF7, which was demethylated only in SLE patients with renal disease. TNK2 encodes a non-receptor tyrosine kinase that modulates many downstream effector molecules and is involved in cell trafficking, endocytosis, cell migration and tissue invasion. DUSP5 encodes a phosphatase that can dephosphorylate MAPK/ERK, a pathway implicated in the pathogenesis of SLE.96

The second study analysed 322 women of European descent with SLE, of whom 80 had a history of lupus nephritis (LN). The investigators analysed peripheral blood mononuclear cells and applied methods to adjust for population substructure and leucocyte cell proportions. The investigators identified differential methylation of 19 sites in 18 genomic regions that were associated with nephritis among patients with SLE (false discovery rate q<0.05), with the largest differences approaching 10%. Associations with hypomethylation in four sites in HIF3A, IFI44 and PRR4 were replicated when examining methylation data derived from CD4+ T cells collected from the EWAS described above. The authors highlighted HIF3A. Its gene product, HIF, can activate the transcription of >100 genes involved in the cell's response to hypoxia. Chronic hypoxia in the tubulointerstitium of the kidney has been proposed as a common pathway that leads to end-stage renal disease, and thus HIF plays a critical pathophysiological role in this process. Of the other CpG sites, only four (IFI44, IFIT1, DDX60 and RSAD2) are known to be induced by type 1 IFN.97 Limitations of both studies include the fact that LN was defined based on a history of meeting the American College of Rheumatology (ACR) renal criterion and not the presence of active lupus nephritis or histological features. Further, neither study fully accounted for medication use at the time of DNA collection.

Autoantibody production

In a study of 326 women of European descent with SLE, methylation differences were compared between patients with and without a history of anti-dsDNA antibody positivity. Peripheral blood mononuclear cells were profiled with the HM450 array. Using a discovery and replication study design, 16 CpG sites in 11 genes were found to be differentially methylated. Seven of these genes are either induced by type 1 IFN (IFIT1, IFI44L, MX1, RSAD2, OAS1, EIF2AK2) or regulate type 1 IFN signalling (NLRC5). These CpG sites were also associated with anti-Sm, anti-RNP and anti-SSA status, with anti-Sm and anti-RNP having a stronger association with hypomethylation than anti-dsDNA or anti-SSA. The PARP9/DTX3L locus was associated with all four autoantibodies studied, and mediates ubiquitination of histone H4 in response to DNA damage to perform DNA-damage repair.98 Hypomethylation of RABGAP1L was also associated with the four autoantibodies, and copy number variants of this gene have been associated with SLE risk.99 This gene encodes for a tyrosine kinase that regulates the activity of the mTOR complex 1 signalling pathway.100 However, another study of similar size examining methylation differences between patients according to the ACR immunologic criteria did not find statistically significant differentially methylated CpGs.92 It is unclear whether an analysis focusing specifically on ds-DNA antibody status or other specific autoantibodies would have revealed different results.

Further studies are needed to fully elucidate the contribution of DNA methylation to disease heterogeneity. This will likely aid in subtyping patients with SLE according to their most prominent methylation marks or patterns with the potential of informing therapeutic approaches and stratification in clinical trials. Limitations of the aforementioned studies include small sample sizes and different methodologic approaches. But perhaps the main limitation is the difficulty of capturing patients with active disease manifestations rather than historical disease manifestations.

Environmental exposures, DNA methylation and SLE

Many lifestyle factors, including diet, obesity, physical activity, tobacco smoking, alcohol consumption, environmental pollutants and psychological stress, have been shown to modify DNA methylation patterns.101–104 For example, environmental chemicals directly act on the function of DNMT and TET enzyme families. In addition, chemicals may affect the availability of SAM. Still, what drives gene-specific DNA methylation patterns remains unclear. A failure to maintain epigenetic homeostasis in the immune response due to factors including environmental influences leads to aberrant gene expression, contributing to immune dysfunction and, in some cases, the development of autoimmunity in genetically predisposed individuals. An exposure or multiple exposures could also contribute to the flare/relapse nature of SLE. Table 1 summarises the relationship between exposures and changes in methylation, as well as their independent associations with SLE.105 Although DNA methylation perturbations have been viewed mostly as a mechanism to explain how exposures can lead to disease, there is increasing evidence that DNA methylation marks or 'signatures' can be used as biomarkers of exposure. As most environmental and residential exposures are difficult to accurately measure, this area of research is of great value. For example, studies of adults have shown that methylation levels at specific smoking-associated loci reflect cigarette pack years and time since quitting.106

Future directions

Future EWAS and DNA methylation studies in SLE should be designed to examine not only relevant cell type(s) and tissues, such as skin and kidney, but also the specific context (active vs inactive disease). Larger samples sizes and longitudinal studies of DNA methylation changes in SLE may clarify some of the findings discussed in this review. Finally, SLE EWAS in different populations and different geographic regions or environments could also shed light on disease pathogenesis. Single-cell profiling has emerged as a potentially powerful way to ascertain cell subtype composition at higher resolution. Single-cell epigenomic methods have been developed107–109 and will likely be implemented in future SLE studies. Integrating these approaches with parallel methods, such as ATAC-seq, which provides a comprehensive map of chromatin accessibility where DNA methylation changes might be important, and RNA-seq that provides comprehensive transcriptomic data, will allow for correlation of epigenetic modifications with causative and downstream effects. These studies are currently costly and analytically challenging to undertake; however, the relevant technologies are developing rapidly and will likely be feasible for application to SLE in the near future. Most EWAS results will probably require experiments with animal or cellular models to confirm causality.

Therapeutic implications

Gene-editing technologies such as those that use the CRISPR/CAS9 system have now have been applied to the enzymatic activities of both DNMTs and TET enzymes, to actively methylate or demethylate specific CpG sites, paving the way for new therapeutic approaches. As our knowledge advances, further understanding of methyl-binding domain proteins as well as the allosteric regulators of DNMTs, coupled with a better understanding of specific methylation aberrations in SLE, could lead to potential targeting and correction of DNA methylation marks. As oxidative stress continues to be implicated as a mediator of DNA methylation perturbations, antioxidants such as N-acetyl cysteine have been suggested as safe interventions. Finally, if a link between exposures, DNA methylation and SLE disease risk is firmly established, preventive measures could be undertaken.

Acknowledgments

The authors thank Kimberly Taylor and Joanne Nititham for their review and comments on the manuscript.

References

  1. 1.
  2. 2.
  3. 3.
  4. 4.
  5. 5.
  6. 6.
  7. 7.
  8. 8.
  9. 9.
  10. 10.
  11. 11.
  12. 12.
  13. 13.
  14. 14.
  15. 15.
  16. 16.
  17. 17.
  18. 18.
  19. 19.
  20. 20.
  21. 21.
  22. 22.
  23. 23.
  24. 24.
  25. 25.
  26. 26.
  27. 27.
  28. 28.
  29. 29.
  30. 30.
  31. 31.
  32. 32.
  33. 33.
  34. 34.
  35. 35.
  36. 36.
  37. 37.
  38. 38.
  39. 39.
  40. 40.
  41. 41.
  42. 42.
  43. 43.
  44. 44.
  45. 45.
  46. 46.
  47. 47.
  48. 48.
  49. 49.
  50. 50.
  51. 51.
  52. 52.
  53. 53.
  54. 54.
  55. 55.
  56. 56.
  57. 57.
  58. 58.
  59. 59.
  60. 60.
  61. 61.
  62. 62.
  63. 63.
  64. 64.
  65. 65.
  66. 66.
  67. 67.
  68. 68.
  69. 69.
  70. 70.
  71. 71.
  72. 72.
  73. 73.
  74. 74.
  75. 75.
  76. 76.
  77. 77.
  78. 78.
  79. 79.
  80. 80.
  81. 81.
  82. 82.
  83. 83.
  84. 84.
  85. 85.
  86. 86.
  87. 87.
  88. 88.
  89. 89.
  90. 90.
  91. 91.
  92. 92.
  93. 93.
  94. 94.
  95. 95.
  96. 96.
  97. 97.
  98. 98.
  99. 99.
  100. 100.
  101. 101.
  102. 102.
  103. 103.
  104. 104.
  105. 105.
  106. 106.
  107. 107.
  108. 108.
  109. 109.
  110. 110.
  111. 111.
  112. 112.
  113. 113.
  114. 114.
  115. 115.
  116. 116.
  117. 117.
  118. 118.
  119. 119.
  120. 120.
  121. 121.
  122. 122.
  123. 123.
  124. 124.
  125. 125.
  126. 126.
  127. 127.
  128. 128.
  129. 129.
  130. 130.
  131. 131.
  132. 132.
  133. 133.
  134. 134.
  135. 135.
  136. 136.
  137. 137.
  138. 138.
  139. 139.
  140. 140.
  141. 141.
  142. 142.
  143. 143.
  144. 144.
  145. 145.
  146. 146.
  147. 147.
  148. 148.
  149. 149.
  150. 150.
  151. 151.
  152. 152.
  153. 153.
  154. 154.
  155. 155.
  156. 156.
  157. 157.
  158. 158.
  159. 159.
  160. 160.
  161. 161.
  162. 162.
  163. 163.
  164. 164.
  165. 165.
  166. 166.
  167. 167.
  168. 168.
  169. 169.
  170. 170.
  171. 171.
  172. 172.
  173. 173.
  174. 174.
  175. 175.
  176. 176.
  177. 177.
View Abstract

Footnotes

  • Contributors CML contributed to the main manuscript preparation. All authors contributed to the discussions and data interpretation that were used to form the review contained in this paper and provided input to the manuscript and approved its final version.

  • Funding CML received grant/research support from the Rheumatology Research Foundation Scientist Development Award 128849A. CML, SAC and LAC are also supported by NIH P30AR070155, and CML and LAC by the PHS Centers for Disease Control and Prevention DP005120, Alliance for Lupus Research and the Lupus Research Institute/Lupus Foundation of America.

  • Competing interests None declared.

  • Patient consent Not required.

  • Provenance and peer review Commissioned; internally peer reviewed.

  • Data statement All data presented in this article are already published and are not in possession of the authors.

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.