Theory Article Volume 13, Issue 12 pp 15699—15749

Shifting epigenetic contexts influence regulatory variation and disease risk

Daniel Richard1, , Terence D. Capellini1,2, ,

  • 1 Department of Human Evolutionary Biology, Harvard University, Cambridge, MA 02138, USA
  • 2 Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA

Received: April 8, 2021       Accepted: June 1, 2021       Published: June 16, 2021      

https://doi.org/10.18632/aging.203194
How to Cite

Copyright: © 2021 Richard and Capellini. This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 3.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Abstract

Epigenetic shifts are a hallmark of aging that impact transcriptional networks at regulatory level. These shifts may modify the effects of genetic regulatory variants during aging and contribute to disease pathomechanism. However, these shifts occur on the backdrop of epigenetic changes experienced throughout an individual’s development into adulthood; thus, the phenotypic, and ultimately fitness, effects of regulatory variants subject to developmental- versus aging-related epigenetic shifts may differ considerably. Natural selection therefore may act differently on variants depending on their changing epigenetic context, which we propose as a novel lens through which to consider regulatory sequence evolution and phenotypic effects. Here, we define genomic regions subjected to altered chromatin accessibility as tissues transition from their fetal to adult forms, and subsequently from early to late adulthood. Based on these epigenomic datasets, we examine patterns of evolutionary constraint and potential functional impacts of sequence variation (e.g., genetic disease risk associations). We find that while the signals observed with developmental epigenetic changes are consistent with stronger fitness consequences (i.e., negative selection pressures), they tend to have weaker effects on genetic risk associations for aging-related diseases. Conversely, we see stronger effects of variants with increased local accessibility in adult tissues, strongest in young adult when compared to old. We propose a model for how epigenetic status of a region may influence the effects of evolutionary relevant sequence variation, and suggest that such a perspective on gene regulatory networks may elucidate our understanding of aging biology.

Introduction

It has been suggested that the process of aging, and the concomitant manifestation of aging-related disease, is subject to both genetic and non-genetic factors impacting the regulatory networks (and subsequent behaviors) of aging cells [1, 2]. Nongenetic regulation of aging refers to epigenetics; chemical changes to the genome (e.g., at the chromatin level) that impact transcriptional programs [1], and which have been shown to accumulate with age [35]. The epigenetic state of chromatin can be broadly classified into activating or repressing modifications [6], referring, in part, to the increased/decreased accessibility of DNA to gene-regulatory machinery (e.g., transcription factors), and is established, maintained, and reset to switch between states [6]. Ample evidence suggests a causal relationship between changes in epigenetic state with age and hallmarks of aging in cells [3, 5]. Much recent work has focused on elucidating this relationship and how, ultimately, this contributes to age-related tissue decline and adult diseases [7].

As aging can also be considered a continuation of development [8], the epigenetic changes that are retained from early-life development may have important consequences for the adult epigenome – establishing the context within which epigenetic aging occurs [9]. A ‘fetal programming’ model has been suggested whereby early epigenetic plasticity in response to environmental and nutritional stimuli, while being adaptive and beneficial to fetal and early post-natal growth, has deleterious consequences later in life by contributing to adult disease risk [912]. This serves as one possible mechanism for the theorized consequences of selection favoring early-life development at the cost of late-life function [9, 13, 14]. Evidence supporting this model has been largely limited to DNA methylation [9, 1517], though replication of important loci findings has been difficult [18].

Epigenetic marks established during development can persist into adulthood [9], but they do so in the context of shifts in epigenetic states (see below) as tissues transition into their adult forms and functions. This transition process has been characterized with respect to DNA methylation, chromatin state, and gene expression across multiple tissues [1921]. Furthermore, these fetal to adult epigenetic shifts can be compounded by additional modifications through aging-associated epigenetic changes. Such epigenetically-regulated biological pathways involved in development, such as Wnt signalling, subsequently take on a role in tissue homeostasis in adults and are implicated in age-related tissue decline [22, 23] – suggesting a molecular link between processes mediating growth and aging [8, 24]. Thus, an important component of understanding the contributions of fetal programming as well as epigenetic aging to disease biology and risk is characterizing the epigenetic changes between fetal and adult tissues and how these might interact with subsequent aging-associated modifications.

While epigenomes vary between cell types [25, 26] and changes to epigenetic state with age may be expected to manifest differently, similar aging epigenetic have been repeatedly observed across tissues epigenetic shifts have been repeatedly observed across tissues [1, 5, 20, 27, 28]. Similarly, while age-related expression changes do exhibit tissue-specificity, there is evidence of potential synchronized changes across different sets of tissues [29], particularly for certain sets of genes and pathways [29, 30], and these changes may integrate at multiple different epigenetic levels [31]. Together, these findings suggest that a central trajectory for epigenetic state that reflects innate aging processes may exist [20], upon which extrinsic and cell-type effects are layered. Similarly, studies between fetal and adult tissues have found that, while epigenetic change is observed within individual tissues, there are also common trends of development (e.g., chromatin restriction, particularly at loci involved in early development) [21, 32, 33].

Importantly, the epigenetic state of genetic variants (e.g., single nucleotide polymorphisms) influences their regulatory effects, and subsequent association with heritable disease risk [34]. Thus, general epigenetic trends across early development and later aging may influence the phenotypic effects of regulatory mutations, albeit the extent to which this occurs is unknown. These phenotypes, if impacting an individual’s fitness, may be acted upon by natural selection. Evolutionary theories have been proposed which suggest that mutations contributing to aging pathologies are ‘allowed’ to accumulate due to the reduced fitness consequences of disease in older, post-reproductive individuals [35], or that beneficial mutations selected for early development become deleterious with age [3638]. Studying the added dimension of epigenetic context may provide a fresh perspective on theories of aging and selection. For example, deleterious mutations that change epigenetic context later in life may have different regulatory effects, and thus different fitness consequences, which alter the selective pressures acting on them.

In the present study, we seek to characterize common epigenetic trends between fetal and adult tissues, and subsequently examine the potential interaction of these developmental changes with later changes associated with epigenetic aging in adult tissues. We utilize our findings to propose a model for how evolutionary forces may have acted at these loci in humans, and how these forces in turn influence the distribution of mutations conferring heritable disease risk across a number of age-associated pathologies.

Results

Defining chromatin accessibility change, its genomic context, and loci subject to change

To investigate epigenetic changes occurring over the course of post-natal development and aging, we focused on chromatin accessibility, as it reflects the regulatory potential of a genetic locus and can be considered a property of the epigenome which integrates a number of possible epigenetic phenomena (e.g., regulatory factor binding, chromatin remodelling, etc.) [39]. We thus consider regions with altered chromatin accessibility as being indicative of epigenetic modifications or ‘shifts’ in context. As a read-out of accessibility we analyzed DNase-I hypersensitivity datasets acquired from primary human tissue, and obtained fetal/adult sample pairs for eight distinct primary tissues (spleen, lung, muscle, stomach, kidney, brain, heart, and skin; see Supplementary Table 1) [6, 40]. For each tissue at each time-point, called accessible or “open” chromatin regions were consolidated across biological replicates, then further aggregated by tissue and stage (see Supplementary Methods).

We first identified chromatin regions exhibiting recurrent accessibility changes between fetal and adult samples across tissue types (see Supplementary Methods, Figure 1A, 1B and Supplementary Figures 1, 2). We define regions as ‘adult-biased’ if they exhibit increased differential accessibility in adults compared to in fetuses. Conversely, we define regions as ‘fetal-biased’ if they exhibited decreased differential accessibility in adults compared to in fetuses. These ‘pan-tissue’ altered regions were compared to those defined in individual tissue comparisons, showing substantial but not complete overlap (Figure 1C) – suggesting that our approach captures cross-tissue signals of broader developmental changes and not tissue-specific effects. We next explored possible signals of epigenetic aging occurring in the context of fetal to adult changes, by further dividing our adult tissue samples into ‘younger’ and ‘older’ age categories (Methods, Supplementary Figure 3). We then assessed accessibility change between young and old occurring within the ‘adult-biased’ and ‘fetal-biased’ regions defined above (Figure 1D). This approach identified regions for which young-old differences mirrored fetal-adult differences, as well as regions where aging changes appear to counter developmental patterns. We observed a tendency for shared directionality in gains or losses of accessibility; i.e., adult-biased regions tended to also have increased accessibility in older adult samples, while regions losing accessibility in adult samples (i.e., are fetal-biased) continued this trend in older samples (chi-sq test, p < 0.05, Supplementary Table 1). In this text, we therefore refer to regions with greater accessibility in older samples as ‘old-biased’ and regions with lower accessibility in older samples as ‘young-biased’. As described in the Supplementary Information, we considered histone mark and DNA methylation changes, key features of developmental [21, 26, 33] and aging epigenetic changes [3, 20] as additional means to validate the behavior of these region sets (see also Supplementary Table 1).

Cross-tissue accessibility. (A) Representative heatmap of Dnase-I accessibility for regions significantly different between fetal/adult tissues. Color scale indicates magnitude of chromatin accessibility signal (see Supplementary Methods). Horizontal lines denote defined fetal-biased (left) and adult-biased regions. (B) Genomic distribution of regions changing accessibility in fetal and adult comparison. Red/blue: density of defined differentially-accessible regions. Solid black line: relative proportion of regions more accessible in adult (top) or fetal (bottom) tissues. First five autosomes shown (see Supplementary Figure 2). (C) The proportion of defined altered-accessibility regions between adult and fetal samples for indicated tissues which are unique to that tissue, or captured in the pan-tissue set. (D) Overlaps between regions defined as differentially-accessible in fetal/adult comparison and those defined in the young/old-age comparison. Directionality in accessibility change is significantly shared (see Supplementary Table 1). Related content can be found in Supplementary Information, Supplementary Figures 1–6 and Supplementary Tables 1, 2.

Figure 1. Cross-tissue accessibility. (A) Representative heatmap of Dnase-I accessibility for regions significantly different between fetal/adult tissues. Color scale indicates magnitude of chromatin accessibility signal (see Supplementary Methods). Horizontal lines denote defined fetal-biased (left) and adult-biased regions. (B) Genomic distribution of regions changing accessibility in fetal and adult comparison. Red/blue: density of defined differentially-accessible regions. Solid black line: relative proportion of regions more accessible in adult (top) or fetal (bottom) tissues. First five autosomes shown (see Supplementary Figure 2). (C) The proportion of defined altered-accessibility regions between adult and fetal samples for indicated tissues which are unique to that tissue, or captured in the pan-tissue set. (D) Overlaps between regions defined as differentially-accessible in fetal/adult comparison and those defined in the young/old-age comparison. Directionality in accessibility change is significantly shared (see Supplementary Table 1). Related content can be found in Supplementary Information, Supplementary Figures 16 and Supplementary Tables 1, 2.

To gain insights into the roles these region sets have in transcriptional regulation, we next characterized the genomic distribution of our adult-biased and fetal-biased region sets using adult tissue epigenetic states [26] (Supplementary Methods). We found that our region sets preferentially fell within different epigenetic states (e.g., enhancers, heterochromatin) depending on the nature of their accessibility shift (e.g., adult-biased, old-biased), suggesting that these shifts may be associated with altered regulatory biology at different loci, and that the interaction between fetal and adult shifts as well as young and old-age shifts heavily favors developmental changes to accessibility (see Supplementary Figures 4, 5, Supplementary Information).

As accessible chromatin regions often serve to regulate gene expression [39] by acting as cis-regulatory sequences, we next sought to identify the potential role of our regions in regulatory changes occurring during development and aging. We did this by considering promoter-level accessibility (see Methods, Supplementary Figure 6), promoter-capture (Hi-C) interactions [41], and regulatory-domain annotations [42] for genes which may be subject to control by these regulatory regions. We found a general pattern for enrichment of immune-related gene sets with the adult-biased set, while development-related (e.g., cellular proliferation) terms were enriched with fetal-biased regions, patterns echoed when considering old-biased and young-biased region sets, respectively (see Supplementary Information, Supplementary Table 2).

We next incorporated tissue expression datasets looking for general gene expression trends between fetal and adult tissues (see Methods). We observe similar enrichment terms as well as significant overlaps with gene sets defined on the regulatory level (see Supplementary Information, Supplementary Table 2). Similarly, we utilized GTEx (gene tissue expression) datasets [43] to look for corresponding shifts in gene expression with age, similar to previous work [44] (see Methods)(Supplementary Table 2). While we did not observe significant overlaps between these aging-expression gene sets and those defined using aging-accessibility changes, we did see significant overlaps with the fetal/adult expression comparisons, along with enriched gene sets with relevance to aging biology (see Supplementary Information, Supplementary Table 2).

As development and aging are phenomena subjected to the actions of random and directed evolutionary forces [13, 45, 46], we next develop expectations for how these epigenetically-altered regions may have evolved over time.

Sequence evolution of epigenetically-altered regions

Development and aging are simultaneously very ancient and variable [45, 47] biological processes and are particularly divergent in key species [48]. Thus, it may be the case that both development and age-associated regions have been shaped by a mix of evolutionary forces acting to either maintain or modify genetic sequences (e.g., regulatory enhancers). To address this possibility, we examined patterns of sequence conservation in our epigenetic datasets using phyloP [49], a measure of nucleotide conservation and/or acceleration across species (Figure 2A). Across primates, we observed that fetal-biased regions tended to have greater sequence conservation than adult-biased regions, and furthermore that both sets differed significantly from those regions not defined as developmentally-altered (Supplementary Table 3). These patterns were similarly observed when comparing age-associated regions (Supplementary Table 3). These findings of conservation differences between sets suggests that the greater regulatory and developmental role associated with fetal-biased and young-biased regions (e.g., enriched for enhancer elements) exerts functional sequence constraint while adult-biased and old-biased regions (e.g., enriched for repressed segments) are less conserved across species.

Sequence evolution of age-altered regions. (A) Distribution of average per-region sequence conservation (phyloP20ways) in differentially-accessible regions (see color legend). (B) Overlaps of developmental and age-related region sets and human acceleration regions. Overlaps shown relative to set size (per bp of sequence) for background (gray) and target (colored) sets; labels correspond to Supplementary Table 3. (C) Intersections of common human variants (per bp of sequence) for target (colored) and randomized (grey) region sets; labels correspond to results in Supplementary Table 3. (D) Diagram summarizing results of evolutionary sequence analyses. Accessible regions, here diagrammed as an upstream enhancer element (thick blue box), which either gain or lose accessibility over development (left) or ageing (right) exhibit different patterns of evolutionary sequence behavior. Created with BioRender.com. See also Supplementary Figures 7, 8.

Figure 2. Sequence evolution of age-altered regions. (A) Distribution of average per-region sequence conservation (phyloP20ways) in differentially-accessible regions (see color legend). (B) Overlaps of developmental and age-related region sets and human acceleration regions. Overlaps shown relative to set size (per bp of sequence) for background (gray) and target (colored) sets; labels correspond to Supplementary Table 3. (C) Intersections of common human variants (per bp of sequence) for target (colored) and randomized (grey) region sets; labels correspond to results in Supplementary Table 3. (D) Diagram summarizing results of evolutionary sequence analyses. Accessible regions, here diagrammed as an upstream enhancer element (thick blue box), which either gain or lose accessibility over development (left) or ageing (right) exhibit different patterns of evolutionary sequence behavior. Created with BioRender.com. See also Supplementary Figures 7, 8.

Within this broader context of species diversity and evolution, humans and chimpanzees display marked and obvious differences in development and longevity [45, 50]. This relatively recent divergence is thought to be driven largely by non-coding changes to cis-regulation [51]. We therefore next looked for evidence of regulatory modifications to biological processes that may contribute to these human/chimp differences. To do this, we intersected our regions sets with sequences demonstrating significant divergence along the human lineage (e.g., ‘human accelerated regions’ [52]). We found that fetal-biased regions were enriched for signals of acceleration while adult-biased regions were depleted (Figure 2B). Similar patterns were seen for young versus old-biased regions (Figure 2B and Supplementary Table 3). We found a number of genes involved in development and aging processes with putative nearby regulatory elements intersecting accelerated regions, two examples of which are shown in Supplementary Figure 7 (see also Supplementary Information).

To gauge the evolutionary interaction between sequence constraint across species and within-species variation, we next assessed modern-day human diversity within region sets (Methods). We found that genetic diversity in fetal-biased regions was markedly reduced (i.e., constrained) compared to genomic backgrounds, as well as to intronic and promoter-TSS elements (Figure 2C and Supplementary Table 3). Conversely, adult-biased regions were enriched for sequence diversity, at the level of annotated repeat elements. These patterns were accentuated when examining young- and old-biased region datasets, and comparing region sets directly (Supplementary Table 3, Supplementary Information). Importantly, when we considered sequence diversity within other ape species, we also observed a decrease in fetal-biased, and young-biased sequence diversity (relative to adult-biased and old-biased, respectively). This latter finding further suggests that fetal-biased regions are associated with conserved regulatory function that discourages mutation and/or drift (Supplementary Figure 8 and Supplementary Table 3).

Overall, natural selection appears to have acted upon regions subject to accessibility shifts in development and aging, modifying some loci (i.e., accelerated divergence indicative of ancient positive selection) while protecting others (i.e., reduced variation indicative of more recent negative selection) (Figure 2D).

Importantly, selective forces, both positive and negative, manifest phenotypically through the effects of random genetic mutations, which act to modify gene regulatory networks to varying degrees. We next examine this relationship.

Epigenetic shifts in age-associated trait associations

In our above analyses, the observed signals of consistent inter- and intra-species conservation in regions most associated with early development (i.e., the fetal-biased set) follows with the expectation that variants negatively impacting early-life would be subject to stronger purifying selection [9, 53, 54]. Conversely, variants with later-manifesting effects, i.e., those within regions increasing in local accessibility with age (i.e., the adult-biased set), would be subjected to substantially weaker selection and may therefore be ‘tolerated’ [55, 56]. To test expectations of the possible deleterious effects of variants subject to accessibility change over development and aging, we utilized GWAS datasets available from the UK Biobank [57]. We extracted summary-statistics for a collection of 127 complex diseases/pathologies falling into aging-related categories [58], including metabolic disorders, cancers, cardiovascular disease, and musculoskeletal impairment (Supplementary Table 4, Methods). We similarly analyzed a set of developmental trait GWAS to act as a control for our fetal/adult accessibility comparisons, and finally considered longevity GWAS data (Supplementary Table 4, see Supplementary Information).

It has been suggested that the highly polygenic nature of complex traits and diseases reflects cumulative regulatory modification to a ‘core’ set of genes which functions most proximately in relevant biology [59]. Across age-associated diseases, this may reflect general aging processes, and regulatory variants impacting these would be expected to contribute to heritable aging-disease risk broadly. Given this rationale, we first considered the behavior of individual SNPs nearby accessibility-altered regions across diseases, and subsequently these behaviors at the gene-locus level (below). We aggregated per-SNP associations across diseases as a singular cross-set metric of risk association (Supplementary Methods). We confirmed that ClinVar variants, variants for which possible clinical significance have been described [60], tended to be more risk-associated by this metric, as we would expect (Supplementary Table 4). Additionally, across all diseases we individually performed enrichment tests for strongly-associated variants nearby our region sets, which corroborated the cross-disease results described below (Figure 3A, see Supplementary Information).

Epigenetic context and heritable disease associations. (A) Adjusted p-values for hypergeometric tests showing enrichment/depletion (positive/negative) for GWAS variants nearby regions increasing (blue) or decreasing (red) accessibility across adult tissues for a number of age-associated diseases (see Supplementary Table 4). (B) Cross-set disease associations, and additional per-SNP metrics, for variants nearby developmental and age-altered region sets along with unaltered DNase sites and variants not nearby accessible regions. See Supplementary Table 4 and Methods. (C) Model for the effects of epigenetic context on disease association and sequence evolution. (Top): Example enhancer elements more accessible in fetal, adult, and old-adult tissues (left-right) which have been modified by mutations. (Left): Deleterious mutations disrupting regulation in development stand to have the biggest impact on fitness, while having a moderate effect on tissue homeostasis. (Middle): Mutations disrupting regulation in young-adult tissues have a moderate impact on fitness, but a larger effect on tissue homeostasis (particularly over adulthood). (Right): Mutations disrupting regulation in old-adult tissues have weak impacts on fitness, and a weak effect on tissue homeostasis (which has already deteriorated with age). (Bottom): Illustrating patterns of accessibility, disease association, sequence constraint and variant allele age for these sets of regions changing accessibility over time.

Figure 3. Epigenetic context and heritable disease associations. (A) Adjusted p-values for hypergeometric tests showing enrichment/depletion (positive/negative) for GWAS variants nearby regions increasing (blue) or decreasing (red) accessibility across adult tissues for a number of age-associated diseases (see Supplementary Table 4). (B) Cross-set disease associations, and additional per-SNP metrics, for variants nearby developmental and age-altered region sets along with unaltered DNase sites and variants not nearby accessible regions. See Supplementary Table 4 and Methods. (C) Model for the effects of epigenetic context on disease association and sequence evolution. (Top): Example enhancer elements more accessible in fetal, adult, and old-adult tissues (left-right) which have been modified by mutations. (Left): Deleterious mutations disrupting regulation in development stand to have the biggest impact on fitness, while having a moderate effect on tissue homeostasis. (Middle): Mutations disrupting regulation in young-adult tissues have a moderate impact on fitness, but a larger effect on tissue homeostasis (particularly over adulthood). (Right): Mutations disrupting regulation in old-adult tissues have weak impacts on fitness, and a weak effect on tissue homeostasis (which has already deteriorated with age). (Bottom): Illustrating patterns of accessibility, disease association, sequence constraint and variant allele age for these sets of regions changing accessibility over time.

As variants in accessible non-coding regions likely have regulatory impacts generally [39], we confirmed that variants within or nearby regions not classified as strictly developmental nor age-altered tended to have greater association than non-accessible variants. However, this control set had significantly less association than variants nearby sets of both developmentally- and age-altered regions (i.e., fetal/adult-biased, and young/old-biased regions) (Supplementary Table 4).

Considering first developmental change, we found that variants in regions gaining nearby accessibility in adults (i.e., adult-biased) have greater association with disease than those in regions losing nearby accessibility (i.e., fetal-biased) (Figure 3B). Unexpectedly, when looking at aging accessibility changes, we observed that variants in regions gaining nearby accessibility in older-samples actually have lower cross-disease associations than those in regions becoming more accessible in younger samples (Figure 3B and Supplementary Table 4). Furthermore, we found that for intersections of development and age-altered regions that the increased disease association with adult-biased regions was abrogated when intersected with old-biased regions. The magnitude of region-set differences in disease associations was also greater in the young/old-biased comparisons (see Figure 3B, Supplementary Information).

Taken together, these results would suggest that those variants most accessible in younger adults stand to have the greatest impact (in terms of association p-value) on late-life disease risk – a finding that may have important implications for understanding the development of disease over adulthood (see Figure 3C, Discussion).

We next considered these disease-association patterns at the gene-locus level. Briefly, for a given disease we assign the most significant nearby SNP to all genes, and subsequently rank genes based on their assigned GWAS signal. Gene ranks are then aggregated across diseases, looking for genes consistently ranked higher across sets (Supplementary Methods). To confirm the behavior of this gene-ranking method, we compared the cross-set ranking of genes associated with homeostatic processes (based on GO annotations) to randomized gene sets, finding that these gene loci tend to harbor stronger genetic variants across a larger number of diseases than expected (compared to randomized sets), but not so for genes involved in reproductive organ development (see Supplementary Information, Supplementary Table 4).

We applied this method to the sets of development and age-associated genes we defined above and asked whether they tended to have more or less cross-disease GWAS signals than expected. Our sets defined by accessibility-region contacts supported our earlier findings on strong GWAS signals nearby development and age-altered regions – namely, loci of both adult-biased and young-biased gene sets were enriched for strong GWAS signals across diseases, while fetal-biased and old-biased gene sets were associated with relatively weaker GWAS signals (Supplementary Table 4). Sets defined by RNA-seq data showed more of a mix of enriched/depleted GWAS signals across developmental and age comparisons, reflecting the possibility that a mixture of genes increasing and decreasing expression over time may additively contribute to aging disease biology (see Supplementary Information).

Given our results, found at both genome-wide and gene-locus set levels, we finally sought to take an unbiased approach to identify relevant ‘core’ aging-related genes solely on the basis of aggregate GWAS signals (see Supplementary Methods, Supplementary Information). Overall, we had limited success in defining a set of genes with clear, pan-tissue biological relevance, suggesting that, if such a core does exist, that it may be too broad, or the per-locus signals too moderate, for our method to robustly detect. However, since our results suggest the importance of altered epigenetics in modifying GWAS associations, we performed a similar gene-prioritization analysis using variants occurring nearby altered-accessibility regions (Supplementary Methods). This yielded markedly different enrichments for terms relating to immune processes and gene regulation (see Supplementary Information, Supplementary Table 4). One particular set of genes, involved in histone deacetylation, has repeatedly been linked to aging and epigenetics [61, 62] and was identified using our set of young-age regions (see Figure 4). We explore this set in more detail below.

Altered-accessibility regions identify relevant aging biology. (A) (Left) Distribution of cross-disease ranks for all protein-coding genes, when ranking by local variants independent of accessibility data (see Supplementary Methods). Red lines indicate genes within the ‘histone deacetylation’ (HDAC) GO term; top ranked genes (by geometric mean) are indicated. (Right) Similar distribution of cross-disease ranks, ranking genes with variants nearby young-biased regions. Red lines indicate top HDAC genes by rank. (B) (Left) Fold-change of normal cumulative distribution function (CDF) p-values of variants within HDAC gene loci associated with different region sets, relative to CDF test performed using all variants, for cross-disease Z-score metric (see Supplementary Methods). (Right) Similar plot for per-variant LINSIGHT scores. See Supplementary Table 4. (C) Variants directly intersecting young-biased regions which interact with the SIRT6 promoter. (Top) Visualized promoter-capture data [41] across multiple cell-types. (Bottom) Tracks indicating variants which overlap young-biased regions within the SIRT6 gene locus.

Figure 4. Altered-accessibility regions identify relevant aging biology. (A) (Left) Distribution of cross-disease ranks for all protein-coding genes, when ranking by local variants independent of accessibility data (see Supplementary Methods). Red lines indicate genes within the ‘histone deacetylation’ (HDAC) GO term; top ranked genes (by geometric mean) are indicated. (Right) Similar distribution of cross-disease ranks, ranking genes with variants nearby young-biased regions. Red lines indicate top HDAC genes by rank. (B) (Left) Fold-change of normal cumulative distribution function (CDF) p-values of variants within HDAC gene loci associated with different region sets, relative to CDF test performed using all variants, for cross-disease Z-score metric (see Supplementary Methods). (Right) Similar plot for per-variant LINSIGHT scores. See Supplementary Table 4. (C) Variants directly intersecting young-biased regions which interact with the SIRT6 promoter. (Top) Visualized promoter-capture data [41] across multiple cell-types. (Bottom) Tracks indicating variants which overlap young-biased regions within the SIRT6 gene locus.

Sequence evolution and disease association

Our previous analyses found that patterns of inter- and intra-species sequence conservation depended on epigenetic status (i.e., degree of accessibility) of regulatory elements. Subsequently, we found that the risk association of variants across a number of age-associated diseases also varied based on accessibility change in the vicinity of the variant. Much work has been done on understanding the relationship between sequence conservation and disease risk [6366]. For example, a transcription factor (TF) binding site may be subjected to negative selection to conserve its sequence and hence function. Mutations that occur within this site would more likely impact cis-regulatory biology, and therefore manifest an association with disease. If this disease impacts fitness, then over time, such mutations will be eliminated, so that genetically ‘older’ mutations are less prevalent [65, 67]. Given our interest in the evolution of development and aging processes, we wanted to investigate the role that epigenetics has on this disease-evolution relationship - and whether this holds with our data. By comparing the cross-trait associations of variants falling within and outside primate-conserved sequences (phastCons) [68] (Supplementary Methods), we found that variants within conserved sequences tend to have greater disease associations, along with younger estimated allele age (Supplementary Table 4). These patterns also hold true for phastCons sequences within age- and developmentally-altered regions (Supplementary Table 4), and follow with previously observed enrichments for GWAS associations of conserved, younger (allele age) variants [65, 67, 69].

We next considered primate conservation, estimated allele age, and cross-set association of variants, looking for the effects of nearby accessibility change on these metrics (Supplementary Methods). As an additional metric for predicted fitness consequences, particularly of non-coding variants, we also included per-bp LINSIGHT scores [70], which integrates data on chromatin accessibility, TF binding motifs, and comparative genomics.

First, we found that variants falling near fetal-biased regions were more conserved, younger, and had lower cross-set association, while variants near adult-biased regions behaved oppositely (Figure 3B and Supplementary Table 4). We also found that the predicted functional consequences associated with fetal-biased regions were greater than with adult-biased regions, despite the lower cross-set association with aging-associated diseases (Figure 3C, see Discussion). Variants falling near old-biased regions were less conserved, had older allele age, and had lower cross-set association than their young-biased counterparts (Supplementary Table 4). These old-biased regions were also associated with the lowest predicted functional consequences (in aggregate) of any set, while the set of young-biased regions had the second-highest average. To compare these behaviors with variants of annotated clinical significance we independently examined ClinVar variants, which while demonstrating increased cross-set association, tended to also be more conserved, younger, and have stronger predicted fitness consequences (Supplementary Table 4).

Collectively, our results indicate that variants stratified by nearby accessibility change violate the expected relationship between sequence conservation and disease association (behaviors instead observed for ClinVar variants). Namely, those regions exhibiting the highest sequence constraint (Figure 3C, left) do not also exhibit the strongest aging-disease associations, nor do those regions exhibiting the weakest constraint, as might be expected in a ‘mutation accumulation’ theory of aging [35]. However, when considering predicted functional consequences (LINSIGHT), which are not defined based on aging demographic data, this pattern is reversed (i.e., the most constrained set, fetal-biased regions, had the strongest predicted consequences despite weaker aging-disease associations). This unexpected behavior may have important implications for evolutionary models of late-onset complex disease genetics. Based on our results, we propose such a model suggesting the outsized impact of regulatory sequences active in early adulthood on genetic contributions to aging-associated disease risk (see Figure 3C, Discussion).

Our proposed model suggests that focusing on disease risk loci containing such putative regulatory sequences (i.e., young-biased regions), should implicate sets of genes involved in aging biological processes. Our gene-level GWAS analyses using young-biased regions identified genes involved with histone deacetylation as being more consistently associated with aging-disease GWAS signal, a pattern which was diminished when considering gene-level associations in the absence of this epigenetic information (Figure 4A), and when using other region sets (e.g., old-biased regions) (Figure 4B). Histone deacetylation enzymes have known impacts on epigenetic aging biology [7, 61, 62] and aging diseases [71]. Within our young-biased enriched gene set we identified SIRT6 and SIRT7 as having multiple variants falling nearby young-biased regions which contacted their respective gene promoters (Figure 4C). Both these enzymes have been associated with maintaining heterochromatin during aging [7274]; SIRT7 decreases expression with age, and antagonizes hMSC epigenetic aging [73], while SIRT6 loss manifests an aging-like state [75]. It may be possible that decreased accessibility of regulatory regions controlling the expression of these genes are involved in decreases in sirtuin expression and heterochromatin [72].

Discussion

In this study we sought to describe how changing epigenetic context, defined here as changes to chromatin accessibility over both development and subsequent aging, influences the behavior of evolutionary forces and genetic disease risk at the sequence level. To address this question, we defined genomic regions whose chromatin accessibility consistently shift over the course of development and aging. Our approach to identify epigenetic shifts relies on the observation that chromatin accessibility broadly reflects the regulatory capacity at a given locus [39], though we acknowledge that more subtle epigenetic changes (e.g., post-translational modifications, CpG methylation) may not be well captured by this accessibility-based definition of epigenetic context.

We performed several analyses suggesting that these regions reflect developmental and aging signatures from previous literature, including genomic features (e.g., repeat elements, CpG sites), epigenetic states (e.g., euchromatin/heterochromatin) and histone mark data. Gene sets associated with developmentally-altered regions were enriched for immune system function and cellular proliferation terms, echoing an earlier study of fetal to adult epigenetic changes [21]. Furthermore, we found correspondence between these gene sets and genes whose RNA-seq expression generally shifted between fetal and adult tissues. Incorporating an independent RNA-seq dataset of adult age-stratified tissues we did not observe the same level of correspondence with age-altered regions – it is possible that some aspects of epigenetic aging (e.g., global de-repression [3, 5, 76]) may account for this disconnect, whereby local accessibility changes are less-directly linked to local expression changes. Interestingly, comparing patterns of expression change in our fetal-adult and young-old comparisons, we saw similar gene-set enrichments (i.e., cell-cycling biased towards fetal and younger-age samples, immune responses biasing towards adult and older-age samples), suggesting that the continuation of epigenetic shifts we observed across development and aging (Figure 1D) may be mirrored at the expression level.

Given that epigenetic state impacts the potential regulatory effects of deleterious variants [34], we looked to see if local development and/or ageing changes to epigenetic context impacts the strength of association between variants and aging-associated diseases. While it is possible that a number of these aging diseases share genetic correlations [77], that these variants are associated with multiple age-associated diseases is also a key expectation for the functional relevance of age-altered regions. In other words, it is the change in epigenetic context that modifies the regulatory potential of these variants, and this has direct impacts on individual associations with multiple diseases.

According to the fetal programming model, we would expect that regulatory regions most active during early development, both dictating developmental processes as well as responding to environmental perturbations [9], would have an out-sized impact on the manifestation of adult-onset diseases. This would be evident in the increased associations of nearby variants with heritable risk for these diseases. However, we found that such fetal-biased regions were not those having the greatest impact with regards to aging disease associations, despite having greater predicted fitness consequences – finding instead that fetal-biased regions are depleted for aging disease GWAS signals, and associated more with developmental diseases/traits (Supplementary Table 4). A recent study of fetal chromatin accessibility at the single-cell level similarly found genetic associations with developmental traits (e.g., height) using regions accessible in different cell-types [78]. We suggest that the ‘fetal programming’ of epigenetic status during early development, genome-wide, has a more moderate impact on aging disease biology than has been previously suggested – though we note that certain developmental loci (e.g., Wnt genes) can and do play a role in aging [8, 24].

According to a model wherein epigenetic aging influences the phenotypic effects of regulatory mutations, we would expect that mutations with increased local accessibility in adult tissues, particularly aged adult tissues, would have stronger impacts on aging disease biology in these tissues (reflected in increased association with heritable disease risk). Here, we found that variants gaining nearby accessibility (i.e., adult-biased regions) have stronger associations across a number of aging-related diseases including several kinds of neoplasms, arthritis, and atherosclerosis. This finding suggests that the regulatory effects of deleterious variants may become ‘uncovered’ as tissues mature and follows with proposed links between development and ageing processes [8, 20, 24]. However, we also found that regions most accessible later in life, when these diseases manifest, are actually associated with weaker GWAS variants. This young/old bias in aging-disease GWAS signal was far stronger than the fetal/adult bias (i.e., the young-biased set was more strongly enriched than adult-biased, and vice-versa). Taken together, these results suggest that (1) accessibility changes in aging tissues have a greater effect on aging tissue diseases, but (2) that variants more accessible earlier in adult life play a bigger regulatory role in contributing to disease risk than do those which gain accessibility later on. Disruptions to regulation in younger tissues may act to set tissues down a path of increasing dysfunction and decline, especially if deleterious variants are able to (cumulatively) contribute to dysfunction as they gradually lose activity with age. In other words, by the time an individual reaches old-age their tissues have had sufficient time to accumulate these dysfunctional effects, ‘setting the stage’ for disease manifestation. Variants more active in old-age, by contrast, have less of an impact on disease manifestation, as their regulatory effects have had less time to integrate. It may be that the time at which disease prevention and/or intervention would be most effective is, perhaps non-intuitively, early in adult life rather than once phenotypes manifest.

We cannot rule out the effects of cell-type specific epigenetic (accessibility) shifts influencing the phenotypic impacts of regulatory sequence modifications on aging-associated disease risk. Similarly, it has been suggested that a facet of aging is ‘epigenetic drift’ – the accumulation of epigenomic aberrations that contribute to mis-regulation of gene regulatory networks, a component of which is tissue-specific [79, 80]. However, the pan-signals which we do observe with respect to evolutionary forces, disease associations, and sets of implicated gene loci indicates the relevance of our approach in understanding the broader components of development and aging-accessibility changes, which may be complemented with future research focusing on those more tissue-specific components.

Regulation of general aging-related mechanisms, as well as increases in heritable disease risk, represent phenotypes upon which evolutionary forces may act to modify aging and mortality rates. We found that young-biased regions were enriched for signals of positive selection, a number of which implicated relevant aging-associated genes, and exhibited increased phylogenetic and within-human sequence constraint. Given that these behaviors are intermediate between those observed with regions more accessible in fetal and older-adult tissues, we suggest the following model (Figure 3D).

Regulatory sequences most active during development are subjected to strong negative selection, both to maintain human-derived functional sequences and discourage subsequent modifications, as dysregulation of development would have the largest fitness consequences. Similarly, sequences most active during early adulthood are subjected to negative selection to maintain proper tissue maintenance and discourage disease. However, the strength of this selection is reduced, as we expect fitness benefits/costs to diminish with age as individuals reproduce less frequently [5355]. Thus, despite the fact that mutations within or nearby these functional sequences stand to have the greatest impact on disease risk (as noted above) they are less efficiently purged, and are allowed to accumulate over generations [35]. Finally, sequences most active in older adults are under relaxed selective pressures and allowed to drift – mutations are permitted and retained, particularly due to the reduced associations that these mutations have with heritable disease risk. Overall, this model suggests that considering the changing epigenetic context of disease-associated variants may help in prioritizing GWAS signals to loci involved in disease biology (e.g., as we saw for histone deacetylases) and, ultimately, the aging processes driving tissue decline and eventual manifestation of aging-associated disease.

Materials and Methods

Accessibility datasets

DNase-I hypersensitivity datasets were obtained from ENCODE [40] for eight different fetal and adult tissues (adrenal gland, brain, heart, lung, muscle, skin, spleen and stomach) (see Supplementary Table 1 for accessions and metadata). Raw data was processed as described in the Supplementary Methods, with called open-chromatin regions consolidated across replicates and tissues in order to define a final set of reproducible regions. This aggregated set of peaks was then used to assess both pan-tissue, as well as per-tissue, accessibility changes between fetal and adult datasets using the limma package (version 3.46) in R [81, 82]. Differentially-accessible regions were defined using a Benjamini-Hochberg FDR [83] cutoff of < 0.05.

Adult DNase samples were further stratified in order to define age-altered chromatin regions, splitting samples used in the above analysis into those individuals younger than 50 (‘young-adult’) and those older than 50 (‘old-adult’), this age representing a roughly equal split of sample numbers. Not all tissues used in the initial fetal/adult comparison were represented in these age-stratified sets – thus we restricted the tissue comparisons to brain, heart, lung, muscle and stomach tissues. A similar computational method as that used in defining fetal- and adult-biased regions was applied here (see Supplementary Methods). We compared accessibility changes between young and old-adult samples within those regions exhibiting fetal/adult biases, defining young-biased and old-biased regions (again, using an FDR cutoff of < 0.05).

Promoter accessibility change

All hg19 Refseq gene TSS were obtained from the UCSC Genome Browser [84] and padded 1kb up/downstream to define promoter regions. For each promoter region, DNase read coverage was compared between adult and fetal samples, with resulting data processed using a similar differential-accessibility method as that used above (see Supplementary Methods). Significantly differentially-accessible promoters were defined using an FDR cut-off of 0.05. As an additional, more stringent analysis, we also defined differentially-accessible promoters based on intersections with the above defined region sets (see Supplementary Methods).

Promoter capture datasets: Promoter-capture data was obtained from Jung et al., 2019 [41]; this dataset was generated from promoter-capture assays across a number of different tissues and cell-types. Given our pan-tissue approach, we considered all data (with the exception of OV2, as we excluded sex-specific tissues from all previous obtained datasets). To generate a set of genomic regions which show evidence of contacting gene promoters, we filtered interacting regions to those which contacted their respective promoters in at least two different tissues/cell-types. This moderate filter was used to exclude those regions for which interactions appear to be exclusive to one dataset, while allowing for regions that do not show such exclusivity.

Gene-set enrichment analyses: Gene sets generated in our analyses were tested for enrichment in different GO Biological Process terms using the ‘enrichGO’ function from the clusterProfiler [85] package version 3.16.1, with semantically-similar GO terms collapsed and significantly-enriched terms defined as adjusted p-value < 0.05.

ENCODE RNA-seq datasets: Processed per-gene quantification files, as generated by the ENCODE pipeline were obtained from the ENCODE web portal [40] (see Supplementary Table 2 for file accessions and metadata). Given the limited availability of adult tissue samples for use in differential-expression analysis, we instead defined a less-stringent method to identify broad changes in gene expression which demonstrate consistency across tissues (see Supplementary Methods).

GTEx RNA-seq processing: Processed RNA-seq quantification files were obtained from the GTEx web portal [43] for the following tissues (matching the above young/old-age accessibility comparison): brain (Brain - Cerebellum), heart (Heart – Left Ventricle), lung (Lung), muscle (Muscle - Skeletal) and stomach (Stomach). Similar to the processing performed in Benayoun et al [44], we applied quality filters to remove lowly-expressed and non-coding genes, and subsequently used the same definitions of ‘young-age’ and ‘old-age’ (as in the above analyses) to calculate differential expression using limma-voom (see Supplementary Methods).

Human sequence variation datasets: Variation data from the 1000 Genomes Project phase 3 (1KGP) [86] (n = 2504 individuals) in. vcf.gz format was obtained and intersected with our region sets using tabix [87] (version 1.9) to obtain variants occurring within these altered-accessibility regions. Common variants were defined using a minor allele frequency (MAF) threshold of >= 0.05. These sets of intersected variants were subsequently used to compare sequence variation across region sets, as well as comparing region-intersected variation to genomic backgrounds and feature sets (see Supplementary Methods).

GWAS summary statistics data: To define a set of aging-associated diseases for use in our analyses, we first used broadly-defined categories as described in Chang et al., 2019 [58]. This study described 92 age-related diseases grouped into broader disease categories based on analyses of large-scale demographic datasets. We took these classifications and manually extracted relevant GWAS phenotypes assessed by the UK Biobanks study [57], obtaining pre-processed summary statistics for these phenotypes provided by the Neale lab [77] (https://nealelab.github.io/UKBB_ldsc/downloads.html). These data were subsequently utilized across several bioinformatic analyses (see Supplementary Methods).

Additional computational methods, including implementations of statistical tests described in the Results, are described in detail in the Supplementary Methods document. The datasets supporting the findings of this study are publicly-available – accession codes and URLs are provided in the Supplementary Methods and Tables. Computational code for processing these datasets is available upon reasonable request from the Lead Contact.

Author Contributions

DR: Conceptualization, Data curation, Formal analysis, Investigation, Visualization, Methodology, Manuscript Writing; TDC: Conceptualization, Supervision, Funding acquisition, Investigation, Manuscript Writing, Project administration.

Acknowledgments

We thank Dr. Anne Brunet for meaningful discussions relating to concepts of aging, development and epigenetics. We thank three anonymous reviewers for their excellent insights into our work.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Funding

Research reported in this publication was supported by The American School of Prehistoric Research, Harvard University.

References

  • 1. Brunet A, Berger SL. Epigenetics of aging and aging-related disease. J Gerontol A Biol Sci Med Sci. 2014 (Suppl 1); 69:S17–20. https://doi.org/10.1093/gerona/glu042 [PubMed]
  • 2. Rodríguez-Rodero S, Fernández-Morera JL, Menéndez-Torre E, Calvanese V, Fernández AF, Fraga MF. Aging genetics and aging. Aging Dis. 2011; 2:186–95. [PubMed]
  • 3. Booth LN, Brunet A. The Aging Epigenome. Mol Cell. 2016; 62:728–44. https://doi.org/10.1016/j.molcel.2016.05.013 [PubMed]
  • 4. Criscione SW, Teo YV, Neretti N. The Chromatin Landscape of Cellular Senescence. Trends Genet. 2016; 32:751–61. https://doi.org/10.1016/j.tig.2016.09.005 [PubMed]
  • 5. López-Otín C, Blasco MA, Partridge L, Serrano M, Kroemer G. The hallmarks of aging. Cell. 2013; 153:1194–217. https://doi.org/10.1016/j.cell.2013.05.039 [PubMed]
  • 6. ENCODE Project Consortium. An integrated encyclopedia of DNA elements in the human genome. Nature. 2012; 489:57–74. https://doi.org/10.1038/nature11247 [PubMed]
  • 7. Zhang W, Qu J, Liu GH, Belmonte JC. The ageing epigenome and its rejuvenation. Nat Rev Mol Cell Biol. 2020; 21:137–50. https://doi.org/10.1038/s41580-019-0204-5 [PubMed]
  • 8. Blagosklonny MV, Hall MN. Growth and aging: a common molecular mechanism. Aging (Albany NY). 2009; 1:357–62. https://doi.org/10.18632/aging.100040 [PubMed]
  • 9. Hanson M, Godfrey KM, Lillycrop KA, Burdge GC, Gluckman PD. Developmental plasticity and developmental origins of non-communicable disease: theoretical considerations and epigenetic mechanisms. Prog Biophys Mol Biol. 2011; 106:272–80. https://doi.org/10.1016/j.pbiomolbio.2010.12.008 [PubMed]
  • 10. Gicquel C, El-Osta A, Le Bouc Y. Epigenetic regulation and fetal programming. Best Pract Res Clin Endocrinol Metab. 2008; 22:1–16. https://doi.org/10.1016/j.beem.2007.07.009 [PubMed]
  • 11. Godfrey KM, Lillycrop KA, Burdge GC, Gluckman PD, Hanson MA. Non-imprinted epigenetics in fetal and postnatal development and growth. Nestle Nutr Inst Workshop Ser. 2013; 71:57–63. https://doi.org/10.1159/000342552 [PubMed]
  • 12. Rinaudo P, Wang E. Fetal programming and metabolic syndrome. Annu Rev Physiol. 2012; 74:107–30. https://doi.org/10.1146/annurev-physiol-020911-153245 [PubMed]
  • 13. Kirkwood TB. Understanding ageing from an evolutionary perspective. J Intern Med. 2008; 263:117–27. https://doi.org/10.1111/j.1365-2796.2007.01901.x [PubMed]
  • 14. Kirkwood TB. Evolution of ageing. Mech Ageing Dev. 2002; 123:737–45. https://doi.org/10.1016/s0047-6374(01)00419-5 [PubMed]
  • 15. Conradt E, Adkins DE, Crowell SE, Raby KL, Diamond LM, Ellis B. Incorporating epigenetic mechanisms to advance fetal programming theories. Dev Psychopathol. 2018; 30:807–24. https://doi.org/10.1017/S0954579418000469 [PubMed]
  • 16. Fraga MF, Ballestar E, Paz MF, Ropero S, Setien F, Ballestar ML, Heine-Suñer D, Cigudosa JC, Urioste M, Benitez J, Boix-Chornet M, Sanchez-Aguilera A, Ling C, et al. Epigenetic differences arise during the lifetime of monozygotic twins. Proc Natl Acad Sci USA. 2005; 102:10604–09. https://doi.org/10.1073/pnas.0500398102 [PubMed]
  • 17. Ollikainen M, Smith KR, Joo EJ, Ng HK, Andronikos R, Novakovic B, Abdul Aziz NK, Carlin JB, Morley R, Saffery R, Craig JM. DNA methylation analysis of multiple tissues from newborn twins reveals both genetic and intrauterine components to variation in the human neonatal epigenome. Hum Mol Genet. 2010; 19:4176–88. https://doi.org/10.1093/hmg/ddq336 [PubMed]
  • 18. Saffery R, Novakovic B. Epigenetics as the mediator of fetal programming of adult onset disease: what is the evidence? Acta Obstet Gynecol Scand. 2014; 93:1090–98. https://doi.org/10.1111/aogs.12431 [PubMed]
  • 19. Aagaard-Tillery KM, Grove K, Bishop J, Ke X, Fu Q, McKnight R, Lane RH. Developmental origins of disease and determinants of chromatin structure: maternal diet modifies the primate fetal epigenome. J Mol Endocrinol. 2008; 41:91–102. https://doi.org/10.1677/JME-08-0025 [PubMed]
  • 20. Horvath S, Raj K. DNA methylation-based biomarkers and the epigenetic clock theory of ageing. Nat Rev Genet. 2018; 19:371–84. https://doi.org/10.1038/s41576-018-0004-3 [PubMed]
  • 21. Yan L, Guo H, Hu B, Li R, Yong J, Zhao Y, Zhi X, Fan X, Guo F, Wang X, Wang W, Wei Y, Wang Y, et al. Epigenomic Landscape of Human Fetal Brain, Heart, and Liver. J Biol Chem. 2016; 291:4386–98. https://doi.org/10.1074/jbc.M115.672931 [PubMed]
  • 22. Maiese K, Li F, Chong ZZ, Shang YC. The Wnt signaling pathway: aging gracefully as a protectionist? Pharmacol Ther. 2008; 118:58–81. https://doi.org/10.1016/j.pharmthera.2008.01.004 [PubMed]
  • 23. Salminen A, Kaarniranta K. Insulin/IGF-1 paradox of aging: regulation via AKT/IKK/NF-kappaB signaling. Cell Signal. 2010; 22:573–77. https://doi.org/10.1016/j.cellsig.2009.10.006 [PubMed]
  • 24. de Magalhães JP. Programmatic features of aging originating in development: aging mechanisms beyond molecular damage? FASEB J. 2012; 26:4821–26. https://doi.org/10.1096/fj.12-210872 [PubMed]
  • 25. Rivera CM, Ren B. Mapping human epigenomes. Cell. 2013; 155:39–55. https://doi.org/10.1016/j.cell.2013.09.011 [PubMed]
  • 26. Kundaje A, Meuleman W, Ernst J, Bilenky M, Yen A, Heravi-Moussavi A, Kheradpour P, Zhang Z, Wang J, Ziller MJ, Amin V, Whitaker JW, Schultz MD, et al, and Roadmap Epigenomics Consortium. Integrative analysis of 111 reference human epigenomes. Nature. 2015; 518:317–30. https://doi.org/10.1038/nature14248 [PubMed]
  • 27. Day K, Waite LL, Thalacker-Mercer A, West A, Bamman MM, Brooks JD, Myers RM, Absher D. Differential DNA methylation with age displays both common and dynamic features across human tissues that are influenced by CpG landscape. Genome Biol. 2013; 14:R102. https://doi.org/10.1186/gb-2013-14-9-r102 [PubMed]
  • 28. Kubben N, Misteli T. Shared molecular and cellular mechanisms of premature ageing and ageing-associated diseases. Nat Rev Mol Cell Biol. 2017; 18:595–609. https://doi.org/10.1038/nrm.2017.68 [PubMed]
  • 29. Yang J, Huang T, Petralia F, Long Q, Zhang B, Argmann C, Zhao Y, Mobbs CV, Schadt EE, Zhu J, Tu Z, and GTEx Consortium. Synchronized age-related gene expression changes across multiple tissues in human and the link to complex diseases. Sci Rep. 2015; 5:15145. https://doi.org/10.1038/srep15145 [PubMed]
  • 30. Frenk S, Houseley J. Gene expression hallmarks of cellular ageing. Biogerontology. 2018; 19:547–66. https://doi.org/10.1007/s10522-018-9750-z [PubMed]
  • 31. Bou Sleiman M, Jha P, Houtkooper R, Williams RW, Wang X, Auwerx J. The Gene-Regulatory Footprint of Aging Highlights Conserved Central Regulators. Cell Rep. 2020; 32:108203. https://doi.org/10.1016/j.celrep.2020.108203 [PubMed]
  • 32. Yuen RK, Neumann SM, Fok AK, Peñaherrera MS, McFadden DE, Robinson WP, Kobor MS. Extensive epigenetic reprogramming in human somatic tissues between fetus and adult. Epigenetics Chromatin. 2011; 4:7. https://doi.org/10.1186/1756-8935-4-7 [PubMed]
  • 33. Zhu J, Adli M, Zou JY, Verstappen G, Coyne M, Zhang X, Durham T, Miri M, Deshpande V, De Jager PL, Bennett DA, Houmard JA, Muoio DM, et al. Genome-wide chromatin state transitions associated with developmental and environmental cues. Cell. 2013; 152:642–54. https://doi.org/10.1016/j.cell.2012.12.033 [PubMed]
  • 34. Boyle AP, Hong EL, Hariharan M, Cheng Y, Schaub MA, Kasowski M, Karczewski KJ, Park J, Hitz BC, Weng S, Cherry JM, Snyder M. Annotation of functional variation in personal genomes using RegulomeDB. Genome Res. 2012; 22:1790–97. https://doi.org/10.1101/gr.137323.112 [PubMed]
  • 35. Charlesworth B. Patterns of age-specific means and genetic variances of mortality rates predicted by the mutation-accumulation theory of ageing. J Theor Biol. 2001; 210:47–65. https://doi.org/10.1006/jtbi.2001.2296 [PubMed]
  • 36. Kirkwood TB, Rose MR. Evolution of senescence: late survival sacrificed for reproduction. Philos Trans R Soc Lond B Biol Sci. 1991; 332:15–24. https://doi.org/10.1098/rstb.1991.0028 [PubMed]
  • 37. Richard D, Liu Z, Cao J, Kiapour AM, Willen J, Yarlagadda S, Jagoda E, Kolachalama VB, Sieker JT, Chang GH, Muthuirulan P, Young M, Masson A, et al. Evolutionary Selection and Constraint on Human Knee Chondrocyte Regulation Impacts Osteoarthritis Risk. Cell. 2020; 181:362–81.e28. https://doi.org/10.1016/j.cell.2020.02.057 [PubMed]
  • 38. Rodríguez JA, Marigorta UM, Hughes DA, Spataro N, Bosch E, Navarro A. Antagonistic pleiotropy and mutation accumulation influence human senescence and disease. Nat Ecol Evol. 2017; 1:55. https://doi.org/10.1038/s41559-016-0055 [PubMed]
  • 39. Klemm SL, Shipony Z, Greenleaf WJ. Chromatin accessibility and the regulatory epigenome. Nat Rev Genet. 2019; 20:207–20. https://doi.org/10.1038/s41576-018-0089-8 [PubMed]
  • 40. Davis CA, Hitz BC, Sloan CA, Chan ET, Davidson JM, Gabdank I, Hilton JA, Jain K, Baymuradov UK, Narayanan AK, Onate KC, Graham K, Miyasato SR, et al. The Encyclopedia of DNA elements (ENCODE): data portal update. Nucleic Acids Res. 2018; 46:D794–801. https://doi.org/10.1093/nar/gkx1081 [PubMed]
  • 41. Jung I, Schmitt A, Diao Y, Lee AJ, Liu T, Yang D, Tan C, Eom J, Chan M, Chee S, Chiang Z, Kim C, Masliah E, et al. A compendium of promoter-centered long-range chromatin interactions in the human genome. Nat Genet. 2019; 51:1442–49. https://doi.org/10.1038/s41588-019-0494-8 [PubMed]
  • 42. McLean CY, Bristor D, Hiller M, Clarke SL, Schaar BT, Lowe CB, Wenger AM, Bejerano G. GREAT improves functional interpretation of cis-regulatory regions. Nat Biotechnol. 2010; 28:495–501. https://doi.org/10.1038/nbt.1630 [PubMed]
  • 43. GTEx Consortium. The Genotype-Tissue Expression (GTEx) project. Nat Genet. 2013; 45:580–85. https://doi.org/10.1038/ng.2653 [PubMed]
  • 44. Benayoun BA, Pollina EA, Singh PP, Mahmoudi S, Harel I, Casey KM, Dulken BW, Kundaje A, Brunet A. Remodeling of epigenome and transcriptome landscapes with aging in mice reveals widespread induction of inflammatory responses. Genome Res. 2019; 29:697–709. https://doi.org/10.1101/gr.240093.118 [PubMed]
  • 45. Carroll SB. Evo-devo and an expanding evolutionary synthesis: a genetic theory of morphological evolution. Cell. 2008; 134:25–36. https://doi.org/10.1016/j.cell.2008.06.030 [PubMed]
  • 46. Kirkwood TB, Holliday R. The evolution of ageing and longevity. Proc R Soc Lond B Biol Sci. 1979; 205:531–46. https://doi.org/10.1098/rspb.1979.0083 [PubMed]
  • 47. Jones OR, Scheuerlein A, Salguero-Gómez R, Camarda CG, Schaible R, Casper BB, Dahlgren JP, Ehrlén J, García MB, Menges ES, Quintana-Ascencio PF, Caswell H, Baudisch A, Vaupel JW. Diversity of ageing across the tree of life. Nature. 2014; 505:169–73. https://doi.org/10.1038/nature12789 [PubMed]
  • 48. Ferris E, Abegglen LM, Schiffman JD, Gregg C. Accelerated Evolution in Distinctive Species Reveals Candidate Elements for Clinically Relevant Traits, Including Mutation and Cancer Resistance. Cell Rep. 2018; 22:2742–55. https://doi.org/10.1016/j.celrep.2018.02.008 [PubMed]
  • 49. Pollard KS, Hubisz MJ, Rosenbloom KR, Siepel A. Detection of nonneutral substitution rates on mammalian phylogenies. Genome Res. 2010; 20:110–21. https://doi.org/10.1101/gr.097857.109 [PubMed]
  • 50. Kirkwood TB. The origins of human ageing. Philos Trans R Soc Lond B Biol Sci. 1997; 352:1765–72. https://doi.org/10.1098/rstb.1997.0160 [PubMed]
  • 51. Varki A, Altheide TK. Comparing the human and chimpanzee genomes: searching for needles in a haystack. Genome Res. 2005; 15:1746–58. https://doi.org/10.1101/gr.3737405 [PubMed]
  • 52. Hubisz MJ, Pollard KS. Exploring the genesis and functions of Human Accelerated Regions sheds light on their role in human evolution. Curr Opin Genet Dev. 2014; 29:15–21. https://doi.org/10.1016/j.gde.2014.07.005 [PubMed]
  • 53. Carnes BA, Olshansky SJ. Evolutionary Perspectives on Human Senescence. Popul Dev Rev. 1993; 19:793. https://doi.org/10.2307/2938414
  • 54. Ricklefs RE. The evolution of senescence from a comparative perspective. Funct Ecol. 2008; 22:379–92. https://doi.org/10.1111/j.1365-2435.2008.01420.x
  • 55. Williams GC. Pleiotropy, natural selection, and the evolution of senescence. Evolution. 1957; 11:398–411. https://doi.org/10.1111/j.1558-5646.1957.tb02911.x
  • 56. Wright A, Charlesworth B, Rudan I, Carothers A, Campbell H. A polygenic basis for late-onset disease. Trends Genet. 2003; 19:97–106. https://doi.org/10.1016/s0168-9525(02)00033-1 [PubMed]
  • 57. Sudlow C, Gallacher J, Allen N, Beral V, Burton P, Danesh J, Downey P, Elliott P, Green J, Landray M, Liu B, Matthews P, Ong G, et al. UK biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age. PLoS Med. 2015; 12:e1001779. https://doi.org/10.1371/journal.pmed.1001779 [PubMed]
  • 58. Chang AY, Skirbekk VF, Tyrovolas S, Kassebaum NJ, Dieleman JL. Measuring population ageing: an analysis of the Global Burden of Disease Study 2017. Lancet Public Health. 2019; 4:e159–67. https://doi.org/10.1016/S2468-2667(19)30019-2 [PubMed]
  • 59. Boyle EA, Li YI, Pritchard JK. An Expanded View of Complex Traits: From Polygenic to Omnigenic. Cell. 2017; 169:1177–86. https://doi.org/10.1016/j.cell.2017.05.038 [PubMed]
  • 60. Landrum MJ, Lee JM, Benson M, Brown GR, Chao C, Chitipiralla S, Gu B, Hart J, Hoffman D, Jang W, Karapetyan K, Katz K, Liu C, et al. ClinVar: improving access to variant interpretations and supporting evidence. Nucleic Acids Res. 2018; 46:D1062–67. https://doi.org/10.1093/nar/gkx1153 [PubMed]
  • 61. Hall JA, Dominy JE, Lee Y, Puigserver P. The sirtuin family’s role in aging and age-associated pathologies. J Clin Invest. 2013; 123:973–79. https://doi.org/10.1172/JCI64094 [PubMed]
  • 62. Sen P, Shah PP, Nativio R, Berger SL. Epigenetic Mechanisms of Longevity and Aging. Cell. 2016; 166:822–39. https://doi.org/10.1016/j.cell.2016.07.050 [PubMed]
  • 63. Cooper GM, Goode DL, Ng SB, Sidow A, Bamshad MJ, Shendure J, Nickerson DA. Single-nucleotide evolutionary constraint scores highlight disease-causing mutations. Nat Methods. 2010; 7:250–51. https://doi.org/10.1038/nmeth0410-250 [PubMed]
  • 64. Cooper GM, Shendure J. Needles in stacks of needles: finding disease-causal variants in a wealth of genomic data. Nat Rev Genet. 2011; 12:628–40. https://doi.org/10.1038/nrg3046 [PubMed]
  • 65. Hujoel ML, Gazal S, Hormozdiari F, van de Geijn B, Price AL. Disease Heritability Enrichment of Regulatory Elements Is Concentrated in Elements with Ancient Sequence Age and Conserved Function across Species. Am J Hum Genet. 2019; 104:611–24. https://doi.org/10.1016/j.ajhg.2019.02.008 [PubMed]
  • 66. MacArthur DG, Manolio TA, Dimmock DP, Rehm HL, Shendure J, Abecasis GR, Adams DR, Altman RB, Antonarakis SE, Ashley EA, Barrett JC, Biesecker LG, Conrad DF, et al. Guidelines for investigating causality of sequence variants in human disease. Nature. 2014; 508:469–76. https://doi.org/10.1038/nature13127 [PubMed]
  • 67. Gazal S, Finucane HK, Furlotte NA, Loh PR, Palamara PF, Liu X, Schoech A, Bulik-Sullivan B, Neale BM, Gusev A, Price AL. Linkage disequilibrium-dependent architecture of human complex traits shows action of negative selection. Nat Genet. 2017; 49:1421–27. https://doi.org/10.1038/ng.3954 [PubMed]
  • 68. Siepel A, Bejerano G, Pedersen JS, Hinrichs AS, Hou M, Rosenbloom K, Clawson H, Spieth J, Hillier LW, Richards S, Weinstock GM, Wilson RK, Gibbs RA, et al. Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes. Genome Res. 2005; 15:1034–50. https://doi.org/10.1101/gr.3715005 [PubMed]
  • 69. Finucane HK, Bulik-Sullivan B, Gusev A, Trynka G, Reshef Y, Loh PR, Anttila V, Xu H, Zang C, Farh K, Ripke S, Day FR, Purcell S, et al, and ReproGen Consortium, Schizophrenia Working Group of the Psychiatric Genomics Consortium, and RACI Consortium. Partitioning heritability by functional annotation using genome-wide association summary statistics. Nat Genet. 2015; 47:1228–35. https://doi.org/10.1038/ng.3404 [PubMed]
  • 70. Huang YF, Gulko B, Siepel A. Fast, scalable prediction of deleterious noncoding variants from functional and population genomic data. Nat Genet. 2017; 49:618–24. https://doi.org/10.1038/ng.3810 [PubMed]
  • 71. Tang J, Yan H, Zhuang S. Histone deacetylases as targets for treatment of multiple diseases. Clin Sci (Lond). 2013; 124:651–62. https://doi.org/10.1042/CS20120504 [PubMed]
  • 72. Benayoun BA, Pollina EA, Brunet A. Epigenetic regulation of ageing: linking environmental inputs to genomic stability. Nat Rev Mol Cell Biol. 2015; 16:593–610. https://doi.org/10.1038/nrm4048 [PubMed]
  • 73. Bi S, Liu Z, Wu Z, Wang Z, Liu X, Wang S, Ren J, Yao Y, Zhang W, Song M, Liu GH, Qu J. SIRT7 antagonizes human stem cell aging as a heterochromatin stabilizer. Protein Cell. 2020; 11:483–504. https://doi.org/10.1007/s13238-020-00728-4 [PubMed]
  • 74. Tasselli L, Zheng W, Chua KF. SIRT6: Novel Mechanisms and Links to Aging and Disease. Trends Endocrinol Metab. 2017; 28:168–85. https://doi.org/10.1016/j.tem.2016.10.002 [PubMed]
  • 75. Mostoslavsky R, Chua KF, Lombard DB, Pang WW, Fischer MR, Gellon L, Liu P, Mostoslavsky G, Franco S, Murphy MM, Mills KD, Patel P, Hsu JT, et al. Genomic instability and aging-like phenotype in the absence of mammalian SIRT6. Cell. 2006; 124:315–29. https://doi.org/10.1016/j.cell.2005.11.044 [PubMed]
  • 76. Tsurumi A, Li WX. Global heterochromatin loss: a unifying theory of aging? Epigenetics. 2012; 7:680–88. https://doi.org/10.4161/epi.20540 [PubMed]
  • 77. Bulik-Sullivan B, Finucane HK, Anttila V, Gusev A, Day FR, Loh PR, Duncan L, Perry JR, Patterson N, Robinson EB, Daly MJ, Price AL, Neale BM, and ReproGen Consortium, and Psychiatric Genomics Consortium, and Genetic Consortium for Anorexia Nervosa of the Wellcome Trust Case Control Consortium 3. An atlas of genetic correlations across human diseases and traits. Nat Genet. 2015; 47:1236–41. https://doi.org/10.1038/ng.3406 [PubMed]
  • 78. Domcke S, Hill AJ, Daza RM, Cao J, O’Day DR, Pliner HA, Aldinger KA, Pokholok D, Zhang F, Milbank JH, Zager MA, Glass IA, Steemers FJ, et al. A human cell atlas of fetal chromatin accessibility. Science. 2020; 370:eaba7612. https://doi.org/10.1126/science.aba7612 [PubMed]
  • 79. Teschendorff AE, West J, Beck S. Age-associated epigenetic drift: implications, and a case of epigenetic thrift? Hum Mol Genet. 2013; 22:R7–15. https://doi.org/10.1093/hmg/ddt375 [PubMed]
  • 80. Zampieri M, Ciccarone F, Calabrese R, Franceschi C, Bürkle A, Caiafa P. Reconfiguration of DNA methylation in aging. Mech Ageing Dev. 2015; 151:60–70. https://doi.org/10.1016/j.mad.2015.02.002 [PubMed]
  • 81. R Development Core Team. R: A Language and Environment for Statistical Computing. Vienna, Austria; 2008. http://www.r-project.org.
  • 82. Ritchie ME, Phipson B, Wu D, Hu Y, Law CW, Shi W, Smyth GK. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 2015; 43:e47. https://doi.org/10.1093/nar/gkv007 [PubMed]
  • 83. Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Stat Soc Ser B (Methodological). 1995; 57: 289–300. http://www.jstor.org/stable/2346101.
  • 84. Karolchik D, Hinrichs AS, Furey TS, Roskin KM, Sugnet CW, Haussler D, Kent WJ. The UCSC Table Browser data retrieval tool. Nucleic Acids Res. 2004; 32:D493–96. https://doi.org/10.1093/nar/gkh103 [PubMed]
  • 85. Yu G, Wang LG, Han Y, He QY. clusterProfiler: an R package for comparing biological themes among gene clusters. OMICS. 2012; 16:284–87. https://doi.org/10.1089/omi.2011.0118 [PubMed]
  • 86. Auton A, Brooks LD, Durbin RM, Garrison EP, Kang HM, Korbel JO, Marchini JL, McCarthy S, McVean GA, Abecasis GR, and 1000 Genomes Project Consortium. A global reference for human genetic variation. Nature. 2015; 526:68–74. https://doi.org/10.1038/nature15393 [PubMed]
  • 87. Li H. Tabix: fast retrieval of sequence features from generic TAB-delimited files. Bioinformatics. 2011; 27:718–19. https://doi.org/10.1093/bioinformatics/btq671 [PubMed]