Research Paper Advance Articles

Characterization of DNA methylation clock algorithms applied to diverse tissue types

Mark Richardson1, , Courtney Brandt1, , Niyati Jain1, , James L. Li1, , Kathryn Demanelis2, , Farzana Jasmine1, , Muhammad G. Kibriya1, , Lin Tong1, , Brandon L. Pierce1,3,4, ,

  • 1 Department of Public Health Sciences, University of Chicago, Chicago, IL 60615, USA
  • 2 Department of Medicine, University of Pittsburgh, Pittsburgh, PA 15261, USA
  • 3 Department of Human Genetics, University of Chicago, Chicago, IL 60615, USA
  • 4 Comprehensive Cancer Center, University of Chicago, Chicago, IL 60615, USA

Received: December 5, 2023       Accepted: December 12, 2024       Published: January 3, 2025      

https://doi.org/10.18632/aging.206182
How to Cite

Copyright: © 2025 Richardson et al. This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Abstract

Background: DNA methylation (DNAm) data from human samples has been leveraged to develop “epigenetic clock” algorithms that predict age and other aging-related phenotypes. Some DNAm clocks were trained using DNAm obtained from blood cells, while other clocks were trained using data from diverse tissue/cell types. To assess how DNAm clocks perform across non-blood tissue types, we applied DNAm algorithms to DNAm data generated from 9 different human tissue types.

Methods: We generated array-based DNAm measurements for 973 samples from deceased tissue donors from the GTEx (Genotype Tissue Expression) project representing nine distinct tissue types: lung, colon, prostate, ovary, breast, kidney, testis, skeletal muscle, and whole blood. For all samples, we generated DNAm clock estimates for 8 epigenetic clocks and characterized these tissue-specific clock estimates in terms of their distributions, correlations with chronological age, correlations of clock estimates between tissue types, and association with participant characteristics.

Results: For each clock, the mean DNAm age estimate varied substantially across tissue types, and the mean values for the different clocks varied substantially within tissue types. For most clocks, the correlation with chronological age varied across tissue types, with blood often showing the strongest correlation. Each clock showed strong correlation across tissues, with some evidence of some residual correlation after adjusting for chronological age. In lung tissue, smoking generally had a positive association with epigenetic age.

Conclusions: This work demonstrates how differences in epigenetic aging among tissue types leads to clear differences in DNAm clock characteristics across tissue types. Tissue or cell-type specific epigenetic clocks are needed to optimize predictive performance of DNAm clocks in non-blood tissues and cell types.

Introduction

Changes in the human epigenome occur as humans age, and these epigenetic alternations are considered a hallmark (and a cause) of human aging [1]. One such epigenetic feature is DNA methylation (DNAm) at cytosine-guanine dinucleotides. DNAm plays a key role in gene regulation, particularly DNAm at CpG islands near gene promoters, which is indicative of gene silencing [2]. Early studies of aging and DNAm reported age-related changes in cancer cells [3, 4], human tissues [5], mouse tissues [6], as well as in blood samples from twin studies [7]. Subsequent studies identified regions enriched for aging-related DNAm changes, including promoters of polycomb group protein target genes [8] and bivalent chromatin domains [9]. Associations between age and DNAm at CpG sites across the human genome have been thoroughly characterized in many studies (most often based on DNAm data from blood cells) [2, 10, 11], and these age-related changes are likely accompanied by changes in other epigenetic features (e.g., histone modifications, nucleosome positioning, chromatin conformation) [8, 12]. While our understanding of these changes and their implications for cellular function is incomplete, the concepts of “loss of constitutive heterochromatin” and “epigenetic drift” have been used to describe how these changes relate to aging [12].

In human studies, DNAm data has often been generated using commercial arrays that measure DNAm at ~27,000 to ~850,000 CpG sites. These data have been used to develop DNA methylation (DNAm) clock algorithms that leverage data on many CpG sites to predict age and other aging-related phenotypes [2, 11, 13]. First generation DNAm clocks (i.e., Hannum clock and Horvath clock) were trained with the goal of accurately estimating chronological age of a biological sample. These clocks estimate acceleration of biological age (relative to chronological age), a measurement that has been shown to positively correlate with aging-related phenotypes [2]. Later generations of DNAm clocks (e.g., GrimAge [14], PhenoAge [15]) were trained on additional health- and aging-related variables (e.g., smoking, circulating biomarkers) in order to generate biological aging estimates that more strongly predict health and lifespan. Additional DNAm clocks of interest include “mitotic clocks”, such as EpiTOC (Epigenetic Timer of Cancer), which estimates the mitotic age of cells, how many stem cell divisions have occurred, and provides a subsequent estimate of cell age [16]. In recent years, clocks have been developed that use larger numbers of CpGs, and sophisticated variable selection methods, in order to improve prediction of chronological age, including Vijayakumar and Cho’s “EpiClock” (~7000 CpGs) [17], AltumAge (20,318 CpGs) [18], and the Zhang clock (514 CpGs) [19]. In addition, a clock that estimates “pace of aging” (DunedinPACE) was developed using two decades of longitudinal biomarker data (173 CpGs) [20].

The DNAm clocks developed to date were trained using either (1) DNAm obtained from blood cells only, or (2) DNAm obtained from diverse tissue types, but with the majority of training data coming from blood cells [11]. Thus, DNAm clocks may not perform equally well across tissue types, and few studies have compared the performance of DNAm aging clock algorithms across a variety of non-blood tissue types [21, 22]. It is important to understand clocks’ performance across different tissues, as clinical applications may rely on accessible tissues only to assess tissue-specific aging; and forensic settings may rely on accurate age prediction from a variety of tissue sources.

To assess differences across tissues with respect to biological age predictions, we used DNAm data from the Genotype-Tissue Expression (GTEx) Project to obtain epigenetic age estimates for DNAm clocks (Horvath, Hannum, PhenoAge, EpiTOC, EpiClock, AltumAge, Zhang, and Dunedin PACE) across 9 different tissue types (lung, colon, prostate, ovary, breast, kidney, testis, skeletal muscle, and whole blood). We characterize DNAm aging estimates across tissues in terms of the (1) distribution of DNAm clock estimates, (2) strength of each clock’s association with age, (3) correlation of clock estimates between tissue types, and (4) associations of participant characteristics with clock estimates.

Materials and Methods

Tissue samples

The GTEx Project is a publicly available biobank of >17,000 human tissue samples collected from ~950 post-mortem multi-tissue donors, with ~50 unique human tissue types represented [23]. The GTEx project has generated genome-wide data on genetic variation (using whole-genome sequencing) and gene expression (using RNA sequencing) for >15,000 tissue samples from >800 donors [24]. For each donor, characteristics such as age, race, BMI, and sex are also publicly available [25].

The “enhancing GTEx” (eGTEx) initiative built upon the core GTEx data by adding complementary layers of biological information, including proteomics, DNA and RNA modifications, and telomere length (TL) [26]. As a part of eGTEx, we generated array-based DNAm measurements for ~1,000 GTEx samples, described previously [27]. The selection of tissue types was based on several considerations, including inclusion of cancer-relevant tissues (lung, colon, prostate, ovary, breast, kidney), tissues with unique aging biology (testis, skeletal muscle), and tissues commonly used in epidemiological research (whole blood).

Genome-wide DNA methylation measurement and processing

We measured DNAm at 866,895 CpG sites throughout the epigenome in 1,000 GTEx tissue samples from 9 unique tissue types obtained from 424 GTEx subjects using the Infinium MethylationEPIC v1 array (Illumina, San Diego, CA, USA). DNA samples were extracted from GTEx tissue samples using Qiagen Gentra Puregene method at GTEx Laboratory Data, Analysis and Coordinating Center (LDACC), and sent to the Institute for Population and Precision Health Laboratory at the University of Chicago on 96-well plates. Neither tissue types nor individuals were not batched by plate. Bisulfite conversion was applied to 500 ng of DNA using EZ-96 DNA methylation kit (Zymo Research, Irvine, CA, USA). All samples were then prepared and analyzed in accordance with the manufacturer guidelines and protocol for the EPIC array.

DNAm data was processed with ChAMP software [28]. Raw DNAm values were background-adjusted using the single sample normal-exponential out-of-band (ssnoob) method with dye bias correction [29, 30]. DNAm beta values were normalized using the beta mixture quantile (BMIQ) method, adjusting for type I/II probe bias [31]. After normalization, we removed one additional sample with array-derived genotype profile not matching WGS-derived one. As described previously [27], principal component analysis (PCA) was conducted on DNAm beta values within each tissue type, and 3 samples were removed for being outliers with respect to the top 5 principal components (PCs) of the corresponding tissue. We additionally removed 13 breast samples obtained from males (leaving only female breast samples for analysis), resulting in 973 samples for analyses purposes (representing 9 tissue types and 424 donors). Due to missing telomere and smoking status data, an additional 99 samples were removed (from the regression analyses only) resulting in 874 samples.

DNAm clock estimates

For this study, we selected several commonly-used clocks, as well as recently developed clocks, including both pan-tissue and blood-specific clocks (Table 1). We excluded clocks that use chronological age as input (i.e., GrimAge). We acknowledge there are additional clocks described in the literature that we are not assessing in this work. DNAm clock estimates for all 973 GTEx tissues samples were obtained using the Horvath group’s online epigenetic clock calculator [32] and the R code provided [33]. Three of the DNAm clocks we analyzed were provided by the calculator: Horvath, Hannum, PhenoAge. Clock estimates for EpiTOC [16], EpiClock [17], AltumAge [34], Zhang clock [19], and DundinPACE [20] were obtained using code provided by the authors. For each of the clock algorithms examined, we provide information on the training data, arrays and CpGs used, the clock’s intended purpose and the number of clock CpGs that were missing from our dataset (i.e., removed during QC) (Table 1).

Table 1. Summary of the clock algorithms examined in this work.

ClockTissue type(s) for trainingArray(s) usedPurposeTraining data ageRef# CpGs used# CpGs missing
HorvathMulti-tissue (~8,000 samples)27K and 450KEstimate chronological ageMean: 43 y Range: 0-100 y3335323
HannumWhole blood (656 samples)450KEstimate chronological ageMean: ~65 y Range: 19-101y35719
Pheno AgeWhole blood (9,926 samples)27K, 450K, and 850KHealth + lifespan estimationMean age: 49y Range: 8-80y (NHANES)1551312
Altum AgeMulti-tissue27K, 450K, and 850KEstimate chronological ageMean: ~50 y Range: 0 to >100 y1820,318522
EpiTOCFetal tissues and blood450KCapture stem cell divisionsMean: ~65 y Range: 19-101y1638531
EpiClockMulti-tissue (3,114 samples)450K, and 850KEstimate chronological ageMedian: 36.5 y Range: 0-103 y176,761515
Dunedin PACEWhole blood (1,037 samples)805KEstimate pace of agingLongitudinal birth cohort (sampled at 26, 32, 38, 45 y)201730
ZhangBlood/saliva (13,661 samples)450K and 850KEstimate chronological ageMean: 15-82 y (14 cohorts) Range: 2–104 y195148

The Horvath clock [33] and Hannum clock [35], considered first-generation clocks, were trained with the goal of accurately estimating chronological age based on DNAm values measured from samples. PhenoAge is considered a second-generation clock that reflects both lifespan and health and was trained on age and 9 blood biomarkers [15]. Because the GrimAge clock [14] uses age as an input, we did not evaluate GrimAge in this work, as age prediction is a primary focus of this paper.

EpiTOC (Epigenetic timer of cancer) [16] is a “mitotic” clock was developed to capture the amount of stem cell division in a tissue. EpiTOC uses 385 CpGs selected based on (1) location within promoters that localize to Polycomb group target genes, (2) lack of methylation in a variety of fetal tissue types, and (3) increasing methylation with age in human blood samples.

The EpiClock algorithm was developed to predict chronological age across multiple tissue types [17]. The AltumAge algorithm, another multi-tissue clock, was trained using a neural network approach and is reported to have enhanced performance for older ages and for diverse tissue types.

The Zhang clock was developed from the stated goal of examining whether the association between the age acceleration residual of different predictors (developed by the researchers in the same paper) and death is affected by improving predictive power as its training set sample size increases and by correcting for confounders. The final CpG count was 514, chosen based on their associations with chronological age in blood samples.

DunedinPACE uses DNA methylation data from blood samples for 173 CpGs to determine the “Pace of Aging” of a person. This clock is based on biological changes observed for 19 indicators of organ-system integrity over two decades in the Dunedin birth cohort [20].

Among the CpGs in our post-QC dataset, we were missing a small percentage for the CpGs used by each clock (Table 1).

Estimation of immune cell infiltration

The presence of leukocytes in GTEx samples was estimated using the LUMP algorithm [36], which leverages data on 44 CpG sites that are specifically methylated specifically in immune cells. LUMP provides a percentage representing “purity”, so we subtracted this percentage from 1 to obtain a percentage represented immune cell infiltration (ICI). To determine if ICI impacts clock performance, we first examined the association between the average immune cell infiltration (for a specific tissue) and the correlation between chronological age and clock age (for a specific tissue), across all eight non-blood tissue types. We also examined the interaction between chronological age and ICI (in relation to clock age) for each clock, in each tissue type.

Statistical analyses

We examined the distributions of all five biological clock estimates (across tissue types) using histograms, ridge plots, and violin plots. Acceleration for each clock was calculated as the residuals from a linear regression of the clock estimates on chronological age. To examine the strength of associations of clock estimates with participant characteristics (and age), we used linear regression and/or Pearson’s correlation analysis. We calculated (1) the mean deviation for each clock applied to each tissue (i.e., average difference between clock age and chronological age) as well as (2) the median absolute error (i.e., median absolute difference between DNAm age and chronological age), similar to previous work [33]. We also used Pearson’s correlation and linear regression to examine the association between clock estimates obtained from different tissue types. Linear regression models included sex, smoking, BMI, and telomere length as covariates. Telomere length was measured for GTEx samples as previously described [37].

Results

The characteristics of the GTEx donors for each tissue type used for this project are described in Table 2. The sample sizes for each tissue type ranged from 38 (female breast) to 212 (lung), with a total of 424 donors contributing tissue samples.

Table 2. GTEx donor characteristics for each tissue type.

Total (n=973)Breast (n=38)Colon (n=224)Kidney (n=50)Lung (n=223)Muscle (n=47)Ovary (n=164)Prostate (n=123)Testis (n=50)Whole Blood (n=54)
Age (years)
Mean (SD)50.0 (11.9)56.1 (11.3)59.7 (8.26)55.2 (11.1)57.1 (10.5)50.6 (13.7)54.1 (13.0)54.2 (12.2)50.3 (12.9)
Median [Min, Max]50.5 [21, 70]59.0 [21, 70]61.5 [36, 70]57.0 [22, 70]60.0 [31, 70]52.0 [21, 70]57.0 [20, 70]56.0 [22, 70]52.5 [22, 70]
Sex
Male0 (0%)156 (69.6%)39 (78.0%)160 (71.7%)28 (59.6%)0 (0%)123 (100%)50 (100%)45 (83.3%)
Female38 (100%)68 (30.4%)11 (22.0%)63 (28.3%)19 (40.4%)164 (100%)0 (0%)0 (0%)9 (16.7%)
Race
Non-Hispanic White0 (0%)3 (1.3%)0 (0%)3 (1.3%)0 (0%)2 (1.2%)0 (0%)0 (0%)0 (0%)
African American6 (15.8%)22 (9.8%)6 (12.0%)28 (12.6%)6 (12.8%)27 (16.5%)10 (8.1%)3 (6.0%)5 (9.3%)
Other32 (84.2%)197 (87.9%)44 (88.0%)190 (85.2%)41 (87.2%)134 (81.7%)111 (90.2%)47 (94.0%)48 (88.9%)
Missing0 (0%)2 (0.9%)0 (0%)2 (0.9%)0 (0%)1 (0.6%)2 (1.6%)0 (0%)1 (1.9%)
BMI (kg/m2)
Mean (SD)25.4 (3.94)27.1 (3.94)26.4 (3.75)27.6 (3.91)26.8 (4.38)26.8 (4.23)27.1 (3.81)27.2 (3.82)27.4 (4.18)
Median [Min, Max]25.5 [18.9, 33.3]27.2 [18.8, 35.0]26.6 [18.8, 34.8]27.4 [18.6, 35.0]26.6 [18.6, 34.4]26.6 [18.5, 34.9]27.1 [18.8, 34.9]27.1 [19.0, 34.8]27.2 [19.8, 35.0]
Smoker
No11 (28.9%)57 (25.4%)10 (20.0%)62 (27.8%)12 (25.5%)57 (34.8%)38 (30.9%)12 (24.0%)12 (22.2%)
Yes27 (71.1%)152 (67.9%)37 (74.0%)150 (67.3%)34 (72.3%)96 (58.5%)73 (59.3%)36 (72.0%)40 (74.1%)
Missing0 (0%)15 (6.7%)3 (6.0%)11 (4.9%)1 (2.1%)11 (6.7%)12 (9.8%)2 (4.0%)2 (3.7%)
Telomere length
Mean (SD)1.06 (0.314)1.09 (0.386)1.01 (0.339)0.930 (0.228)1.39 (0.352)1.29 (0.291)1.09 (0.294)1.97 (0.522)0.836 (0.206)
Median [Min, Max]1.02 [0.37, 1.96]1.07 [0.31, 2.59]0.96 [0.32, 1.88]0.91 [0.35, 1.72]1.34 [0.89, 2.41]1.25 [0.71, 2.24]1.05 [0.19, 2.16]1.93 [1.18, 3.45]0.87 [0.37, 1.20]
Missing1 (2.6%)5 (2.2%)0 (0%)19 (8.5%)4 (8.5%)13 (7.9%)0 (0%)4 (8.0%)1 (1.9%)

The tissue-specific distributions of chronological age and estimated DNAm ages (for each clock examined) are shown in Figure 1. The mean chronological age for donors varied somewhat across tissue types (~50 to ~62 years). For each clock examined, estimates varied substantially across tissue types, with mean clock estimates close to 0 years in some scenarios (i.e., PhenoAge for ovary) and close to 100 years other scenarios (i.e., AltumAge for kidney). For each clock, testis and ovary tissue tended to have lower (younger) clock estimates, while colon and lung tended to have higher (older) clock estimates compared to other tissue types.

DNA methylation clock estimates vary across tissue types. The distributions of estimates for age and six clocks are color coded by tissue type. For each clock, tissue types are ranked by their median and color-coded. The median and inter-quartile ranges are shown as vertical lines.

Figure 1. DNA methylation clock estimates vary across tissue types. The distributions of estimates for age and six clocks are color coded by tissue type. For each clock, tissue types are ranked by their median and color-coded. The median and inter-quartile ranges are shown as vertical lines.

The distribution of each cock within tissue type is shown in Figure 2. In whole blood, the distribution of each clock (considering both the mean and the variance), was most similar to chronological age, compared to other tissue types. Whole blood had low median absolute errors (Supplementary Table 1) and mean deviations (Supplementary Table 2) compared to other tissue types (for all clocks except AltumAge). The distributions of the Horvath (light brown) and EpiClock (blue) clocks most closely resembled the chronological age distributions (across most tissue types), with correspondingly small median absolute errors and mean deviations (Supplementary Tables 1, 2). The means of the blood-based clocks (PhenoAge and Hannum) tended to be lower than chronological age, with large negative mean deviations, in most tissue types (except in whole blood), and high median absolute errors (Supplementary Tables 1, 2). In general, the clocks’ distributions deviated most from chronological age in muscle, testis, and ovary tissues.

Distributions of chronological age and DNAm clock estimates within each of nine GTEx tissue types.

Figure 2. Distributions of chronological age and DNAm clock estimates within each of nine GTEx tissue types.

The correlation between each tissue-specific clock estimate and individuals’ chronological age is shown in Table 3. For all clocks except EpiTOC, the clock estimates from blood showed the strongest correlation with chronological age, as compared to all other tissue types. For each clock, the correlation with chronological age varied substantially across tissue types. For most non-blood tissue types (including breast, colon, kidney, lung, ovary, prostate) the Hannum and PhenoAge clocks, both specifically designed for blood DNAm data, showed weaker correlation with chronological age than did the clocks designed for pan-tissue applications (Horvath, AltumAge, and EpiClock). EpiTOC clock was positively correlated with chronological age (P<0.05) in all tissue types except for breast and muscle.

Table 3. Peason correlations (and confidence intervals) between clock estimates and chronological age by tissue type.

TissueDNAm clocks
HorvathHannumPhenoAgeEpiClockAltumAgeEpiTOCZhangClock
Breast0.78 (0.62-0.88)0.73 (0.54-0.85)0.57 (0.30-0.75)0.86 (0.74-0.92)0.85 (0.74-0.92)0.10 (-0.23-0.41)0.86 (0.74-0.92)
Colon0.89 (0.86-0.91)0.39 (0.27-0.50)0.36 (0.25-0.47)0.78 (0.72-0.83)0.88 (0.85-0.91)0.33 (0.20-0.44)0.71 (0.63-0.77)
Kidney0.89 (0.81-0.93)0.82 (0.69-0.89)0.67 (0.47-0.80)0.90 (0.82-0.94)0.84 (0.73-0.91)0.49 (0.24-0.68)0.91 (0.84-0.95)
Lung0.87 (0.83-0.90)0.85 (0.81-0.88)0.71 (0.64-0.77)0.92 (0.90-0.94)0.93 (0.91-0.94)0.36 (0.24-0.47)0.84 (0.8-0.88)
Muscle0.54 (0.30-0.72)0.58 (0.35-0.74)0.46 (0.19-0.66)0.72 (0.54-0.83)0.84 (0.72-0.91)0.28 (-0.003-0.53)0.46 (0.2-0.66)
Ovary0.58 (0.47-0.67)0.53 (0.41-0.63)0.26 (0.12-0.40)0.61 (0.50-0.70)0.72 (0.64-0.79)0.16 (0.01-0.31)0.50 (0.37-0.61)
Prostate0.86 (0.81-0.90)0.60 (0.48-0.71)0.67 (0.56-0.76)0.81 (0.74-0.86)0.91 (0.87-0.93)0.38 (0.22-0.52)0.71 (0.61-0.79)
Testis0.69 (0.52-0.82)0.65 (0.45-0.78)0.77 (0.62-0.86)0.52 (0.28-0.70)0.62 (0.41-0.77)0.42 (0.16-0.62)0.45 (0.19-0.65)
Whole Blood0.92 (0.87-0.96)0.93 (0.89-0.96)0.88 (0.81-0.93)0.95 (0.92-0.97)0.94 (0.90-0.96)0.41 (0.16-0.61)0.94 (0.9-0.97)

For each clock, we attempted to estimate the correlation in clock estimates between pairs of tissue types, using data from donors with DNAm data available for both tissue types. For each clock, there was clear correlation across tissue types (Supplementary Tables 39 and Supplementary Figures 16), with an average tissue-tissue correlation of 0.71 for Horvath, 0.47 for Hannum, 0.47 for PhenoAge, 0.15 for EpiTOC, 0.78 for AltumAge, 0.65 for EpiClock, and 0.65 for Zhang (based on 32 possible tissue-tissue pairs) (Figure 3A). Since lung and colon had the largest sample size (146 donors with DNAm data for both tissue types) and the most power/precision for assessing correlation between clock estimates, we present those results in more detail in Figure 4. Specifically, we observed the strongest correlation between lung and colon using Horvath (r=0.82) and AltumAge (r=0.80) (Figure 4, top).

Distribution of between-tissue correlations for each clock, for all possible pairs of tissue types. (A) not adjusted for age (B) adjusted for age. Each of the five distributions shown (in both panels) has a mean greater than zero (P

Figure 3. Distribution of between-tissue correlations for each clock, for all possible pairs of tissue types. (A) not adjusted for age (B) adjusted for age. Each of the five distributions shown (in both panels) has a mean greater than zero (P<0.05).

Correlation between lung-based and colon-based DNAm clock estimates for 6 clocks, without adjustment for age (top) and with adjustment for age (bottom). EpiTOC showed very weak evidence for correlation and is not presented.

Figure 4. Correlation between lung-based and colon-based DNAm clock estimates for 6 clocks, without adjustment for age (top) and with adjustment for age (bottom). EpiTOC showed very weak evidence for correlation and is not presented.

As expected, after adjusting for age, the inter-tissue correlations for each clock were substantially attenuated. Examining age-adjusted associations across all possible pairs of tissue types (Supplementary Tables 310 and Supplementary Figures 16), we find evidence of weak residual correlation in clock estimates between tissues after age adjustment (Figure 3B). The means of the distributions of age-adjusted correlations for all possible tissue pairs correlations were greater than zero for each clock examined (EpiTOC P=0.003, Hannum P=0.02, Horvath P=0.0003, PhenoAge P=0.01, EpiClock P=0.02, AltumAge P <0.001, Zhang P=0.02). Figure 4 (bottom) shows the correlation between the age-adjusted clock estimates for lung and colon, with only the Horvath clock showing a correlation with P<0.05.

To assess the hypothesis that clock performance varies across tissue types due to differences in immune cell infiltration (ICI), we first examined the average leukocyte percentage in each tissue type. Based on visual inspection, there was suggestive evidence that tissue types with higher leukocyte percentage showed stronger correlation between chronological age and clock age (Supplementary Figure 7). However, given the small number of tissue types (8), we were underpowered for statistical testing, although AltumAge and EpiClock had P-values of 0.10 and 0.11, respectively.

To determine if ICI impacts clock performance, we examined with interaction between chronological age and leukocyte percentage (in relation to clock age) in a linear regression model. For some tissue types with small sample sizes (breast, kidney, muscle), the association between chronological age and clock age was no longer clear after including the interaction (P>0.05 for most clocks), suggesting lack of power for interaction testing in these tissue types (Supplementary Table 11). However, lung showed the clearest evidence that clock performance strengthened as ICI increased, with interaction P<0.05 for Horvath, Hannum, Zhang, EpiClock, PhenoAge, and EpiTOC (Supplementary Table 11).

We examined the association of participant characteristics (sex, BMI, smoking, and TL) with clock estimates for each tissue type (Supplementary Tables 12, 13). Of note, smoking showed association (P<0.05) with increased DNAm clock estimates for the Horvath clock (in testis), the Hannum clock (in lung and testis), the PhenoAge clock (in lung), EpiTOC (in lung), EpiClock (in lung and testis), and AltumAge (in lung and testis) (Table 4). For the smoking analyses, 72 tests were conducted, thus the Bonferroni-corrected P-value threshold is 0.0006. PACE (blood and lung), PhenoAge (lung), and EpiClock (lung) pass this threshold.

Table 4. Beta coefficients (and P-values) for the association between smoking and age acceleration, by tissue type and clock.

Blood (n=51)Breast (n=36)Colon (n=205)Kidney (n=47)Lung (n=194)Muscle (n=43)Ovary (n=142)Prostate (n=111)Testis (n=45)
EpiClock3.82 (0.026)0.02 (0.991)-1.52 (0.128)-0.98 (0.454)2.00 (<0.001)-1.96 (0.104)0.02 (0.973)0.12 (0.933)5.12 (0.037)
AltumAge6.96 (0.024)-5.55 (0.125)-0.69 (0.557)-3.49 (0.071)3.64 (0.002)-2.22 (0.313)2.02 (0.185)-1.27 (0.354)6.82 (0.103)
Horvath3.76 (0.055)-0.39 (0.841)0.13 (0.328)-4.13 (0.003)-0.39 (0.593)-3.34 (0.039)1.29 (0.078)-0.81 (0.438)4.52 (0.032)
Hannum1.03 (0.565)-2.28 (0.073)-3.43 (0.097)-0.04 (0.738)1.96 (0.005)-1.83 (0.152)-0.65 (0.339)-1.39 (0.234)1.23 (0.279)
PhenoAge1.62 (0.557)-4.20 (0.193)-3.53 (0.20)-1.92 (0.369)4.92 (<0.001)-0.97 (0.596)0.69 (0.449)-0.38 (0.838)1.94 (0.205)
EpiTOC0.00 (0.838)-0.01 (0.241)-0.01 (0.027)-0.00 (0.599)0.01 (0.003)-0.00 (0.883)0.00 (0.301)0.00 (0.979)0.00 (0.204)
Zhang-2.32 (0.214)-1.32 (0.339)-2.18 (0.038)-0.09 (0.940)0.02 (0.980)-3.82 (0.080)0.01 (0.051)-0.378 (0.221)6.45 (0.084)
PACE0.009 (<0.001)0.118 (0.003)0.064 (0.014)0.042 (0.224)0.083 (<0.001)0.006 (0.805)0.007 (0.547)0.034 (0.222)-0.008 (0.588)
Linear regression models include sex, BMI, and telomere length (TQI) as covariates. Association estimates with P-vales <0.05 are shown in bold text. Smoking is coded as never (0) and ever (1).

Discussion

In this study we have applied eight DNAm aging/clock algorithms to DNAm data derived from 9 different human tissue types, and we demonstrate that there are substantial differences in these clock estimates across tissue types. We found that the tissue types examined varied in terms of their mean clock estimates, as well as the strength of the correlation between the clock estimates and chronological age. These differences across tissue types were most apparent for clocks trained using DNAm from blood only (e.g., Hannum), but also present for clocks trained on multiple tissue types (e.g., Horvath, a clock designed for pan-tissue age prediction). When applied to different tissue types, each clock showed strong correlations of epigenetic age across tissues, but these correlations were drastically (but not completely) attenuated after adjustment for age, suggesting age acceleration estimates from a single tissue type (e.g., whole blood) can potentially serve as a proxy for other tissue types, although perhaps a weak proxy. Therefore, studies of additional tissue types are needed.

We acknowledge that some of the clocks examined here (Hannum, PhenoAge), were not developed with the goal of being applied to DNAm data from non-blood tissue types. Therefore, one should not expect these clocks to predict age (or other aging-related phenotypes) equally well across tissue types. The pan-tissue clocks tended to be better predictors of chronological age for non-blood tissues, but prediction for these clocks tended to be worse for muscle, ovary, and testis (when compared to lung, colon, breast, kidney, and prostate). Of note, ovary and testis were not in the Horvath training sample. All clocks showed the strongest correlation with chronological age when applied to DNAm data from blood (including the pan-tissue clocks), which likely reflects the fact that blood DNAm data is the most common type of data in the datasets used to train the clocks examined in this work. Furthermore, we provide suggestive evidence that higher immune cell infiltration into non-blood tissues leads to improved clock performance. We also acknowledge that mitotic clocks, such as EpiTOC, are not designed to predict chronological age, although clear correlation with age is observed across all tissue types.

The clocks varied with respect to their mean estimate across tissues, with AltumAge tending to provide the highest age estimates and PhenoAge tending to provide the lowest age estimates. For all clocks, the mean of the clock estimate tended to be closest to mean for chronological age when applied to blood DNAm data. It is possible that some differences in performance observed among clocks may be due in part to differences in the age distribution of the clock’s training dataset compared to our GTEx sample.

The effects of smoking on DNAm clock estimates appeared more pronounced in lung tissue (and perhaps in blood and testicular tissue) as compared to other tissues, with smoking being generally associated with increased DNAm aging across clocks. This observation likely reflects the fact that lung tissue is exposed to tobacco combustion products directly via inhalation, whereas other tissue types are primarily exposed to tobacco combustion byproducts from the blood stream. The association of smoking with accelerated aging in testicular tissue is novel and requires validation in future studies. No other donor characteristics showed clear evidence of consistent association with clock estimates across tissue types (Supplementary Tables 12, 13).

We observed that EpiTOC estimates generally showed weaker correlations with chronological age compared to the other DNAm clocks examined. EpiTOC differs from other clocks examined here in that it is a “mitotic clock” developed to reflect the number of stem cell divisions that have occurred in a tissue’s corresponding stem cell population over time rather than being trained to predict chronological age [38].

Our results suggest that forensic applications of DNAm clocks using non-blood tissue types will provide age estimates that are not as accurate as predictions based on blood, especially if using clocks algorithms trained on blood samples. Our results also suggest that tissue-specific DNAm aging measures may have utility for detecting biological differences in organ aging, as accelerated DNAm aging due to smoking was more pronounced in lung compared to other tissue types. However, in order to draw robust conclusions regarding the consistency of association between participant characteristics and DNAm clocks across tissue types, larger samples sizes are needed.

While our study comprehensively examines how chronological age associates with epigenetic age across different tissue types using 8 DNAm clocks, there were several limitations. First, our study included some tissue types with small sample sizes (~50 samples), which limited our power to detect associations between epigenetic age and donor characteristics. Larger studies of DNAm across multiple tissue/cell types (with more metadata on donors) are needed to better understand how DNAm clocks (and individual CpG sites) relate to donor characteristics. Second, we have a limited understanding of the cell type heterogeneity of bulk tissue samples from which our DNAm data is derived. This heterogeneity may influence the performance of the clock algorithms we are studying, as we demonstrate by examining the impact of leukocyte infiltration on clock performance. Cell type deconvolution approaches have been shown to be useful for adapting blood-based clocks to perform well in other tissues, such as saliva [39]. Third, the GTEx cohort of tissue donors, all of whom were selected for inclusion in GTEx post-mortem, is strongly enriched for smokers and individuals dying at younger ages; this potential selection bias may introduce bias into the associations between epigenetic and chronological age reported in this study. Furthermore, while our study examines DNAm clocks in nine tissue types, future studies that include additional tissue types will improve our understanding of how epigenetic aging varies across the human body.

Author Contributions

M.R., C.B., and L.T. performed analyses and prepared manuscript figures. M.R. and B.L.P. interpreted the data and wrote the main manuscript text. M.G.K. and F.J. generated DNA methylation data. N.J., J.L.L., and K.D. contributed to data processing, quality control, and statistical analysis. B.L.P. conceived the project and contributed to writing/editing and data interpretation.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Funding

This work was supported by grants U01 HG007601 (to B.L.P.) and R35ES028379 (to B.L.P.) and was completed in part with computational resources provided by the Center for Research Informatics at the University of Chicago. The Genotype-Tissue Expression (GTEx) Project was supported by the Common Fund of the Office of the Director of the National Institutes of Health and by NCI, NHGRI, NHLBI, NIDA, NIMH, and NINDS. We thank the donors and their families for their generous gifts of biospecimens to the GTEx research project; the Genomics Platform at the Broad Institute for data generation; F. Aguet, J. Nedzel, and K. Ardlie for sample-delivery logistics and data-release management. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1. López-Otín C, Blasco MA, Partridge L, Serrano M, Kroemer G. The hallmarks of aging. Cell. 2013; 153:1194–217. https://doi.org/10.1016/j.cell.2013.05.039 [PubMed]
  • 2. Horvath S, Raj K. DNA methylation-based biomarkers and the epigenetic clock theory of ageing. Nat Rev Genet. 2018; 19:371–84. https://doi.org/10.1038/s41576-018-0004-3 [PubMed]
  • 3. Ahuja N, Issa JP. Aging, methylation and cancer. Histol Histopathol. 2000; 15:835–42. https://doi.org/10.14670/HH-15.835 [PubMed]
  • 4. Toyota M, Ahuja N, Ohe-Toyota M, Herman JG, Baylin SB, Issa JP. CpG island methylator phenotype in colorectal cancer. Proc Natl Acad Sci USA. 1999; 96:8681–86. https://doi.org/10.1073/pnas.96.15.8681 [PubMed]
  • 5. Christensen BC, Houseman EA, Marsit CJ, Zheng S, Wrensch MR, Wiemels JL, Nelson HH, Karagas MR, Padbury JF, Bueno R, Sugarbaker DJ, Yeh RF, Wiencke JK, Kelsey KT. Aging and environmental exposures alter tissue-specific DNA methylation dependent upon CpG island context. PLoS Genet. 2009; 5:e1000602. https://doi.org/10.1371/journal.pgen.1000602 [PubMed]
  • 6. Maegawa S, Hinkal G, Kim HS, Shen L, Zhang L, Zhang J, Zhang N, Liang S, Donehower LA, Issa JP. Widespread and tissue specific age-related DNA methylation changes in mice. Genome Res. 2010; 20:332–40. https://doi.org/10.1101/gr.096826.109 [PubMed]
  • 7. Fraga MF, Ballestar E, Paz MF, Ropero S, Setien F, Ballestar ML, Heine-Suñer D, Cigudosa JC, Urioste M, Benitez J, Boix-Chornet M, Sanchez-Aguilera A, Ling C, et al. Epigenetic differences arise during the lifetime of monozygotic twins. Proc Natl Acad Sci USA. 2005; 102:10604–09. https://doi.org/10.1073/pnas.0500398102 [PubMed]
  • 8. Teschendorff AE, Menon U, Gentry-Maharaj A, Ramus SJ, Weisenberger DJ, Shen H, Campan M, Noushmehr H, Bell CG, Maxwell AP, Savage DA, Mueller-Holzner E, Marth C, et al. Age-dependent DNA methylation of genes that are suppressed in stem cells is a hallmark of cancer. Genome Res. 2010; 20:440–46. https://doi.org/10.1101/gr.103606.109 [PubMed]
  • 9. Rakyan VK, Down TA, Maslau S, Andrew T, Yang TP, Beyan H, Whittaker P, McCann OT, Finer S, Valdes AM, Leslie RD, Deloukas P, Spector TD. Human aging-associated DNA hypermethylation occurs preferentially at bivalent chromatin domains. Genome Res. 2010; 20:434–39. https://doi.org/10.1101/gr.103101.109 [PubMed]
  • 10. Jones MJ, Goodman SJ, Kobor MS. DNA methylation and healthy human aging. Aging Cell. 2015; 14:924–32. https://doi.org/10.1111/acel.12349 [PubMed]
  • 11. Bergsma T, Rogaeva E. DNA Methylation Clocks and Their Predictive Capacity for Aging Phenotypes and Healthspan. Neurosci Insights. 2020; 15:2633105520942221. https://doi.org/10.1177/2633105520942221 [PubMed]
  • 12. Zhang W, Qu J, Liu GH, Belmonte JC. The ageing epigenome and its rejuvenation. Nat Rev Mol Cell Biol. 2020; 21:137–50. https://doi.org/10.1038/s41580-019-0204-5 [PubMed]
  • 13. Li A, Koch Z, Ideker T. Epigenetic aging: Biological age prediction and informing a mechanistic theory of aging. J Intern Med. 2022; 292:733–44. https://doi.org/10.1111/joim.13533 [PubMed]
  • 14. Lu AT, Quach A, Wilson JG, Reiner AP, Aviv A, Raj K, Hou L, Baccarelli AA, Li Y, Stewart JD, Whitsel EA, Assimes TL, Ferrucci L, Horvath S. DNA methylation GrimAge strongly predicts lifespan and healthspan. Aging (Albany NY). 2019; 11:303–27. https://doi.org/10.18632/aging.101684 [PubMed]
  • 15. Levine ME, Lu AT, Quach A, Chen BH, Assimes TL, Bandinelli S, Hou L, Baccarelli AA, Stewart JD, Li Y, Whitsel EA, Wilson JG, Reiner AP, et al. An epigenetic biomarker of aging for lifespan and healthspan. Aging (Albany NY). 2018; 10:573–91. https://doi.org/10.18632/aging.101414 [PubMed]
  • 16. Yang Z, Wong A, Kuh D, Paul DS, Rakyan VK, Leslie RD, Zheng SC, Widschwendter M, Beck S, Teschendorff AE. Correlation of an epigenetic mitotic clock with cancer risk. Genome Biol. 2016; 17:205. https://doi.org/10.1186/s13059-016-1064-3 [PubMed]
  • 17. Vijayakumar KA, Cho GW. Pan-tissue methylation aging clock: Recalibrated and a method to analyze and interpret the selected features. Mech Ageing Dev. 2022; 204:111676. https://doi.org/10.1016/j.mad.2022.111676 [PubMed]
  • 18. de Lima Camillo LP, Lapierre LR, Singh R. A pan-tissue DNA-methylation epigenetic clock based on deep learning. NPJ Aging. 2022; 8:4.
  • 19. Zhang Q, Vallerga CL, Walker RM, Lin T, Henders AK, Montgomery GW, He J, Fan D, Fowdar J, Kennedy M, Pitcher T, Pearson J, Halliday G, et al. Improved precision of epigenetic clock estimates across tissues and its implication for biological ageing. Genome Med. 2019; 11:54. https://doi.org/10.1186/s13073-019-0667-1 [PubMed]
  • 20. Belsky DW, Caspi A, Corcoran DL, Sugden K, Poulton R, Arseneault L, Baccarelli A, Chamarti K, Gao X, Hannon E, Harrington HL, Houts R, Kothari M, et al. DunedinPACE, a DNA methylation biomarker of the pace of aging. Elife. 2022; 11:e73420. https://doi.org/10.7554/eLife.73420 [PubMed]
  • 21. Grodstein F, Lemos B, Yu L, Iatrou A, De Jager PL, Bennett DA. Characteristics of Epigenetic Clocks Across Blood and Brain Tissue in Older Women and Men. Front Neurosci. 2021; 14:555307. https://doi.org/10.3389/fnins.2020.555307 [PubMed]
  • 22. Bell CG, Lowe R, Adams PD, Baccarelli AA, Beck S, Bell JT, Christensen BC, Gladyshev VN, Heijmans BT, Horvath S, Ideker T, Issa JJ, Kelsey KT, et al. DNA methylation aging clocks: challenges and recommendations. Genome Biol. 2019; 20:249. https://doi.org/10.1186/s13059-019-1824-y [PubMed]
  • 23. GTEx Consortium. The Genotype-Tissue Expression (GTEx) project. Nat Genet. 2013; 45:580–85. https://doi.org/10.1038/ng.2653 [PubMed]
  • 24. GTEx Consortium. The GTEx Consortium atlas of genetic regulatory effects across human tissues. Science. 2020; 369:1318–30. https://doi.org/10.1126/science.aaz1776 [PubMed]
  • 25. GTEx Portal [Internet]. 2022. https://www.gtexportal.org/home/.
  • 26. eGTEx Project. Enhancing GTEx by bridging the gaps between genotype, gene expression, and disease. Nat Genet. 2017; 49:1664–70. https://doi.org/10.1038/ng.3969 [PubMed]
  • 27. Oliva M, Demanelis K, Lu Y, Chernoff M, Jasmine F, Ahsan H, Kibriya MG, Chen LS, Pierce BL. DNA methylation QTL mapping across diverse human tissues provides molecular links between genetic variation and complex traits. Nat Genet. 2023; 55:112–22. https://doi.org/10.1038/s41588-022-01248-z [PubMed]
  • 28. Morris TJ, Butcher LM, Feber A, Teschendorff AE, Chakravarthy AR, Wojdacz TK, Beck S. ChAMP: 450k Chip Analysis Methylation Pipeline. Bioinformatics. 2014; 30:428–30. https://doi.org/10.1093/bioinformatics/btt684 [PubMed]
  • 29. Triche TJ Jr, Weisenberger DJ, Van Den Berg D, Laird PW, Siegmund KD. Low-level processing of Illumina Infinium DNA Methylation BeadArrays. Nucleic Acids Res. 2013; 41:e90. https://doi.org/10.1093/nar/gkt090 [PubMed]
  • 30. Fortin JP, Triche TJ Jr, Hansen KD. Preprocessing, normalization and integration of the Illumina HumanMethylationEPIC array with minfi. Bioinformatics. 2017; 33:558–60. https://doi.org/10.1093/bioinformatics/btw691 [PubMed]
  • 31. Teschendorff AE, Marabita F, Lechner M, Bartlett T, Tegner J, Gomez-Cabrero D, Beck S. A beta-mixture quantile normalization method for correcting probe design bias in Illumina Infinium 450 k DNA methylation data. Bioinformatics. 2013; 29:189–96. https://doi.org/10.1093/bioinformatics/bts680 [PubMed]
  • 32. Home | DNA Methylation Age Calculator [Internet]. 2023. https://dnamage.genetics.ucla.edu/.
  • 33. Horvath S. DNA methylation age of human tissues and cell types. Genome Biol. 2013; 14:R115. https://doi.org/10.1186/gb-2013-14-10-r115 [PubMed]
  • 34. AltumAge [Internet]. Singh Lab @ Brown. 2023. https://github.com/rsinghlab/AltumAge.
  • 35. Hannum G, Guinney J, Zhao L, Zhang L, Hughes G, Sadda S, Klotzle B, Bibikova M, Fan JB, Gao Y, Deconde R, Chen M, Rajapakse I, et al. Genome-wide methylation profiles reveal quantitative views of human aging rates. Mol Cell. 2013; 49:359–67. https://doi.org/10.1016/j.molcel.2012.10.016 [PubMed]
  • 36. Aran D, Sirota M, Butte AJ. Systematic pan-cancer analysis of tumour purity. Nat Commun. 2015; 7:10707. https://doi.org/10.1038/ncomms9971 [PubMed]
  • 37. Demanelis K, Jasmine F, Chen LS, Chernoff M, Tong L, Delgado D, Zhang C, Shinkle J, Sabarinathan M, Lin H, Ramirez E, Oliva M, Kim-Hellmuth S, et al, and GTEx Consortium. Determinants of telomere length across human tissues. Science. 2020; 369:eaaz6876. https://doi.org/10.1126/science.aaz6876 [PubMed]
  • 38. Teschendorff AE. A comparison of epigenetic mitotic-like clocks for cancer risk prediction. Genome Med. 2020; 12:56. https://doi.org/10.1186/s13073-020-00752-3 [PubMed]
  • 39. Galkin F, Kochetov K, Mamoshina P, Zhavoronkov A. Adapting Blood DNA Methylation Aging Clocks for Use in Saliva Samples With Cell-type Deconvolution. Front Aging. 2021; 2:697254. https://doi.org/10.3389/fragi.2021.697254 [PubMed]