Abstract

Aging is reflected by highly reproducible DNA methylation (DNAm) changes that open new perspectives for estimation of chronological age in legal medicine. DNA can be harvested non-invasively from cells at the inside of a person's cheek using buccal swabs – but these specimens resemble heterogeneous mixtures of buccal epithelial cells and leukocytes with different epigenetic makeup. In this study, we have trained an age predictor based on three age-associated CpG sites (associated with the genes PDE4C, ASPA, and ITGA2B) for swab samples to reach a mean absolute deviation (MAD) between predicted and chronological age of 4.3 years in a training set and of 7.03 years in a validation set. Subsequently, the composition of buccal epithelial cells versus leukocytes was estimated by two additional CpGs (associated with the genes CD6 and SERPINB5). Results of this “Buccal-Cell-Signature” correlated with cell counts in cytological stains (R2 = 0.94). Combination of cell type-specific and age-associated CpGs into one multivariate model enabled age predictions with MADs of 5.09 years and 5.12 years in two independent validation sets. Our results demonstrate that the cellular composition in buccal swab samples can be determined by DNAm at two cell type-specific CpGs to improve epigenetic age predictions.