Research Paper Volume 13, Issue 7 pp 10619—10658

Identification of novel functional CpG-SNPs associated with Type 2 diabetes and birth weight

Rui-Ke Liu1,2, , Xu Lin1, , Zun Wang3, , Jonathan Greenbaum4, , Chuan Qiu4, , Chun-Ping Zeng5, , Yong-Yao Zhu2, , Jie Shen1, , Hong-Wen Deng1,4,6, ,

  • 1 Department of Endocrinology and Metabolism, The Third Affiliated Hospital of Southern Medical University, Guangzhou 510630, China
  • 2 Department of Endocrinology and Metabolism, SSL Central Hospital of Dongguan City, Dongguan 523326, China
  • 3 Xiangya Nursing School, Central South University, Changsha 410013, China
  • 4 Tulane Center for Biomedical Informatics and Genomics, Deming Department of Medicine, School of Medicine, Tulane University, New Orleans, LA 70112, USA
  • 5 Department of Endocrinology and metabolism, The Fifth Affiliated Hospital of Guangzhou Medical University, Guangzhou 510330, China
  • 6 School of Basic Medical Sciences, Central South University, Changsha 410000, China

Received: November 6, 2020       Accepted: March 4, 2021       Published: April 4, 2021      

https://doi.org/10.18632/aging.202828
How to Cite

Copyright: © 2021 Liu et al. This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 3.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Abstract

Genome-wide association studies (GWASs) have identified hundreds of genetic loci for type 2 diabetes (T2D) and birth weight (BW); however, a large proportion of the total trait heritability remains unexplained. The previous studies were generally focused on individual traits and largely failed to identify the majority of the variants that play key functional roles in the etiology of the disease. Here, we aim to identify novel functional loci for T2D, BW and the pleiotropic variants shared between them by performing a targeted conditional false discovery rate (cFDR) analysis that integrates two independent GWASs with summary statistics for T2D (n = 26,676 cases and 132,532 controls) and BW (n = 153,781) which entails greater statistical power than individual trait analyses. In this analysis, we considered CpG-SNPs, which are SNPs that may influence DNA methylation status, and are therefore considered to be functionally important. We identified 103 novel CpG-SNPs for T2D, 182 novel CpG-SNPs for BW (cFDR < 0.05), and 52 novel pleiotropic loci for both (conjunction cFDR [ccFDR] < 0.05). Among the identified novel CpG-SNPs, 33 were annotated as methylation quantitative trait loci (meQTLs) in whole blood, and 145 displayed at least some effects on meQTL, metabolic QTL (metaQTL), and/or expression QTL (eQTL). These findings may provide further insights into the shared biological mechanisms and functional genetic determinants that overlap between T2D and BW, thereby providing novel potential targets for treatment/intervention development.

Introduction

Type 2 diabetes (T2D) is a common chronic metabolic disorder characterized by hyperglycemia, insulin resistance, and impaired insulin secretion [1] It is estimated that in 2017 there were 451 million people suffering from diabetes worldwide [2] 90% of which were classified as T2D [3] Due to the reduced quality of life, increased mortality, and a significant burden on the healthcare system, T2D represents a severe global public health concern. Therefore, it is imperative to gain a better understanding of the pathophysiological mechanisms involved in the onset of T2D for the enhanced development of intervention/treatment strategies.

Birth weight (BW) is a clinical indicator of a variety of metabolic conditions that manifest with age. Studies have demonstrated that low BW is associated with the increased risk of developing T2D [4, 5]. The concept of "developmental programming" holds that events occurring during the early development of an individual and specifically during intrauterine life have profound consequences on future disease such as diabetes [6, 7]. Both T2D and BW are believed to be highly influenced by genetic factors with heritability estimates of over 50% and 37%, respectively [8, 9]. Additionally, the phenotypic correlation between T2D and BW suggests that these traits may share overlapping genetic determinants [10]. Several studies have also proposed the genetic correlation between BW and T2D. For instance, a Mendelian randomization study demonstrated the genetic effects on retarded fetal growth and increased diabetes risk by showing that lower birth weight was associated with increased risk of T2D and higher fasting glucose concentration [11]. Our previous study also identified several loci that associated with both T2D and BW [12]. Therefore, studying genetic relationships between BW and T2D could yield insights into the genetic regulation of T2D risk during the fetal stage.

Hundreds of single nucleotide polymorphisms (SNPs) associated with T2D or BW have been identified by genome-wide association studies (GWASs) [13, 14]. However, these SNPs can only explain a small proportion of the total heritability for T2D (~10%) [15] and BW (~25%) [16]. The large majority of the missing heritability is likely attributed to the well-documented limitations of the single-trait GWAS analysis [17]. Due to the polygenic architecture of most complex traits, many SNPs have associations too weak to be identified with the relatively small sample sizes of current GWASs [18]. Therefore, it is essential to employ statistical approaches that can increase the effective sample size by incorporating more information embedded in the existing univariate datasets. For example, by incorporating the pleiotropic effects among correlated traits, it may be possible to augment the sample sizes of standard GWAS for individual traits and identify more trait-associated loci that would otherwise be missed.

Previous studies suggest that epigenetic mechanisms, which are a crucial link between the genetic factors and environmental exposures [19], may also account for some of the missing heritability [20]. DNA methylation occurs mainly at the fifth position of the cytosine ring in CpG dinucleotides [21], and SNPs that are associated with methylation status are commonly referred to as CpG-SNPs. Although it was previously believed that methylation of the promoter region is responsible for transcriptional silencing, emerging evidence suggests that DNA methylation is closely related to the expression across all genomic regions [19]. DNA methylation is the most well-explored epigenetic mark and is also involved in T2D pathogenesis [22]. For instance, Ma et al. [23] identified 30 CpGs representing the whole blood DNA methylation signatures that are associated with cardiovascular disease risk factors and all-cause mortality. Our previous work also proposed that peripheral blood-derived meQTL loci were related to the increasing risk of diabetes and coronary artery disease [23, 24]. Focusing on these CpG-SNPs may be an effective strategy to improve the detection of novel potential functional variants associated with T2D and BW.

A genetic-pleiotropy-informed conditional false discovery rate (cFDR) method was developed to improve the gene discovery in complex traits by integrating two independent GWASs for related traits [25]. A major advantage of this approach is that it only requires the summary statistics rather than the individual level genotype data, which are usually not easily accessible, as well as improves statistical power for identifying novel polygenic effects [25]. Using this approach, our group has analyzed multiple sets of genetically associated traits and successfully identified many novel trait-associated loci [12, 26].

In this study, we applied a targeted cFDR analysis on CpG-SNPs for T2D and BW [13, 14] to identify novel functional loci that are shared between these two traits. Our findings will provide novel insights into the shared pathophysiology between T2D and BW.

Results

Assessment of pleiotropic enrichment

We observed a clear separation between each curve in the conditional Q-Q plot, and the enrichment of effects in T2D varied on different levels of association for BW (Figure 1A and 1C). We can intuitively observe and graphically assess a strong enrichment of T2D-associated SNPs conditioning on various strengths of associations with the BW. The similar separation between the different curves and similar enrichment pattern was also observed in BW conditioned on T2D (Figure 1B and 1D). This result indicates a strong pleiotropy between T2D and BW, regardless of whether T2D is conditioned on BW or BW is conditioned on T2D.

Stratified Q-Q plots and fold-enrichment plots. Stratified Q-Q plots of nominal versus empirical -log10(p) values in T2D (A) as a function of the significance of the association with BW at the level of -log10(p) > 0, -log10(p) > 1, -log10(p) > 2, -log10(p) > 3 corresponding to p ≤ 1, p ≤ 0.1, p ≤ 0.01, p ≤ 0.001, and p ≤ 0.0001, respectively. and (B) reversely BW as a function of the significance of the association with T2D. Fold-enrichment plots of enrichment versus nominal -log10p-values (corrected for inflation) corresponding to levels of p ≤ 1, p ≤ 0.1, p ≤ 0.01, p ≤ 0.001, respectively in (C) T2D as a function of significance of the association with BW; and in (D) BW as a function of significance with T2D. Dashed lines indicate the null-hypothesis.

Figure 1. Stratified Q-Q plots and fold-enrichment plots. Stratified Q-Q plots of nominal versus empirical -log10(p) values in T2D (A) as a function of the significance of the association with BW at the level of -log10(p) > 0, -log10(p) > 1, -log10(p) > 2, -log10(p) > 3 corresponding to p ≤ 1, p ≤ 0.1, p ≤ 0.01, p ≤ 0.001, and p ≤ 0.0001, respectively. and (B) reversely BW as a function of the significance of the association with T2D. Fold-enrichment plots of enrichment versus nominal -log10p-values (corrected for inflation) corresponding to levels of p ≤ 1, p ≤ 0.1, p ≤ 0.01, p ≤ 0.001, respectively in (C) T2D as a function of significance of the association with BW; and in (D) BW as a function of significance with T2D. Dashed lines indicate the null-hypothesis.

T2D loci identified by cFDR

Conditional on association with BW, we identified 127 significant CpG-SNPs (cFDR ≤ 0.05) for T2D variation, which were located on 21 different chromosomes and annotated to 110 genes (Figure 2A, Supplementary Table 1). Among these significant SNPs, 103 were not reported compared with the original T2D GWAS study [13], our earlier work [12] and other previous studies [15, 27]. Using the more conservative threshold of cFDR ≤ 0.01, 64 significant loci remained.

Conditional Manhattan plot. Conditional Manhattan plot of conditional -log10(FDR) values for (A) T2D given BW (T2D|BW), (B) BW given T2D (BW|T2D), (C) T2D and BW. The red line marks the conditional -log10(FDR) values of 1.3 corresponds to a cFDR ≤ 0.05.

Figure 2. Conditional Manhattan plot. Conditional Manhattan plot of conditional -log10(FDR) values for (A) T2D given BW (T2D|BW), (B) BW given T2D (BW|T2D), (C) T2D and BW. The red line marks the conditional -log10(FDR) values of 1.3 corresponds to a cFDR ≤ 0.05.

To explore the potential regulatory functions of these CpG-SNPs, we conducted a series of bioinformatics analyses. Our results found that 27 CpG-SNPs showed significant meQTL effects in whole blood (Supplementary Table 2). Additionally, there were 18 loci that mapped to metaQTLs. Interestingly, two novel SNPs (rs677042 and rs7816345) were associated with bile acids (i.e., ursodeoxycholate, hyodeoxycholate), and a third novel SNP (rs10774563) was related to branched chain fatty acids (i.e., ethylmalonate, methylsuccinate). Bile acid and branched-chain fatty acids are known to be involved in the mechanisms of glucose metabolism [28]. Multiple recent studies reported that short-chain fatty acids and branched short-chain fatty acids may have beneficial health effects on adipocyte lipid and glucose metabolism that can contribute to improved insulin sensitivity in individuals with disturbed metabolism [29, 30]. Furthermore, the SNP rs11659412 (Supplementary Table 3) has associations with methylamine, which has previously been shown to have connections with susceptibility for T2D [31].

We also detected four pathways associated with the metabolites that are linked to the metaQTLs. Pathway analysis suggested that these metabolites were linked mainly to the lipid metabolism (alpha-Linolenic acid metabolism and Glycerophospholipid metabolism), energy metabolism (Methane metabolism), and genetic Information Processing (aminoacyl-tRNA biosynthesis) (Supplementary Table 4). Lastly, we detected the SNPs enriched in the eQTLs, which may regulate gene expression levels. Eight CpG-SNPs showed the eQTL effect and were associated with enhancer, promoter, and DNAse elements in various tissues. Notably, two novel CpG-SNPs, rs7787720, and rs579459 showed meQTL, eQTL, and metaQTL effects simultaneously (Table 2).

Table 1. Conjunction cFDR for 55 pleiotropic CpG-SNPs in T2D and BW (ccFDR ≤ 0.05).

VariantChrPosAltGeneLocationeQTL/meQTL/metaQTLSNP TypeGene TypecFDR_T2DcFDR_BWccFDR
rs10449766142070125A/GHNRNPFP128396 upstream-novelnovel4.99E-023.73E-024.99E-02
rs3408831213972363C/TPROX1-AS1non-coding intronic-T2Dnovel7.04E-052.13E-022.13E-02
rs75538901213832562T/CPROX1-AS1non-coding-novelnovel1.89E-022.53E-031.89E-02
rs15151142226233671A/GAC062015.148300 downstreameQ-TLnovelnovel8.17E-031.82E-021.82E-02
rs15228122226132738A/GAC062015.147306 downstreameQTLnovelnovel7.43E-041.55E-021.55E-02
rs28945932226325601T/CAC062015.1140230 upstreameQTLnovelnovel9.16E-036.05E-039.16E-03
rs7605661243397939T/CTHADAintronicmeQTLnovelT2D3.21E-041.87E-031.87E-03
rs173613243123412407C/TADCY5intronicmeQTLnovelpleiotropic8.40E-091.20E-188.40E-09
rs46778873123381376T/GADCY5introniceQTLpleiotropicpleiotropic6.80E-062.91E-116.80E-06
rs46778893123424425G/AADCY5intronic-novelpleiotropic2.53E-021.02E-022.53E-02
rs5692553125207090G/ASLC12A8introniceQTLnovelnovel2.27E-021.01E-022.27E-02
rs67704203170931960G/AKLF7P120890 downstreameQTL,metaQTLnovelnovel1.75E-031.63E-021.63E-02
rs6794193347073414T/CSETD2introniceQTL,meQTLnovelnovel3.92E-023.02E-033.92E-02
rs92892183123345984C/TADCY5introniceQTLnovelpleiotropic1.60E-028.91E-041.60E-02
rs7663887417901297C/ALCORLintroniceQTLnovelBW1.89E-025.90E-071.89E-02
rs10514870559055501A/GPDE4D, AC092343.1intronic,intronic-novelnovel2.18E-024.17E-042.18E-02
rs1012635620675064A/GCDKAL1intronic-pleiotropicpleiotropic2.64E-141.00E-091.00E-09
rs12526403641676676C/TTFEB7302 downstream-novelnovel4.50E-023.83E-024.50E-02
rs2206734620694653C/TCDKAL1intronicmeQTLnovelpleiotropic9.60E-274.80E-174.80E-17
rs2745929620754530T/CCDKAL1intronic-novelpleiotropic2.10E-094.77E-094.77E-09
rs48973786130217352C/TSAMD35upstreameQTLnovelnovel4.54E-021.20E-024.54E-02
rs17689040740880714C/GSUGCT19951 upstream-novelnovel3.04E-021.31E-023.04E-02
rs6948511727939096T/CJAZF1introniceQTL, metaQTLnovelT2D4.02E-025.80E-034.02E-02
rs7723744578194G/ATMED43utreQTL, metaQTLT2Dnovel4.22E-048.40E-048.40E-04
rs7004862894864735T/GINTS8introniceQTL, metaQTLnovelT2D2.26E-039.35E-039.35E-03
rs7816345836988591C/TAC090453.1179 upstreameQTL, metaQTLnovelnovel2.68E-021.05E-022.68E-02
rs10739970994134010A/GPTPDC124154 upstream-novelnovel3.18E-023.78E-023.78E-02
rs10990568995651855A/GAL354861.2non-coding intronic-novelnovel2.46E-021.19E-022.46E-02
rs5794599133278724T/CABO3510 upstreameQTL,meQTL,metaQTLnovelT2D8.22E-063.60E-053.60E-05
rs242101910122391070C/TPLEKHA15upstreameQTLnovelpleiotropic1.17E-069.45E-049.45E-04
rs24880711092739820A/GY_RNA29212 upstreameQTL, metaQTLT2Dnovel1.96E-109.00E-089.00E-08
rs70707861092363930C/TMARCH59966 upstreameQTLnovelnovel9.54E-054.80E-034.80E-03
rs14473511192984997A/GMTNR1B3utrmetaQTLnovelpleiotropic1.47E-032.05E-032.05E-03
rs151216112659585C/TKCNQ1, KCNQ1OT1intronic,non-coding-novelpleiotropic1.33E-051.97E-041.97E-04
rs163177112817183T/CKCNQ1introniceQTL,meQTLT2Dpleiotropic2.07E-101.52E-031.52E-03
rs231354112685121T/CKCNQ1, KCNQ1OT1intronic,non-codingeQTL,meQTLpleiotropicpleiotropic2.40E-049.10E-039.10E-03
rs234857112831299T/CKCNQ1intronic-T2Dpleiotropic1.09E-064.67E-024.67E-02
rs3213225112135306G/AIGF2, INS-IGF2intronic,introniceQTL,meQTLnovelpleiotropic4.15E-039.59E-054.15E-03
rs10427251265964567C/THMGA23utr,-T2Dpleiotropic1.54E-031.90E-291.54E-03
rs10774202124168281A/GAC007207.150149 upstream-novelnovel2.42E-022.32E-022.42E-02
rs108629601277030355C/TE2F7introniceQTLnovelnovel2.14E-027.30E-032.14E-02
rs108783531265988752T/CHMGA222457 upstream-novelpleiotropic5.34E-031.04E-065.34E-03
rs493071812123428886A/GRILPL2introniceQTLnovelnovel6.05E-033.28E-046.05E-03
rs128652431340104683G/ALINC00598non-coding intronic-novelnovel2.44E-023.32E-042.44E-02
rs95324981340104306G/CLINC00598non-coding intronic-novelnovel2.05E-022.22E-032.05E-02
rs46257141655607701C/TLPCAT221031 upstream-novelnovel2.64E-021.05E-022.64E-02
rs15317981778826049A/GUSP36introniceQTLnovelnovel7.84E-033.72E-023.72E-02
rs1985421750567176G/ACACNA1Gintronic-novelnovel9.15E-036.42E-039.15E-03
rs390200177206676A/GDLG4introniceQTLnovelnovel1.88E-026.89E-101.88E-02
rs65655311781049580G/ABAIAP2introniceQTL,meQTLnovelnovel1.88E-023.61E-023.61E-02
rs8786191750555910A/GSPATA2058 upstreameQTL,meQTLnovelnovel2.23E-034.24E-042.23E-03
rs24267782058718421G/ANPEPL1non-codingeQTLnovelnovel4.76E-026.10E-034.76E-02
rs9263452041143307T/CPLCG1introniceQTL,meQTLnovelnovel3.11E-029.32E-043.11E-02
rs1378482250001867T/CIL17RELintroniceQTL,meQTLnovelnovel3.55E-042.08E-022.08E-02
rs60063932230194037T/CAC002378.1non-coding introniceQTLnovelnovel2.14E-041.53E-021.53E-02
Abbreviations: Chr, chromosome; Pos, chromosomal position (GRCh38/hg38); Alt, reference allele/alter allele; eQTL, expression quantitative trait locus; meQTL, methylation quantitative trait locus (including associated SNPs with an LD r2 ≥ 0.8, Supplementary Table 3); metaQTL, metabolic quantitative trait locus. T2D, type 2 diabetes; BW, birth weight; cFDR, conditional false discovery rate; ccFDR, conjunctional conditional false discovery rate. The allele was exhibited as reference allele/alter allele; SNP type and gene type means whether identified CpG-SNPs and genes have been reported in previous GWAS or in our previous related cFDR studies.

Table 2. Functional annotation for 17 CpG-SNPs showing significant effects in meQTL, eQTL, and metaQTL.

rsIDGENCODE genesTraitsmeQTL (p)eQTL hitsmQTL (metabolics)Promoter histone marksEnhancer histone marksDNAseProteins boundMotifs changed
rs579459ABOPleiotropic1.45E-095 hitsglycylglycineBLDGI4 tissuesNFYA, POL2Hmx, Nkx2
rs6446490PPP2R2CT2D2.81E-167 hits-LIV4 tissues4 tissues7 bound proteins-
rs7787720AC005019.2T2D2.04E-221 hitsalicyluric glucuronide-MUS, LNG, SKIN--Mef2
rs12245680TCF7L2T2D8.59E-05--15 tissues17 tissues29 tissuesFOXA1-
rs11819995ETS1T2D5.19E-05--19 tissues9 tissues15 tissuesPOL2NRSF
rs2237892KCNQ1T2D1.45E-10-#gamma-g, **N2, N2-d-5 tissuesKID, MUS-8 altered motifs
rs2334499FAM99BT2D4.05E-092 hits-LIVADRL, LIVMUS, MUS-GR, PU.1, RXRA
rs2291725GIPT2D1.62E-0629 hits-11 tissues14 tissues22 tissues16 bound proteinsSin3Ak-20
rs6687139LINC01681BW1.90E-051 hit-FAT7 tissues4 tissues--
rs863818PIK3R1BW--1-methylxanthineBLD15 tissuesHRT--
rs3750640ASB13BW-2 hitsphenylalanylserineGI7 tissuesIPSC, THYM, GI-PPAR
rs4980661AP000439.2BW1.02E-04--SPLNHRT, MUS, LIVLNG, BLD--
rs11079803PNPOBW5.25E-0957 hits-19 tissues13 tissues11 tissues-6 altered motifs
rs4647887SNHG16BW6.23E-2033 hits-5 tissues16 tissues14 tissuesGATA1-
rs2586211GNALBW--allantoinLIV9 tissues9 tissuesCTCF, CMYCCEBPB, GATA
rs2261988UHRF1BW1.83E-057 hits-21 tissues10 tissues24 tissuesPOL2, POL24H84 altered motifs
rs492602FUT2BW2.50E-0731 hits*ADp----Znf143
Abbreviations: meQTL, methylation quantitative trait locus (including associated SNPs with an LD r2 ≥ 0.8, Supplementary Table 3); eQTL, expression quantitative trait locus; metaQTL, metabolic quantitative trait locus. DNAse, deoxyribonuclease; *ADp, ADpSGEGDFXAEGGGVR; #gamma-g, gamma-glutamylvaline; **N2, N2-d, N2, N2-dimethylguanosine.

BW loci identified with cFDR

We identified a total of 188 significant CpG-SNPs (cFDR ≤ 0.05) for BW variation on their association with T2D, which were mapped to 20 different chromosomes (Figure 2B, Supplementary Table 5). Other than seven confirmed CpG-SNPs for BW [12, 14], the remaining 182 are novel CpG-SNPs. Using the more conservative threshold of conditional cFDR ≤ 0.01, 85 significant loci remained.

Likewise, we performed the analysis to explore the potential regulatory functions of these identified CpG-SNPs. We found 34 CpG-SNPs that showed significant meQTL effects in whole blood (Supplementary Table 2), and 20 CpG-SNPs that showed metaQTL effects. Among these SNPs identified by the metaQTL analysis, three were associated with bile acids, and one was associated with fatty acids. For the metabolites associated with these CpG-SNPs, two significant pathways were detected, including aminoacyl-tRNA biosynthesis and glycerophospholipid metabolism (Supplementary Table 3, Supplementary Table 4). Lastly, nine novel CpG-SNPs (i.e., rs6687139, rs863818, rs579459, rs3750640, rs4980661, rs11079803, rs4647887, rs2586211, and rs2261988) were identified as eQTL SNPs and were associated with enhancer, promoter, and DNAse elements in various tissues. There was one novel CpG-SNP, rs492602, which showed meQTL, eQTL, and metaQTL effects simultaneously (Table 2). To further verify the eQTL effects of identified loci, we applied LocusCompare method (see methods). Using this method, we identified 11 eQTL loci in T2D and 15 eQTL loci in BW (Supplementary Table 6). Two of these loci are overlapped with our results: JAZF1 (p = 8.7 × 10−9), and PLEKHA1 (p = 5.2 × 10−8), indicating that these two genes are more likely to be the eQTL loci.

Pleiotropic loci in T2D and BW identified with ccFDR

We computed ccFDR and constructed a ccFDR Manhattan plot to investigate whether any of the CpG-SNPs were associated with both T2D and BW. We found 55 independent pleiotropic SNPs detected by our analysis (The detailed annotations are listed in Supplementary Table 1 and 5), which were located on 16 chromosomes that reached a significance level of ccFDR ≤ 0.05 (Figure 2C, Table 1). With the more stringent significance threshold of ccFDR ≤ 0.01, 23 pleiotropic CpG-SNPs remained. Among the identified loci, 52 CpG-SNPs were suggested to be novel. In total, five SNPs have been associated with T2D, while none of these SNPs has previously been identified for BW. All the identified CpG-SNPs annotated to 45 different genes, and 35 of these were not detected as pleiotropic genes in previous related research [12]. Finally, we found 35 SNPs have at least one eQTL, meQTL, or metaQTL effect. One pleiotropic CpG-SNP, rs579459 (ABO), showed eQTL, meQTL, and metaQTL effects simultaneously (Table 2).

GO enrichment analysis and protein-protein interaction analysis.

We conducted GO enrichment analysis for the T2D- and BW- associated genes that were annotated to the identified CpG-SNPs to explore the potential regulatory functions. The identified SNPs were enriched in biological processes related to "response to oxygen-containing compound" and a molecular function of "scaffold protein binding". We also found that genes associated with T2D were significantly enriched in the pathway of "regulation of hormone levels" (FDR = 3.04 × 10−3) and "regulation of insulin secretion" (FDR = 4.19 × 10−3) (Supplementary Table 7). Using STRING 11.0 database, we performed protein-protein interaction analysis to further investigated the functional partnership among identified T2D- and BW- associated genes (Figure 3), respectively. PPI results showed several genes were well-connected in the interaction network in both traits, such as ADCY5, KCNQ1, IGF2, and CDKAL1, suggesting these genes are essential in the genetic network that coupling of both traits.

Functional protein association network analysis. Connections are based on evidence with a STRING 11.0 summary score above 0.4. Each network nodes represent a gene; edges between nodes indicate protein-protein interactions between protein products of the corresponding genes in (A) T2D and (B) BW, edge colors indicate the types of interaction.

Figure 3. Functional protein association network analysis. Connections are based on evidence with a STRING 11.0 summary score above 0.4. Each network nodes represent a gene; edges between nodes indicate protein-protein interactions between protein products of the corresponding genes in (A) T2D and (B) BW, edge colors indicate the types of interaction.

Results of the validation study

For cFDR analysis between BW and FG, we observed similar significant separation between the different curves, which indicates a strong pleiotropy between those two traits (Supplementary Figure 1A and 1B). Conditional on association with FG, we identified 160 significant CpG-SNPs (cFDR ≤ 0.05) for BW variation, and 131 of them were replicated compared with the main cFDR analysis. We identified a total of 104 significant CpG-SNPs for FG variation on their association with BW, and 25 of them were replicated. And we replicated 18 pleiotropic CpG-SNPs for FG and BW (Supplementary Table 8).

For cFDR analysis between BW and FI, we identified clearly separation between the different curves, which indicates a strong pleiotropy between those two traits (Supplementary Figure 1C and 1D). Conditional on association with FI, we identified 140 significant CpG-SNPs for BW variation, and 125 of them were replicated compared with the main cFDR analysis. We identified a total of 13 significant CpG-SNPs for FI variation on their association with BW, and 5 of them were replicated. And we replicated 3 pleiotropic CpG-SNPs for FI and BW (Supplementary Table 9).

For cFDR analysis between BW_maternal and T2D_corBMI, significant deflection between curves suggested strong pleiotropy between those two phenotypes (Supplementary Figure 2A and 2B). Conditional on association with T2D_corBMI, we identified 90 CpG-SNPs for BW_ maternal variation, and 13 of them were replicated compared with the main cFDR analysis. We identified a total of 622 significant CpG-SNPs for T2D_corBMI variation on their association with BW_ maternal, and 79 of them were replicated. And we replicated 8 pleiotropic CpG-SNPs for BW_ maternal and T2D_corBMI (Supplementary Table 10).

For cFDR analysis between BW_fetal and T2D_corBMI, similar deflection between curves demonstrated significant pleiotropy between those two phenotypes (Supplementary Figure 2C and 2D). Conditional on association with T2D_corBMI, we identified 133 CpG-SNPs for BW_fetal variation, and 59 of them were replicated compared with the main cFDR analysis. We identified a total of 669 significant CpG-SNPs for T2D_corBMI variation on their association with BW_ fetal, and 87 of them were replicated. And we replicated 25 pleiotropic CpG-SNPs for BW_ fetal and T2D_corBMI (Supplementary Table 11).

MR results

Finally, 46 independent SNPs were left for MR analysis (Table 3). IVW results suggested causal association between BW and T2D (OR = 1.554, se = 0.207, 95% CI (1.036, 2.330), P = 0.033), MR-Egger regression demonstrated no pleiotropy among selected IVs (P = 0.450). However, other approaches did not identify any causal association (Table 4). Our bi-directional MR analysis also suggested no causal association between T2D and BW.

Table 3. Characteristics of the instrumental variables used in MR analysis.

SNPea_Eoa_Eea_Ooa_Obeta.Ebeta.Oeaf.Eeaf.Ose.Opval.Ooutcomese.Epval.Eexposure
rs10818797CTCT0.03450.09531020.140.1410.0760.3T2D0.00541.20E-10BW
rs10830963GCGC0.023200.280.2760.0390.27T2D0.00422.90E-08BW
rs10872678CTCT-0.0375-0.0408220.280.2770.040.31T2D0.00416.90E-20BW
rs10935733CTCT-0.0221-0.09531020.590.6060.0340.092001T2D0.00399.20E-09BW
rs111778406GAGA0.0492-0.02020270.0680.0720.130.89T2D0.00755.80E-11BW
rs113086489TCTC0.030700.560.5450.0340.94T2D0.00389.10E-16BW
rs11720108TCTC0.04600.230.2490.0450.61T2D0.00433.40E-26BW
rs11765649CTCT-0.026700.250.2630.0420.37T2D0.00435.80E-10BW
rs1187118TATA-0.029900.830.8330.0580.73T2D0.00513.60E-09BW
rs12543725AGAG-0.023100.410.4120.0330.41T2D0.00381.20E-09BW
rs12906125AGAG-0.0228-0.01005030.320.3260.0370.88T2D0.0041.70E-08BW
rs13266210GAGA-0.0308-0.05129330.210.2120.0510.36T2D0.00451.30E-11BW
rs134594TCTC-0.02270.01005030.650.650.0360.77T2D0.0041.00E-08BW
rs1351394CTCT-0.043600.510.5110.0320.760001T2D0.00371.90E-32BW
rs1411424AGAG0.02120.02020270.520.5240.0330.62T2D0.00382.20E-08BW
rs1415701AGAG-0.025300.260.2690.0410.9T2D0.00432.60E-09BW
rs144843919AGAG-0.066-0.01005030.0350.0350.270.98T2D0.01161.40E-08BW
rs17034876TCTC0.04710.03045920.70.6990.0390.42T2D0.00422.60E-29BW
rs1819436CTCT0.03290.01005030.870.8770.0760.93T2D0.00576.30E-09BW
rs2131354AGAG0.025900.530.5260.0330.52T2D0.00374.10E-12BW
rs2168443ATAT-0.02280.07257070.620.6210.0350.053001T2D0.00393.50E-09BW
rs2229742CGCG-0.036-0.0408220.130.1040.0840.62T2D0.0062.20E-09BW
rs2306547TCTC-0.021100.460.4670.0330.92T2D0.00371.80E-08BW
rs2473248CTCT0.032500.870.8810.0790.94T2D0.00571.00E-08BW
rs2497304TCTC-0.028200.480.4780.0330.97T2D0.00372.60E-14BW
rs28530618GAGA-0.0261-0.09531020.510.5290.0330.041T2D0.00387.70E-12BW
rs2946179CTCT0.02400.730.740.0420.27T2D0.00421.30E-08BW
rs35261542ACAC-0.0444-0.1278330.270.2630.0390.00063T2D0.00414.40E-27BW
rs3753639CTCT0.03060.09531020.240.2450.0460.25T2D0.00457.30E-12BW
rs3780573AGAG0.05550.09531020.0960.0960.0990.22T2D0.00647.00E-18BW
rs4144829TCTC-0.034100.730.7390.0430.56T2D0.00425.30E-16BW
rs6016377TCTC0.023900.430.4460.0350.53T2D0.00399.50E-10BW
rs6040076CGCG0.0231-0.05129330.490.4940.0330.1T2D0.00392.00E-09BW
rs72480273CACA0.0313-0.05129330.170.1890.0560.38T2D0.00518.00E-10BW
rs72833480AGAG0.0226-0.03045920.290.2950.0390.49T2D0.00414.60E-08BW
rs72851023TCTC0.047600.0730.0770.130.74T2D0.00752.90E-10BW
rs7402982GAGA-0.02320.01005030.570.5860.0340.67T2D0.00392.30E-09BW
rs740746AGAG0.036400.730.7340.0410.62T2D0.00423.80E-18BW
rs753381CTCT-0.020500.550.550.0330.52T2D0.00372.80E-08BW
rs7575873GAGA-0.0384-0.06187540.120.130.0710.42T2D0.00571.20E-11BW
rs7854962GCGC-0.0279-0.0408220.220.2160.0470.36T2D0.00461.90E-09BW
rs79237883CTCT0.0371-0.08338160.080.0760.10.39T2D0.00673.50E-08BW
rs7964361AGAG0.0391-0.1053610.0850.0880.0960.25T2D0.00674.70E-09BW
rs798498GTGT-0.022900.310.3070.0380.709999T2D0.0041.30E-08BW
rs7998537AGAG-0.022200.320.3210.0370.35T2D0.0043.90E-08BW
rs854037GAGA-0.02680.09531020.190.1860.0550.37T2D0.00482.20E-08BW
rs900399GAGA-0.052300.390.3930.0380.47T2D0.00392.20E-41BW
rs9368777CGCG0.02150.03045920.580.5750.0330.31T2D0.00382.20E-08BW
Notes:
E: exposure; O: outcome; ea: effect_allele; oa: other_allele.

Table 4. MR analysis results.

OutcomeExposureMethodnsnpbsepvaloror_lci95, or_uci95
T2DBWMR Egger460.9680.7230.1872.6320.639, 10.845
T2DBWWeighted median460.0000.3091.0001.0000.546, 1.831
T2DBWInverse variance weighted460.4410.2070.0331.5541.036, 2.330
T2DBWSimple mode460.0090.5270.9861.0090.359, 2.834
T2DBWWeighted mode460.0090.4310.9831.0090.434, 2.347

Discussion

In this study, by performing the cFDR on the two independent GWAS datasets from T2D and BW, we identified 103 novel loci for T2D and 182 for BW when focusing on CpG-SNPs. Meanwhile, we identified 55 pleiotropic CpG-SNPs suggesting a shared genetic mechanism among them. Interestingly, only three of these CpG SNPs were identified as pleiotropic loci in the previous studies.

Since the genetic variants located at CpG-SNPs could affect the gene expression and regulation via epigenetic mechanisms, we investigated these 103 CpG-SNPs, which were regarded as novel SNPs associated with T2D. Among those CpG-SNPs, 55 showed at least one effect on meQTL, metaQTL, and/or eQTL, and 35 of them were located at novel risk genes for T2D. For example, rs11073964 is a novel CpG-SNP which showed both eQTL and meQTL effects and mapped to VPS33B (15q26.1). The relationship between VPS33B and T2D is unknown, although VPS33B is expressed in pancreas tissue, and encodes Vascular Protein Sorting-associated protein 33B [32]. The function of VPS33B refers to intracellular protein trafficking and membrane fusion mechanisms [33]. It also plays critical roles in bile acids metabolism [34], which could contribute to the regulation of glucose homeostasis [28]. Therefore, it is conceivable that VPS33B might influence pancreatic cell function through epigenetic mechanisms.

Another novel CpG-SNP (rs12786533), also showed both meQTL and eQTL effects and was located at the gene KCNQ1DN (KCNQ1 downstream neighbor). KCNQ1DN, which imprinted and mapped between CDKN1C and KCNQ1 on chromosome 11p15.5, is usually associated with Wilms' tumor [35]. Other imprinted genes in 11p15.5, including KCNQ1 and IGF2, are candidates for involvement in T2D [36, 37]. We also detected seven CpG-SNPs (rs151216, rs163171, rs163177, rs2237892, rs231354, rs234857, and rs3852527) which were annotated to KCNQ1. Two of these loci (rs163177 and rs231354) were shown to have eQTL and meQTL effects, and another SNP (rs2237892) was shown to have metaQTL effect. It is plausible that these multiple neighboring CpG-SNPs might synergistically regulate gene expression and play some roles in T2D.

We also investigated the significant CpG-SNPs associated with BW conditioned on T2D, in which 120 CpG-SNPs were novel loci and annotated to genes that were not reported in previous study [13]. All these SNPs showed at least one effect on eQTL, meQTL, and/or metaQTL. There were two notable SNPs, rs3184504, which was located on the gene SH2B3, and rs492602, which was located on the gene FUT2. Both SNPs showed eQTL, meQTL, and metaQTL effects simultaneously. The gene SH2B3 acts as a negative regulator of cytokine signaling and cell proliferation [38], which is known to be associated with type 1 diabetes and celiac disease [39]. The gene FUT2 encodes a specific fucosyltransferase enzyme, which is crucial for the synthesis of histo-blood group antigens [40]. Whether these genes have a function on BW is unclear, but our findings imply that epigenetic alteration deserves attention and presents new insights for further exploration.

Importantly, 19 of the 55 pleiotropic variants were novel loci that showed at least one QTL effect. Notably, seven of these 19 loci (rs6770420 in KLF7P1, rs6794193 in SETD2, rs7816345 in AC090453.1, rs6565531 in BAIAP2, rs878619 in SPATA20, rs926345 in PLCG1, and rs137848 in IL17REL) showed eQTL and either metaQTL or meQTL effect. These facts suggest that these SNPs might be involved in the shared pathogenesis of both T2D and BW.

There are several advantages in the current study. First, identifying shared genetic factors between these two traits can facilitate our understanding of the genetic correlation between BW and T2D. To our knowledge, this study is the first to use a targeted cFDR method by focusing on functional CpG-SNPs that are associated with both T2D and BW. Compared with our previous work on the same two traits [12], this work was based on the updated GWAS datasets with a larger population and sophisticated study design [13, 14]. Second, in this study, we only focused on functional genetic variants-CpG-SNPs, which not only significantly reduced the multiple testing burden but also shed light on the biological interpretation of the results. Furthermore, we performed the analysis on meQTL, metaQTL, and eQTL effects, which facilitate the identification of candidate functional variants associated with T2D and/or BW. Third, our MR analysis IVW approach showed causal association between BW and T2D while other methods did not, therefore, we cannot define the direct causal relationship between BW and T2D. This result suggested that instead of acting like a direct risk factor, the BW might regulate T2D through multiple intermediate variants such as DNA methylation or regulation of metabolism, as we demonstrated in this study. Fourth, considering BW is influenced both by inherited fetus genotypes and maternal genotypes, we performed validation analysis by using GWAS datasets of direct fetal and indirect maternal genetic effects, which further support the story. Finally, multiple validations in different GWAS datasets were performed in the current study to partially support our findings, which demonstrated the credibility and significance of these findings.

There are also some limitations to our study. First, we are unable to evaluate the effect estimates of pleiotropic SNPs on the traits due to our inability to access the individual-level data. Second, it is difficult to distinguish between the pleiotropic scenarios where a SNP directly influences both BW and T2D, or the SNP affects BW, and the resulting change in phenotype influences T2D susceptibility. However, our MR analysis results may suggest that the SNPs affect T2D susceptibility via affecting BW. Thirdly, although using LD r2 ≥ 0.2 as the threshold for SNP pairwise pruning is a common practice in similar integration studies [4143], some of our findings could still be secondary to the signals, especially for SNPs identified in the same gene without special features such as eQTL (e.g., rs151216, rs234857 in KCNQ1). However, since the identified SNPs are functional CpG-loci, the CpG-SNPs identified in this study are more likely to be the leading signals, compared with the previous results. Finally, these findings are based on a bioinformatics analysis of GWAS data. Without further molecular validation, some of these results are suggestive rather than conclusive, such as suggested CpG-loci or functional genes. The aim of our study was to find more potential novel T2D associated variants, so we hope that this limitation could partially be addressed in the future by follow up with fine-mapping studies or molecular validation experiments.

In conclusion, by using cFDR method on functional CpG-SNPs, we successfully improved the identification of novel genetic variants of both T2D and BW. Our findings offer an improved understanding of the potential shared genetic mechanisms in T2D and BW, which may provide a new direction for further biological studies and clinical trials.

Materials and Methods

GWAS datasets

We obtained GWAS summary statistics for T2D and BW from publicly available sources [13, 14]. The dataset for T2D contained meta-analysis summary statistics from 18 studies performed by DIAbetes Genetics Replication and Meta-analysis (DIAGRAM) Consortium (n = 26,676 case and 132,532 control, European) [13]. The dataset for BW contained meta-analysis summary statistics from 37 studies conducted by Early Growth Genetics (EGG) Consortium (n = 153,781, predominantly European) [14]. Both of these datasets have large enough sample sizes (n > 100,000) for statistical power. Each dataset contains summary statistics for each SNP with the p values that have undergone genomic control at the individual study level. The detailed inclusion criteria and phenotype characteristics from different GWAS are described in the original publications.

Identification of potentially functional CpG-SNPs

The CpG-SNPs in the human genome were identified by interrogating the comprehensive catalog of both common and rare genetic variants from the 1000 Genomes reference panel [44], and our in-house whole-genome high-coverage deep re-sequencing study [45]. A SNP is defined as a CpG-SNP if it introduces or disrupts a CpG site. A total of 50,278,228 CpG-SNPs was identified throughout the human genome. The details of the identification of potentially functional CpG-SNPs were described in our previous study [46].

Data processing

First, we combined the 8,099,761 common SNPs included in these two datasets, then overlapped these common SNPs with the above-identified CpG-SNPs and retrieved a total of 2,478,365 common CpG-SNPs with association summary statistics for both T2D and BW. We then used HapMap 3 genotypes as a reference, and performed a linkage disequilibrium (LD) based pruning method by PLINK 1.9 to remove pairs of SNPs with substantial correlations [47]. The process begins using a window of 50 SNPs, where LD between each pair of SNPs is calculated. If pairs have an R2 > 0.2, one of that pair of SNPs is removed. Following this initial removal of SNPs, the window shifts 5 SNPs forward, and the process is repeated until there are no pairs of SNPs that are in high LD. After pruning, 96,312 independent CpG-SNPs remained to be used in the subsequent analysis.

Statistical analysis

We constructed conditional quantile-quantile plots (Q-Q plot) to evaluate the enrichment of pleiotropic effects by evaluating the increase in the number of trait-associated SNPs for the first trait (principal trait) when conditioning on SNPs with varying strengths of association in the second trait (conditional trait). We also constructed fold-enrichment plots, which quantify the pleiotropic enrichment within each conditional subset compared with the baseline group, which includes all SNPs.

By leveraging two GWASs from T2D and BW, we applied the cFDR approach to obtain the probability that a random SNP is null for association with the principal phenotype given that the p-values for the principal and conditional phenotypes are both less than observed p-values. The method was applied for both orderings of the two phenotypes, cFDR(T2D|BW) and cFDR(BW|T2D). Then, we computed the conjunction cFDR (ccFDR), taken to be the maximum of the two cFDR values, to identify pleiotropic SNPs for both T2D and BW. Finally, we present conditional Manhattan plots to visualize the localization of the SNPs associated with T2D conditional on the strength of association with BW and the reverse. We also present a conjunction Manhattan plot to visualize the locations of the variants with a pleiotropic effect on both phenotypes. We identified a SNP as novel if it has not been reported in previous GWASs [13, 14] or our previous cFDR studies [12]. The details were presented in the Supplementary Materials and Methods.

Functional annotation of the pleiotropic CpG-SNPs

To explore the biological functions of the individual trait associated CpG-SNPs and pleiotropic CpG-SNPs, we annotated each identified CPG-SNP to corresponding DNA features or regulatory elements using functional analysis tools such as HaploReg (http://www.broadinstitute.org/mammals/haploreg/haploreg.php) and SNPnexus (http://www.snp-nexus.org/). These tools provide the ENCODE [48] and RoadMap [49] annotations for the CpG-SNPs of interest as well as other SNPs in high LD (r2 ≥ 0.8).

We further determined whether the identified CpG-SNPs or other SNPs in high LD (r2 ≥ 0.8) have expression quantitative trait loci (eQTL), methylation QTL (meQTL), or metabolic QTL (metaQTL) effects. First, we obtained the eQTL hits from HaploReg based on GTEx and other eQTL results. Then, we acquired the independent cis- and trans- meQTLs in whole blood from Bonder's study (https://genenetwork.nl/biosqtlbrowser/). Last, we obtained the metaQTLs from SNiPA (https://snipa.helmholtz-muenchen.de/snipa3/), which summarized recently published metaQTL studies. We also used the metabolites associated with these metaQTLs to perform a pathway analysis using MetaboAnalyst 4.0 (https://www.metaboanalyst.ca/). The eQTL results were further confirmed using a web tool "LocusCompare" [50]. This method calculates the colocalization between GWAS and eQTL results and identifies significant loci (GWAS lead SNP p-value < 5 × 10−8 and eQTL lead SNP p-value < 1 × 10−6). We used the original GWAS summary statistics we chose in this study as GWAS input, while using eQTL from adipose tissue as eQTL input (eQTL_ Adipose_Subcutaneous_GTEx_v7) for BW and T2D are closely associated with adipocyte biology [51, 52].

The gene ontology (GO) terms database (http://omicslab.genetics.ac.cn/GOEAST/index.php) was used to perform gene enrichment analysis among the list of genes associated with pleiotropic CpG-SNPs. Among the significant genes we identified gene sets enriched in certain biological processes, cellular components, and molecular functions. To investigate the interaction and functional relationship of the identified T2D and BW genes, protein-protein interaction analyses were constructed by using the STRING 11.0 database (http://string-db.org/).

Validation study using different GWAS datasets

Considering the long duration of action between BW and T2D, we re-performed cFDR analysis between BW and diabetes-related indicators (fasting glucose (FG), and fasting insulin (FI) [53]) to validate the results of the main cFDR analysis, and QQ plots were also generated to demonstrate the pleiotropic enrichment.

Additionally, considering BMI may be highly related with both birth weight and T2D, and BW is influenced both by inherited fetus genotypes and maternal genotypes with intrauterine environment. We further conducted cFDR analysis between BW_maternal [54] and BW_fetal [54] with T2D with correction of BMI (T2D_corBMI) [55], separately.”

Bi-directional Mendelian Randomization (MR) analysis

First, we selected independent SNPs (r2 < 0.001) that achieved genome-wide significance (p < 5 × 10−8) in the BW GWAS datasets as instrumental variables (IVs), then summary statistics of those SNPs were further extracted from T2D GWAS datasets. Next, inverse variance weighted (IVW) regression [56] was performed as main MR analysis to estimate the causal relationship between BW and T2D. MR-Egger regression [57] was performed to estimate the pleiotropy effect among selected SNPs. Simple mode, weighted mode and weighted median [56] were then conducted to validate the main results. P values less than 0.05 was considered significant. Then bi-directional MR analysis was repeated using T2D as exposure, BW as outcome.

Ethics approval and patient consent

We obtained genome-wide association study (GWAS) results published online. The relevant institutional review boards or ethics committees approved the research protocol of the individual GWAS used in the current analysis, and all human participants gave written informed consent, which was demonstrated in the respective original papers.

Author Contributions

Hong-Wen Deng conceived and initiated this study, he is responsible for the general development and design of the study and contributed to critical revisions. Jie Shen gave constructive suggestions and finalization of the manuscript. Rui-ke Liu is the first author who performed data analysis and drafted the manuscript. Xu Lin and Chuan Qiu contributed to data analysis. Zun Wang, Jonathan Greenbaum, and Chun-Ping Zeng contributed to critical revisions. Yong-Yao Zhu gave constructive suggestions during the whole process. All authors have given approval to the final version of the manuscript. All authors agree to be accountable for the work and ensure that any questions relating to the accuracy and integrity of the paper are investigated and properly resolved.

Acknowledgments

None.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Funding

HWD was partially supported by grants from National Institutes of Health [R01AR059781, P20GM109036, R01MH107354, R01MH104680, R01GM109068, R01AR069055, U19AG055373, R01DK115679], the Edward G. Schlieder Endowment. Jie Shen was partially supported by grants from the Science and Technology Program of Guangzhou, China [201604020007], and the National Natural Science Foundation of China [81770878].

References

  • 1. Chaudhury A, Duvoor C, Reddy Dendi VS, Kraleti S, Chada A, Ravilla R, Marco A, Shekhawat NS, Montales MT, Kuriakose K, Sasapu A, Beebe A, Patil N, et al. Clinical Review of Antidiabetic Drugs: Implications for Type 2 Diabetes Mellitus Management. Front Endocrinol (Lausanne). 2017; 8:6. https://doi.org/10.3389/fendo.2017.00006 [PubMed]
  • 2. Cho NH, Shaw JE, Karuranga S, Huang Y, da Rocha Fernandes JD, Ohlrogge AW, Malanda B. IDF Diabetes Atlas: Global estimates of diabetes prevalence for 2017 and projections for 2045. Diabetes Res Clin Pract. 2018; 138:271–81. https://doi.org/10.1016/j.diabres.2018.02.023 [PubMed]
  • 3. Khan MAB, Hashim MJ, King JK, Govender RD, Mustafa H, Al Kaabi J. Epidemiology of Type 2 Diabetes - Global Burden of Disease and Forecasted Trends. J Epidemiol Glob Health. 2020; 10:107–11. https://doi.org/10.2991/jegh.k.191028.001 [PubMed]
  • 4. Ronn PF, Jorgensen ME, Smith LS, Bjerregaard P, Dahl-Petersen IK, Larsen CVL, Grarup N, Andersen GS. Associations between birth weight and glucose intolerance in adulthood among Greenlandic Inuit. Diabetes Res Clin Pract. 2019; 150:129–37. https://doi.org/10.1016/j.diabres.2019.03.005 [PubMed]
  • 5. Li Y, Ley SH, Tobias DK, Chiuve SE, VanderWeele TJ, Rich-Edwards JW, Curhan GC, Willett WC, Manson JE, Hu FB, Qi L. Birth weight and later life adherence to unhealthy lifestyles in predicting type 2 diabetes: prospective cohort study. BMJ. 2015; 351:h3672. https://doi.org/10.1136/bmj.h3672 [PubMed]
  • 6. Briana DD, Malamitsi-Puchner A. Developmental origins of adult health and disease: The metabolic role of BDNF from early life to adulthood. Metabolism. 2018; 81:45–51. https://doi.org/10.1016/j.metabol.2017.11.019 [PubMed]
  • 7. Langley-Evans SC. Nutrition in early life and the programming of adult disease: a review. J Hum Nutr Diet. 2015 (Suppl 1); 28:1–14. https://doi.org/10.1111/jhn.12212 [PubMed]
  • 8. Clausson B, Lichtenstein P, Cnattingius S. Genetic influence on birthweight and gestational length determined by studies in offspring of twins. BJOG. 2000; 107:375–81. https://doi.org/10.1111/j.1471-0528.2000.tb13234.x [PubMed]
  • 9. Herder C, Roden M. Genetics of type 2 diabetes: pathophysiologic and clinical relevance. Eur J Clin Invest. 2011; 41:679–92. https://doi.org/10.1111/j.1365-2362.2010.02454.x [PubMed]
  • 10. Plotnikov D, Williams C, Guggenheim JA. Association between birth weight and refractive error in adulthood: a Mendelian randomisation study. Br J Ophthalmol. 2020; 104:214–19. https://doi.org/10.1136/bjophthalmol-2018-313640 [PubMed]
  • 11. Huang T, Wang T, Zheng Y, Ellervik C, Li X, Gao M, Fang Z, Chai JF, Ahluwalia TVS, Wang Y, Voortman T, Noordam R, Frazier-Wood A, et al, and BIRTH-GENE (BIG) Study Working Group. Association of Birth Weight With Type 2 Diabetes and Glycemic Traits: A Mendelian Randomization Study. JAMA Netw Open. 2019; 2:e1910915. https://doi.org/10.1001/jamanetworkopen.2019.10915 [PubMed]
  • 12. Zeng CP, Chen YC, Lin X, Greenbaum J, Chen YP, Peng C, Wang XF, Zhou R, Deng WM, Shen J, Deng HW. Increased identification of novel variants in type 2 diabetes, birth weight and their pleiotropic loci. J Diabetes. 2017; 9:898–907. https://doi.org/10.1111/1753-0407.12510 [PubMed]
  • 13. Scott RA, Scott LJ, Magi R, Marullo L, Gaulton KJ, Kaakinen M, Pervjakova N, Pers TH, Johnson AD, Eicher JD, Jackson AU, Ferreira T, Lee Y, et al, and DIAbetes Genetics Replication And Meta-analysis (DIAGRAM) Consortium. An Expanded Genome-Wide Association Study of Type 2 Diabetes in Europeans. Diabetes. 2017; 66:2888–902. https://doi.org/10.2337/db16-1253 [PubMed]
  • 14. Horikoshi M, Beaumont RN, Day FR, Warrington NM, Kooijman MN, Fernandez-Tajes J, Feenstra B, van Zuydam NR, Gaulton KJ, Grarup N, Bradfield JP, Strachan DP, Li-Gao R, et al, and CHARGE Consortium Hematology Working Group, and Early Growth Genetics (EGG) Consortium. Genome-wide associations for birth weight and correlations with adult disease. Nature. 2016; 538:248–52. https://doi.org/10.1038/nature19806 [PubMed]
  • 15. Morris AP, Voight BF, Teslovich TM, Ferreira T, Segre AV, Steinthorsdottir V, Strawbridge RJ, Khan H, Grallert H, Mahajan A, Prokopenko I, Kang HM, Dina C, et al, and DIAbetes Genetics Replication And Meta-analysis (DIAGRAM) Consortium. Large-scale association analysis provides insights into the genetic architecture and pathophysiology of type 2 diabetes. Nat Genet. 2012; 44:981–90. https://doi.org/10.1038/ng.2383 [PubMed]
  • 16. Magnus P, Gjessing HK, Skrondal A, Skjaerven R. Paternal contribution to birth weight. J Epidemiol Community Health. 2001; 55:873–77. https://doi.org/10.1136/jech.55.12.873 [PubMed]
  • 17. Blanco-Gomez A, Castillo-Lluva S, Del Mar Saez-Freire M, Hontecillas-Prieto L, Mao JH, Castellanos-Martin A, Perez-Losada J. Missing heritability of complex diseases: Enlightenment by genetic variants from intermediate phenotypes. Bioessays. 2016; 38:664–73. https://doi.org/10.1002/bies.201600084 [PubMed]
  • 18. Manolio TA, Collins FS, Cox NJ, Goldstein DB, Hindorff LA, Hunter DJ, McCarthy MI, Ramos EM, Cardon LR, Chakravarti A, Cho JH, Guttmacher AE, Kong A, et al. Finding the missing heritability of complex diseases. Nature. 2009; 461:747–53. https://doi.org/10.1038/nature08494 [PubMed]
  • 19. Nilsson E, Ling C. DNA methylation links genetics, fetal environment, and an unhealthy lifestyle to the development of type 2 diabetes. Clin Epigenetics. 2017; 9:105. https://doi.org/10.1186/s13148-017-0399-2 [PubMed]
  • 20. Yokoi N. Epigenetic dysregulation in pancreatic islets and pathogenesis of type 2 diabetes. J Diabetes Investig. 2017; 9:475–77. https://doi.org/10.1111/jdi.12724 [PubMed]
  • 21. Kader F, Ghai M, Maharaj L. The effects of DNA methylation on human psychology. Behav Brain Res. 2018; 346:47–65. https://doi.org/10.1016/j.bbr.2017.12.004 [PubMed]
  • 22. Volkmar M, Dedeurwaerder S, Cunha DA, Ndlovu MN, Defrance M, Deplus R, Calonne E, Volkmar U, Igoillo-Esteve M, Naamane N, Del Guerra S, Masini M, Bugliani M, et al. DNA methylation profiling identifies epigenetic dysregulation in pancreatic islets from type 2 diabetic patients. EMBO J. 2012; 31:1405–26. https://doi.org/10.1038/emboj.2011.503 [PubMed]
  • 23. Ma J, Rebholz CM, Braun KVE, Reynolds LM, Aslibekyan S, Xia R, Biligowda NG, Huan T, Liu C, Mendelson MM, Joehanes R, Hu EA, Vitolins MZ, et al. Whole Blood DNA Methylation Signatures of Diet Are Associated with Cardiovascular Disease Risk Factors and All-cause Mortality. Circ Genom Precis Med. 2020; 13:e002766. https://doi.org/10.1161/circgen.119.002766 [PubMed]
  • 24. Wang Z, Qiu C, Lin X, Zhao LJ, Liu Y, Wu X, Wang Q, Liu W, Li K, Deng HW, Tang SY, Shen H. Identification of novel functional CpG-SNPs associated with type 2 diabetes and coronary artery disease. Mol Genet Genomics. 2020; 295:607–19. https://doi.org/10.1007/s00438-020-01651-3 [PubMed]
  • 25. Andreassen OA, Djurovic S, Thompson WK, Schork AJ, Kendler KS, O'Donovan MC, Rujescu D, Werge T, van de Bunt M, Morris AP, McCarthy MI, Roddey JC, McEvoy LK, et al, and International Consortium for Blood Pressure GWAS, and Diabetes Genetics Replication and Meta-analysis Consortium, and Psychiatric Genomics Consortium Schizophrenia Working Group. Improved detection of common variants associated with schizophrenia by leveraging pleiotropy with cardiovascular-disease risk factors. Am J Hum Genet. 2013; 92:197–209. https://doi.org/10.1016/j.ajhg.2013.01.001 [PubMed]
  • 26. Greenbaum J, Wu K, Zhang L, Shen H, Zhang J, Deng HW. Increased detection of genetic loci associated with risk predictors of osteoporotic fracture using a pleiotropic cFDR method. Bone. 2017; 99:62–68. https://doi.org/10.1016/j.bone.2017.03.052 [PubMed]
  • 27. Mahajan A, Go MJ, Zhang W, Below JE, Gaulton KJ, Ferreira T, Horikoshi M, Johnson AD, Ng MC, Prokopenko I, Saleheen D, Wang X, Zeggini E, et al, and DIAbetes Genetics Replication And Meta-analysis (DIAGRAM) Consortium, and Asian Genetic Epidemiology Network Type 2 Diabetes (AGEN-T2D) Consortium, and South Asian Type 2 Diabetes (SAT2D) Consortium, and Mexican American Type 2 Diabetes (MAT2D) Consortium, and Type 2 Diabetes Genetic Exploration by Nex-generation sequencing in muylti-Ethnic Samples (T2D-GENES) Consortium. Genome-wide trans-ancestry meta-analysis provides insight into the genetic architecture of type 2 diabetes susceptibility. Nat Genet. 2014; 46:234–44. https://doi.org/10.1038/ng.2897 [PubMed]
  • 28. Gonzalez-Regueiro JA, Moreno-Castaneda L, Uribe M, Chavez-Tapia NC. The Role of Bile Acids in Glucose Metabolism and Their Relation with Diabetes. Ann Hepatol. 2017 (Suppl. 1: s3-105.); 16:16–21. https://doi.org/10.5604/01.3001.0010.5672 [PubMed]
  • 29. Rumberger JM, Arch JR, Green A. Butyrate and other short-chain fatty acids increase the rate of lipolysis in 3T3-L1 adipocytes. PeerJ. 2014; 2:e611. https://doi.org/10.7717/peerj.611 [PubMed]
  • 30. Heimann E, Nyman M, Degerman E. Propionic acid and butyric acid inhibit lipolysis and de novo lipogenesis and increase insulin-stimulated glucose uptake in primary rat adipocytes. Adipocyte. 2014; 4:81–88. https://doi.org/10.4161/21623945.2014.960694 [PubMed]
  • 31. Cioni L, De Siena G, Ghelardini C, Sernissi O, Alfarano C, Pirisino R, Raimondi L. Activity and expression of semicarbazide-sensitive benzylamine oxidase in a rodent model of diabetes: interactive effects with methylamine and alpha-aminoguanidine. Eur J Pharmacol. 2006; 529:179–87. https://doi.org/10.1016/j.ejphar.2005.10.052 [PubMed]
  • 32. Huizing M, Didier A, Walenta J, Anikster Y, Gahl WA, Kramer H. Molecular cloning and characterization of human VPS18, VPS 11, VPS16, and VPS33. Gene. 2001; 264:241–47. https://doi.org/10.1016/s0378-1119(01)00333-x [PubMed]
  • 33. Liu J, Wen Y, Liu Z, Liu S, Xu P, Xu Y, Deng S, Hu S, Luo R, Jiang J, Yu G. VPS33B modulates c-Myc/p53/miR-192-3p to target CCNB1 suppressing the growth of non-small cell lung cancer. Mol Ther Nucleic Acids. 2020; 23:324–35. https://doi.org/10.1016/j.omtn.2020.11.010 [PubMed]
  • 34. Fu K, Wang C, Gao Y, Fan S, Zhang H, Sun J, Jiang Y, Liu C, Guan L, Liu J, Huang M, Bi H. Metabolomics and Lipidomics Reveal the Effect of Hepatic Vps33b Deficiency on Bile Acids and Lipids Metabolism. Front Pharmacol. 2019; 10:276. https://doi.org/10.3389/fphar.2019.00276 [PubMed]
  • 35. Xin Z, Soejima H, Higashimoto K, Yatsuki H, Zhu X, Satoh Y, Masaki Z, Kaneko Y, Jinno Y, Fukuzawa R, Hata J, Mukai T. A novel imprinted gene, KCNQ1DN, within the WT2 critical region of human chromosome 11p15.5 and its reduced expression in Wilms' tumors. J Biochem. 2000; 128:847–53. https://doi.org/10.1093/oxfordjournals.jbchem.a022823 [PubMed]
  • 36. Khan IA, Jahan P, Hasan Q, Rao P. Genetic confirmation of T2DM meta-analysis variants studied in gestational diabetes mellitus in an Indian population. Diabetes Metab Syndr. 2019; 13:688–94. https://doi.org/10.1016/j.dsx.2018.11.035 [PubMed]
  • 37. Shah UJ, Xie W, Flyvbjerg A, Nolan JJ, Hojlund K, Walker M, Relton CL, Elliott HR, and RISC Consortium. Differential methylation of the type 2 diabetes susceptibility locus KCNQ1 is associated with insulin sensitivity and is predicted by CpG site specific genetic variation. Diabetes Res Clin Pract. 2019; 148:189–99. https://doi.org/10.1016/j.diabres.2019.01.008 [PubMed]
  • 38. Bersenev A, Wu C, Balcerek J, Tong W. Lnk controls mouse hematopoietic stem cell self-renewal and quiescence through direct interactions with JAK2. J Clin Invest. 2008; 118:2832–844. https://doi.org/10.1172/jci35808 [PubMed]
  • 39. Smyth DJ, Plagnol V, Walker NM, Cooper JD, Downes K, Yang JH, Howson JM, Stevens H, McManus R, Wijmenga C, Heap GA, Dubois PC, Clayton DG, et al. Shared and distinct genetic variants in type 1 diabetes and celiac disease. N Engl J Med. 2008; 359:2767–777. https://doi.org/10.1056/nejmoa0807917 [PubMed]
  • 40. Soejima M, Koda Y. Genetic variation of FUT2 in a Peruvian population: identification of a novel LTR-mediated deletion and characterization of 4 nonsynonymous single-nucleotide polymorphisms. Transfusion. 2019; 59:2415–421. https://doi.org/10.1111/trf.15298 [PubMed]
  • 41. Hu Y, Tan LJ, Chen XD, Greenbaum J, Deng HW. Identification of novel variants associated with osteoporosis, type 2 diabetes and potentially pleiotropic loci using pleiotropic cFDR method. Bone. 2018; 117:6–14. https://doi.org/10.1016/j.bone.2018.08.020 [PubMed]
  • 42. Jia X, Shi N, Feng Y, Li Y, Tan J, Xu F, Wang W, Sun C, Deng H, Yang Y, Shi X. Identification of 67 Pleiotropic Genes Associated With Seven Autoimmune/Autoinflammatory Diseases Using Multivariate Statistical Analysis. Front Immunol. 2020; 11:30. https://doi.org/10.3389/fimmu.2020.00030 [PubMed]
  • 43. Wang Z, Greenbaum J, Qiu C, Li K, Wang Q, Tang SY, Deng HW. Identification of pleiotropic genes between risk factors of stroke by multivariate metaCCA analysis. Mol Genet Genomics. 2020; 295:1173–185. https://doi.org/10.1007/s00438-020-01692-8 [PubMed]
  • 44. Genomes Project C, Abecasis GR, Altshuler D, Auton A, Brooks LD, Durbin RM, Gibbs RA, Hurles ME, McVean GA. A map of human genome variation from population-scale sequencing. Nature. 2010; 467:1061–73. https://doi.org/10.1038/nature09534 [PubMed]
  • 45. Shen H, Li J, Zhang J, Xu C, Jiang Y, Wu Z, Zhao F, Liao L, Chen J, Lin Y, Tian Q, Papasian CJ, Deng HW. Comprehensive characterization of human genome variation by high coverage whole-genome sequencing of forty four Caucasians. PLoS One. 2013; 8:e59494. https://doi.org/10.1371/journal.pone.0059494 [PubMed]
  • 46. Qiu C, Shen H, Fu X, Xu C, Deng H. Meta-Analysis of Genome-Wide Association Studies Identifies Novel Functional CpG-SNPs Associated with Bone Mineral Density at Lumbar Spine. Int J Genomics. 2018; 2018:6407257. https://doi.org/10.1155/2018/6407257 [PubMed]
  • 47. Andreassen OA, McEvoy LK, Thompson WK, Wang Y, Reppe S, Schork AJ, Zuber V, Barrett-Connor E, Gautvik K, Aukrust P, Karlsen TH, Djurovic S, Desikan RS, Dale AM, and International Consortium for Blood Pressure Genome-Wide Association Studies, Genetic Factors for Osteoporosis Consortium. Identifying common genetic variants in blood pressure due to polygenic pleiotropy with associated phenotypes. Hypertension. 2014; 63:819–26. https://doi.org/10.1161/hypertensionaha.113.02077 [PubMed]
  • 48. Rosenbloom KR, Sloan CA, Malladi VS, Dreszer TR, Learned K, Kirkup VM, Wong MC, Maddren M, Fang R, Heitner SG, Lee BT, Barber GP, Harte RA, et al. ENCODE data in the UCSC Genome Browser: year 5 update. Nucleic Acids Res. 2013; 41:D56–63. https://doi.org/10.1093/nar/gks1172 [PubMed]
  • 49. Dayem Ullah AZ, Oscanoa J, Wang J, Nagano A, Lemoine NR, Chelala C. SNPnexus: assessing the functional relevance of genetic variation to facilitate the promise of precision medicine. Nucleic Acids Res. 2018; 46:W109–W13. https://doi.org/10.1093/nar/gky399 [PubMed]
  • 50. Liu B, Gloudemans MJ, Rao AS, Ingelsson E, Montgomery SB. Abundant associations with gene expression complicate GWAS follow-up. Nat Genet. 2019; 51:768–69. https://doi.org/10.1038/s41588-019-0404-0 [PubMed]
  • 51. Nakano Y. Adult-Onset Diseases in Low Birth Weight Infants: Association with Adipose Tissue Maldevelopment. J Atheroscler Thromb. 2020; 27:397–405. https://doi.org/10.5551/jat.rv17039 [PubMed]
  • 52. Rawshani A, Eliasson B, Rawshani A, Henninger J, Mardinoglu A, Carlsson A, Sohlin M, Ljungberg M, Hammarstedt A, Rosengren A, Smith U. Adipose tissue morphology, imaging and metabolomics predicting cardiometabolic risk and family history of type 2 diabetes in non-obese men. Sci Rep. 2020; 10:9973. https://doi.org/10.1038/s41598-020-66199-z [PubMed]
  • 53. Lagou V, Magi R, Hottenga JJ, Grallert H, Perry JRB, Bouatia-Naji N, Marullo L, Rybin D, Jansen R, Min JL, Dimas AS, Ulrich A, Zudina L, et al, and Meta-Analyses of Glucose and Insulin-related traits Consortium (MAGIC). Sex-dimorphic genetic effects and novel loci for fasting glucose and insulin variability. Nat Commun. 2021; 12:24. https://doi.org/10.1038/s41467-020-19366-9 [PubMed]
  • 54. Warrington NM, Beaumont RN, Horikoshi M, Day FR, Helgeland Ø, Laurin C, Bacelis J, Peng S, Hao K, Feenstra B, Wood AR, Mahajan A, Tyrrell J, et al, and EGG Consortium. Maternal and fetal genetic effects on birth weight and their relevance to cardio-metabolic risk factors. Nat Genet. 2019; 51:804–14. https://doi.org/10.1038/s41588-019-0403-1 [PubMed]
  • 55. Mahajan A, Wessel J, Willems SM, Zhao W, Robertson NR, Chu AY, Gan W, Kitajima H, Taliun D, Rayner NW, Guo X, Lu Y, Li M, et al, and ExomeBP Consortium, and MAGIC Consortium, and GIANT Consortium. Refining the accuracy of validated target identification through coding variant fine-mapping in type 2 diabetes. Nat Genet. 2018; 50:559–71. https://doi.org/10.1038/s41588-018-0084-1 [PubMed]
  • 56. Burgess S, Butterworth A, Thompson SG. Mendelian randomization analysis with multiple genetic variants using summarized data. Genet Epidemiol. 2013; 37:658–65. https://doi.org/10.1002/gepi.21758 [PubMed]
  • 57. Bowden J, Davey Smith G, Haycock PC, Burgess S. Consistent Estimation in Mendelian Randomization with Some Invalid Instruments Using a Weighted Median Estimator. Genet Epidemiol. 2016; 40:304–14. https://doi.org/10.1002/gepi.21965 [PubMed]