- Research
- Open access
- Published:
Methylome-wide association analyses of lipids and modifying effects of behavioral factors in diverse race and ethnicity participants
Clinical Epigenetics volume 17, Article number: 54 (2025)
Abstract
Circulating lipid concentrations are clinically associated with cardiometabolic diseases. The phenotypic variance explained by identified genetic variants remains limited, highlighting the importance of searching for additional factors beyond genetic sequence variants. DNA methylation has been linked to lipid concentrations in previous studies, although most of the studies harbored moderate sample sizes and exhibited underrepresentation of non-European ancestry populations. In addition, knowledge of nongenetic factors on lipid profiles is extremely limited. In the Population Architecture Using Genomics and Epidemiology (PAGE) Study, we performed methylome-wide association analysis on 9,561 participants from diverse race and ethnicity backgrounds for HDL-c, LDL-c, TC, and TG levels, and also tested interactions between smoking or alcohol intake and methylation in their association with lipid levels. We identified novel CpG sites at 16 loci (P < 1.18E-7) with successful replication on 3,215 participants. One additional novel locus was identified in the self-reported White participants (P = 4.66E-8). Although no additional CpG sites were identified in the genome-wide interaction analysis, 13 reported CpG sites showed significant heterogeneous association across smoking or alcohol intake strata. By mapping novel and reported CpG sites to genes, we identified enriched pathways directly linked to lipid metabolism as well as ones spanning various biological functions. These findings provide new insights into the regulation of lipid concentrations.
Introduction
Circulating lipid concentrations are clinically associated with cardiometabolic diseases [1, 2]. Genome-wide association studies (GWAS) and whole-exome sequencing (WES)/whole genome sequencing (WGS) have identified thousands of genetic loci associated with high-density lipoprotein cholesterol (HDL-c), low-density lipoprotein cholesterol (LDL-c), total cholesterol (TC), and triglycerides (TG) [3,4,5]. Similar to most complex polygenic traits, however, the phenotypic variances explained by these identified genetic loci remain limited and the underlying mechanisms remain to be fully understood [5].
More recently, blood DNA methylation has been linked to circulating lipid concentrations in several studies [6,7,8,9,10,11,12]. Over 900 unique CpG sites have been associated with HDL-c, LDL-c, TC, and TG levels, highlighting the importance of looking for additional factors contributing to lipid variations beyond genetic sequence variants. All these published studies focused solely on European populations, except one study [12]. African Americans and Hispanic/Latino participants together with European participants were included in an effort from the Cohort for Heart and Aging Research in Genomic Epidemiology (CHARGE) Consortium, uncovering both population-agnostic and population-specific novel findings and relatively low correlation of effect estimates of LDL-associated CpG sites across population groups [12]. These results emphasize the need to improve diversity and inclusion in epigenetic studies.
Previous studies have reported the modifying effects of nongenetic factors on lipid-gene associations, including smoking and alcohol intake, leading to identifications of additional novel loci for lipids [13, 14]. Recent studies also demonstrated the effect of smoking and alcohol intake on DNA methylation changes [15,16,17]. However, no study has explored potential modifying effects of smoking or alcohol intake on associations between lipid levels and DNA methylation variation using interaction analysis.
Here, we first present methylome-wide association analyses for HDL-c, LDL-c, TC, and TG in over 12,000 self-reported White, African American, Hispanic/Latino, Asian American, and Native American participants from eight studies. We then performed interaction analyses taking into account smoking and alcohol intake. We aim to identify novel CpG sites associated with HDL-c, LDL-c, TC, and TG levels in a diverse race and ethnicity setting and to explore the potential effect of smoking and alcohol intake on the association between CpG sites and lipid levels.
Methods
Study populations
A total of eight studies comprising over 12,000 participants were included in the analysis. Three studies, the Atherosclerosis Risk in Communities (ARIC) study [18, 19], the Women’s Health Initiative (WHI) [20], and the Jackson Heart Study (JHS) [21], were included in the discovery stage. An additional five studies, the Amish study [22, 23], the Cardiovascular Health Study (CHS) [24], the Genetic Epidemiology Network of Arteriopathy (GENOA) [25], the Multi-ethnic Cohort (MEC) study [26], and the Multi-Ethnic Study of Atherosclerosis (MESA) study [27], were included in the replication stage. Self-reported White, African American, and Hispanic/Latino participants formed the three largest population groups. Smaller numbers of Asian American and Native American participants were also represented in our analysis. Each participating study is detailed in the Supplemental Methods. All studies obtained written informed consent from participants and were approved by local institutional review boards and ethics committees.
Lipid measurements
HDL-c, TC, and TG levels (mg/dL) in fasting blood were measured while LDL-c levels were calculated using the Friedewald equation. LDL-c levels were not calculated if the corresponding TG levels were greater than 400 mg/dL. Lipid levels were further adjusted for medication use by adding a constant based on previous publications (Supplemental Table 1) [28]. If multiple medications were used, only the largest constant was applied. Participants who were pregnant at blood draw or who had fasted less than eight hours prior to blood draw were excluded from the analysis. TG levels after adjustment for medication were natural log transformed.
Smoking and alcohol intake measurements
To explore the potential modifying effects of behavioral factors on associations between DNA methylation and lipid levels, we collected information on smoking and alcohol intake in each study at the time of blood draw (Supplemental Table 2), for exploration of potential interactions. According to smoking behavior, participants were divided into three groups: nonsmokers, past smokers, and current smokers. According to alcohol intake and sex, participants were divided into four groups: non-drinkers, light drinkers (≤ 3 servings of alcohol intake per week), moderate drinkers (3 < servings of alcohol intake per week ≤ 7 and 3 < servings of alcohol intake per week ≤ 14 for female and male, respectively), and heavy drinkers (> 7 and > 14 servings of alcohol intake per week for female and male, respectively). Amish and MEC studies were excluded from the analysis due to extremely limited sample sizes (n < 50) in heavy drinkers and/or current smoker groups.
DNA methylation measurement, quality control, and normalization
DNA methylation was quantified in each participating study independently. Levels were measured from peripheral blood leukocytes isolated from whole blood. The Illumina Infinium HumanMethylation450 BeadChip was used in the Amish, ARIC, CHS, and WHI studies, while the Illumina Infinium MethylationEPIC BeadChip was used in the GENOA, JHS, MEC, and MESA studies. Either the Beta MIxture Quantile dilation (BMIQ) [29], the normal-exponential out-of-band (Noob) [30], or the subset quantile normalization [31] approach was used to perform preprocessing and normalization in each study. A beta value was calculated for each CpG site representing the percentage of methylation at that site. Houseman’s method was implemented to estimate white blood cell proportions [32]. Any single value with a detection p value > 0.01 was set to missing. Probes with missing data in greater than 5% of samples per study were excluded. Samples with greater than 5% of missing probes were also excluded. To avoid spurious signals in DNA methylation data, we excluded CpG sites that co-hybridize to alternate genomic sequences or overlapped with genomic variations.
Methylome-wide association analysis and meta-analysis
Methylome-wide association analysis was performed in each study stratified by self-reported race and ethnicity groups, followed by population-combined and population-specific meta-analysis. In the study- and population-specific methylome-wide association analysis, we first regressed the four lipid traits on age, sex, and study-specific covariates (if applicable) and estimated the residual values. In the next step, the inverse-normalized residual values were used as the outcome for testing association with each CpG site using linear mixed models. White blood cell proportions (CD8T, CD4T, NK, BCELL, MONO, and GRAN values estimated using the Houseman’s method [30]), the first 10 principal components (PCs, except in Amish, where a mixed model controlling for the fixed effects of covariates, and the random effect of a genetic relationship matrix derived from the complete Amish pedigree structure was used [https://mmap.github.io/]), row, and column were adjusted as fixed-effect covariates and chip was adjusted as random-effect covariates. Summary statistics from each study and ancestral group were combined through fixed-effect inverse-variance weighted meta-analysis using METAL [33]. Population-specific meta-analysis was performed in African American, White, and Hispanic/Latino participants, which are the three largest population groups in our analysis. To control for bias and inflation, BACON [34] adjustment was implemented for the meta-analysis results generated by METAL. CpG sites with P < 1.18E-7 were considered as genome-wide significant. These analyses focused only on CpG sites that are available on the HumanMethylation450 BeadChip to maximize sample sizes. Due to the limited sample sizes of studies using the MethylationEPIC BeadChip, we summarized association results of CpG sites on the MethylationEPIC BeadChip by combining all available studies in our analysis.
Methylome-wide interaction analysis with smoking and alcohol intake
We performed methylome-wide interaction analysis in all eight participating studies except Amish and MEC due to limited sample sizes. Methylome-wide interaction analysis (which has two degrees of freedom for the three-level smoking variable and three degrees of freedom for the four-level alcohol variable) was performed in each study stratified by population groups, followed by population-combined and population-specific meta-analysis in the discovery stage and in the replication stage. In the methylome-wide interaction analysis, we used the inverse-normalized residual values as the outcome and adjusted for the same set of covariates as in the methylome-wide association analysis. To combine study- and population-specific results, we estimated a test statistic based on coefficients and variance–covariance matrix from each study and population group. This test statistic followed a χ2 distribution and was used to estimate P values. Genomic control was applied to the P values to account for inflation. CpG sites with P < 1.18E-7 were considered as genome-wide significant.
Stratified association analysis across smoking and alcohol intake strata
We then focused our efforts on better characterizing previously reported CpG sites and novel CpG sites identified in our methylome-wide association analysis in terms of their potential interactions with smoking and alcohol intake. Association analysis between each CpG site and each lipid level was performed in each smoking (non, past, and current) and alcohol intake stratum (none, light, moderate, and heavy) with the same adjustment implemented in the methylome-wide association analysis. P values for heterogeneity were estimated across different strata and the significant threshold was set as P < 4.01E-5 (0.05/1246 CpG sites tested). Stratified analyses were performed only in the three discovery studies (ARIC, WHI, and JHS, Supplemental Table 2).
Pathway enrichment analysis
Gene ontology (GO) for biological processes (BP) was used to analyze the differentially methylated genes for functional enrichment using the R package “clusterProfiler” [35, 36]. We combined all previously reported CpG sites with the novel ones we identified in our meta-analysis for the pathway enrichment analysis. The annotations of these CpG sites to the corresponding genes were performed using the R package “missMethyl” [37]. Associations that were P < 0.05 after FDR adjustment were considered statistically significant.
Results
Participating studies were divided into discovery (ARIC, WHI, and JHS) and replication stages (Amish, CHS, GENOA, MEC, and MESA). Mean ages across studies ranged from 56.9 (JHS African American) to 75.0 (CHS white, Table 1). Smoking and alcohol intake status for participants in each study are summarized in Supplemental Table 2.
Novel CpG sites associated with lipids
In the discovery stage, a total of 9,561 participants from ARIC, WHI, and JHS were analyzed using both population-combined and population-specific approaches. In the African American, White, and Hispanic/Latino populations, the sample sizes were 4,912, 3,764, and 712, respectively. Novel CpG sites identified in the discovery stage (P < 1.18E-7 and located > 500 kb away from any reported CpG sites) were carried forward for replication in 3,215 participants from the Amish, CHS, GENOA, MEC, and MESA studies. For this part, we focused on CpG sites available on the HumanMethylation450 BeadChip to maximize sample sizes. Across all analyses, lambda values ranged from 0.997 to 1.258 after BACON adjustment, indicating limited amount of inflation (Supplemental Table 3).
In the population-combined analysis, a total of 348 CpG sites mapped to 303 loci (based on the genomic locations of CpG sites and each locus is > 500 kb away from each other) were significantly associated with at least one of the four lipid traits in the discovery stage (Supplemental Table 4). More than half of these CpG sites also displayed consistent directions of association in the replication stage (190 CpG sites mapped to 175 loci, Supplemental Table 4). However, only eight, one, and nine loci were successfully replicated in the replication stage for HDL-c, LDL-c, and TG, respectively (P < 0.05/number of significant loci for each lipid trait, Table 2). In the African American- and White-specific meta-analysis, 12 and four additional novel CpG sites were identified, respectively (Supplemental Table 5), and one of them was successfully replicated for association with TG in self-reported White participants (P < 3.13E-3, 0.05/16 significant loci in African American- and White-specific meta-analysis, Table 2). The limited sample sizes in the replication stage compared to the discovery stage (3,215 and 9,516 participants, respectively) could have an impact on the statistical power and the failure of replication. In summary, we identified and replicated 17 novel CpG sites in the multi-population meta-analysis.
To maximize sample sizes for CpG sites only available on the MethylationEPIC BeadChip, we performed meta-analysis including all studies and summarized 96 top CpG sites that have not been previously reported in Supplemental Table 6 (P < 1.18E-7).
Replication of previously reported CpG sites
In order to assess the generalizability of previously identified CpG sites from the literature to our diverse populations in PAGE, we evaluated previously reported CpG sites in our population-combined and population-specific meta-analysis from the discovery stage (Supplemental Tables 7–10). In the population-combined analysis, 20.4% (383 out of 1,876), 21.0% (69 out of 328), 7.8% (14 out of 180), and 19.4% (351 out of 1,824) reached genome-wide significance for association with HDL-c, LDL-c, TC, and TG, respectively (P < 1.18E-7 and with consistent association directions of the reported trait, Supplemental Table 7). The percentages increased to 57.1%, 59.5%, 60.6%, and 57.6% for HDL-c, LDL-c, TC, and TG, respectively, when setting the P value cutoff to 0.05 (with consistent association directions of the reported trait, Supplemental Table 7). In the White- and African American-specific meta-analysis, the percentages of reported CpG sites showing genome-wide significance dropped to a maximum of 12.8% while the percentages of reported CpG sites with P < 0.05 stayed close to 50% (Supplemental Tables 8–9). In the Hispanic/Latino-specific meta-analysis with a considerably smaller sample size, the percentages of reported CpG sites showing genome-wide significance were all below 1% and roughly 15% reported CpG sites showed P < 0.05 (Supplemental Table 10). It is worth mentioning that the vast majority of previously reported loci (85.4%) are reported by the recent multi-ethnic meta-analysis [12] and there are overlapping samples (60.4%) between our PAGE discovery stage studies with the published multi-ethnic meta-analysis [12].
Interactions between CpG sites and behavioral factors on lipid levels
We first performed methylome-wide interaction analysis aiming to identify additional CpG sites whose associations with lipids were context specific, by allowing for interaction with smoking or alcohol intake status. We performed methylome-wide interaction analysis in the discovery studies (ARIC, WHI, and JHS, Supplemental Table 2). However, no CpG sites reached genome-wide significance (P < 1.18E-7).
We then performed stratified association analysis focusing on the novel CpG sites we identified in our PAGE analysis and those that had been previously associated in the literature. A total of 914, 316, and 16 top CpG sites from literature, the population-combined meta-analysis, and the population-specific meta-analysis, respectively, were tested for association with lipid levels in each smoking and alcohol intake stratum. In the smoking-stratified analysis, three and eight CpG sites associated with HDL-c and TG, respectively, showed significant differences across the three strata (P < 4.01E-5, Supplemental Fig. 1). For the three CpG sites displaying heterogeneous associations with HDL-c, all of them showed more significant association and larger effect sizes in nonsmokers compared to past or current smokers (Supplemental Fig. 1A-B). For the eight CpG sites showing heterogeneous associations with TG, cg16411857, which is previously reported for association with LDL-c, showed more significant association and larger effect sizes in current smokers (P = 1.02E-5) while the other seven CpG sites all showed more significant associations and larger effect sizes in nonsmokers (Supplemental Fig. 1B). These eight CpG sites included a newly identified CpG site, SLC2A9-cg02387843, which is associated with both HDL-c and TG levels in the population-combined meta-analysis and showed a more pronounced association with TG in the nonsmoking group (Supplemental Fig. 1B). In the alcohol-stratified analysis, one CpG site each in association with HDL-c (cg15804598) and TC (cg19377250), respectively, showed significant heterogeneity across strata (P < 4.01E-5, Supplemental Fig. 1). Both these two CpG sites were reported previously (cg15804598 for association with HDL-c and cg19377250 for association with TG) [12] and showed the largest effect sizes in moderate drinkers (Supplemental Fig. 1C and 1D).
Pathway enrichment analysis
To better understand the pathways enriched for genes affected by lipid-associated CpG sites, we mapped all previously reported CpG sites as well as novel CpG sites identified in the current study to corresponding genes and then identified GO pathways that were over-represented by these genes. A total of 41, 57, 17, and 50 GO pathways are enriched for the mapped genes for HDL-c, LDL-c, TC, and TG, respectively (Supplemental Table 11, FDR adjusted P < 0.05). Among these significantly enriched pathways, 4 were shared across all four lipid traits, namely cholesterol biosynthetic process, cholesterol metabolic process, sterol biosynthetic process, and secondary alcohol biosynthetic process. Notably, the genes mapped to these four shared pathways were not the same across the four lipids traits (Supplemental Table 11), indicating distinct CpG sites and genes contributing to the enrichment of the same pathways across lipid traits. It is also worth mentioning that some enriched GO pathways with entirely different descriptions harbor the same set of genes. For example, the same five genes associated with LDL-c were mapped to both sterol metabolic process (GO:0016125) and secondary alcohol metabolic process (GO:1,902,652, supplemental Table 11). These connections across enriched pathways are displayed in Supplemental Fig. 2.
Discussion
We report here a large-scale methylome-wide association analysis of blood lipids in diverse race and ethnicity populations and the first interaction analysis focusing on smoking and alcohol intake. We identified and replicated a total of 17 novel CpG sites in the meta-analysis. Although no additional novel CpG sites were identified in the interaction analysis, 13 CpG sites showed significant heterogeneity across smoking or alcohol intake groups in the stratified analysis. Pathway enrichment analysis, including both newly and previously identified CpG sites, revealed new inferences on lipid metabolism.
There are five population groups contributing to the combined meta-analysis, making it the most diverse study so far. Population-specific meta-analysis offered us an opportunity to identify potential population-specific signals and a CpG site in association with TG was revealed only in self-reported White participants. This CpG site, cg03584506, showed much larger effect sizes in White participants compared to African American participants (Table 2). It is also worth mentioning here that we included not only CpG sites measured on the Illumina Infinium HumanMethylation450 BeadChip but also ones on the Illumina Infinium MethylationEPIC BeadChip, which almost doubles the number of tested CpG sites. To maximize the sample size for the CpG sites measured on the Illumina Infinium MethylationEPIC BeadChip, we simply combined all available studies and summarized significant findings in Supplemental Table 6. Future studies are needed to validate these potential novel findings.
The overlap between genes mapped by lipid-associated CpG sites and genes identified in previous GWAS studies is limited. Among the 17 novel CpG sites identified in our analysis, 13 CpG sites mapped to a gene region while the others are located in gene desert areas. Only one of these 13 genes, TNFAIP8 which is associated with TG in our study, has been reported in a published GWAS study for association with HDL-c and TG [38]. At the same time, 11 of the 13 mapped genes overlapped with reported genes from previous methylome-wide association studies for association with cardiometabolic traits or diseases, including type 2 diabetes [39], ischemic heart disease [39], body mass index [40,41,42,43,44], waist circumference [41, 42], and fasting insulin [45] (Supplemental Table 12). These observations demonstrate the strength of methylation studies for uncovering additional genes implicated in lipid metabolism and cardiometabolic traits and diseases. These 11 genes have also been reported for association with smoking and/or alcohol intake [46,47,48,49]. The mechanisms connecting these CpG sites to lipids as well as smoking and alcohol intake need further investigation. Although C16orf38 and FLJ43663 show no overlap with genes reported for association with cardiometabolic traits or diseases in methylome-wide association studies, there is strong association of C16orf38 genetic variants with insulin-like growth factor 1 in previous GWAS studies [50, 51] and FLJ43663 is linked to lipogenesis in hepatocytes [52].
Previous genome-wide genetic variant interaction analyses on lipids with smoking or alcohol intake identified additional genetic loci [13, 14], motivating us to implement the same strategy for CpG sites in the current study. However, no CpG site reached genome-wide significance in the interaction analysis. We further performed stratified analyses focusing only on newly identified CpG sites and previously reported ones and observed 11 and two CpG sites exhibiting significant heterogeneity across smoking and alcohol intake strata, respectively. The majority of the heterogeneous associations across smoking strata showed larger effect sizes in the nonsmoker group. The two heterogeneous associations across alcohol intake strata were both driven by larger effect sizes in the moderate drinker group, supporting previous studies which suggest an important influence of alcohol consumption on lipid levels. Follow-up analyses are needed to confirm these findings and the biological mechanisms underlying these observed heterogeneous associations require further investigation.
The pathways identified using CpG sites in the current study showed limited overlap with the ones identified using genetic variants from a previously published GWAS [53]. As expected, lipid metabolism-related pathways showed enrichment in both studies. In both studies, we observed that most of the enriched pathways were lipid-specific. It is worth noting that we identified immune response related pathways using LDL-associated CpG sites and it is well known that lipids are important factors in the host defense system [54]. We also identified blood cell differentiation related pathways using HDL-associated CpG sites, whose potential mechanisms have been explored in experimental studies in the setting of atherosclerosis progression [55]. Another possible explanation for this is the inadequate adjustment for estimated blood cell proportions in the association analyses.
There are three major limitations of our current study. First, examining additional factors, such as dietary habits, physical activity levels, geographic location, and prevalence of chronic diseases, might contribute to a better understanding of the association between CpG sites and lipid metabolism. These factors have been shown to influence lipid levels [56,57,58], and future studies are needed to explore the potential impact of these factors. Insufficient adjustment of confounding factors might also lead to the failure of replication of some CpG sites identified in the discovery stage. In addition, sex-stratified analyses could potentially help identify sex-specific associations that have been missed in the sex-combined analysis [59]. Secondly, whole blood-based DNA methylation was used in the current analysis while lipid metabolism-related tissues, namely liver, adipose tissue, and muscle, were not available for evaluation. Recent studies have explored DNA methylation patterns across tissues and the correlation varies based on genes, tissues, and the studied phenotypes [60,61,62,63]. Notably, findings from an epigenetic study on lipid-related metabolites indicated that differential methylation of multiple CpG sites persists in both whole blood and adipose tissue [64]. Another study found that differentially methylated CpG sites at well-established lipid-associated genes ABCG1 and SREBF1 were also associated with insulin resistance and BMI in blood, liver, and adipose tissues [65]. Whole blood has been the most commonly used biological material in epigenetic studies due to its easy accessibility and minimal invasiveness. Studies using DNA methylation measurements in whole blood benefit from much larger sample sizes compared to ones using tissue samples. Nevertheless, future studies on lipid metabolism in related tissues and experimental studies are needed to confirm findings from whole blood and to gain a more comprehensive picture of DNA methylation patterns across tissues. Thirdly, the cross-sectional design of our current study limits inference on temporality. Future Mendelian randomization analyses in large sample sizes are needed to clarify whether the CpG variations cause lipid level changes or are a consequence of lipid variability. There are two minor limitations of the current study. The limited sample sizes of Asian American and Native American populations prevented us from performing meaningful analyses. Participants in some of the included studies were originally selected for other phenotypes, which could potentially introduce a selection bias.
In conclusion, we identified 17 novel CpG sites in the methylome-wide association analysis and confirmed a considerable number of previously reported CpG sites. We identified 13 CpG sites which exhibit heterogeneous associations across smoking or alcohol intake strata. Our analyses provided additional information on the genes and pathways associated with lipid concentrations compared to GWAS studies.
Availability of data and materials
Summary statistics of the methylome-wide association analyses and interaction analyses are available upon request.
References
Ference BA, Graham I, Tokgozoglu L, Catapano AL. Impact of lipids on cardiovascular health: JACC health promotion series. J Am Coll Cardiol. 2018. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.jacc.2018.06.046.
Orešič M, et al. Prediction of non-alcoholic fatty-liver disease and liver fat content by serum molecular lipids. Diabetologia. 2013;56:2266.
Kanai M, et al. Genetic analysis of quantitative traits in the Japanese population links cell types to complex human diseases. Nat Genet. 2018;50:390.
Klarin D, et al. Genetics of blood lipids among ~300,000 multi-ethnic participants of the Million Veteran Program. Nat Genet. 2018;50:1514.
Graham SE, et al. The power of genetic diversity in genome-wide association studies of lipids. Nature. 2021;600:675.
Braun KVE, et al. Epigenome-wide association study (EWAS) on lipids: the Rotterdam study. Clin Epigenetics. 2017. https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s13148-016-0304-4.
Hedman ÅK, et al. Epigenetic patterns in blood associated with lipid traits predict incident coronary heart disease events and are enriched for results from genome-wide association studies. Circ Cardiovasc Genet. 2017. https://doiorg.publicaciones.saludcastillayleon.es/10.1161/CIRCGENETICS.116.001487.
Sayols-Baixeras S, et al. Identification and validation of seven new loci showing differential DNA methylation related to serum lipid profile: an epigenome-wide approach The REGICOR study. Hum Mol Genet. 2016;25:4556.
Pfeiffer L, et al. DNA methylation of lipid-related genes affects blood lipid levels. Circ Cardiovasc Genet. 2015;8:334.
Frazier-Wood AC, et al. Methylation at CPT1A locus is associated with lipoprotein subfraction profi les. J Lipid Res. 2014;55:1324.
Irvin MR, et al. Epigenome-wide association study of fasting blood lipids in the genetics of lipid-lowering drugs and diet network study. Circulation. 2014;130:565.
Jhun MA, et al. A multi-ethnic epigenome-wide association study of leukocyte DNA methylation and blood lipids. Nat Commun. 2021;12:3987.
Bentley AR, et al. Multi-ancestry genome-wide gene–smoking interaction study of 387,272 individuals identifies new loci associated with serum lipids. Nat Genet. 2019;51:636.
De Vries PS, et al. Multiancestry Genome-Wide Association Study of Lipid Levels Incorporating Gene-Alcohol Interactions. Am J Epidemiol. 2019;188:1033.
Liu C, et al. A DNA methylation biomarker of alcohol consumption. Mol Psychiatry. 2018;23:422.
Perrier F, et al. Association of leukocyte DNA methylation changes with dietary folate and alcohol intake in the EPIC study. Clin Epigenetics. 2019. https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s13148-019-0637-x.
Prince C, et al. Investigating the impact of cigarette smoking behaviours on DNA methylation patterns in adolescence. Hum Mol Genet. 2019;28:155.
The ARIC Investigators. The atherosclerosis risk in communities (ARIC) sudy: design and objectives. Am J Epidemiol. 1989;129:687.
Wright JD, et al. The ARIC (Atherosclerosis risk in communities) Study. J Am Coll Cardiol. 2021;77:2939.
Anderson G, et al. Design of the women’s health initiative clinical trial and observational study. Control Clin Trials. 1998;19:61.
Taylor HA, et al. Toward resolution of cardiovascular health disparities in African Americans: design and methods of the Jackson heart study. Ethn Dis. 2005;15:1.
Mitchell BD, et al. The genetic response to short-term interventions affecting cardiovascular function: rationale and design of the heredity and phenotype intervention (HAPI) heart study. Am Heart J. 2008;155:823.
Shuldiner AR, et al. Association of cytochrome P450 2C19 genotype with the antiplatelet effect and clinical efficacy of clopidogrel therapy. JAMA. 2009;302:849.
Fried LP, et al. The cardiovascular health study: design and rationale. Ann Epidemiol. 1991;1:263.
Daniels PR, et al. Familial aggregation of hypertension treatment and control in the Genetic Epidemiology Network of Arteriopathy (GENOA) study. Am J Med. 2004;116:676.
Gram IT, et al. Smoking and risk of breast cancer in a racially/ethnically diverse population of mainly women who do not drink alcohol the MEC Study. Am J Epidemiol. 2015;182:917.
Bild DE, et al. Multi-ethnic study of atherosclerosis: objectives and design. Am J Epidemiol. 2002;156:871.
Hu Y, et al. Minority-centric meta-analyses of blood lipid levels identify novel loci in the population architecture using genomics and epidemiology (PAGE) study. PLoS Genet. 2020;16:e1008684.
Teschendorff AE, et al. A beta-mixture quantile normalization method for correcting probe design bias in Illumina Infinium 450 k DNA methylation data. Bioinformatics. 2013;29:189.
Fortin JP, et al. Functional normalization of 450 k methylation array data improves replication in large cancer studies. Genome Biol. 2014. https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s13059-014-0503-2.
Wu Z, Aryee MJ. Subset quantile normalization using negative control features. J Comput Biol. 2010;17:1385.
Houseman EA, et al. DNA methylation arrays as surrogate measures of cell mixture distribution. BMC Bioinform. 2012;13:16.
Willer CJ, Li Y, Abecasis GR. Metal: fast and efficient meta-analysis of genomewide association scans. Bioinformatics. 2010;26:2190.
van Iterson M, et al. Controlling bias and inflation in epigenome- and transcriptome-wide association studies using the empirical null distribution. Genome Biol. 2017;18:1.
Yu G, Wang LG, Han Y, He QY. ClusterProfiler: an R package for comparing biological themes among gene clusters. OMICS. 2012;16:284.
Wu T, et al. Clusterprofiler 4.0: A universal enrichment tool for interpreting omics data. Innovation. 2021;2:100141.
Phipson B, Maksimovic J, Oshlack A. MissMethyl: an R package for analyzing data from Illumina’s HumanMethylation450 platform. Bioinformatics. 2016;32:286.
Hoffmann TJ, et al. A large electronic-health-record-based genome-wide study of serum lipids. Nat Genet. 2018;50:401.
Hillary, R. F. et al. Blood-based epigenome-wide analyses on the prevalence and incidence of nineteen common disease states. medRxiv Preprint at (2023).
Sun D, et al. Body mass index drives changes in DNA methylation: a longitudinal study. Circ Res. 2019;125:824.
Demerath EW, et al. Epigenome-wide association study (EWAS) of BMI, BMI change and waist circumference in African American adults identifies multiple replicated loci. Hum Mol Genet. 2015;24:4464.
Dhana K, et al. An epigenome-wide association study of obesity-related traits. Am J Epidemiol. 2018;187:1662.
Vehmeijer FOL, et al. DNA methylation and body mass index from birth to adolescence: meta-analyses of epigenome-wide association studies. Genome Med. 2020;12:1.
Mendelson MM, et al. Association of body mass index with DNA methylation and gene expression in blood cells and relations to cardiometabolic disease: a Mendelian randomization approach. PLoS Med. 2017;14:e1002215.
Liu J, et al. An integrative cross-omics analysis of DNA methylation sites of glucose and insulin homeostasis. Nat Commun. 2019;10:2581.
Witt SH, et al. Acute alcohol withdrawal and recovery in men lead to profound changes in DNA methylation profiles: a longitudinal clinical study. Addiction. 2020;115:2034.
Christiansen C, et al. Novel DNA methylation signatures of tobacco smoking with trans-ethnic effects. Clin Epigenetics. 2021. https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s13148-021-01018-4.
Dugué PA, et al. Alcohol consumption is associated with widespread changes in blood DNA methylation: analysis of cross-sectional and longitudinal data. Addict Biol. 2021;26:e12855.
Sikdar S, et al. Comparison of smoking-related DNA methylation between newborns from prenatal exposure and adults from personal smoking. Epigenomics. 2019;11:1487.
Sinnott-Armstrong N, Naqvi S, Rivas M, Pritchard JK. GWAS of three molecular traits highlights core genes and pathways alongside a highly polygenic background. Elife. 2021. https://doiorg.publicaciones.saludcastillayleon.es/10.7554/eLife.58615.
Barton AR, Sherman MA, Mukamel RE, Loh PR. Whole-exome imputation within UK Biobank powers rare coding variant association and fine-mapping analyses. Nat Genet. 2021;53:1260.
Khatun M, Sur S, Steele R, Ray R, Ray RB. Inhibition of long noncoding RNA Linc-Pint by hepatitis C virus in infected hepatocytes enhances lipogenesis. Hepatology. 2021;74:41.
Selvaraj MS, et al. Whole genome sequence analysis of blood lipid levels in >66,000 individuals. Nat Commun. 2022;13:5995.
Feingold KR, Grunfeld C. Lipids: a key player in the battle between the host and microorganisms. J Lipid Res. 2012;53:2487.
Gao M, et al. Regulation of high-density lipoprotein on hematopoietic stem/progenitor cells in atherosclerosis requires scavenger receptor type BI expression. Arterioscler Thromb Vasc Biol. 2014;34:1900.
Hu FB. Dietary pattern analysis: A new direction in nutritional epidemiology. Curr Opin Lipidol. 2002;13:3.
Arsenault BJ, Boekholdt SM, Kastelein JJP. Lipid parameters for measuring risk of cardiovascular disease. Nat Rev Cardiol. 2011;8:197.
Trejo-Gutierrez JF, Fletcher G. Impact of exercise on blood lipids and lipoproteins. J Clin Lipidol. 2007. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.jacl.2007.05.006.
Govender P, Ghai M, Okpeku M. Sex-specific DNA methylation: impact on human health and development. Mol Genet Genomics. 2022. https://doiorg.publicaciones.saludcastillayleon.es/10.1007/s00438-022-01935-w.
Crujeiras AB, et al. DNA methylation map in circulating leukocytes mirrors subcutaneous adipose tissue methylation pattern: a genome-wide analysis from non-obese and obese patients. Sci Rep. 2017. https://doiorg.publicaciones.saludcastillayleon.es/10.1038/srep41903.
Hardy T, et al. Plasma DNA methylation: A potential biomarker for stratification of liver fibrosis in non-alcoholic fatty liver disease. Gut. 2017;66:1321.
Wu YL, et al. Epigenetic regulation in metabolic diseases: mechanisms and advances in clinical study. Sig Transd Target Therapy. 2023. https://doiorg.publicaciones.saludcastillayleon.es/10.1038/s41392-023-01333-7.
Jones AC, Irvin MR, Claas SA, Arnett DK. Lipid phenotypes and DNA methylation: a review of the literature. Curr Atheroscler Rep. 2021. https://doiorg.publicaciones.saludcastillayleon.es/10.1007/s11883-021-00965-w.
del Gomez-Alonso MC, et al. DNA methylation and lipid metabolism: an EWAS of 226 metabolic measures. Clin Epigenetics. 2021. https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s13148-020-00957-8.
Krause C, et al. Critical evaluation of the DNA-methylation markers ABCG1 and SREBF1 for Type 2 diabetes stratification. Epigenomics. 2019;11:885.
Acknowledgements
The authors acknowledge and gratefully thank the other investigators, the staff, and the participants of all studies for their important and valuable contributions, without whom this research would not be possible. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Heart, Lung, and Blood Institute; the National Institutes of Health; or the US Department of Health and Human Services. The PAGE consortium thanks the staff and participants of the PAGE studies for their important contributions.
Funding
PAGE: The Population Architecture Using Genomics and Epidemiology (PAGE) program is supported by the National Human Genome Research Institute [R01HG010297]. The listing of PAGE senior investigators can be found at http://www.pagestudy.org. ARIC: The Atherosclerosis Risk in Communities study has been funded in whole or in part with Federal funds from the National Heart, Lung, and Blood Institute, National Institutes of Health, Department of Health and Human Services, under Contract nos. (75N92022D00001, 75N92022D00002, 75N92022D00003, 75N92022D00004, 75N92022D00005). The authors thank the staff and participants of the ARIC study for their important contributions. CHS: Infrastructure for the CHARGE Consortium is supported in part by the National Heart, Lung, and Blood Institute grant R01HL105756. The CHS research was supported by NHLBI contracts HHSN268201200036C, HHSN268200800007C, HHSN268201800001C, N01HC55222, N01HC85079, N01HC85080, N01HC85081, N01HC85082, N01HC85083, N01HC85086, R01AG023629, 75N92021D00006; and NHLBI grants U01HL080295, U01HL130114, K08HL116640, R01HL087652, R01HL092111, R01HL103612, R01HL111089, R01HL116747, and R01HL120393 with additional contribution from the National Institute of Neurological Disorders and Stroke (NINDS). Additional support was provided through R01AG023629 from the National Institute on Aging (NIA), Merck Foundation/Society of Epidemiologic Research as well as Laughlin Family, Alpha Phi Foundation, and Locke Charitable Foundation. A full list of principal CHS investigators and institutions can be found at CHS-NHLBI.org. The provision of genotyping data was supported in part by the National Center for Advancing Translational Sciences, CTSI grant UL1TR000124, and the National Institute of Diabetes and Digestive and Kidney Disease Diabetes Research Center (DRC) grant DK063491 to the Southern California Diabetes Endocrinology Research Center. Dr. Fohner is supported by K01AG071689. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health. A full list of principal CHS investigators and institutions can be found at CHS-NHLBI.org. GENOA: Support for the Genetic Epidemiology Network of Arteriopathy (GENOA) was provided by the National Heart, Lung, and Blood Institute (U01HL054457, RC1HL100185, R01HL087660, R01HL119443, and R01HL133221). JHS: The Jackson Heart Study (JHS) is supported and conducted in collaboration with Jackson State University (HHSN268201800013I), Tougaloo College (HHSN268201800014I), the Mississippi State Department of Health (HHSN268201800015I), and the University of Mississippi Medical Center (HHSN268201800010I, HHSN268201800011I, and HHSN268201800012I) contracts from the National Heart, Lung, and Blood Institute (NHLBI) and the National Institute on Minority Health and Health Disparities (NIMHD). The authors also wish to thank the staffs and participants of the JHS. The views expressed in this manuscript are those of the authors and do not necessarily represent the views of the National Heart, Lung, and Blood Institute; the National Institutes of Health; or the US Department of Health and Human Services. MESA: Whole genome sequencing (WGS) for the Trans-Omics in Precision Medicine (TOPMed) program was supported by the National Heart, Lung, and Blood Institute (NHLBI). WGS for “NHLBI TOPMed: Multi-Ethnic Study of Atherosclerosis (MESA)” (phs001416.v3.p1) was performed at the Broad Institute of MIT and Harvard (3U54HG003067-13S1). Centralized read mapping and genotype calling, along with variant quality metrics and filtering, were provided by the TOPMed Informatics Research Center (3R01HL-117626-02S1). Phenotype harmonization, data management, sample-identity QC, and general study coordination were provided by the TOPMed Data Coordinating Center (3R01HL-120393-02S1) and TOPMed MESA Multi-Omics (HHSN2682015000031/HSN26800004). The MESA projects are conducted and supported by the National Heart, Lung, and Blood Institute (NHLBI) in collaboration with MESA investigators. Support for the Multi-Ethnic Study of Atherosclerosis (MESA) projects are conducted and supported by the National Heart, Lung, and Blood Institute (NHLBI) in collaboration with MESA investigators. Support for MESA is provided by contracts 75N92020D00001, HHSN268201500003I, N01-HC-95159, 75N92020D00005, N01-HC-95160, 75N92020D00002, N01-HC-95161, 75N92020D00003, N01-HC-95162, 75N92020D00006, N01-HC-95163, 75N92020D00004, N01-HC-95164, 75N92020D00007, N01-HC-95165, N01-HC-95166, N01-HC-95167, N01-HC-95168, N01-HC-95169, UL1-TR-000040, UL1-TR-001079, UL1-TR-001420, UL1TR001881, DK063491, HL148610, and R01HL105756. The authors thank the other investigators, the staff, and the participants of the MESA study for their valuable contributions. A full list of participating MESA investigators and institutes can be found at http://www.mesa-nhlbi.org. MEC: The MEC study is funded through the National Cancer Institute (NCI; U01CA164973). The data generated for this analysis was funded by the NHGRI PAGE program (NIH U01HG007397) and NCI P01CA138338. WHI: The WHI program is funded by the National Heart, Lung, and Blood Institute, National Institute of Health, US Department of Health and Human Services [75N92021D00001, 75N92021D00002, 75N92021D00003, 75N92021D00004, and 75N92021D00005]. Fred Hutch computing is S10OD028685. The datasets used for the analyses described in this manuscript were obtained from dbGaP under accession phs000227. A short list of WHI investigators is available online at https://www-whi-org.s3.us-west-2.amazonaws.com/wp-content/uploads/WHI-Investigator-Short-List.pdf.
Author information
Authors and Affiliations
Contributions
Y.H. contributed to the design of the work, analyses, data interpretation, and manuscript draft. J.H. contributed to the design of the work, analyses, data interpretation, and manuscript revision. J.L. and B.D. contributed to the design of the work, interpretation of data, and manuscript revision. E.W., M.G., W.G., R.X., M.S., L.R., S.R., Y.W., X.W., A.F., M.L., and Y.P. contributed to analyses, data interpretation, and manuscript revision. L.P., H.X., B.M., J.B., N.S., J.B., B.P., G.P., M.T., S.R., J.R., J.S., S.K., A.R., L.L., M.F., and J.P. contributed to acquisition and interpretation of data as well as manuscript revision. M.G., K.N., C.K., and U.P. contributed to the design of the work, acquisition and interpretation of data, and manuscript revision.
Corresponding author
Ethics declarations
Ethics approval and consent to participate
All studies obtained written informed consent from participants and were approved by local institutional review boards and ethics committees.
Consent for publication
All authors have approved the manuscript and agreed with its submission to Clinical Epigenetics.
Competing interest
The authors declare no competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Hu, Y., Haessler, J., Lundin, J.I. et al. Methylome-wide association analyses of lipids and modifying effects of behavioral factors in diverse race and ethnicity participants. Clin Epigenet 17, 54 (2025). https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s13148-025-01859-3
Received:
Accepted:
Published:
DOI: https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s13148-025-01859-3