Skip to main content

Protein epigenetic scores and overall mortality in the longitudinal Swedish Adoption/Twin Study of Aging (SATSA)

Abstract

Introduction

DNA methylation (DNAm) has a functional role in gene regulation, and it has been used to estimate various human characteristics. Variation in DNAm is associated with aging and variability of the proteome. Therefore, understanding the relationship between blood circulating proteins, aging, and mortality is critical to identify disease-causing pathways. We aimed to estimate the association between protein epigenetic scores (EpiScores) and overall mortality in the Swedish Adoption/Twin Study of Aging (SATSA).

Methods

We included information from 374 individuals collected between 1992 and 2014. Our exposures were 109 protein EpiScores generated using DNAm data and prediction models by the MethylDetectR shiny app. All-cause mortality was the outcome of interest. To estimate the protein EpiScores associations with all-cause mortality, we fitted Cox proportional hazard models adjusted for age, sex, education, smoking status, body mass index, and occupation. We also conducted co-twin control analyses to control for shared familial factors.

Results

The mean age of participants at the first assessment was 68.6 years. In total, nine protein EpiScores (e.g., Stanniocalcin 1) were associated with a higher risk for all-cause mortality. In contrast, five protein EpiScores (e.g., Prolyl endopeptidase) were associated with a lower risk for all-cause mortality.

Conclusion

The protein EpiScores associated with an increased mortality risk represent proteins involved in metabolic functions, immune response, and inflammation. Conversely, those associated with a lower risk represent proteins involved in neurogenesis and cellular functions. Overall, it is possible to predict protein levels from DNAm data that could have clinical relevance.

Introduction

Aging is characterized by molecular and cellular changes that may impact an individual’s functional abilities over a lifetime [1]. Among these changes, epigenetic alterations stand out as one of the hallmarks of aging [1]. Epigenetics is a process that modifies gene expression without altering the DNA sequence, yet it can be inherited across generations, e.g., DNA methylation (DNAm) [2].

DNAm is an epigenetic mechanism widely studied in epidemiological research, and it consists of a process where methyl groups are added to the cytosine bases of DNA to form 5-methylcytosine [2]. This mechanism is of particular significance since it regulates gene expression and inhibition [3]. DNAm changes are linked to both aging and environmental factors as well as aging-related diseases and mortality [4,5,6]. Over time, this biomarker has been used to study biological age acceleration through different epigenetic clocks. Some of these clocks focus solely on chronological age (e.g., Horvath’s epigenetic clock), while others incorporate clinical biomarkers of aging, DNAm-based plasma protein markers, and lifestyle characteristics alongside chronological age (e.g., DNAm PhenoAge and DNAm GrimAge) [5].

In a more recent approach, Gadd and collaborators [7] have developed an epigenetic predictor for protein levels. They generated protein epigenetic scores (EpiScores) using blood DNAm. These EpiScores were associated with the incidence of several common morbidities in aging, such as diabetes and cardiovascular diseases. Proteins serve numerous functions in human physiology to maintain homeostasis [8], acting as structural components within many of the body’s metabolic processes. In aging and aging-related diseases, many molecular and cellular alterations are related to protein abnormalities or altered functionality [1]. Notably, blood proteins have been suggested as potential proteomic aging clocks linked to extending or declining lifespan and health span in animal models [9].

If an epigenetic predictor for protein levels is directly correlated with their corresponding circulating blood proteins, this approach would enable researchers to investigate proteins related to disease-causing pathways with greater reproducibility and reduced susceptibility to transient fluctuations (e.g., acute infections) compared to the corresponding blood proteins. Researchers have examined the relationship between protein EpiScores, diseases, and inflammation [7, 10]; however, to our knowledge, associations with mortality have not been thoroughly evaluated. Therefore, we aimed to estimate the association between protein EpiScores and overall mortality in the Swedish Adoption/Twin Study of Aging (SATSA).

Methods

Study population

We used information from the Swedish Adoption/Twin Study of Aging (SATSA). SATSA is a longitudinal study to understand the factors that contribute to healthy aging and longevity. The cohort consists of a large sample of monozygotic (MZ) and dizygotic (DZ) twins (859 individuals) from Sweden. Data collection periods occurred between 1986 and 2014 (Supplementary Fig. S1) [11].

We analyzed data from up to six in-person testing (IPT) occasions (from 1992 to 2014—Supplementary Fig. S1). Each IPT was an extensive data collection that included physical tests, health information, and blood testing [11]. We considered 533 participants with available blood samples with methylation and covariates information. We excluded 159 participants who had missing covariate values (Supplementary Table S1). The final sample size for our main analysis included 374 individuals. For the co-twin control analyses, we excluded occupation as a covariate due to the high proportion of missing values, which would have reduced statistical power (Supplementary Table S1). Consequently, the sample size is larger (n = 491, including 176 MZ twins and 315 DZ twins).

Protein epigenetic scores

DNAm was measured using array technologies (Infinium Human Methylation 450 K Bead Chip [n = 385] or Infinium MethylationEPIC BeadChip [n = 150], both from Illumina). Information about the methylation process and multi-step quality control pipeline can be found elsewhere [12].

We used the MethylDetectR software [13] to generate the 109 protein EpiScores from blood DNAm data based on Gadd and collaborators work (Table 1) [7]. The authors generated epigenetic scores by training epigenetic data for 953 plasma proteins from the SOMAscan (aptamer-based) and Olink (antibody-based) platforms in two different cohorts (the German population-based study KORA and the Scottish Lothian Birth Cohort 1936) [7]. Of the 953 proteins, 109 met the robustness criteria (r > 0.1, p < 0.05) and were further validated in the Generation Scotland study. The authors used elastic net penalized regression models to generate the scores. In this model, DNAm at CpG sites, which is predictive of a given protein, was selected by the model, and each selected CpG received a weighting coefficient. The model’s weightings enable MethylDetectR to generate 109 protein EpiScores for our sample using SATSA DNAm data [7, 13].

Table 1 List of 109 protein epigenetic scores generated from blood DNA methylation data—SATSA

Overall, to generate the EpiScores using MethylDetectR software, the DNAm data, in beta values format, should be uploaded into the software. The software imputes missing beta values (mean imputation) and assigns missing CpG sites the mean beta value from Generation Scotland DNAm data. It is also possible to locally generate the protein EpiScores (using the same imputation methods) with the R codes and data files provided by MethylDetectR [13]. We chose to generate our EpiScores locally. (The data was not uploaded to MethylDetectR software.)

It is also worth mentioning that the protein EpiScores are returned without a unit/scale; however, they correlate with their target protein in the sense that those on the lower end of the scale should have lower protein values than those at the higher end [7, 13]. Hence, we standardized our scores using z-score standardization (mean value is 0, and the standard deviation is 1).

Overall mortality

All-cause mortality information (vital status and dates of death) was derived from the Swedish National Death Registry. The dates of death were updated on 01 October 2022 (date of censuring).

Time-stable and time-varying confounders

The time-stable confounders were sex, education, and occupation. Sex was treated as 1 = men and 2 = women. Education was categorized as 1 = elementary school, 2 = 0-level vocational school or folk high school, 3 = gymnasium, and 4 = university or higher. Occupation was defined as the work or profession the participants have had for most of their working life. The categories included: 1 = unskilled and semiskilled workers, 2 = skilled workers, 3 = assistant non-manual employees, 4 = intermediate non-manual employees, 5 = employed and self-employed professionals (higher civil servants and executives), 6 = self-employed (other than professionals), 7 = housewife (or male equivalent), and 8 = housewife with a temporary job. Education and occupation were considered a time-stable confounder because they are unlikely to change after 50 years of age.

The time-varying confounders included in our study were age (time scale), smoking status, and body mass index (BMI). All time-varying confounders were measured on the same date as the blood sample collection. Smoking status was treated as 1 = not currently smoking, 2 = ex-smoker, and 3 = currently smoking. BMI was numeric and calculated as kilograms divided by squared height in meters.

Statistical analyses

Means, standard deviations, proportions, and frequency were used to describe characteristics of the study population regarding the protein EpiScores, overall mortality, and confounders. Descriptive Pearson correlation plots between the protein EpiScores and age are included in Supplementary Figs. S2, S3, and S4.

The effects of the 109 protein EpiScores on overall mortality were estimated through Cox proportional hazard models. First, to create the time scale, we considered the earliest chronological age, in months, as the initial time, and from this age, the entry time of each individual was calculated. We utilized a counting process to account for time-varying confounders. In this time-varying analysis, all variables were included at each time point [14].

We performed a separate model for each of the 109 protein EpiScores. We considered hazard ratios (HR) with a 95% Confidence Interval (CI) as significant; however, we only interpreted HR that were significant after adjustments for multiple comparisons (False Discovery Rate, FDR). Model 1 refers to estimated crude HR adjusted for age (time scale) and sex. Model 2 refers to estimated HR adjusted for age (time scale), sex, education, smoking status, BMI, and occupation. In all models, we considered the relatedness of the twin pairs. The accumulated risks of overall mortality for the protein EpiScores during the entire study period are shown in Supplementary Fig. S5 and S6. Additionally, we evaluated whether the risk factor protein EpiScores were correlated with one another (Supplementary Fig. S7). A similar assessment was conducted for the protective factor protein EpiScores (Supplementary Fig. S8). Finally, we conducted residuals analysis with different purposes. We verified Schoenfeld residuals to check the assumption of proportional hazards, Deviance residuals to check outliers, and Score residuals to check influential or extreme observations. Lastly, Harrell’s C-index was described in the models to check the global assessment of discrimination. Values close to one have better discriminatory power.

To evaluate the familial and shared environmental influences, we performed stratified Cox proportional hazard models within MZ and DZ twin pairs. In these analyses, model 2 was not adjusted for occupation. We also tested the correlation of protein EpiScores values within MZ twin pairs as sensitivity analysis. All analyses were conducted using R software [15], version 4.3.1, package tidyverse, heatmaply, and survival.

Results

In the initial assessment, the mean age of participants was 68.6 years. Most of the study population consisted of women (61%), 50.5% completed elementary school, and 29.1% were unskilled and semiskilled workers. Additionally, 80.5% were non-smokers, with a mean body mass index of 26 in the initial assessment. These proportions remained consistent throughout the entire study period. The average follow-up time was 16 years and further two IPTs per individual. The number of participants who had died was 253 (Table 2). The 109 protein EpiScores exhibited a range from approximately −8.5 to 7.2 (Supplementary Table S2). Furthermore, these protein EpiScores had weak correlations with age, with values varying between −0.4 and 0.4 (Supplementary Figs. S2, S3, and S4).

Table 2 Descriptives of study variables in the first assessment and in the whole study period, SATSA

Protein EpiScore and overall mortality

In Cox proportional hazard models (after multiple comparisons correction), the increase of one standard deviation of IGFBP4, MMP9, OSM, PIGR, RARRES2, S100A12, STC1, TGFA, and THBS2 was associated with an increased risk of overall mortality: HR of 1.23, 1.23, 1.32, 1.31, 1.21, 1.21, 1.48, 1.31, and 1.18, respectively (Table 3). MMP9, OSM, PIGR, S100A12, and TGFA exhibited strong correlations with each other (r > 0.70) (Supplementary Fig. S7). Conversely, the increase of one standard deviation of LY9, NCAM1, PREP, SEMA3E, and WFIKKN2 were associated with a lower risk of overall mortality: HR of 0.80, 0.78, 0.83, 0.79, and 0.80, respectively (Table 3). LY9, NCAM1, and SEMA3E exhibited moderate correlation with each other (r > 0.50) (Supplementary Fig. S8). The estimated concordance probability of all models had a predictive value ranging from 64 to 66% (Harrell’s C-index). The hazard ratios for the 109 protein EpiScores are detailed in Supplementary Table S3. Throughout the study period, the cumulative risk of overall mortality progressively increased, particularly toward the end, for all protein EpiScores (Supplementary Figs. S5 and S6).

Table 3 Crude and adjusted associations between protein EpiScores and overall mortality in SATSA

The residual analyses revealed that NTRK3—Olink violated Schoenfeld’s assumption (Supplementary Table S4 and Supplementary Fig. S9). Hence, despite having significant estimates in the Cox proportional hazard models, its estimates were not interpreted. Additionally, we observed a few outliers (deviance residuals—Supplementary Table S5) but no influential or extreme observations (score residuals—Supplementary Table S6).

In Cox proportional hazard models within twin pairs, the HRs for the protein EpiScores were not statistically significant for DZ twin pairs. In contrast, for MZ twin pairs, we observed stronger counterintuitive associations with substantial confidence intervals (Fig. 1 and Supplementary Table S7). In our sensitivity analysis (protein EpiScores correlation within MZ twin pairs, Supplementary Fig. S10), we observed that most protein EpiScores had poor or moderate correlation within these twin pairs. This suggests a potentially lower genetic correlation. However, STC1 was an exception, showing a correlation higher than 0.7 (Supplementary Fig. S10).

Fig. 1
figure 1

Forest plot with Cox proportional hazards from co-twin control analyses, SATSA (n = 491 individuals). Note: The values represent hazard ratios (HR) with a 95% Confidence Interval (CI) from stratified Cox proportional hazard models within MZ and DZ twin pairs. Swedish Adoption/Twin Study of Aging—SATSA. The confidence interval for MMP9 ranges from 3.03 to 93.7; however, it was not represented in the figure to enhance visualization. IGFBP4: Insulin-like growth factor-binding protein 4, LY9: Lymphocyte antigen 9, MMP9: Matrix metalloproteinase 9, NCAM1: Neural cell adhesion molecule 1, OSM: Oncostatin-M, PIGR: Polymeric immunoglobulin receptor, PREP: Prolyl endopeptidase, RARRES2: Retinoic acid receptor responder protein 2, S100A12: S100 calcium binding protein A12, SEMA3E: Semaphorin 3E, STC1: Stanniocalcin 1, TGFA: Transforming growth factor alpha, THBS2: Thrombospondin 2, WFIKKN2: WAP, Kazal, immunoglobulin, Kunitz and NTR domain-containing protein 2

Discussion

In this study, we investigated the association between protein EpiScores and overall mortality. We found nine protein EpiScores associated with an increased overall mortality risk and five EpiScores associated with a decreased overall mortality risk after adjustments for age (time scale), sex, education, smoking status, BMI, and occupation.

The protein EpiScores associated with an increased mortality risk represent proteins involved in metabolic functions, immune response, and inflammation. Additionally, some represent proteins acting as growth factors/promoters. Specifically, IGFBP4, RARRES2, S100A12, and STC1 have metabolic functions, such as renal phosphate reabsorption and regulation of inflammatory processes [7, 16,17,18,19]. MMP9, OSM, PIGR, and THBS2 play crucial roles in the immune response and inflammation. These roles include, for example, facilitating cell-to-cell interactions, transporting antibodies, and regulating the growth and differentiation of various cell types [20,21,22,23]. Lastly, TGFA regulate cell growth, such as angiogenesis and tissue regeneration [24]. Most of these proteins are related to the pathogenesis of numerous diseases, such as cancers and cardiovascular diseases [16, 17, 20, 24]. In previous research, these protein EpiScores were also associated with the incidence of ischemic heart disease, chronic obstructive pulmonary disease, lung cancer, depression, and diabetes [7].

It is worth noticing that these proteins interact with each other, and a high level of a protein EpiScore might not necessarily indicate they have a role in diseases or mortality causal paths. As an example, Gude et al. describe the relationship between PAPPA, IGFBP1, IGFBP4, and STC1 in cardiovascular disease [25]. In this scenario, STC1 may be elevated as a compensatory mechanism in response to elevated levels of PAPPA, despite its association with a high risk of all-cause mortality. Hence, it is essential to conduct additional studies to explore the potential role of these proteins or protein EpiScores in disease causal pathways through stronger causal inference methods [26].

The protein EpiScores associated with a lower mortality risk represent proteins acting as growth factors and involved in inflammation, immune response, neurogenesis, and cellular functions. Specifically, LY9 and SEMA3E have significant roles in immune response, including the regulation of various immune cell activities and functions [27, 28]. NCAM1 is a protein regulating neurogenesis, neuron-neuron adhesion, neurite fasciculation, outgrowth of neurites, and cell migration [29]. PREP is a cell surface glycoprotein serine protease that participates in many cellular processes in tissue remodeling and inflammation [30]. Lastly, WFIKKN2 inhibits the biological activity of mature myostatin [7]. In previous research, PREP and WFIKKN2 EpiScores were considered protective factors to incident chronic obstructive pulmonary disease and stroke, respectively [7].

Limited research is found concerning associations between mortality and the proteins corresponding to the EpiScores identified here [25, 31,32,33,34,35,36,37,38]. The studies are often conducted with small samples or in specific settings, e.g., institutionalized participants or participants from clinics, and populations with specific conditions, e.g., participants in hemodialysis. However, our results in community-dwelling older twin adults appear to be robust and in the same direction of association as previous research. To the best of our knowledge, our study was the first to describe associations between the protein EpiScores and overall mortality.

We also conducted a co-twin control analysis to estimate the association between the protein EpiScores and overall mortality, considering the interplay between environmental factors and genetics [39]. However, we could not extract much information from the co-twin control analysis due to limited statistical power (most associations were non-significant). In a sensitivity analysis, we found that the EpiScore STC1 exhibited the highest correlation among MZ twins, suggesting a potentially stronger genetic influence despite its non-significant association in the within-pair analysis. The other EpiScores showed weaker or moderate correlations within the MZ twins. It is reasonable to conclude that the protein EpiScores may be influenced by environmental characteristics since they are created using an epigenetic mechanism (DNAm), and external factors, such as drugs, nutrition, and chemicals, may influence epigenetics alterations [40]. This influence of the environmental characteristics on DNAm is a limitation of this study.

Another limitation is our sample size. Although we had longitudinal data to estimate the associations between protein EpiScores and overall mortality over time, we could not perform stratified analysis by sex due to a lack of statistical power. Additionally, we were unable to consider important confounders in our analysis, such as race/ethnicity. It is well-established that men and women differ in many biological aging traits [41], and that ethnic differences exist. Therefore, we recommend that future research further analyze sex and ethnic differences concerning the protein EpiScores. Furthermore, as in any longitudinal study, attrition occurred in SATSA, resulting in individuals remaining with better health status. Hence, the generalizability of our study to the general population must be considered with caution. Finally, SATSA did not measure the corresponding blood protein values, preventing us from providing correlations between EpiScores and their corresponding blood proteins as a validation metric.

To conclude, we assessed the protein EpiScores developed by Gadd and collaborators in a distinct cohort and found that 14 protein EpiScores were significantly associated with overall mortality. This study discussed the functionality of the blood proteins associated with these significant EpiScores and addressed their relevance in different health outcomes. Overall, it is possible to predict protein levels from DNAm data, which could have clinical relevance.

Availability of data and materials

The data cannot be shared publicly by the authors. The Swedish Adoption/Twin Study of Aging (SATSA) data is available from the National Archive of Computerized Data on Aging (NACDA) 2015. https://www.icpsr.umich.edu/web/NACDA/studies/3843/summary.

References

  1. López-Otín C, Blasco MA, Partridge L, Serrano M, Kroemer G. Hallmarks of aging: an expanding universe. Cell. 2023;186(2):243–78. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/J.CELL.2022.11.001.

    Article  PubMed  Google Scholar 

  2. Weinhold B. Epigenetics: the science of change. Environ Health Perspect. 2006. https://doiorg.publicaciones.saludcastillayleon.es/10.1289/ehp.114-a160.

    Article  PubMed  PubMed Central  Google Scholar 

  3. Moore LD, Le T, Fan G. DNA Methylation and its basic function. Neuropsychopharmacology. 2013;38(1):23–38. https://doiorg.publicaciones.saludcastillayleon.es/10.1038/npp.2012.112.

    Article  CAS  PubMed  Google Scholar 

  4. Li X, Ploner A, Wang Y, et al. Longitudinal trajectories, correlations and mortality associations of nine biological ages across 20-years follow-up. Elife. 2020. https://doiorg.publicaciones.saludcastillayleon.es/10.7554/ELIFE.51507.

    Article  PubMed  PubMed Central  Google Scholar 

  5. Duan R, Fu Q, Sun Y, Li Q. Epigenetic clock: a promising biomarker and practical tool in aging. Ageing Res Rev. 2022;81: 101743. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.arr.2022.101743.

    Article  CAS  PubMed  Google Scholar 

  6. Salameh Y, Bejaoui Y, El Hajj N. DNA methylation biomarkers in aging and age-related diseases. Front Genet. 2020. https://doiorg.publicaciones.saludcastillayleon.es/10.3389/fgene.2020.00171.

    Article  PubMed  PubMed Central  Google Scholar 

  7. Gadd DA, Hillary RF, McCartney DL, et al. Epigenetic scores for the circulating proteome as tools for disease prediction. Elife. 2022. https://doiorg.publicaciones.saludcastillayleon.es/10.7554/eLife.71802.

    Article  PubMed  PubMed Central  Google Scholar 

  8. LaPelusa A, Kaushik R. Physiology, Proteins. StatPearls Treasure Island (FL). https://www.ncbi.nlm.nih.gov/books/NBK555990/.

  9. Johnson AA, Shokhirev MN, Wyss-Coray T, Lehallier B. Systematic review and analysis of human proteomics aging studies unveils a novel proteomic aging clock and identifies key processes that change with age. Ageing Res Rev. 2020;60: 101070. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.arr.2020.101070.

    Article  CAS  PubMed  Google Scholar 

  10. Smith HM, Moodie JE, Monterrubio-Gómez K, et al. Epigenetic scores of blood-based proteins as biomarkers of general cognitive function and brain health. Clin Epigenetics. 2024;16(1):46. https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s13148-024-01661-7.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  11. Pedersen NL, McClearn GE, Plomin R, Nesselroade JR, Berg S, DeFaire U. The Swedish Adoption Twin Study of Aging: an update. Acta Genet Med Gemellol (Roma). 1991;40(1):7–20. https://doiorg.publicaciones.saludcastillayleon.es/10.1017/S0001566000006681.

    Article  CAS  PubMed  Google Scholar 

  12. Karlsson IK, Ericsson M, Wang Y, et al. Replicating associations between DNA methylation and body mass index in a longitudinal sample of older twins. Int J Obes. 2020;44(6):1397–405. https://doiorg.publicaciones.saludcastillayleon.es/10.1038/s41366-019-0498-6.

    Article  CAS  Google Scholar 

  13. Hillary RF, Marioni RE. MethylDetectR: a software for methylation-based health profiling. Wellcome Open Res. 2021;5:283. https://doiorg.publicaciones.saludcastillayleon.es/10.12688/wellcomeopenres.16458.2.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  14. Zhang Z, Reinikainen J, Adeleke KA, Pieterse ME, Groothuis-Oudshoorn CGM. Time-varying covariates and coefficients in Cox regression models. Ann Transl Med. 2018;6(7):121–121. https://doiorg.publicaciones.saludcastillayleon.es/10.21037/atm.2018.02.12.

    Article  PubMed  PubMed Central  Google Scholar 

  15. Team R Core. R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. 2024. https://www.R-project.org/

  16. Nur S, Ozturk A, Kavas M, et al. IGFBP-4: a promising biomarker for lung cancer. J Med Biochem. 2021;40(3):237–44. https://doiorg.publicaciones.saludcastillayleon.es/10.5937/jomb0-25629.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  17. Zhao F, Yang G, Feng M, et al. Expression, function and clinical application of stanniocalcin-1 in cancer. J Cell Mol Med. 2020;24(14):7686–96. https://doiorg.publicaciones.saludcastillayleon.es/10.1111/jcmm.15348.

    Article  PubMed  PubMed Central  Google Scholar 

  18. Meijer B, Gearry RB, Day AS. The role of S100A12 as a systemic marker of inflammation. Int J Inflam. 2012;2012:1–6. https://doiorg.publicaciones.saludcastillayleon.es/10.1155/2012/907078.

    Article  CAS  Google Scholar 

  19. https://www.proteinatlas.org/ENSG00000186340-THBS2.RARRES2. 2024. Accessed December 19, 2024. https://www.proteinatlas.org/ENSG00000106538-RARRES2

  20. National Institutes of Health. Oncostatin M. June 17, 2024. Accessed June 28, 2024. https://www.ncbi.nlm.nih.gov/gene/5008

  21. Turula H, Wobus CE. The role of the polymeric immunoglobulin receptor and secretory immunoglobulins during mucosal infection and immunity. Viruses. 2018;10(5):237. https://doiorg.publicaciones.saludcastillayleon.es/10.3390/v10050237.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  22. The human protein atlas. MMP9. 2024. Accessed December 19, 2024. https://www.proteinatlas.org/ENSG00000100985-MMP9

  23. The human protein atlas. THBS2. 2024. Accessed December 19, 2024. https://www.proteinatlas.org/ENSG00000186340-THBS2

  24. National Institutes of Health. Transforming growth factor alpha. June 17, 2024. Accessed June 28, 2024. https://www.ncbi.nlm.nih.gov/gene/7039

  25. Gude MF, Hjortebjerg R, Bjerre M, et al. The STC2–PAPP-A–IGFBP4–IGF1 axis and its associations to mortality and CVD in T2D. Endocr Connect. 2023. https://doiorg.publicaciones.saludcastillayleon.es/10.1530/EC-22-0451.

    Article  PubMed  PubMed Central  Google Scholar 

  26. Hammerton G, Munafò MR. Causal inference with observational data: the need for triangulation of evidence. Psychol Med. 2021;51(4):563–78. https://doiorg.publicaciones.saludcastillayleon.es/10.1017/S0033291720005127.

    Article  PubMed  PubMed Central  Google Scholar 

  27. The human protein atlas. LY9. Published online 2024. Accessed December 19, 2024. https://www.proteinatlas.org/ENSG00000122224-LY9

  28. The human protein atlas. SEMA3E. Published online 2024. Accessed December 19, 2024. https://www.proteinatlas.org/ENSG00000170381-SEMA3E

  29. National Institutes of Health. Neural cell adhesion molecule 1. June 17, 2024. Accessed June 28, 2024. https://www.ncbi.nlm.nih.gov/gene/4684

  30. Huang CH, Suen CS, Lin CT, et al. Cleavage-site specificity of prolyl endopeptidase FAP investigated with a full-length protein substrate. The J Biochem. 2011;149(6):685–92. https://doiorg.publicaciones.saludcastillayleon.es/10.1093/jb/mvr017.

    Article  CAS  PubMed  Google Scholar 

  31. Lisko I, Tiainen K, Stenholm S, et al. Inflammation, adiposity, and mortality in the oldest old. Rejuvenation Res. 2012;15(5):445–52. https://doiorg.publicaciones.saludcastillayleon.es/10.1089/rej.2011.1310.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  32. Jung ES, Chung W, Kim AJ, et al. Associations between soluble receptor for advanced glycation end products (sRAGE) and S100A12 (EN-RAGE) with mortality in long-term hemodialysis patients. J Korean Med Sci. 2017;32(1):54. https://doiorg.publicaciones.saludcastillayleon.es/10.3346/jkms.2017.32.1.54.

    Article  CAS  PubMed  Google Scholar 

  33. Kalousová M, Kuběna AA, Benáková H, Dusilová-Sulková S, Tesař V, Zima T. EN-RAGE (extracellular newly identified receptor for advanced glycation end-products binding protein) and mortality of long-term hemodialysis patients: aprospective observational cohort study. Clin Biochem. 2012;45(7–8):556–60. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.clinbiochem.2012.02.014.

    Article  CAS  PubMed  Google Scholar 

  34. Santalahti K, Havulinna A, Maksimow M, et al. Plasma levels of hepatocyte growth factor and placental growth factor predict mortality in a general population: a prospective cohort study. J Intern Med. 2017;282(4):340–52. https://doiorg.publicaciones.saludcastillayleon.es/10.1111/joim.12648.

    Article  CAS  PubMed  Google Scholar 

  35. Parker MJS, Jee AS, Hansen D, et al. Multiple serum biomarkers associate with mortality and interstitial lung disease progression in systemic sclerosis. Rheumatology. 2024. https://doiorg.publicaciones.saludcastillayleon.es/10.1093/rheumatology/keae110.

    Article  PubMed  PubMed Central  Google Scholar 

  36. Garcia-Navarrete R, Garcia E, Arrieta O, Sotelo J. Hepatocyte growth factor in cerebrospinal fluid is associated with mortality and recurrence of glioblastoma, and could be of prognostic value. J Neurooncol. 2010;97(3):347–51. https://doiorg.publicaciones.saludcastillayleon.es/10.1007/s11060-009-0037-8.

    Article  CAS  PubMed  Google Scholar 

  37. Czúcz J, Cervenak L, Förhécz Z, et al. Serum soluble E-selectin and NT-proBNP levels additively predict mortality in diabetic patients with chronic heart failure. Clin Res Cardiol. 2011;100(7):587–94. https://doiorg.publicaciones.saludcastillayleon.es/10.1007/s00392-011-0283-6.

    Article  CAS  PubMed  Google Scholar 

  38. Noren Hooten N, Mode NA, Allotey S, Ezike N, Zonderman AB, Evans MK. Inflammatory proteins are associated with mortality in a middle-aged diverse cohort. Clin Transl Med. 2023. https://doiorg.publicaciones.saludcastillayleon.es/10.1002/ctm2.1412.

    Article  PubMed  PubMed Central  Google Scholar 

  39. Haworth CMA, Dale P, Plomin R. A twin study into the genetic and environmental influences on academic performance in science in nine-year-old boys and girls. Int J Sci Educ. 2008;30(8):1003–25. https://doiorg.publicaciones.saludcastillayleon.es/10.1080/09500690701324190.

    Article  PubMed  PubMed Central  Google Scholar 

  40. Kubota T, Miyake K, Hirasawa T. Epigenetic understanding of gene-environment interactions in psychiatric disorders: a new concept of clinical genetics. Clin Epigenetics. 2012;4(1):1. https://doiorg.publicaciones.saludcastillayleon.es/10.1186/1868-7083-4-1.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  41. Hägg S, Jylhävä J. Sex differences in biological aging with a focus on human studies. Elife. 2021. https://doiorg.publicaciones.saludcastillayleon.es/10.7554/ELIFE.63425.

    Article  PubMed  PubMed Central  Google Scholar 

Download references

Acknowledgements

The authors are grateful for the participants’ contributions to the SATSA study.

Funding

Open access funding provided by Karolinska Institute. This work was supported by the Swedish Research Council (grant numbers 2015-03255, 2017-00639, 2019-01272, 2020-06101, 2021-00178, 2022-01608); the National Institute on Aging (1R01AG067996); and Karolinska Institutet Foundations and Strategic Research Program in Epidemiology. The funders of this research did not contribute to study design development, analysis, interpretation of data, and in the writing of the manuscript.

Author information

Authors and Affiliations

Authors

Contributions

T.L.O. contributed to the conceptualization of the research, data curation, formal analysis, investigation, software, visualization, writing—original draft, and writing—review & editing. A.M. contributed to data curation and writing—review & editing. J.K.L.M. contributed with writing—review & editing. N.L.P. contributed supporting methodology and writing—review & editing. S.H. contributed to the conceptualization of the research, funding acquisition, methodology, project administration, resources, supervision, and writing—review & editing.

Corresponding author

Correspondence to Thaís Lopes De Oliveira.

Ethics declarations

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Lopes De Oliveira, T., March, A., Mak, J.K.L. et al. Protein epigenetic scores and overall mortality in the longitudinal Swedish Adoption/Twin Study of Aging (SATSA). Clin Epigenet 17, 41 (2025). https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s13148-025-01843-x

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s13148-025-01843-x

Keywords