Next Article in Journal
Decoding Picky Eating in Children: A Temporary Phase or a Hidden Health Concern?
Previous Article in Journal
Folate Supplementation Awareness Among Women of Reproductive Age in Poland: Focus on Active Forms and Updated National Recommendations
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Lifestyle Patterns and Incidence of Cardiovascular Diseases, Cancer, Respiratory Diseases, and Type 2 Diabetes: A Large-Scale Prospective Cohort Study

Department of Epidemiology, University Medical Center Groningen, University of Groningen, 9713 AP Groningen, The Netherlands
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Nutrients 2025, 17(24), 3883; https://doi.org/10.3390/nu17243883
Submission received: 12 November 2025 / Revised: 4 December 2025 / Accepted: 6 December 2025 / Published: 12 December 2025
(This article belongs to the Section Nutritional Epidemiology)

Abstract

Background: Lifestyle factors often interact in complex ways when influencing chronic disease risk. We aimed to examine the prospective associations between empirically derived real-life lifestyle patterns (LPs) and the incidence of major chronic diseases, and to explore the linearity of the relationships between lifestyle summation scores and disease risk. Methods: We included adults free of cardiovascular diseases (CVDs), cancer, chronic respiratory diseases (CRDs), or type 2 diabetes (T2D) at baseline (2006–2013) from the Dutch Lifelines cohort. LPs and lifestyle summation scores were derived from baseline self-reported data on diet, physical activity, substance use, sleep, stress, and social connectedness, each categorised as healthy, moderately healthy, or unhealthy. Fine–Gray sub-hazard regression models assessed associations between LPs and disease incidence, with natural spline functions used to evaluate linearity in summation scores. Results: Among 114,919 T2D-free, 131,248 cancer-free, 91,777 CRD-free, and 77,645 CVD-free participants, we observed 3114 T2D, 4685 cancer, 4133 CRDs, and 2850 CVD incident cases (median follow-up time: 8 years). Compared to the “Unhealthy” pattern, both the “Healthy-in-a-balanced-way” and “Healthy-but-physically-inactive” patterns were broadly significantly protective. The “Unhealthy-but-no-substance-use” pattern was associated with increased T2D risk (Sub-Hazard Ratio (SHR) = 1.27, 95% Confidence Interval (CI): 1.11–1.47) but reduced cancer risk (SHR = 0.85, 95%CI: 0.74–0.97). The “Unhealthy-but-light-drinking-and-never-smoked” pattern was protective for T2D (SHR = 0.89, 95%CI: 0.79–0.99). Linear associations were observed between lifestyle summation scores and disease risk, except for “healthy lifestyle” scores with T2D and “unhealthy lifestyle” scores with CRDs (non-linear p-value < 0.05). Conclusions: There are potential protective effects of healthy lifestyles on T2D, cancer, CRDs, and CVDs. However, the “Unhealthy but no substance use” demonstrated increased risk on T2D, protective effect on cancer and no significant effect on CRDs or CVDs. The relationship between combined lifestyle factors and NCD risk is complex and partly non-linear, showing diminishing benefits beyond certain thresholds, especially T2D and CRDs.

1. Introduction

Non-communicable diseases (NCDs) are the leading causes of death globally [1,2]. Among them, cancer and cardiovascular diseases (CVDs) are the foremost contributors to global mortality [3,4]. Chronic respiratory diseases (CRDs) are the third leading cause of mortality globally in 2019 [5]. Type 2 diabetes (T2D), the most prevalent of diabetes, is predicted to have a 12.2% prevalence rate by 2045 [6]. Collectively, CVDs, cancer, CRDs and T2D account for 80% of all premature NCD deaths [7].
Lifestyle Medicine emphasises prevention and disease management through healthy lifestyle behaviours and avoidance of risky substances defined by six pillars- nutrition, physical (in)activity, sleep, stress, social connection, and substance use [8]. Strong evidence links these lifestyle domains to the incidence of CVDs, cancer, CRDs and T2D [9,10,11]. However, few studies have examined all six pillars simultaneously in relation to these four NCDs [12,13,14,15]. Given the co-occurrence [16,17] and revealed synergistic effect [18,19] of multiple lifestyle factors, it is critical to study their real-world combinations rather than individual effects summed together.
Lifestyle patterns (LPs), identified via latent class analysis (LCA), capture real-world combinations of multiple lifestyle factors [20]. Significant associations have been reported between LPs with cancer [21], T2D [22,23,24], cardiovascular risk markers [25], and dementia [26]. However, comparisons of LP-associated risks for CVDs, cancer, CRDs, and T2D within the same population remain unclear. Lifestyle summation scores are widely used to evaluate combined lifestyle effects [27,28], typically showing that higher scores correlate with lower disease risk [29,30,31]. Yet, few studies have examined whether these associations are linear [29], and comparisons of LPs versus lifestyle summation scores within the same population are lacking.
This study aims to: (1) investigate the relationships between real-world LPs (based on six pillars of Lifestyle Medicine) and four major NCDs within one population, and (2) assess whether associations between lifestyle summation scores and major four NCD risk are linear, as commonly assumed.

2. Methods

2.1. Study Design and Participants

Data from the Lifelines cohort wave one to wave three were used (2006–2024). Lifelines is a multi-disciplinary prospective population-based cohort study examining in a unique three-generation design the health and health-related behaviours of 167,729 persons living in the North of the Netherlands. It employs a broad range of investigative procedures in assessing the biomedical, socio-demographic, behavioural, physical and psychological factors which contribute to the health and disease of the general population, with a special focus on multi-morbidity and complex genetics [32]. More detailed information about Lifelines has been published elsewhere [33,34]. Adults (≥18 years) who were disease-free at the baseline (wave 1, 2006–2013) were included to meet the research interests. Participants who were lost to follow-up or for whom disease status was unavailable at both follow-up assessments (wave 2, 2014–2019; wave 3, 2019–2024) were excluded (see Figure 1). This study was conducted following the Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) reporting guideline [35].

2.2. Exposures and Outcomes

LPs and lifestyle summation scores were defined as the primary exposures, derived from baseline data of the Lifelines cohort. Ten lifestyle variables were included based on the six pillars of Lifestyle Medicine [20] and each was categorised into three levels (healthy, moderately healthy, or unhealthy) based on national recommendation guidelines [36], or according to tertiles when no guideline was available: smoking status (never, former, current smoker), alcohol intake (abstainer, light to moderate (0–15 g/day), excessive (≥15 g/day), binge drinking (abstainer, not binge drinking: 1–4 glasses/ day (female) or 1–5 glasses/day (male), binge drinking: ≥4 glasses/day (female) or ≥5 glasses/ day (male)), diet quality (high, moderate, low), ultra-processed food consumption (low, moderate, high), physical activity (>300 mins/week, 120–300 mins/week, 0–120 mins/week), long-term stress level (none, moderate, high), TV watching time (≤2 h, 2–3 h, >3 h), sleep duration (recommend: 7/8/9 h/day (18–65 y) or 7/8 h/day (≥65 y), above recommended: >9 h/day (18–65 y), or >8 h/day (≥65 y), below recommended: <7 h/day (≥18y), and social connections (high, moderate, low). Smoking status and alcohol consumption were determined based on the self-report information. Diet quality was assessed using the Lifelines Diet Score (LLDS) [37], derived from the Food Frequency Questionnaire (FFQ) [38]. Ultra-processed food consumption was classified according to the NOVA framework [39] and expressed as the proportion of UPF weight relative to total weight daily food and beverage consumption, calculated from FFQ responses. Physical activity was quantified using the Short QUestionnaire to ASsess Health-Enhancing Physical Activity (SQUASH) [40], as total weekly minutes of moderate-to-vigorous leisure-time activity, including commuting and leisure. Long-term stress was measured using the Long-term Difficulties Inventory (LDI) [41]. Sedentary behaviour was evaluated by self-reported television watching time. Sleep duration was self-reported in hours and minutes and categorised according to age-specific recommendations of the American National Sleep Foundation [36]. Social connection was evaluated using the short form of the Social Production Function Instrument for the Level of Well-being (SPF-IL(s)) [42]. Further details on the derivation of LPs and lifestyle summation scores are provided elsewhere [20] or Figure 2.
Briefly, we conducted LCA based on the above ten categorical lifestyle variables. The optimal class solution is one that achieved meaningful theoretical interpretation saturation [43,44], rather than relying solely on statistical indices such as Akaike information criterion (AIC), Bayesian information criterion (BIC), Lo–Mendell–Rubin adjusted likelihood ratio test (LMR-LRT), or normalised entropy. Ultimately, a five-class model was selected as the final solution. This process resulted in the identification of two “Healthy” and three “Unhealthy” patterns: “Healthy but physically inactive”, “Healthy in a balanced way”, “Unhealthy but no substance use”, “Unhealthy but light drinking and never smoked”, and “Unhealthy” patterns. Lifestyle summation scores were calculated by summing the number of healthy or unhealthy levels across the ten lifestyle variables separately, resulting in healthy and unhealthy lifestyle scores ranging from 0 to 10.
The outcomes of interest were the incidence of T2D, cancer, CRDs, and CVDs. T2D was defined as self-reported T2D, or self-reported diabetes with missing or unknown type, and/or participants using diabetes medication and/or having a fasting plasma glucose (FG) ≥ 7.0 mmol/L or a hemoglobin A1c (HbA1c) ≥ 6.5% [45]. Individuals with Type 1 diabetes were then excluded, unless they specifically reported T2D and used medications other than insulin, to define pure T2D. Cancer assessment relied on self-reported data; cases involving primary skin cancer, pre-stage cancer, or past cancer with unknown origin were excluded, and considered low prevalence rate and weak associations with the relevant lifestyle factors [46]. CRDs were classified as either asthma or chronic obstructive pulmonary disease (COPD) based on clinical diagnosis, medication use, spirometry, and/or reported two or more symptoms (wheeze, dyspnea at rest, and nocturnal dyspnea) [47]. Eleven CVDs (myocardial infarction, heart failure, atrial fibrillation, heart valve disorders, arrhythmia, aortic aneurysm, stroke, thrombosis, atherosclerosis, narrowing of carotid arteries and balloon angioplasty or bypass surgery) were included based on the International Classification of Diseases (ICD) [48,49]. Mortality data were obtained from the Dutch municipal population registries.

2.3. Confounders

Socio-demographic variables were considered as potential confounders, as they are associated with both lifestyle factors and disease outcomes [50,51]. These variables were collected through self-administered questionnaires and included age, sex, marital status, household income, highest educational attainment, and employment status. Household income was normalised by dividing net household income (the midpoint of each participant’s income category) by the square root of the number of household members. The categorisation of these variables is presented in Table 1 and Supplementary Material File S1.

2.4. Statistical Analysis

We calculated the incidence rates for T2D, cancer, CRDs and CVDs by dividing new cases by person-years at risk and multiplying by 100,000, respectively. Person-years at risk were calculated from the date of enrolment to the date of death, onset of disease, loss to follow-up, or the end of follow-up, whichever came first. We imputed missing socio-demographic variables and LPs using multivariate imputation by chained equations (MICE) with five imputation sets [52], where LPs were imputed based on their individual lifestyle indicators rather than on the LPs alone. Details on comparisons of pre- and post-imputation datasets are provided in Supplementary Material, File S2, Figure S1 and Table S1.
Fine–Gray sub-distribution hazard models [53] were used to account for the competing risk of death when estimating the incidence of the four leading NCDs in relation to LPs and lifestyle summation scores, separately. Sub-distribution hazard proportionality was test by plotting cumulative incidence curves. All models were adjusted for socio-demographic variables, with age modelled as a continuous variable. Lifestyle summation scores were further modelled using natural spline functions to capture potential non-linear associations. Models were fitted with degree of freedom (df) ranging from 2 to 6, and the optimal non-linear models were selected based on the lowest Akaike Information Criterion (AIC). Log-likelihood ratio tests were then performed to compare the linear models (df = 1) with the optimal non-linear models, with p-values used to assess statistical significance.
Sensitivity analyses were conducted to evaluate the robustness of our findings. First, to reduce the potential bias arising from lifestyle modifications due to other health conditions, we adjusted for the presence of the three other chronic diseases (excluding the disease of interest) in LPs’ model. Second, we redefined reference groups to compare the effects across different LPs. Third, we performed complete-case analyses to evaluate the impact of missing-data handling. Fourth, we applied traditional Cox proportional hazards models without accounting for competing risks. Fifth, to examine the potential interdependence between healthy and unhealthy lifestyle scores, we included the opposite scores as a covariate in each corresponding model.
All analyses were performed using RStudio (R 4.2.2). Multiple imputations were conducted using the mice package [32]; Fine–Gray sub-distribution hazard models were implemented with the tidycmprsk package, and natural splines functions were applied using the splines package. A two-sided p-value of less than 0.05 was considered statistically significant through all the analyses. Figure 2 illustrates the methodological framework of this study.

3. Results

3.1. Study Population and Descriptive Statistics

The final analytical datasets included 114,919 participants without T2D, 131,248 participants for the cancer analysis, 91,777 for CRDs, and 77,645 for CVDs (Figure 1). Participant characteristics are summarised in Table 1. Across all datasets, approximately 60% were female, with a mean age ranging from 45.3 and 46.3 years (SD = 13). Most participants had upper secondary education (38.3–38.7%), an income between €1500 and €1900 (25.4–26.2%), were married or cohabiting (80.2–81.1%), and employed full-time (44.2–46.9%).
Table 2 presents the prevalence of LPs and lifestyle summation scores. The most common LPs were “Healthy in a balanced way” (38.0–40.4%) and “Unhealthy but light drinking and never smoked” (27.9–28.9%). Approximately 80% of participants had 2–5 healthy lifestyle scores, and 70% had 2–4 unhealthy scores.
Over a median follow-up of 9.3 years for T2D, 5.9 years for cancer, 8.5 years for CRDs, and 8.7 years for CVDs, a total of 3114, 4685, 4133, and 2850 disease incidences were recorded, respectively. CRDs had the highest incidence rate (603 per 100,000 person-years), while T2D had the lowest (321 per 100,000 person-years). Cancer and CVDs had incidence rates of 511 and 485 per 100,000 person-years, respectively (Table 1).

3.2. Associations Between LPs and Disease Risk

Table 3 summarises the associations between LPs and disease risk. Compared to the “Unhealthy” pattern, the “Healthy in a balanced way” pattern was significantly associated with reduced risks of T2D, cancer, and CRDs. The “Healthy but physically inactive” pattern was linked to significantly lower risks across all four disease outcomes. The “Unhealthy but no substance use” pattern was associated with increased risk of T2D (SHR = 1.27; 95% CI: 1.11–1.47) but decreased cancer risk (SHR = 0.85; 95% CI: 0.74–0.97). The “Unhealthy but light drinking and never smoked” pattern was linked to decreased T2D risk. File S2 Table S2 presents the associations between LPs and disease risk, using different LPs as reference categories.

3.3. Linearity of Associations Between Lifestyle Summation Scores and Disease Risk

Comparison of the optimal non-linear models with the corresponding linear models (Table 4) revealed two associations where the non-linear specification provided a significantly better fit (p< 0.05): between healthy lifestyle scores and T2D, and between unhealthy lifestyle scores and CRDs. For the remaining associations, the non-linear models did not demonstrate a significantly improved fit. These associations are illustrated in Figure 3. File S2 Table S3 presents the results of optimal non-linear model selection based on AIC.
Using the most prevalent healthy lifestyle score of 3 as the reference, we observed a significant inverse association with T2D risk (non-linear) and CRD risk (linear) (Figure 3a,c). Only a score of 10 significantly decreased cancer risk (Figure 3b), and no significant associations were found for CVDs (Figure 3d). When using the most common unhealthy lifestyle score of 2 as the reference, unhealthy scores of 0–1 were associated with a significantly reduced T2D risk, while scores of 4–10 were associated with increased risk (Figure 3e). Scores of 4–7 significantly increased CRD risk, with a plateau effect observed at scores 7–10 (Figure 3g). Only a score of 10 significantly increased CVD risk (Figure 3h), and no significant association was found between unhealthy lifestyle scores and cancer risk (Figure 3f).

3.4. Sensitivity Analyses

The models we used demonstrated significantly better performance and there was no evidence of violation of proportionality assumption (Table 4, File S2 Table S3, Table S7 and Figure S4). Findings between LPs and disease risk remained generally robust across multiple sensitivity analyses. Results were consistent after additional adjustment for other health conditions (File S2 Table S4), using complete-case analysis—although some SHR became non-significant and the association between the “Unhealthy but no substance use” pattern and CVDs became statistically significant (SHR = 1.19, 95% CI: 1.00–1.42) (File S2 Table S5) and when using Cox proportional hazard models (File S2 Table S6).
When mutually adjusting healthy and unhealthy lifestyle scores for each other, several associations became non-significant (File S2 Figure S2): between healthy lifestyle scores and T2D, healthy lifestyle scores and cancer, and unhealthy lifestyle scores and CVDs. Additionally, associations for lower lifestyle scores (healthy lifestyle scores and CRDs; unhealthy lifestyle scores and T2D) lost statistical significance, and the previously observed plateau effect for CRDs was no longer significant.

4. Discussion

In this large prospective Dutch cohort study, we investigated how LPs influence the incidence of four major NCDs and examined the linearity between lifestyle summation scores and disease risk. Our findings demonstrate that while generally two “Healthy” LPs were broadly protective, certain mixed “Unhealthy” patterns exhibited disease-specific associations. We also observed mostly linear associations between lifestyle scores and outcomes, alongside two non-linear associations, suggesting that combined lifestyle–disease relationships may be complex and involve threshold or saturation effects.

4.1. Association Between LPs and Disease Risk

Limited research has investigated LPs and the incidence of major NCDs comparatively. Consistent with prior studies, our healthy LPs showed broad protective effects. For instance, a study on LPs and T2D [22] found that both an “unhealthy lifestyle group” and a “poor diet and low physical activity group” were significantly associated with increased T2D risk compared with a “healthy lifestyle group”. Two other studies using principal component analysis (PCA) also linked LPs with disease outcomes [21,23], though results varied. One reported that a pattern characterised by “high socioeconomic status and low physical activity” was associated with elevated serum concentrations of branched-chain amino acids (BCAAs) [23], which increase T2D risk [54]. Our results differed: the “Healthy but physically inactive” pattern was associated with a lower T2D risk. Methodological differences may explain this. LPs used in our study were identified using LCA, a person-oriented approach that identifies subgroups of individuals based on shared characteristics across multiple lifestyle factors. In this context, the “Healthy but physically inactive” pattern reflects individuals with low physical activity levels who nonetheless engage in other healthy lifestyle behaviours (never smoking/drinking alcohol and healthy diet), highlighting the complex interplay of real-life behavioural patterns [20]. In contrast, PCA is a variable-oriented method that reduces dimensionality without necessarily capturing complete individual-level behavioural profiles. Therefore, the “high socioeconomic status and low physical activity” pattern identified by Manninen et al. may reflect limited information on other lifestyle behaviours. As a result, while their findings suggest a detrimental effect, our pattern—encompassing a broader set of healthy behaviours—was protective. Notably, our findings suggest that a healthy diet may partially offset the adverse impact of physical inactivity on T2D risk, highlighting the importance of investigating the combined effects of lifestyle factors. Although prior studies have shown that physical activity can eliminate the increased mortality risk associated with sedentary behaviour [55], it remains unclear whether a healthy diet can similarly compensate for the negative effects of physical inactivity on T2D causally. Our study points to a potential new direction for research in this area.
Whereas our findings demonstrate that generally two “Healthy” LPs were broadly protective, certain mixed “Unhealthy” patterns exhibited disease-specific associations. Disease-specific effects were observed in the “Unhealthy but no substance use” pattern, which was associated with an increased risk of T2D but a decreased risk of cancer. The contrasting effects of alcohol consumption on T2D and cancer risk may partly account for the disease-specific associations we observed. According to a meta-analysis, light to moderate alcohol consumption was associated with a lower risk of T2D, whereas heavy consumption showed little or no additional protective effect [56]. In contrast, alcohol consumption has been associated with an increased risk of cancer in a dose-dependent manner compared to non-consumption [57,58]. This inverse association between light to moderate alcohol consumption and T2D may also explain why the “Unhealthy but light drinking and never smoked” pattern played a protective role for T2D incidence. Notably, no significant associations were observed between the “Unhealthy but no substance use” or “Unhealthy but light drinking and never smoked” patterns and CRDs or CVDs when compared to the “Unhealthy” pattern. However, both patterns were significantly associated with increased risks of CRDs and CVDs when compared with the two “Healthy” LPs (see File S2 Table S2). The primary distinction among the “Unhealthy,” “Unhealthy but no substance use,” and “Unhealthy but light drinking and never smoked” patterns was substance use behaviour. This lack of difference in disease risk suggests that light alcohol consumption or the absence of smoking may not substantially influence the development of CRDs or CVDs in the context of an otherwise unhealthy lifestyle. Conversely, these findings indicate that alcohol consumption may play a particularly influential role in the associations observed with T2D and cancer compared with CRDs and CVDs. Further research is needed to confirm this assumption, which could provide valuable evidence to inform lifestyle intervention guidelines. These findings also suggest that the relationship between combined lifestyle factors and NCD risk is complex. This also raises the question of whether the association between lifestyle summation scores and disease risk are truly linear, especially when all lifestyle factors are equally weighted in these scores.

4.2. Linearity Between Lifestyle Summation Scores and Disease Risk

To our knowledge, this is the first Dutch study to examine the linearity between lifestyle summation scores and disease outcomes. One previous Norwegian study found non-linear associations between a healthy lifestyle index and the incidence of lung cancer and postmenopausal breast cancer, but not overall cancer incidence [29]. In our study, we did not observe non-linearity for overall cancer, possibly because grouping all cancers masked subtype-specific effects. Instead, we observed non-linear associations between healthy lifestyle scores and T2D risk, and between unhealthy scores and CRD risk. Both showed plateau effects, suggesting diminishing benefits beyond certain thresholds. Similar ceiling effects were reported in a Korean cohort of 2.4 million participants, where physical activity up to ~1000 MET-min/week reduced T2D prevalence but no further reduction occurred beyond that [59]. Such effects highlight the limitations of assuming linear dose–response relationships and suggest tailoring interventions to account for thresholds.
Several mechanisms may underlie these non-linear associations. First, behavioural mechanism—such as interactions among lifestyle factors—may contribute. Previous research has documented synergistic [18] and additive [60,61] effects, which can complicate risk estimates as behaviours accumulate and may generate non-linear risk patterns. Second, equal weighting in summation scores ignores evidence that lifestyle factors contribute unequally to disease risk [61,62]. For example, T2D risk is strongly influenced by diet, sedentary behaviour, and physical activity, but less by social connectedness [22,63]. In our study, healthier diet, limited screen time, and non-smoking were common at lower lifestyle scores, while social connection and alcohol behaviours dominated at higher scores (see File S2 Figure S3). This imbalance may explain why healthy lifestyle scores showed linear reductions in T2D risk at low–moderate levels but plateaued at higher levels. Third, biological pathways linking lifestyle behaviours to disease are often mediated through biomarkers. For example, one study found that albumin, HDL-C, triglycerides, apolipoprotein A, C-reactive protein, and HbA1c collectively explained 23% of the association between overall lifestyle behaviours and microvascular diseases [64]. Other mediators—such as inflammation, dyslipidaemia, and oxidative stress—may also contribute to the observed associations [65].

4.3. Insights for Implementation: Comparing LPs and Lifestyle Summation Scores

Addressing multiple behaviours simultaneously has been advocated since the 1970s [66], and evidence supports multiple health behaviour change (MHBC) interventions as more effective than single-behaviour approaches [62,67]. Comparing LPs and lifestyle summation scores offers insights for implementation.
Both LPs and lifestyle summation scores consistently show that healthy behaviours reduce the risk of major NCDs, with summation scores offering a straightforward framework for communication. Notably, we observed non-linear associations: reductions in T2D risk plateaued at higher summation scores, indicating that additional benefits become limited beyond a certain threshold. Nevertheless, promoting all healthy behaviours remains important, as substantial gains can be achieved at low-to-moderate adherence levels. Lifestyle summation scores are particularly useful for individual-level interventions, as they help balance achievable effort with meaningful risk reduction.
Regarding the discrepancies between LPs and lifestyle summation scores, LPs provide more nuanced, disease-specific information. Unlike summation scores, which generally show a monotonic association (more unhealthy behaviours, higher risk), LPs highlighted that certain behaviours within combined lifestyle patterns- such as alcohol consumption- are more prominent in T2D and cancer risk than for CRDs or CVDs. LPs also better capture real-life behavioural complexity and interdependence, making them more suitable for population-level strategies. Previous work showed that LPs correlate with socio-demographic factors [20], enabling identification of high-risk subgroups. Tailored lifestyle interventions that address the contextual and structural barriers of certain LPs could therefore enhance adherence and effectiveness. While the complexity of patterns may pose challenges for practical implementation, trained professionals can manage and translate these insights into actionable strategies.
Integrating insights from both approaches may enable more comprehensive and equitable strategies: summation scores for personalised, simple guidance, and LPs for structural, population-level interventions. Our finding also supports the development of disease-specific lifestyle scores that weigh factors according to their relative impact while considering biomarker effects. Such scores could better reflect nuanced relationships, improve risk prediction, and guide interventions.

4.4. Limitations and Strengths

This study has notable strengths. It is a large prospective cohort with a substantial sample size and long follow-up, enhancing statistical power and temporal inference. The use of theory-based domains from the Lifestyle Medicine framework strengthens conceptual clarity of both summation scores and LPs. Studying multiple NCDs in one population enabled cross-disease comparisons. Analyses also accounted for competing risks from death, improving disease-specific estimates.
Several limitations must be acknowledged. Lifestyle behaviours were self-reported and may be prone to recall bias or socially desirable responses. LPs were derived using LCA, which has inherent limitations such as potential class misclassification, sensitivity to model specifications, and dependence on the selected indicators. Although many covariates were adjusted for, residual confounding is possible. Lifestyle factors were measured only at baseline; changes over time were not captured, except partially for smoking. Behavioural changes due to early symptoms may have introduced reverse causation. Missing LPs were present; we imputed missing LPs using observed lifestyle indicators after deriving LPs from complete cases in the LCA. While this approach preserves real-life patterns, it may still introduce some distributional bias compared with imputing lifestyle factors prior to identifying LPs. However, sensitivity analyses showed minimal differences between the pre- and post-imputation datasets, supporting the robustness of our findings despite this limitation. In the sensitivity results, a few associations lost statistical significance. This attenuation likely reflects reduced statistical power in the complete-case datasets and the removal of information gained through imputation. Nonetheless, the direction and magnitude of the estimates generally remained stable, suggesting that the underlying relationships are not fundamentally altered. However, the loss of significance indicates that some associations may be less robust and should be interpreted with appropriate caution. Finally, grouping heterogeneous outcomes (e.g., cancers) into broad categories may obscure subtype-specific effects. Prior work shows lifestyle–disease associations vary across cancer types [57], so future studies should analyse subtypes separately.

5. Conclusions

The relationship between combined lifestyle factors and NCD risk is complex and partly non-linear, indicating diminishing benefits beyond certain thresholds, especially T2D and CRDs. While lifestyle summation scores are more straightforward for individual risk communication, LPs capture behavioural complexity and may be more effective for population-level prevention. Findings from both LPs and lifestyle summation scores consistently showed potential protective effects of healthy lifestyles on T2D, cancer, CRDs, and CVDs. However, the “Unhealthy but no substance use” demonstrated increased risk on T2D, protective effect on cancer and no significant effect on CRDs or CVDs, suggesting that the most influential lifestyle factors within combined behaviours may differ across diseases. Overall, these findings underscore the need for integrated, multidimensional lifestyle strategies in chronic disease prevention.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/nu17243883/s1, File S1: Definitions of confounders; File S2 Figure S1: Flowchart of exclusion for original datasets; Figure S2: Association curves between lifestyle summation scores and disease risk; Figure S3: Upset plot illustrating the prevalence of different combinations of healthy lifestyle factors; Figure S4: Cumulative incidence curves of diseases by LPs; Table S1: Comparisons between imputed datasets and original datasets; Table S2: Associations between LPs and disease risk using different reference groups; Table S3: AIC values for non-linear models of lifestyle summation scores and disease risk, with degrees of freedom ranging from 2 to 6; Table S4: Associations between LPs and disease risk; Table S5: Associations between LPs and disease risk, complete-case analysis; Table S6: Associations between LPs and disease risk; Table S7: Model Comparison with null models for LPs.

Author Contributions

Q.Z. and E.C. designed the research. Q.Z. cleaned and prepared most of the analytical dataset. R.O.-G. cleaned and prepared the disease variables. Q.Z. analysed the dataset. E.C. and G.L. supervised and confirmed the process of analysis. All authors discussed the results. Q.Z. and E.C. interpreted the results. Q.Z. wrote the paper. E.C. and G.L. supervised the research and reviewed the paper. All authors have read and agreed to the published version of the manuscript.

Funding

The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This study used the data from the Lifelines Cohort Study. The Lifelines initiative has been made possible by subsidy from the Dutch Ministry of Health, Welfare and Sport, the Dutch Ministry of Economic Affairs, the University Medical Center Groningen (UMCG), Groningen University and the Provinces in the North of the Netherlands (Drenthe, Friesland, Groningen). Q.Z. acknowledges the support of a joint scholarship from the China Scholarship Council and the University of Groningen.

Institutional Review Board Statement

The Lifelines Cohort Study is conducted according to the principles of the Declaration of Helsinki and in accordance with the research code of the University Medical Center Groningen (UMCG). The Lifelines protocol was approved by the UMCG Medical Ethics Committee under number 2007/152 (approval date: 4 September 2014).

Informed Consent Statement

All participants in Lifelines cohort provided signed informed consent before entry. This study involved secondary analysis of previously collected data.

Data Availability Statement

Data may be obtained from a third party and are not publicly available. Researchers can apply to use the Lifelines data used in this study. More information about how to request Lifelines data and the conditions of use can be found on their website (https://www.lifelines-biobank.com/researchers/working-with-us (accessed on April 2023 with updating)).

Acknowledgments

The authors wish to acknowledge the services of the Lifelines Cohort Study, the contributing research centers delivering data to Lifelines, and all the study participants.

Conflicts of Interest

The authors declare that they have no competing interests.

Abbreviations

CVDsCardiovascular diseases
CRDsChronic respiratory diseases
T2DType 2 diabetes
LPsLifestyle patterns
NCDsNon-communicable diseases
COPDChronic obstructive pulmonary disease
LCALatent class analysis
FEV1Forced expiratory volume in one second
FVCForced vital capacity
MICEMultivariate imputation by chained equations
SHRSub-hazard ratios
HRHazard ratios
CIConfidence interval
SDStandard deviation
PCAPrincipal component analysis
BACCsBranched-chain amino acids
MHBCMultiple health behaviour change

References

  1. GBD 2015 Mortality and Causes of Death Collaborators. Global, regional, and national life expectancy, all-cause mortality, and cause-specific mortality for 249 causes of death, 1980–2015: A systematic analysis for the Global Burden of Disease Study 2015. Lancet 2016, 388, 1459–1544. [Google Scholar] [CrossRef]
  2. World Health Organization. The Top 10 Causes of Death. Available online: https://www.who.int/news-room/fact-sheets/detail/the-top-10-causes-of-death (accessed on 19 November 2024).
  3. Mattiuzzi, C.; Lippi, G. Current Cancer Epidemiology. J. Epidemiol. Glob. Health 2019, 9, 217–222. [Google Scholar] [CrossRef] [PubMed]
  4. World Health Organization. Cardiovascular Diseases (CVDs). Available online: https://www.who.int/news-room/fact-sheets/detail/cardiovascular-diseases-(cvds) (accessed on 19 November 2024).
  5. GBD Chronic Respiratory Disease Collaborators. Prevalence and attributable health burden of chronic respiratory diseases, 1990-2017: A systematic analysis for the Global Burden of Disease Study 2017. Lancet Respir. Med. 2020, 8, 585–596. [Google Scholar] [CrossRef] [PubMed]
  6. Federation, I.D. IDF Diabetes Atlas, 10th ed. Available online: https://www.diabetesatlas.org (accessed on 11 November 2024).
  7. World Health Organizaion. Noncommunicable Diseases. Available online: https://www.who.int/news-room/fact-sheets/detail/noncommunicable-diseases?utm_source=chatgpt.com (accessed on 19 November 2024).
  8. Lifestyle Medicine Global Alliance. Lifestyle Medicine. Available online: https://lifestylemedicineglobal.org/lifestyle-medicine (accessed on 1 November 2025).
  9. Nyberg, S.T.; Batty, G.D.; Pentti, J.; Virtanen, M.; Alfredsson, L.; Fransson, E.I.; Goldberg, M.; Heikkila, K.; Jokela, M.; Knutsson, A.; et al. Obesity and loss of disease-free years owing to major non-communicable diseases: A multicohort study. Lancet Public Health 2018, 3, e490–e497. [Google Scholar] [CrossRef] [PubMed]
  10. Billingsley, M. New advice on physical activity aims to prevent chronic disease from early years. BMJ 2011, 343, d4457. [Google Scholar] [CrossRef]
  11. Budreviciute, A.; Damiati, S.; Sabir, D.K.; Onder, K.; Schuller-Goetzburg, P.; Plakys, G.; Katileviciute, A.; Khoja, S.; Kodzius, R. Management and Prevention Strategies for Non-communicable Diseases (NCDs) and Their Risk Factors. Front. Public Health 2020, 8, 574111. [Google Scholar] [CrossRef]
  12. Yang, I.A.; Jenkins, C.R.; Salvi, S.S. Chronic obstructive pulmonary disease in never-smokers: Risk factors, pathogenesis, and implications for prevention and treatment. Lancet Respir. Med. 2022, 10, 497–511. [Google Scholar] [CrossRef]
  13. Ai, S.; Zhang, J.; Zhao, G.; Wang, N.; Li, G.; So, H.C.; Liu, Y.; Chau, S.W.; Chen, J.; Tan, X.; et al. Causal associations of short and long sleep durations with 12 cardiovascular diseases: Linear and nonlinear Mendelian randomization analyses in UK Biobank. Eur. Heart J. 2021, 42, 3349–3357. [Google Scholar] [CrossRef]
  14. Ng, R.; Sutradhar, R.; Yao, Z.; Wodchis, W.P.; Rosella, L.C. Smoking, drinking, diet and physical activity-modifiable lifestyle risk factors and their associations with age to first chronic disease. Int. J. Epidemiol. 2020, 49, 113–130. [Google Scholar] [CrossRef]
  15. Tu, Z.-Z.; Lu, Q.; Zhang, Y.-B.; Shu, Z.; Lai, Y.-W.; Ma, M.-N.; Xia, P.-F.; Geng, T.-T.; Chen, J.-X.; Li, Y.; et al. Associations of Combined Healthy Lifestyle Factors with Risks of Diabetes, Cardiovascular Disease, Cancer, and Mortality Among Adults with Prediabetes: Four Prospective Cohort Studies in China, the United Kingdom, and the United States. Engineering 2023, 22, 141–148. [Google Scholar] [CrossRef]
  16. Weitzman, E.R.; Chen, Y.Y. The co-occurrence of smoking and drinking among young adults in college: National survey results from the United States. Drug Alcohol Depend. 2005, 80, 377–386. [Google Scholar] [CrossRef] [PubMed]
  17. Cockerham, W.C. Health lifestyle theory and the convergence of agency and structure. J. Health Soc. Behav. 2005, 46, 51–67. [Google Scholar] [CrossRef] [PubMed]
  18. Ding, D.; Rogers, K.; van der Ploeg, H.; Stamatakis, E.; Bauman, A.E. Traditional and Emerging Lifestyle Risk Behaviors and All-Cause Mortality in Middle-Aged and Older Adults: Evidence from a Large Population-Based Australian Cohort. PLoS Med. 2015, 12, e1001917. [Google Scholar] [CrossRef] [PubMed]
  19. Slattery, M.L.; Potter, J.D. Physical activity and colon cancer: Confounding or interaction? Med. Sci. Sports Exerc. 2002, 34, 913–919. [Google Scholar] [CrossRef]
  20. Zou, Q.; Ogaz-Gonzalez, R.; Du, Y.; Duan, M.J.; Lunter, G.; Corpeleijn, E. Non-random aggregations of healthy and unhealthy lifestyles and their population characteristics-pattern recognition in a large population-based cohort. Arch. Public Health 2025, 83, 192. [Google Scholar] [CrossRef]
  21. Navarro Silvera, S.A.; Mayne, S.T.; Risch, H.A.; Gammon, M.D.; Vaughan, T.; Chow, W.H.; Dubin, J.A.; Dubrow, R.; Schoenberg, J.; Stanford, J.L.; et al. Principal component analysis of dietary and lifestyle patterns in relation to risk of subtypes of esophageal and gastric cancer. Ann. Epidemiol. 2011, 21, 543–550. [Google Scholar] [CrossRef]
  22. Duan, M.J.; Dekker, L.H.; Carrero, J.J.; Navis, G. Lifestyle patterns and incident type 2 diabetes in the Dutch lifelines cohort study. Prev. Med. Rep. 2022, 30, 102012. [Google Scholar] [CrossRef]
  23. Manninen, S.; Tilles-Tirkkonen, T.; Aittola, K.; Mannikko, R.; Karhunen, L.; Kolehmainen, M.; Schwab, U.; Lindstrom, J.; Lakka, T.; Pihlajamaki, J.; et al. Associations of Lifestyle Patterns with Glucose and Lipid Metabolism in Finnish Adults at Increased Risk of Type 2 Diabetes. Mol. Nutr. Food Res. 2024, 68, e2300338. [Google Scholar] [CrossRef]
  24. Mourouti, N.; Mavrogianni, C.; Mouratidou, T.; Liatis, S.; Valve, P.; Rurik, I.; Torzsa, P.; Cardon, G.; Bazdarska, Y.; Iotova, V.; et al. The Association of Lifestyle Patterns with Prediabetes in Adults from Families at High Risk for Type 2 Diabetes in Europe: The Feel4Diabetes Study. Nutrients 2023, 15, 3155. [Google Scholar] [CrossRef]
  25. Schnermann, M.E.; Schulz, C.A.; Herder, C.; Alexy, U.; Nothlings, U. A lifestyle pattern during adolescence is associated with cardiovascular risk markers in young adults: Results from the DONALD cohort study. J. Nutr. Sci. 2021, 10, e92. [Google Scholar] [CrossRef]
  26. Li, R.; Li, R.; Xie, J.; Chen, J.; Liu, S.; Pan, A.; Liu, G. Associations of socioeconomic status and healthy lifestyle with incident early-onset and late-onset dementia: A prospective cohort study. Lancet Healthy Longev. 2023, 4, e693–e702. [Google Scholar] [CrossRef]
  27. Ding, J.; Fu, R.; Yuan, T.; Brenner, H.; Hoffmeister, M. Lifestyle scores and their potential to estimate the risk of multiple non-communicable disease-related endpoints: A systematic review. BMC Public Health 2025, 25, 293. [Google Scholar] [CrossRef]
  28. McAloney, K.; Graham, H.; Law, C.; Platt, L. A scoping review of statistical approaches to the analysis of multiple health-related behaviours. Prev. Med. 2013, 56, 365–371. [Google Scholar] [CrossRef] [PubMed]
  29. Chen, S.L.F.; Braaten, T.; Borch, K.B.; Ferrari, P.; Sandanger, T.M.; Nost, T.H. Combined Lifestyle Behaviors and the Incidence of Common Cancer Types in the Norwegian Women and Cancer Study (NOWAC). Clin. Epidemiol. 2021, 13, 721–734. [Google Scholar] [CrossRef] [PubMed]
  30. Aleksandrova, K.; Pischon, T.; Jenab, M.; Bueno-de-Mesquita, H.B.; Fedirko, V.; Norat, T.; Romaguera, D.; Knuppel, S.; Boutron-Ruault, M.C.; Dossus, L.; et al. Combined impact of healthy lifestyle factors on colorectal cancer: A large European cohort study. BMC Med. 2014, 12, 168. [Google Scholar] [CrossRef] [PubMed]
  31. Lelong, H.; Blacher, J.; Baudry, J.; Adriouch, S.; Galan, P.; Fezeu, L.; Hercberg, S.; Kesse-Guyot, E. Combination of Healthy Lifestyle Factors on the Risk of Hypertension in a Large Cohort of French Adults. Nutrients 2019, 11, 1687. [Google Scholar] [CrossRef] [PubMed]
  32. van Buuren, S.; Groothuis-Oudshoorn, K. mice: Multivariate Imputation by Chained Equations in R. J. Stat. Softw. 2011, 45, 1–67. [Google Scholar] [CrossRef]
  33. Klijs, B.; Scholtens, S.; Mandemakers, J.J.; Snieder, H.; Stolk, R.P.; Smidt, N. Representativeness of the LifeLines Cohort Study. PLoS ONE 2015, 10, e0137203. [Google Scholar] [CrossRef]
  34. Scholtens, S.; Smidt, N.; Swertz, M.A.; Bakker, S.J.; Dotinga, A.; Vonk, J.M.; van Dijk, F.; van Zon, S.K.; Wijmenga, C.; Wolffenbuttel, B.H.; et al. Cohort Profile: LifeLines, a three-generation cohort study and biobank. Int. J. Epidemiol. 2015, 44, 1172–1180. [Google Scholar] [CrossRef]
  35. von Elm, E.; Altman, D.G.; Egger, M.; Pocock, S.J.; Gotzsche, P.C.; Vandenbroucke, J.P.; Initiative, S. The Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) statement: Guidelines for reporting observational studies. Prev. Med. 2007, 45, 247–251. [Google Scholar] [CrossRef]
  36. Hirshkowitz, M.; Whiton, K.; Albert, S.M.; Alessi, C.; Bruni, O.; DonCarlos, L.; Hazen, N.; Herman, J.; Katz, E.S.; Kheirandish-Gozal, L.; et al. National Sleep Foundation’s sleep time duration recommendations: Methodology and results summary. Sleep Health 2015, 1, 40–43. [Google Scholar] [CrossRef]
  37. Vinke, P.C.; Corpeleijn, E.; Dekker, L.H.; Jacobs, D.R., Jr.; Navis, G.; Kromhout, D. Development of the food-based Lifelines Diet Score (LLDS) and its application in 129,369 Lifelines participants. Eur. J. Clin. Nutr. 2018, 72, 1111–1119. [Google Scholar] [CrossRef]
  38. Brouwer-Brolsma, E.M.; Perenboom, C.; Sluik, D.; van de Wiel, A.; Geelen, A.; Feskens, E.J.; de Vries, J.H. Development and external validation of the ‘Flower-FFQ’: A FFQ designed for the Lifelines Cohort Study. Public Health Nutr. 2022, 25, 225–236. [Google Scholar] [CrossRef] [PubMed]
  39. Monteiro, C.; Cannon, G.; Levy, R.; Moubarac, J.-C.; Jaime, P.; Parra, A. NOVA. The star shines bright. World Nutr. 2016, 7, 28–38. [Google Scholar]
  40. Byambasukh, O. Physical Activity and Cardiometabolic Health: Focus on Domain-Specific Associations of Physical Activity over the Life Course. Ph.D. Thesis, University of Groningen, Groningen, The Netherlands, 2020. [Google Scholar]
  41. Rosmalen, J.G.; Bos, E.H.; de Jonge, P. Validation of the Long-term Difficulties Inventory (LDI) and the List of Threatening Experiences (LTE) as measures of stress in epidemiological population-based cohort studies. Psychol. Med. 2012, 42, 2599–2608. [Google Scholar] [CrossRef] [PubMed]
  42. Nieboer, A.; Lindenberg, S.; Boomsma, A.; Bruggen, A.C.V. Dimensions Of Well-Being And Their Measurement: The Spf-Il Scale. Soc. Indic. Res. 2005, 73, 313–353. [Google Scholar] [CrossRef]
  43. Pereira, S.; Katzmarzyk, P.T.; Gomes, T.N.; Borges, A.; Santos, D.; Souza, M.; dos Santos, F.K.; Chaves, R.N.; Champagne, C.M.; Barreira, T.V.; et al. Profiling physical activity, diet, screen and sleep habits in Portuguese children. Nutrients 2015, 7, 4345–4362. [Google Scholar] [CrossRef]
  44. Weller, B.E.; Bowen, N.K.; Faubert, S.J. Latent Class Analysis: A Guide to Best Practice. J. Black Psychol. 2020, 46, 287–311. [Google Scholar] [CrossRef]
  45. Amini, M.; Bashirova, D.; Prins, B.P.; Corpeleijn, E.; LifeLines Cohort, S.; Bruinenberg, M.; Franke, L.; Harst, P.V.; Navis, G.; Wolffenbuttel, B.H.; et al. Eosinophil Count Is a Common Factor for Complex Metabolic and Pulmonary Traits and Diseases: The LifeLines Cohort Study. PLoS ONE 2016, 11, e0168480. [Google Scholar] [CrossRef]
  46. Cortes-Ibanez, F.O.; van Pinxteren, B.; Sijtsma, A.; Bruggink, A.; Sidorenkov, G.; van der Vegt, B.; de Bock, G.H. The validity of self-reported cancer in a population-based cohort compared to that in formally registered sources. Cancer Epidemiol. 2022, 81, 102268. [Google Scholar] [CrossRef]
  47. Niebuur, J.; Vonk, J.M.; Du, Y.; de Bock, G.H.; Lunter, G.; Krabbe, P.F.M.; Alizadeh, B.Z.; Snieder, H.; Smidt, N.; Boezen, M.; et al. Lifestyle factors related to prevalent chronic disease multimorbidity: A population-based cross-sectional study. PLoS ONE 2023, 18, e0287263. [Google Scholar] [CrossRef] [PubMed]
  48. van der Ende, M.Y.; Hartman, M.H.; Hagemeijer, Y.; Meems, L.M.; de Vries, H.S.; Stolk, R.P.; de Boer, R.A.; Sijtsma, A.; van der Meer, P.; Rienstra, M.; et al. The LifeLines Cohort Study: Prevalence and treatment of cardiovascular disease and risk factors. Int. J. Cardiol. 2017, 228, 495–500. [Google Scholar] [CrossRef] [PubMed]
  49. International Statistical Classification of Diseases and Related Health Problems (ICD). Available online: https://www.who.int/standards/classifications/classification-of-diseases (accessed on 20 November 2024).
  50. Nummela, O.P.; Sulander, T.T.; Heinonen, H.S.; Uutela, A.K. Self-rated health and indicators of SES among the ageing in three types of communities. Scand. J. Public Health 2007, 35, 39–47. [Google Scholar] [CrossRef]
  51. Daw, J.; Margolis, R.; Wright, L. Emerging Adulthood, Emergent Health Lifestyles: Sociodemographic Determinants of Trajectories of Smoking, Binge Drinking, Obesity, and Sedentary Behavior. J. Health Soc. Behav. 2017, 58, 181–197. [Google Scholar] [CrossRef] [PubMed]
  52. van Buuren, S. Multiple imputation of discrete and continuous data by fully conditional specification. Stat. Methods Med. Res. 2007, 16, 219–242. [Google Scholar] [CrossRef]
  53. Fine, J.P.; Gray, R.J. A Proportional Hazards Model for the Subdistribution of a Competing Risk. J. Am. Stat. Assoc. 1999, 94, 496–509. [Google Scholar] [CrossRef]
  54. Moreno-Vedia, J.; Llop, D.; Rodriguez-Calvo, R.; Plana, N.; Amigo, N.; Rosales, R.; Esteban, Y.; Girona, J.; Masana, L.; Ibarretxe, D. Serum branch-chained amino acids are increased in type 2 diabetes and associated with atherosclerotic cardiovascular disease. Cardiovasc. Diabetol. 2023, 22, 249. [Google Scholar] [CrossRef]
  55. Ekelund, U.; Steene-Johannessen, J.; Brown, W.J.; Fagerland, M.W.; Owen, N.; Powell, K.E.; Bauman, A.; Lee, I.M.; Lancet Physical Activity Series 2 Executive Committee; Lancet Sedentary Behaviour Working Group. Does physical activity attenuate, or even eliminate, the detrimental association of sitting time with mortality? A harmonised meta-analysis of data from more than 1 million men and women. Lancet 2016, 388, 1302–1310. [Google Scholar] [CrossRef]
  56. Li, X.H.; Yu, F.F.; Zhou, Y.H.; He, J. Association between alcohol consumption and the risk of incident type 2 diabetes: A systematic review and dose-response meta-analysis. Am. J. Clin. Nutr. 2016, 103, 818–829. [Google Scholar] [CrossRef]
  57. Sandstrom, N.; Johansson, M.; Jekunen, A.; Andersen, H. Socioeconomic status and lifestyle patterns in the most common cancer types-community-based research. BMC Public Health 2023, 23, 1722. [Google Scholar] [CrossRef]
  58. Rehm, J.; Shield, K.D.; Weiderpass, E. Alcohol consumption. A leading risk factor for cancer. Chem. Biol. Interact. 2020, 331, 109280. [Google Scholar] [CrossRef] [PubMed]
  59. Jang, W.; Kim, S.; Son, Y.; Kim, S.; Lee, H.; Park, J.; Lee, K.; Kang, J.; Pizzol, D.; Hwang, J.; et al. Non-Linear Association Between Physical Activities and Type 2 Diabetes in 2.4 Million Korean Population, 2009–2022: A Nationwide Representative Study. J. Korean Med. Sci. 2025, 40, e42. [Google Scholar] [CrossRef] [PubMed]
  60. Viner, B.; Barberio, A.M.; Haig, T.R.; Friedenreich, C.M.; Brenner, D.R. The individual and combined effects of alcohol consumption and cigarette smoking on site-specific cancer risk in a prospective cohort of 26,607 adults: Results from Alberta’s Tomorrow Project. Cancer Causes Control 2019, 30, 1313–1326. [Google Scholar] [CrossRef] [PubMed]
  61. Lu, R.; Zhang, Y.; Chen, Y.; Lyu, J.; Li, R.; Kang, W.; Zhao, A.; Ning, Z.; Hu, Y.; Lin, X.; et al. The Joint Association of Sleep Quality and Outdoor Activity with Asthma and Allergic Rhinitis in Children: A Cross-Sectional Study in Shanghai. J. Asthma Allergy 2025, 18, 101–111. [Google Scholar] [CrossRef]
  62. Silva, C.C.; Presseau, J.; van Allen, Z.; Schenk, P.M.; Moreto, M.; Dinsmore, J.; Marques, M.M. Effectiveness of Interventions for Changing More Than One Behavior at a Time to Manage Chronic Conditions: A Systematic Review and Meta-analysis. Ann. Behav. Med. 2024, 58, 432–444. [Google Scholar] [CrossRef]
  63. Zhang, Y.; Pan, X.F.; Chen, J.; Xia, L.; Cao, A.; Zhang, Y.; Wang, J.; Li, H.; Yang, K.; Guo, K.; et al. Combined lifestyle factors and risk of incident type 2 diabetes and prognosis among individuals with type 2 diabetes: A systematic review and meta-analysis of prospective cohort studies. Diabetologia 2020, 63, 21–33. [Google Scholar] [CrossRef]
  64. Geng, T.; Zhu, K.; Lu, Q.; Wan, Z.; Chen, X.; Liu, L.; Pan, A.; Liu, G. Healthy lifestyle behaviors, mediating biomarkers, and risk of microvascular complications among individuals with type 2 diabetes: A cohort study. PLoS Med. 2023, 20, e1004135. [Google Scholar] [CrossRef]
  65. Kim, U.J.; Choi, E.J.; Park, H.; Lee, H.A.; Park, B.; Kim, H.; Hong, Y.; Jung, S.; Park, H. The Mediating Effect of Inflammation between the Dietary and Health-Related Behaviors and Metabolic Syndrome in Adolescence. Nutrients 2022, 14, 2339. [Google Scholar] [CrossRef]
  66. Belloc, N.B.; Breslow, L. Relationship of physical health status and health practices. Prev. Med. 1972, 1, 409–421. [Google Scholar] [CrossRef]
  67. Silva, C.C.; Presseau, J.; van Allen, Z.; Dinsmore, J.; Schenk, P.; Moreto, M.; Marques, M.M. Components of multiple health behaviour change interventions for patients with chronic conditions: A systematic review and meta-regression of randomized trials. Health Psychol. Rev. 2025, 19, 200–255. [Google Scholar] [CrossRef]
Figure 1. A flowchart of participants exclusion. The stacked boxes represent four separate analysis datasets specified at the bottom of the figure; We first compiled all required variables into a single dataset from the Lifelines cohort and then divided it into four disease-specific analysis datasets, each excluding participants with only one target disease at baseline. To estimate disease incidence, participants were followed from baseline until incident diagnosis or death. * Complete losses of follow-up mean lost to follow-up in wave 2 (2014–2019) and wave 3 (2019–2024).
Figure 1. A flowchart of participants exclusion. The stacked boxes represent four separate analysis datasets specified at the bottom of the figure; We first compiled all required variables into a single dataset from the Lifelines cohort and then divided it into four disease-specific analysis datasets, each excluding participants with only one target disease at baseline. To estimate disease incidence, participants were followed from baseline until incident diagnosis or death. * Complete losses of follow-up mean lost to follow-up in wave 2 (2014–2019) and wave 3 (2019–2024).
Nutrients 17 03883 g001
Figure 2. A methodological framework of this study.
Figure 2. A methodological framework of this study.
Nutrients 17 03883 g002
Figure 3. Association curves between lifestyle summation scores and disease risk. Light blue and light orange zones showed corresponding 95% confidence intervals. (a,e), risk for type 2 diseases; (b,f) risk for cancer; (c,g) risk for respiratory diseases; and (d,h) risk for cardiovascular disease. All models adjusted for age (median), sex (female), marital status (married/cohabiting), income (1500–1900), highest education (upper secondary) and employment status (full-time employed).
Figure 3. Association curves between lifestyle summation scores and disease risk. Light blue and light orange zones showed corresponding 95% confidence intervals. (a,e), risk for type 2 diseases; (b,f) risk for cancer; (c,g) risk for respiratory diseases; and (d,h) risk for cardiovascular disease. All models adjusted for age (median), sex (female), marital status (married/cohabiting), income (1500–1900), highest education (upper secondary) and employment status (full-time employed).
Nutrients 17 03883 g003
Table 1. Baseline characteristics of socio-demographic variables and follow-up information across each dataset.
Table 1. Baseline characteristics of socio-demographic variables and follow-up information across each dataset.
T2D DatasetCancer DatasetCRDs DatasetCVDs Dataset
Sample size114,919131,24891,77777,645
Disease incidences 3114468541332850
Incidence rate a321511603485
Years of follow-up9.3 (4.8, 10.8)5.9 (4.0, 10.0)8.5 (4.0, 10.3)8.7 (4.0, 10.3)
Age, year45.6 ± 12.945.4 ± 13.046.3 ± 13.045.3 ± 12.9
Sex, male47,570 (41.4)54,656 (41.6)38,164 (41.6)34,024 (43.8)
Education attainment
Elementary2799 (2.4)3622 (2.8)2245 (2.5)1842 (2.4)
Lower secondary29,805 (25.9)34,719 (26.5)23,863 (26.0)19,706 (25.4)
Upper secondary44,450 (38.7)50,670 (38.6)35,097 (38.3)30,010 (38.7)
Tertiary35,727 (31.1)39,720 (30.3)28,787 (31.4)24,720 (31.8)
Others2138 (1.9)2517 (1.9)1719 (1.9)1367 (1.8)
Net income, Euro
lower than 110019,478 (17.0)22,903 (17.5)14,842 (16.2)12,736 (16.4)
1100 to 150026,218 (22.8)29,724 (22.7)20,828 (22.7)17,715 (22.8)
1500 to 190029,814 (25.9)33,351 (25.4)23,952 (26.1)20,305 (26.2)
Higher than 190023,250 (20.2)26,396 (20.1)19,191 (20.9)16,144 (20.8)
I don’t know this 4294 (3.7)5014 (3.8)3380 (3.7)2843 (3.7)
I don’t want to tell11,865 (10.3)13,860 (10.6)9518 (10.4)7902 (10.2)
Partner relationships
Married/cohabiting92,564 (80.6)105,190 (80.2)74,416 (81.1)62,477 (80.5)
Have partner but no-cohabiting5967 (5.2)7001 (5.3)4499 (4.9)4067 (5.2)
No partner15,433 (13.4)17,896 (13.6)12,062 (13.2)10,427 (13.4)
Others955 (0.8)1161 (0.9)734 (0.8)674 (0.9)
Employment status
Full-time employment51,516 (44.8)58,766 (44.8)40,528 (44.2)36,408 (46.9)
Retired10,791 (9.4)12,663 (9.7)9646 (10.5)7125 (9.2)
Housewife/husband7636 (6.6)8791 (6.7)6300 (6.9)4889 (6.3)
Student5248 (4.6)6297 (4.8)3918 (4.3)3782 (4.9)
Not employed6023 (5.2)6994 (5.3)4483 (4.9)3622 (4.7)
Part-time employment30,201 (26.3)33,723 (25.7)23,992 (26.2)19,546 (25.2)
Less than 12 h/week3504 (3.1)4014 (3.1)2844 (3.1)2273 (2.9)
Values are mean ± SD, n (%) or median (quartile 25%, quartile 75%). a Unit: 100,000 person-year.
Table 2. Disease incidence in each LP and lifestyle summation score across each dataset.
Table 2. Disease incidence in each LP and lifestyle summation score across each dataset.
T2D DatasetCancer DatasetCRDs DatasetCVDs Dataset
New cases/n% New cases/n%New cases/n% New cases/n%
LPs
Healthy in a balanced way1074/45,03239.22171/49,855381618/37,05140.41320/30,59439.4
Healthy but physically inactive393/13,53711.8634/14,52911.1523/11,06712.1316/883111.4
Unhealthy but no substance use 329/89917.8369/11,8749.1369/70157.7201/59387.7
Unhealthy but light drinking and never smoked801/32,45728.2971/37,69228.71107/25,53927.9658/22,45528.9
Unhealthy517/14,90213540/17,29813.2514/11,03912355/982712.7
Healthy lifestyle scores
056/1182161/13831.146/8470.935/6930.9
1314/80617305/94637.2335/61766.7207/51016.6
2602/19,21816.7737/21,94016.7759/14,80916.2474/12,58116.2
3720/26,06922.71024/29,91422.8954/20,63422.5645/17,53722.6
4600/24,55521.41000/27,91121.3835/19,66821.5586/16,74221.6
5395/17,93815.6742/20,53015.6637/14,79316.1448/12,37615.9
6253/10,5289.2483/11,8939.1356/86279.4296/74219.6
7117/50314.4237/56574.3156/41724.6111/35634.6
845/18141.671/19891.544/15271.738/12401.6
912/4540.421/5000.4<10/3950.4<10/3410.4
10<10/69<0.1<10/680.05<10/63<0.1<10/50<0.1
Unhealthy lifestyle scores
0170/93378.1434/10,2167.8272/79828.7262/68038.8
1533/21,853191054/24,40918.6717/18,34720571/15,38819.8
2662/27,42423.91190/31,11123.7949/22,21824.2724/18,60224
3640/23,52220.5954/26,95820.5894/18,64520.3560/15,74320.3
4517/16,55114.4552/19,42114.8673/12,75613.9377/11,00614.2
5293/93398.1298/11,0758.4362/69677.6204/59487.7
6193/45634131/53174.1176/32493.5102/28033.6
777/17171.553/20181.574/11541.333/10071.3
824/5060.415/5920.513/3180.414/2820.4
9<10/102<0.1<10/123<0.1<10/69<0.1<10/60<0.1
10<5/5<0.1<8/8<0.1<6/6<0.1<3/3<0.1
%—the prevalence of each category of LPs/scores within each dataset. To comply with Lifelines’ data use agreements, we refrain from reporting specific case counts for categories with fewer than 10 individuals.
Table 3. Associations between LPs and disease risk.
Table 3. Associations between LPs and disease risk.
T2D IncidenceCancer IncidenceCRDs Incidence CVDs Incidence
LPs
Unhealthy1 (ref)1 (ref)1 (ref)1 (ref)
Healthy in a balanced way0.54 (0.49–0.60)0.87 (0.79–0.96)0.79 (0.71–0.88)0.91 (0.81–1.03)
Healthy but physically inactive0.60 (0.52–0.69)0.82 (0.73–0.93)0.68 (0.60–0.77)0.82 (0.70–0.96)
Unhealthy but no substance use1.27 (1.11–1.47)0.85 (0.74–0.97)1.01 (0.88–1.16)1.16 (0.97–1.38)
Unhealthy but light drinking and never smoked0.89 (0.79–0.99)0.91 (0.82–1.02)0.95 (0.86–1.06)1.05 (0.92–1.20)
All models adjusted for age, sex, education attainment, marital status, income and employment status. It reports sub-hazard ratios (SHR) (95% confidence interval). Bold means the SHR met statistical significance: p < 0.05.
Table 4. Comparison of linear and optimal non-linear models for the association between lifestyle summation scores and disease risk.
Table 4. Comparison of linear and optimal non-linear models for the association between lifestyle summation scores and disease risk.
Degree of FreedomLog-Likelihoodχ2dfp Value
Healthy lifestyle scores
T2DLinear model1−32,499.326.4010.011 *
Optimal non-linear model 2−32,496.12
Cancer Linear model1−49,359.52010.948
Optimal non-linear model 2−49,359.52
CRDsLinear model1−44,094.377.4830.058
Optimal non-linear model 4−44,090.63
CVDsLinear model1−28,666.564.5130.211
Optimal non-linear model 4−28,664.30
Unhealthy lifestyle scores
T2DLinear model1−32,456.714.6630.198
Optimal non-linear model 4−32,454.38
Cancer Linear model1−49,372.542.4420.295
Optimal non-linear model 3−49,371.32
CRDsLinear model1−44,095.227.4820.006 **
Optimal non-linear model 3−44,090.06
CVDsLinear model1−28,665.180.6310.428
Optimal non-linear model 2−28,664.86
df—degree of freedom. Statistical method was log-likelihood ratio test. * p < 0.05; ** p < 0.01. Optimal models in this table were selected from five models based on the lowest AIC, see File S2, Table S3.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Zou, Q.; Ogaz-González, R.; Lunter, G.; Corpeleijn, E. Lifestyle Patterns and Incidence of Cardiovascular Diseases, Cancer, Respiratory Diseases, and Type 2 Diabetes: A Large-Scale Prospective Cohort Study. Nutrients 2025, 17, 3883. https://doi.org/10.3390/nu17243883

AMA Style

Zou Q, Ogaz-González R, Lunter G, Corpeleijn E. Lifestyle Patterns and Incidence of Cardiovascular Diseases, Cancer, Respiratory Diseases, and Type 2 Diabetes: A Large-Scale Prospective Cohort Study. Nutrients. 2025; 17(24):3883. https://doi.org/10.3390/nu17243883

Chicago/Turabian Style

Zou, Qian, Rafael Ogaz-González, Gerton Lunter, and Eva Corpeleijn. 2025. "Lifestyle Patterns and Incidence of Cardiovascular Diseases, Cancer, Respiratory Diseases, and Type 2 Diabetes: A Large-Scale Prospective Cohort Study" Nutrients 17, no. 24: 3883. https://doi.org/10.3390/nu17243883

APA Style

Zou, Q., Ogaz-González, R., Lunter, G., & Corpeleijn, E. (2025). Lifestyle Patterns and Incidence of Cardiovascular Diseases, Cancer, Respiratory Diseases, and Type 2 Diabetes: A Large-Scale Prospective Cohort Study. Nutrients, 17(24), 3883. https://doi.org/10.3390/nu17243883

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop