Next Article in Journal
Dietary Behaviors, Digestive Symptoms, and Neurovegetative Features in Disorders of Gut–Brain Interaction: A Cross-Sectional Clinical Study
Previous Article in Journal
Fermented Dairy Food Intake and Risk of Depression and Dementia in Later Life: Findings from a Prospective Cohort of Older Australians
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Ecological Correlates of Differences in Mean Age at Death Across Nearly Extinct Cohorts: The Role of Dietary Habits

by
Alessandro Menotti
1,
Paolo Emilio Puddu
1,2,*,
David R. Jacobs, Jr.
3,
Anthony Kafatos
4,
Miodrag Ostojic
5 and
Hanna Tolonen
6
1
Association for Cardiac Research, Via Voghera, 31, 00182 Rome, Italy
2
EA 4650, Signalisation, Électrophysiologie et Imagerie des Lésions d’Ischémie Reperfusion Myocardique, Université de Normandie, 14000 Caen, France
3
Division of Epidemiology and Community Health, School of Public Health, University of Minnesota, Minneapolis, MN 55455, USA
4
Department of Social Medicine, Preventive Medicine and Nutrition Clinic, University of Crete, 70013 Heraklion, Greece
5
Institute for Cardiovascular Diseases “Dedinje”, 11040 Belgrade, Serbia
6
Department of Public Health and Welfare, Finnish Institute for Health and Welfare, 00300 Helsinki, Finland
*
Author to whom correspondence should be addressed.
Nutrients 2026, 18(7), 1021; https://doi.org/10.3390/nu18071021
Submission received: 4 February 2026 / Revised: 3 March 2026 / Accepted: 18 March 2026 / Published: 24 March 2026
(This article belongs to the Section Nutrition and Public Health)

Abstract

Objectives. Our objective was to study the ecological relationship of many risk factors and personal characteristics with mean age at death (AD) after a 50-year follow-up of nearly extinct cohorts. Material and Methods. There were 16 cohorts totaling 12,763 middle-aged men enrolled in the Seven Countries Study (SCS), and 58 variables were measured, including traditional risk factors, dietary nutrition and anthropometric variables. A follow-up of 50 years allowed the use of AD as the end-point. Analysis included simple linear regression correlation and multivariate modelling using Principal Component Analysis and regression and Ridge regression. Results. Out of 58 variables, only 11 (10 nutrition-dietary items plus age) showed a significant linear correlation coefficient (R) ≥ 0.50 and a p value ≤ 0.05. Linear regression was computed by using as a predictor the dietary factor score derived from a Principal Component Analysis of the 11 significant variables, which were used as independent variables, whose coefficients were significantly related with AD, and the final R2 was 0.52. The Principal Component regression and Ridge regression documented the direct relationship of food groups of vegetable origin (including olive oil) with the AD and the inverse relationship for food groups of animal origin. Conclusions. A few variables, all related to diet and nutrition, were able to statistically explain about 50% of the different AD in 16 cohorts of men followed up with nearly until death. Other variables, including traditional cardiovascular disease risk factors, did not contribute in a significant way for this purpose.

1. Introduction

Age at death (AD) is a simple and old demographic metric that has recently been re-evaluated [1,2,3,4,5]. Traditionally, it is used in demography, exploiting national official mortality data to study increasing longevity, to estimate the modelling of age and its distribution in pediatric and sport medicine in relation with the occurrence of sudden and unexpected death in infants and athletes, and in forensic medicine with the purpose of estimating age at death using complex autopsy procedures.
There is a great interest in longevity that has been increasing worldwide in recent decades. Among many existing studies, some are devoted to modelling the population structure [6], others are interested in the changing social and health problems among the new elderly population [7], and others try to identify the possible origins of increasing longevity in dietary habits [8].
In this study, our interest instead deals with the attempt to explain differences in longevity, expressed by AD across cohorts of people of the same sex and age range and living in the same historical period, examined directly through multiple standardized procedures and criteria and followed up with until near death, with the purpose of identifying which entry characteristics explain differences in AD.
In field epidemiology, AD can be used only if the study population cohorts have reached death or nearly death (usually with less than 10% of survivors) since otherwise, its meaning is deformed, not interpretable and difficult to compare across cohorts. On the other hand, there are a few investigations in which participants were followed up with until the cohort’s death, like in a Japanese study, where autopsy was performed in almost all participants [9], but this was a clinical exercise of about 800 consecutive patients enrolled at hospital admission and not a population study.
The Seven Countries Study of Cardiovascular Diseases (SCS) started in 1958, enrolling 16 cohorts of middle-aged men, and has reached exceptionally long follow-up for mortality, approaching extinction, which has allowed for the study of AD from different points of view. So far, little effort has been made to explain the different levels of AD across the enrolled cohorts based on an ecological approach, and only one dedicated paper was published in this area [10]. In the present investigation, we used the largest number of possible determinants of different conceptual features to explain the relatively large difference in AD across the 16 cohorts.

2. Material and Methods

Populations and Measurements. The 16 cohorts enrolled and first examined between 1958 and 1964 were located in 7 countries: USA, Finland, the Netherlands, Italy, former Yugoslavia (separately for the federate nations of Croatia and Serbia), Greece and Japan, for a total of 12,763 men aged 40–59 years. Ten cohorts were located in rural communities, 1 in a fishing community, and 4 in defined occupational groups, and 1 was a random sample of a small town. Details can be found elsewhere [11,12]. At the baseline examination, a large number of risk factors and personal characteristics were measured in all men [11,12]. In the early phases of the study, a complex dietary survey was run in subsamples of each cohort, allowing for the recording of many food groups and the chemical measurement of some basic nutrients in portions of food taken in the homes of the participants [10]. More details and related analyses have been reported elsewhere [13,14,15].
For the purpose of this analysis, 58 variables were identified as possible determinants of AD differences: 13 traditional cardiovascular risk factors, all expressed as the average from all men measured in the entry examination of the study [11]; 20 simple food groups; 4 combined food groups; 13 nutrients; and 8 ratios of food groups or nutrients and Dietary Scores, all measured in the dietary study mentioned above (expressed as their average to represent each cohort) [10,11,12,15]. The full list of all 58 variables is reported in Table A1 of Appendix A. The list of the major variables used in this analysis, together with their units of measurement and their abbreviations mainly adopted in further tables, is reported in Table 1.
For the measurement procedures of the possible determinants, details can be found in the quoted references, but for the 11 determinants individually associated with AD, some details are reported below:
(a)
Age (in years), measured in all men, approximated to the nearest birthday and expressed in years (average for each cohort);
(b)
Energy intake, derived from the dietary study on subsamples of each cohort, using the list of food groups and chemical measurements of food samples, expressed in calories per day (average for each cohort);
(c)
Intake of liquid fat (all kind of oils), fruit, olive oil, hard fats and the sum of all vegetable food groups consumed, derived from the dietary study on subsamples of each cohort, from the list of food groups, expressed in grams per day and adjusted for 1000 calories (average for each cohort);
(d)
M/S ratio (monounsaturated/saturated fat ratio), derived from the dietary study on subsamples of each cohort and from the list of nutrients and chemical measurements (average for each cohort);
(e)
MP/ST ratio (monounsaturated + polyunsaturated fat/saturated fat + trans-fat), derived from the dietary study on subsamples of each cohort and from the list of food groups and chemical measurements (average for each cohort);
(f)
Dietary thrombogenicity index (THI) following the rules from Ulbricht and Southgate [16] involving 10 types of fatty acids, expressed in arbitrary units (average for each cohort);
(g)
Dietary inflammation index (INF), derived from the dietary study using energy, 24 nutrients and 26 simple or combined food groups as described in the pertinent reference, expressed in arbitrary units [10] (average for each cohort).
Mortality data were systematically collected and coded. For the purpose of this analysis the time horizon for mortality and consequently AD was fixed after the first 50 years of follow-up. At that time, mortality data and AD were practically complete for 11 cohorts, while they were available for shorter follow-up periods for the other 5 cohorts. Therefore, we adopted the procedure already successfully tested in other analyses [17,18], whereby incomplete 50-year mortality rates and AD for cohorts with shorter follow-up periods were estimated using regression equations derived from the 11 cohorts with complete mortality data for 50 years. The regression included as dependent variables the rates and AD of cohorts with a 50-year follow-up and as independent variables the 25- or 45-year follow-up data of the same cohorts. The 50-year estimates for the 5 cohorts with shorter follow-up periods were obtained by incorporating the 25- or 45- year data of these cohorts into the equation.
Death rates were expressed as per 1000 person/year in 50 years, and AD was expressed in years and computed only for cases of death. In this way, all-cause mortality covered about 97% of the denominator.
Statistical analysis. A descriptive analysis was run and tabulated, presenting cohort death rates and AD. A univariate analysis was computed, producing simple linear correlation coefficients (R) of all possible correlates (n = 58) versus AD. A correlation matrix was calculated across the 11 variables that were significantly associated with AD in the univariate analysis.
Multivariate analysis was performed using various types of models and using AD as the end-point (dependent variable) and adopting the following steps:
  • We computed all possible Multiple Linear Regressions (MLRs) using a maximum of 5 variables for a total of 131 MLRs.
  • The majority of the above models presented multicollinearity problems; the models were discarded, and as a consequence, we adopted two different steps.
  • Principal Component Analysis (PCA) was conducted involving the 11 significant variables in the univariate analysis. Starting from the correlation matrix of the 11 variables, this procedure produced a factor score (called Dietary Score) that was used to predict AD in a subsequent linear regression (with a single independent variable).
  • Then, we used the Principal Component Regression and the Ridge regression, which smooth the collinearity problems. A limited number of variables were used in the models, based on arbitrary a priori choice and/or convenience.

3. Results

Table 2 reports the mean levels of all-cause mortality and AD for each of the 16 cohorts where a relatively larger variation was found for cohort death rate than for AD. The highest levels of AD were found in Crete (Greece), Belgrade (Serbia, former Yugoslavia) and Rome Railroad cohorts (Italy); the lowest levels were found in Slavonia (Croatia, former Yugoslavia) and in East Finland, with a difference of over 10 years between the extremes. Death rates were inversely correlated with AD, with an R value of −0.96 (p < 0.0001), suggesting that higher death rates led to lower AD.
The first screening, consisting of a univariate analysis with the computation of simple linear regression coefficients (R) of each possible determinant versus AD, showed that out of 58 variables, only 11 had R ≥ 0.50 (p ≤ 0.05) (Table 3). Those 11 variables were age at baseline, LIF, FRU, OLI, VEG, ENE, HAF, M/S ratio, MP/ST ratio, THI and INF. Further analysis was concentrated on these variables. Among the dietary nutrition factors, the estimate of partial correlation was limited to each of the first five variables with the highest Rs, while the other four played the role of confounders. With this approach, all variables lost their significance except LIF, suggesting the existence of strong correlations across the five variables. The correlation matrix of the 11 variables (Table 4) indicates the existence of high R values, mainly those involving some variables like VEG, INF, M/S and MP/ST.
The detailed levels of the 11 predictive variables (Table 5) in each of the 16 cohorts is not easy to read and interpret. However, important information is provided by the coefficient of variation (standard deviation expressed as % of average) that is larger than the average (more than 100%) for LIF, OLI, FRU and INF, indicating substantial differences across cohorts in these variables and being a prerequisite of the possibility that these variables might play valuable roles in the association with the end-point.
To overcome the difficulties in exploiting all the significant variables in the multiple regression models, we computed a model with AD as dependent variable and as predictive variables the dietary factor scores of each cohort derived from a Principal Component Analysis (PCA) of the 11 individually significant variables. The PCA factor scores (called Dietary Score) are reported in the right-end column of Table 3. The univariate regression model predicting AD showed that lower levels of score coefficients were associated with higher AD and vice versa. The original equation was AD = 74.91 − 1.83 Dietary Score, with R = 0.72 and R2 = 0.52. Figure 1 reports the correlation between Dietary Score and AD. In Table 3, there are two groups of Rs with the opposite algebraic sign, i.e., positive and negative, and the same two groups can be found in the columns of factor scores of the same table but characterized by opposite algebraic signs in a coherent way.
The singular fact is that the two groups were identified by the Rs that had an end-point as the dependent variable, while in the PCA, the same two groups (carrying opposite algebraic signs) were created only on the basis of their inter-correlation, without reference to an end-point. The R between univariate Rs of 10 variables and the correspondent Dietary Score was −0.99. This relationship is reported in Figure 2, where two small very compacted clouds of four and six points are located far away from each other, representing variables inversely and directly related to AD, respectively.
The Principal Component regression and the Ridge regression were run in parallel after an arbitrary choice of 6 among the 11 significant variables with significant linear correlation, i.e., 3 variables directly related and, separately, 3 variables inversely related to AD. Moreover, we considered models with two variables having opposite relationships with AD. In a preliminary check of the corresponding correlation matrix, the Variance Inflation Factor was less than 10, and no collinearity problems were detected in these groups. The Principal Component regression produced reasonable findings only when one component was omitted, and the findings are reported in the upper section of Table 6.
As a consequence of this constraint, and following a systematic search, the Ridge regression findings reported in the lower section of Table 6 are those most similar to the ones of the Principal Component regression, corresponding to the choice of k = 0.00001.
The regression coefficients in all models’ approaches maintained the expected algebraic sign (positive for the variables directly related with AD and negative for the variables inversely related with AD), and this was true also in the models with two variables of opposite type. The p values, derived from the analysis of variance, were highly significant for the group comprising LIF, FRU and VEG, while the p values were only close to significant for the group comprising HAF, THI and INF. Larger p values were seen in the models with two variables of opposite type. As an example, the relationship of observed and estimated AD for the Ridge model dealing with LIF and HAF is reported in Figure 3 (R = 0.72).

4. Discussion

This analysis was based on a unique approach to the problem of longevity and its determinants that encompasses overall mortality, its components and its time of appearance using the metrics of AD. As a consequence, it cannot be applied to experimental studies. On the other hand, it is not guaranteed that the possible improvements in health care during the long follow-up period were similar across the various cohorts. However, the renowned high standard of the Finnish health care system did not prevent, mainly in the cohort of East Finland, the occurrence of low levels of AD.
Despite the exploration of many correlates of AD possibly capable of explaining cohorts’ differences in AD, the number of those producing significant correlations in the univariate analysis is rather few. Moreover, besides baseline age, 10 of 11 consisted of dietary nutrition items. The coefficient of age suggested that younger cohorts tended to reach higher AD, but this might be largely due to the relatively low entry age of some cohorts (like Belgrade and Rome Railroad) that also had very high or relatively high AD. The Belgrade cohort (university professors) was an outlier as it combined the lowest entry age with the second highest AD. In any case, in this ecological analysis, age at entry played an opposite role compared with that observed in analyses based on single individuals, where higher baseline age was associated with higher probabilities to reach high AD [19,20]. Moreover, the role of entry age could be partly explained by the “healthy worker effect” that, on the other hand, was not so clear in the US occupational Railroad cohort and absolutely absent in the Zrenjanin occupational cohort. More generally, the role of age at entry cannot be interpreted like other determinants since in multivariate models, it mainly represents an adjustment factor for the spread on age at entry examination (40 to 59 years).
ENE intake had an important predictive role. Sometimes it is interpreted as energy expenditure starting from the assumption of a metabolic balance, but in our case, its measurement was taken only based on food groups and nutrients. LIF, OLI, VEG and FRU intake plus the MP/ST ratio and the M/S ratio are among the food groups and ratios with the highest positive R values, which contrasted with the negative R values of ENE, HAF, THI, and INF (all inversely related to AD). The first group is also strongly related with the Mediterranean Diet, whose characteristics can also be seen in the positive role of the M/S and MP/ST ratios and the inverse association with THI and INF. In fact, INF was constructed on the basis of its positive associations with energy, fat, and saturated fat and its negative association with vegetables, fruit, olive oil, liquid fat, the sum of vegetable foods, and the ratio of vegetal/animal food [10]. By itself, a high inflammation index is a mediator between external causes and the initiation and development of many chronic diseases, and in this case, the food groups and nutrients directly or inversely associated with them may represent some of the external causes in different ways.
In general, the correlates of variables selected as significant in the univariate analysis should also be considered for interpretation. The most typical example is provided by the strong and significant role of LIF, which is an indicator of the Mediterranean diet but also a variable closely related with others typical for the Mediterranean diet, as seen from our correlation matrix. Incidentally, LIF has an R with AD of 0.70, and its components, i.e., olive oil and all seed oils, have R values of 0.63 and 0.17, respectively, suggesting that there is an advantage of combining them, although olive oil represents 84% and seed oils represent 16% of total liquid fat. A plausible takeaway message from these analyses is that several features of the Mediterranean diet are positively associated with longevity.
A few classic CVD risk factors never entered the list with significantly high Rs, not even blood pressure and serum cholesterol or smoking. One of the possible reasons might be the limited variance of mean blood pressure across the 16 cohorts (standard deviation equal to 4% of mean), while a larger but still small variance was found for serum cholesterol (standard deviation equal to 16% of mean). However, we know that serum cholesterol is predictive only of coronary heart disease (CHD), and this condition represents a substantial but not exceedingly high contribution to mortality, except in Finland. On the other hand, the association of serum cholesterol with 50-year CHD mortality was very large across the 16 cohorts; CHD mortality was higher than 10 per 1000 person/years in 5 of 16 cohorts, while it was definitely lower in the other 11 cohorts, and the proportion of CHD over all-cause mortality was greater than 25% only in the same 5 cohorts [18].
The coefficient of variation for smoking prevalence was also relatively low (14%), describing a limited variance at the ecologic level of this lifestyle habit across cohorts, probably obscuring its role in the association with AD. However, some findings about the role of traditional CVD risk factors have already been published from the Italian rural areas of the same study [19] and partly from the group of the 10 SCS cohorts that have reached death [20]. An inverse (although not significant) role versus AD of smoking habits and of systolic blood pressure was found at the ecological level, while the analysis on individuals indicated a direct association of vigorous physical activity and an adverse association of smoking habits, serum cholesterol, systolic blood pressure and heart rate. It is therefore clear that the same risk factors remain predictive of CVD and other conditions at the individual level and that the conclusion of this analysis cannot be directly applied to individuals.
A major problem of this analysis consisted of the difficult choice and handling of multivariate models and the need to exclude the linear multiple regression due to the small number of statistical units and due to the multicollinearity observed in the linear models. Beyond the technical limitation to comply with the small number of statistical units, each variable tends to be largely associated with the other variables either in a direct or inverse way, as clearly shown in the correlation matrix of Table 4. This means that each variable carries, indirectly, some information from the others.
The first problem was obviously impossible to solve, while for the second one, a partial solution was derived from the use, as a single covariate, of the Dietary Score derived from a PCA involving the 11 variables significant in the univariate analysis, producing a satisfactory outcome. It is of interest that the algebraic signs of factor scores of the PCA computed with the 11 major predictive variables were coherent with those found in the univariate analysis dealing with R and including the same variables. In fact, PCA is an a posteriori score procedure that does not include the end-point (that is AD) and is not influenced by the investigator opinions that often impinge upon selections if performed a priori. The relationship of Rs derived from univariate analysis and the factor scores of PCA showed their ability to segregate variables directly related to AD from those inversely so. Probably the most relevant finding of this analysis stays in the correlation between the Dietary Score (as a single determinant) and AD. Moreover, we used the Principal Component regression and the Ridge regression due to their specific characteristics and made the decision to select three groups of variables on the basis of an arbitrary a priori approach being, from this point of view, rather parsimonious. At the end, the findings were satisfactory and practically superposable with other results. Attempts to produce valuable models with six variables definitely failed.
All of these findings provide a clear lesson from ecological analyses comparing different cohorts. Possible determinants are more likely to be real predictive variables if their coefficient of variation is very high, say, more than 100%, as partly found in this analysis. Significant differences across the statistical units may not represent possible causes but only preliminary interpretations for differences. One should not infer that the same association could be found at the individual level but, if so, the fact represents a suggestion of a possible common causal role.
An ecological paradox finding was published in 1997 using the 25-year follow-up data of the SCS where the association of blood pressure with stroke mortality rates was inverse, while it was direct in analyses on individuals [21]. A possible explanation was identified in the fact that cohorts with high cholesterol and high blood pressure produced an excess of CHD deaths, while cohorts with low cholesterol and lower blood pressure produced an excess of stroke deaths, apparently due to the stronger role of serum cholesterol compared to that of high blood pressure for CHD mortality among cohorts.
An ecological analysis, partly based on the same dietary data and the same follow-up period of 50 years, was published in 2017 [17]: it showed that mortality from all causes was directly associated with energy and hard fats and inversely associated with vegetables, liquid fats, the MS ratio, and other complex food groups and ratios, partly in line with the present findings although using a different end-point (50-year all-cause death rate instead of 50-year AD). Incidentally, like in this paper, a strong component was energy intake, while the strongest food group was oils (liquid fats), with their Rs carrying opposite algebraic signs due to the different end-points.
It is clear that all of these observational findings do not solve the problem of the possible causality of eating habits that may need proper preventive trials. This accomplishment was recently reached by two Spanish trials where successes in primary and secondary prevention of major cardiovascular disease were obtained using some food components of the Mediterranean diet [22,23]. Despite this, it might be uncertain whether the same conclusions can be extended to longevity and AD, also in relation to technical issues dealing with extremely long trials.
The dispute on the role of saturated fat versus liquid fat seems never ending. Our findings, identifying LIF as the primary positive correlate of AD, align with the 2025 DGAC Scientific Report and the AHA news release [24,25]. However, they diverge from the final 2025–2030 DGA policy [26], which de-emphasizes the role of saturated fat, highlighting a critical contemporary conflict between long-term epidemiological data and current USA nutrition policy. However, a recent viewpoint of the problem is totally in line with the message of this analysis [27], and much debate is still ongoing [28]. With the limitation of having been conducted essentially within health profession settings, the controversy has recently been enriched by an additional element. The study just published online in the Journal of the American College of Cardiology [29] prospectively examined the association between diets characterized by different quantities and qualities of macronutrients, their metabolomic effects, and the risk of ischemic heart disease in 42,720 men in the Health Professionals Follow-Up Study (HPFS) (1986–2016), 64,164 women in the Nurses’ Health Study (NHS) (1986–2018), and 91,589 women in the NHSII (1991–2019), using dietary questionnaires aimed at defining the different origins and quality of foods (animal versus plant foods, whole grains versus refined carbohydrates, and other comparisons) rather than simply classifying diets as being low in saturated fat or carbohydrates. During 5,248,916 person-years, 20,033 cases of ischemic heart disease occurred, and the critical role of diet quality was highlighted, particularly when associated with fruits and vegetables rather than animal-derived foods, in reducing risk, since the “healthy” foods characteristic of such diets, much more than individual nutrients and their ratios, may exert beneficial health effects and significantly reduce coronary risk. These important individually obtained results align with the ecological ones presented in Table 6 of this study, obtained collectively using Principal Component and Ridge models in relation to LIF, FRU and VEG and determined as positively related to AD: this represents a clear allusion to the quality of a Mediterranean diet largely containing olive oil [10,17,22,23,25].
A frequently mentioned problem is bound to the possible changes in eating habits and nutrient contents of food during long follow-up periods. Re-analyses of the available food groups of each cohort after about 30 years of follow-up showed only slight changes that did not affect the ranks of the various nutrients across the cohorts [12,14,15]. A new dietary survey conducted after 25–30 years would cover no more than 30% of survivors aged 70 to 89 years, whose contribution to the association with AD would be rather limited.
We are unaware of other population studies that provided evidence on the relationship of risk factors and personal characteristics with AD in a long follow-up observation, and therefore, we are unable to quote any current references derived from previous investigations.

5. Conclusions

A group of 10 dietary nutrition variables were significantly ecologically associated with AD, and 6 of them could be tested in dedicated multivariate analyses. From the combined review of univariate and multivariate analyses, it appears that LIF (olive oils accounting in the largest proportion), FRU and VEG favour longevity, while hard fats and indicators of high intake of saturated and trans fats may play an adverse role. We can claim that around 50% of variance in cohort AD is explained by dietary habits. Similar findings were observed in an earlier publication using all-cause mortality as the end-point for the same follow-up duration [17]. On the other hand, traditional risk factors and personal characteristics did not contribute to explain the AD differences across the 16 study cohorts, likely due to their small coefficients of variation. We do not claim that traditional CVD risk factors are not associated with all-cause mortality or AD but simply that in the ecological approach, the priority in that association was found for the role of dietary nutrition measurements.
These findings underline the large contribution of eating habits in relation to longevity expressed by AD, but it should be recalled that the enrollment of the study cohorts was partly based on the hypothesized differences in habitual diets in relation to CHD occurrence [11] as a valuable basis for showing differences in outcome.

Author Contributions

Conceptualization, A.M. and P.E.P.; Methodology, A.M., P.E.P. and D.R.J.J.; Validation, D.R.J.J., A.K., M.O. and H.T.; Formal Analysis, A.M. and P.E.P.; Resources, A.K., M.O. and H.T.; Writing—Original Draft, A.M. and P.E.P.; Writing—Review and Editing, D.R.J.J., A.K., M.O. and H.T.; Supervision, A.M.; Project Administration, A.M. All authors have read and agreed to the published version of the manuscript.

Funding

The initiator of the Seven Countries Study of Cardiovascular Diseases, Prof. Ancel Keys, University of Minnesota, USA, obtained research grants from the National Heart Institute (later NHLBI) and the American Heart Association. Analysis and writing of this contribution were not covered by further funds.

Institutional Review Board Statement

Ethical approval that was not requested at the time of the Seven Countries Study, which started before the Helsinki Declaration, was provided through verbal authorization by the enrolled individuals, implicit in their participation. This was presented and accepted in other recent publications of our team, with different journals and publishers and diverse ethical requirements [10,17,20].

Informed Consent Statement

Baseline measurements were taken before the era of the initial Helsinki Declaration, and approval was implied in participation, while verbal or written consent was obtained for the collection of field examinations and follow-up data along with explicit authorizations to publish scientific results provided that the participants names were unidentifiable. In fact, a double procedure was created with a consecutive identification number in the roster wherefrom the participants were locally derived, whereas all measurements were performed only after the creation of a second identifier. An independent investigator, unaware of the data analyzing procedures and results, was responsible for allying the two identification numbers, mainly for future follow-up and event allocations. The original blank copy in English of the participants’ informed consent may be provided upon administrative request.

Data Availability Statement

The data used in this analysis are derived from the Seven Countries Study. Due to internal rules, the data and computing codes are not publicly available, although specific requests for dedicated analyses may be evaluated by the internal Coordinating Committee.

Acknowledgments

The authors are much indebted to all principal investigators of the Seven Country Study cohorts grouped in the present analysis for giving us the opportunity to use their country data. The authors gratefully thank H. Blackburn, School of Public Health, University of Minnesota, Minneapolis MN, USA, for his role of chairing the Seven Countries Study for decades.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A

Table A1. List of 58 baseline variables tested for the association with age at death.
Table A1. List of 58 baseline variables tested for the association with age at death.
Type of VariableVariables
13 traditional CVD risk factorsage, socio-economic status, body mass index, systolic blood pressure, diastolic blood pressure, serum cholesterol, subscapular skinfold, heart rate, cigarette per day, smoker prevalence, sedentary physical activity, moderate physical activity, vigorous physical activity
20 simple food groupsbread, cereals, potatoes, vegetables, legumes, fruit, meat, butter, milk, cheese, eggs, margarine, lard, fish, olive oil, n6-poly-oil, sugar, pastries, alcohol, wine
4 combined food groupshard fat, liquid fat, sum of vegetable food, sum of animal food,
13 nutrientsenergy, protein, sucrose, natural sugar, added sugar, mono-di-saccharides, starch, all carbohydrates, fat, saturated fat, monounsaturated fat, polyunsaturated fat, trans-fat
8 ratios of foods or nutrients,
and Dietary Scores
ratio animal food/vegetable food, monounsaturated fat/saturated fat, monounsaturated + polyunsaturated fat/saturated + trans fat, polyunsaturated fat/saturated fat, Mediterranean Adequacy Index, dietary inflammation score, dietary atherogenicity index, dietary thrombogenicity index

References

  1. Canudas-Romo, V. The modal age at death and the shifting mortality hypothesis. Demogr. Res. 2008, 19, 1179–1204. [Google Scholar] [CrossRef]
  2. Tuljapurkar, S. The final inequality variance in age at death. In Demography and the Economy, from the National Bureau of Economic Research; Shoven, J.B., Ed.; University of Chicago Press: Chicago, IL, USA, 2010; pp. 209–221. [Google Scholar]
  3. Wrycza, T.F.; Missov, T.I.; Baudisch, A. Quantifying the shape of aging. PLoS ONE 2015, 10, e0119163. [Google Scholar] [CrossRef]
  4. Robine, J.M. Modal age at death: Lifespan indicator in the era of longevity extension. Vienna Yearb. Popul. Res. 2013, 2, 37–69. [Google Scholar]
  5. Menotti, A.; Puddu, P.E. Focus on age at death in field epidemiology. Aging Clin. Exp. Res. 2023, 35, 1187–1194. [Google Scholar] [CrossRef]
  6. Kiss, C.; Nemeth, L.; Veto, B. Modelling the age distribution of longevity leaders. Sci. Rep. 2024, 12, 20592. [Google Scholar] [CrossRef] [PubMed]
  7. Falah, G.; Giller, A.; Gutman, D.; Arzmon, G. Breaking the glass ceiling. Gerontology 2020, 66, 309–314. [Google Scholar] [CrossRef] [PubMed]
  8. Hu, F.B. Diet strategies for promoting healthy aging and longevity. An epidemiological perspective. J. Intern. Med. 2024, 295, 508–531. [Google Scholar] [CrossRef] [PubMed]
  9. Konishi, M.; Iso, H.; Baba, S.; Terao, A.; Ida, M.; Shimamoto, T.; Komachi, Y. Pathologic characteristics of stroke and myocardial infarction in Japan. J. Epidemiol. 1992, 2, 137–147. [Google Scholar] [CrossRef][Green Version]
  10. Puddu, P.E.; Shivappa, N.; Menotti, A.; Hebert, J.R.; Tolonen, H.; Kafatos, A.; Adachi, H. Energy-adjusted Dietary Inflammatory Index scores predict long-term cardiovascular disease mortality and other causes of death in an ecological analysis of the Seven Countries Study. Eur. J. Prev. Cardiol. 2021, 28, 1342–1350. [Google Scholar] [CrossRef]
  11. Keys, A.; Aravanis, C.; Blackburn, H.; van Buchem, F.S.P.; Buzina, R.; Djordjevic, B.S.; Dontas, A.S.; Fidanza, F.; Karvonen, M.J.; Kimura, N.; et al. Epidemiological studies related to coronary heart disease: Characteristics of men aged 40–59 in seven countries. Acta Med. Scand. 1967, 180, 1–392. [Google Scholar] [CrossRef]
  12. Kromhout, D.; Menotti, A.; Blackburn, H. (Eds.) Prevention of Coronary Heart Disease: Diet, Lifestyle and Risk Factors in the Seven Countries Study; Kluwer Publ: Norwell, MA, USA; Dordrecht, NL, USA, 2002; pp. 1–267. [Google Scholar]
  13. Den Hartog, C.; Buzina, K.; Fidanza, F.; Keys, A.; Roine, P. (Eds.) Dietary Studies and Epidemiology of Heart Diseases; Voeding: The Hague, The Netherlands, 1968; pp. 1–157. [Google Scholar]
  14. Kromhout, D.; Keys, A.; Aravanis, C.; Buzina, R.; Fidanza, F.; Giampaoli, S.; Jansen, A.; Menotti, A.; Nedeljkovic, S.; Pekkarinen, M.; et al. Food consumption patterns in the 1960s in Seven Countries. Am. J. Clin. Nutr. 1989, 49, 889–894. [Google Scholar] [CrossRef]
  15. Kromhout, D.; Menotti, A.; Bloemberg, B.; Aravanis, C.; Blackburn, H.; Buzina, R.; Dontas, A.S.; Fidanza, F.; Giampaoli, S.; Jansen, A. Dietary saturated and trans fatty acids and cholesterol and 25-year mortality from coronary heart disease: The Seven Countries Study. Prev. Med. 1995, 24, 308–315. [Google Scholar] [CrossRef]
  16. Ulbricht, T.L.V.; Southgate, D.A.T. Coronary heart disease: Seven dietary factors. Lancet 1991, 338, 985–992. [Google Scholar] [CrossRef]
  17. Menotti, A.; Kromhout, D.; Puddu, P.E.; Alberti-Fidanza, A.; Hollman, P.; Kafatos, A.; Tolonen, H.; Adachi, H.; Jacobs, D.R., Jr. Baseline fatty acids, food groups, a diet score and 50-year all-cause mortality rates. An ecological analysis of the Seven Countries Study. Annals Med. 2017, 49, 718–727. [Google Scholar] [CrossRef]
  18. Kromhout, D.; Menotti, A.; Alberti-Fidanza, A.; Puddu, P.E.; Hollman, P.; Kafatos, A.; Tolonen, H.; Adachi, H.; Jacobs, D.R., Jr. Comparative ecologic relationships of saturated fat, sucrose, food groups and a Mediterranean food pattern score to 50-year coronary heart disease mortality rates among 16 cohorts of the Seven Countries Study. Eur. J. Clin. Nutr. 2018, 72, 1102–1110. [Google Scholar] [CrossRef]
  19. Menotti, A.; Puddu, P.E.; Catasta, G. Determinants of longevity and age at death in a practically extinct cohort of middle-aged men followed-up for 61 years. Aging Clin. Exp. Res. 2022, 34, 1247–1258. [Google Scholar] [CrossRef]
  20. Puddu, P.E.; Menotti, A.; Jacobs, D.R., Jr.; Adachi, H.; Kafatos, A.; Tolonen, H. Cardiovascular risk factors predict age at death in 60-year follow-up of the Seven Countries Study. Aging Clin. Exp. Res. 2023, 35, 193–202. [Google Scholar] [CrossRef] [PubMed]
  21. Menotti, A.; Blackburn, H.; Kromhout, D.; Nissinen, A.; Karvonen, M.; Aravanis, C.; Dontas, A.; Fidanza, F.; Giampaoli, S. The inverse relation of average population blood pressure and stroke mortality rates in the Seven Countries Study: A paradox. Eur. J. Epidemiol. 1997, 13, 379–386. [Google Scholar] [CrossRef] [PubMed]
  22. Estruch, R.; Ros, E.; Salas-Salvadó, J.; Covas, M.-I.; Corella, D.; Arós, F.; Gómez-Gracia, E.; Ruiz-Gutiérrez, V.; Fiol, M.; Lapetra, J.; et al. Primary prevention of cardiovascular disease with a Mediterranean supplement with extra-virgin olive oil and nuts. N. Engl. J. Med. 2018, 378, e34. [Google Scholar] [CrossRef]
  23. Delgado-Lista, J.; Alcala-Diaz, J.F.; Torres-Peña, J.D.; Quintana-Navarro, G.M.; Fuentes, F.; Garcia-Rios, A.; Ortiz-Morales, A.M.; I Perez-Caballero, A.; Yubero-Serrano, E.M.; A Rangel-Zuñiga, O.; et al. Long-term secondary prevention of cardiovascular disease with a Mediterranean diet and a low-fat diet (CORDIOPREV). A randomized controlled trial. Lancet 2022, 399, 1876–1885. [Google Scholar] [CrossRef] [PubMed]
  24. 2025 Dietary Guidelines Advisory Committee. Scientific Report of the 2025 Dietary Guidelines Advisory Committee: Advisory Report to the Secretary of Health and Human Services and Secretary of Agriculture; U.S. Department of Health and Human Services: Washington, DC, USA, 2024. [Google Scholar] [CrossRef]
  25. New Dietary Guidelines Underscore Importance of Healthy Eating. 7 January 2026. Available online: https://newsroom.heart.org/news/releases-20260107-6915862 (accessed on 3 March 2026).
  26. U.S. Department of Agriculture; U.S. Department of Health and Human Services. Dietary Guidelines for Americans, 2025–2030, 10th ed.; U.S. Department of Agriculture: Washington, DC, USA; U.S. Department of Health and Human Services: Washington, DC, USA, 2026. Available online: https://cdn.realfood.gov/DGA.pdf (accessed on 3 March 2026).
  27. Mozaffarian, D. The 2025–2030 Dietary Guidelines for Americans. JAMA 2026, 335, 575. [Google Scholar] [CrossRef] [PubMed]
  28. Abbasi, J. What Nutrition Experts say about the New Dietary Guidelines for Americans. JAMA 2026, 335, 927–929. [Google Scholar] [CrossRef] [PubMed]
  29. Wu, Z.; Liu, B.; Wang, X.; Alessa, H.; Zeleznik, O.A.; Eliassen, A.H.; Clish, C.; Wang, M.; Mukamal, K.J.; Rimm, E.B.; et al. Effect of low-carbohydrate and low-fat diets on metabolomic indices and coronary heart disease in U.S. individuals. J. Am. Coll. Cardiol. 2026. In Press. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Relationship of Dietary Score with AD derived from univariate linear regression.
Figure 1. Relationship of Dietary Score with AD derived from univariate linear regression.
Nutrients 18 01021 g001
Figure 2. Correlation of the R value of the age at death of 10 variables (except AGE) versus the Dietary Score of the same variables. See Table 3 for the list of these variables (column R).
Figure 2. Correlation of the R value of the age at death of 10 variables (except AGE) versus the Dietary Score of the same variables. See Table 3 for the list of these variables (column R).
Nutrients 18 01021 g002
Figure 3. Relationship of observed versus estimated AD derived from a Ridge regression model based on 2 variables associated with AD in opposite ways.
Figure 3. Relationship of observed versus estimated AD derived from a Ridge regression model based on 2 variables associated with AD in opposite ways.
Nutrients 18 01021 g003
Table 1. List of abbreviations and units of measurement.
Table 1. List of abbreviations and units of measurement.
Unit of MeasurementAbbreviation
US Railroad cohort----US
East Finland cohort----EF
West Finland cohort----WF
Zutphen, the Netherlands----ZU
Crevalcore, Italy----CR
Montegiorgio, Italy----MO
Rome Railroad, Italy----RR
Dalmatia, Croatia-former Yugoslavia----DA
Slavonia, Croatia-former Yugoslavia----SL
Velika Krsna, Serbia-former Yugoslavia----VK
Zrenjanin, Serbia-former Yugoslavia----ZR
Belgrade, Serbia-former Yogoslavia----BE
Crete, Greece----KT
Corfu, Greece----CO
Tanushimaru, Japan----TA
Ushibuka, Japan----UB
Energy intakecalories per dayENE
Liquid fat intakeg per day per 1000 caloriesLIF
Hard fat intakeg per day per 1000 caloriesHAF
Fruit intakeg per day per 1000 caloriesFRU
Olive oil intakeg per day per 1000 caloriesOLI
All vegetable food intakeg per day per 1000 caloriesVEG
Monounsaturated fat/saturated fat intakeratioM/S
Mono plus polyunsturated fat/saturated plus trans-fat intakeratioMP/ST
DietaryThrombogenicity indexarbitrary unitsTHI
Dietary inflammation Indexarbitrary unitsINF
AgeyearsAGE
Table 2. Descriptive data of 50-year death rates and mean age at death in the 16 cohorts.
Table 2. Descriptive data of 50-year death rates and mean age at death in the 16 cohorts.
CohortDeath Rates
per 1000
Person/Year
Rank of Death Rate (*)Age at Death
Years
Rank of Age at Death (**)
US Railroad37.3875.18
East Finland44.01572.115
West Finland40.41373.113
Zutphen, the Netherlands38.81074.59
Crevalcore, Italy39.11174.510
Montegiorgio, Italy36.2676.34
Rome Railroad, Italy34.4377.93
Dalmatia, Croatia, former Yugoslavia36.3775.47
Slavonia, Croatia former Yugoslavia45.31669.716
Velika Krsna, Serbia, former Yugoslavia39.91273.811
Zrenjanin, Serbia, former Yugoslavia43.11472.914
Belgrade, Serbia, former Yugoslavia31.1277.92
Crete, Greece31.1180.41
Corfu, Greece35.9476.15
Tanushimaru, Japan35.9575.46
Ushibuka, Japan38.6973.612
Coefficient of variation (***)4.1-----3.3-----
(*) From the lowest to the highest. (**) From the highest to the lowest. (***) Standard deviation as % of mean.
Table 3. Simple linear correlation coefficients between significant variables and mean age at death plus PCA score coefficients for the same variables derived from PCA procedure. For meaning of abbreviations of variables, see Table 1.
Table 3. Simple linear correlation coefficients between significant variables and mean age at death plus PCA score coefficients for the same variables derived from PCA procedure. For meaning of abbreviations of variables, see Table 1.
RankVariableRR2p ValuePartial CorrelationPCA Factor Scores
1LIF0.700.490.00240.80 (*)−0.1255
2OIL0.630.400.00880.27−0.1229
3M/S 0.620.380.01040.31−0.1320
4INF −0.620.380.0104−0.430.1178
5HAF−0.610.370.01200.400.1138
6ENE−0.610.370.0120 0.0852
7FRU0.600.360.0140 −0.1084
8MP/ST 0.590.350.0162 −0.1377
9THI−0.570.320.0212 0.1128
10VEG0.560.310.0240 −0.1323
11AGE −0.500.250.0486 0.0044
Partial correlation was computed only for each of the first 5 significant determinants versus AD, while the 4 other significant determinants play the role of confounding variables. (*) p value of partial correlation ≤ 0.05.
Table 4. Correlation matrix (for R) of 11 entry variables significantly related with mean age at death. For meaning of abbreviations of variables, see Table 1.
Table 4. Correlation matrix (for R) of 11 entry variables significantly related with mean age at death. For meaning of abbreviations of variables, see Table 1.
AGEENEFRUOLILIFHAFVEGINFM/STHIMP/ST
AGE1.000.21−0.220.04−0.12−0.020.090.110.02−0.080.08
ENE 1.00−0.40−0.19−0.230.55−0.550.74−0.300.80−0.46
FRU 1.000.690.72−0.420.75−0.600.73−0.330.66
OLI 1.000.98−0.540.72−0.610.96−0.460.86
LIF 1.00−0.600.72−0.610.96−0.470.87
HAF 1.00−0.680.62−0.630.83−0.79
VRG 1.00−0.700.82−0.750.89
INF 1.00−0.650.70−0.75
M/S 1.00−0.580.94
THI 1.00−0.77
MP/ST 1.00
Values of R ≥ 0.50 correspond to p values < 0.05 (in bold).
Table 5. Mean levels in 16 cohorts of 11 variables significantly associated with mean age at death (through R). All items are given as grams per day, except ENE (energy), which is given as calories per day. For meaning of abbreviations of variables, see Table 1.
Table 5. Mean levels in 16 cohorts of 11 variables significantly associated with mean age at death (through R). All items are given as grams per day, except ENE (energy), which is given as calories per day. For meaning of abbreviations of variables, see Table 1.
CohortLIFOLIM/SINFHAFENEFRUM/PSTTHIVEGAGE
US1.30.00.88−0.2312.92326100.21.1249.8281.649.4
EF00.00.611.4627.1357711.20.6976.6244.148.8
WF00.00.682.2521.234409.90.6864.7261.349.9
ZU00.09.30.812.7027.4292228.10.8153.4284.749.5
CR11.49.31.322.2415.8343255.71.3249.7263.149.2
MO8.68.61.600.6114.3279110.01.6029.7305.349.0
RR17.517.51.93−0.232.4245561.11.9326.1246.248.3
DA22.522.51.970.305.332011.91.9736.9311.850.4
SL2.12.11.182.9816.338160.31.1866.3245.350.4
VK1.50.00.812.177.133880.30.8146.6293.449.7
ZR3.70.01.191.7313.5325656.81.1952.8338.149.0
BE10.10.00.951.399.7287052.20.9551.8267.647.0
KT35.035.02.960.110.02712171.12.9625.0508.949.0
CO28.928.92.85−1.100.02540178.12.8519.8540.949.7
TA1.30.00.990.460.0224311.60.999.3402.650.1
UB3.10.01.230.600.0226718.51.2313.2371.949.5
Coef Var (*)113.8147.250.8102.888.428.2111.646.945.726.11.60
(*) Coefficient of variation = standard deviation as % of average.
Table 6. Principal Component regression and Ridge regression models predicting age at death with 3 or 2 different arbitrarily selected variables. For meaning of abbreviations of variables, see Table 1.
Table 6. Principal Component regression and Ridge regression models predicting age at death with 3 or 2 different arbitrarily selected variables. For meaning of abbreviations of variables, see Table 1.
Principal Component Model with 3 Variables Directly Related to AD in Univariate Analysis
VariablesCoefficientStandard errorStandardized coefficientVariance Inflation Factor
Intercept72.5990
LIF0.13190.07220.57662.4761
FRU0.00390.07230.08630.7449
VEG0.00280.00470.09710.6701
Analysis of variance with 1 component omitted, p = 0.0293R between observed and estimated AD = 0.72
Principal Component model with 3 variables inversely related to AD in univariate analysis
VariablesCoefficientStandard errorStandardized coefficientVariance Inflation Factor
Intercept77.1954
HAF−0.03030.0529−0.11040.7834
THI−0.2350.0154−0.42831.8032
INF−0.91410.6243−0.18270.3037
Analysis of variance with 1 component omitted, p = 0.0.713R between observed and estimated AD = 0.66
Principal Component model with 2 variables related to AD in opposite way
VariablesCoefficientStandard errorStandardized coefficientVariance Inflation Factor
Intercept75.1738
LIF0.08850.02550.38700.3121
HAF−0.10620.0307−0.38700.3121
Analysis of variance with 1 component omitted, p = 0.0143R between observed and estimated AD = 0.69
Ridge model with 3 variables directly related to AD in univariate analysis
VariablesCoefficientStandard errorStandardized coefficientVariance Inflation Factor
Intercept73.1870
LIF0.13220.07210.57832.4739
FRU0.00740.01470.16542.6981
VEG0.00050.00950.01632.7078
Analyis of variance: p = 0.0282R between observed and estimated AD = 0.72
Ridge model with 3 variables inversely related to AD in univariate analysis
VariablesCoefficientStandard errorStandardized coefficientVariance Inflation Factor
Intercept77.1722
HAF−0.03240.1090−0.11793.3238
THI−0.02230.0557−0.17373.9620
INF−0.91820.6520−0.43021.9672
Analysis of variance = 0.0713R between observed and estimates AD = 0.66
Ridge model with 2 variables related to AD in opposite way
VariablesCoefficientStandard errorStandardized coefficientVariance Inflation Factor
Intercept74.0749
LIF0.13980.05500.61171.5684
HAF−0.04460.06600.61231.5684
Analysis of variance with p = 0.0085R between observed and estimated AD = 0.72
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Menotti, A.; Puddu, P.E.; Jacobs, D.R., Jr.; Kafatos, A.; Ostojic, M.; Tolonen, H. Ecological Correlates of Differences in Mean Age at Death Across Nearly Extinct Cohorts: The Role of Dietary Habits. Nutrients 2026, 18, 1021. https://doi.org/10.3390/nu18071021

AMA Style

Menotti A, Puddu PE, Jacobs DR Jr., Kafatos A, Ostojic M, Tolonen H. Ecological Correlates of Differences in Mean Age at Death Across Nearly Extinct Cohorts: The Role of Dietary Habits. Nutrients. 2026; 18(7):1021. https://doi.org/10.3390/nu18071021

Chicago/Turabian Style

Menotti, Alessandro, Paolo Emilio Puddu, David R. Jacobs, Jr., Anthony Kafatos, Miodrag Ostojic, and Hanna Tolonen. 2026. "Ecological Correlates of Differences in Mean Age at Death Across Nearly Extinct Cohorts: The Role of Dietary Habits" Nutrients 18, no. 7: 1021. https://doi.org/10.3390/nu18071021

APA Style

Menotti, A., Puddu, P. E., Jacobs, D. R., Jr., Kafatos, A., Ostojic, M., & Tolonen, H. (2026). Ecological Correlates of Differences in Mean Age at Death Across Nearly Extinct Cohorts: The Role of Dietary Habits. Nutrients, 18(7), 1021. https://doi.org/10.3390/nu18071021

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop