A Machine-Learning Approach to Target Clinical and Biological Features Associated with Sarcopenia: Findings from Northern and Southern Italian Aging Populations

Epidemiological and public health resonance of sarcopenia in late life requires further research to identify better clinical markers useful for seeking proper care strategies in preventive medicine settings. Using a machine-learning approach, a search for clinical and fluid markers most associated with sarcopenia was carried out across older populations from northern and southern Italy. A dataset of adults >65 years of age (n = 1971) made up of clinical records and fluid markers from either a clinical-based subset from northern Italy (Pavia) and a population-based subset from southern Italy (Apulia) was employed (n = 1312 and n = 659, respectively). Body composition data obtained by dual-energy X-ray absorptiometry (DXA) were used for the diagnosis of sarcopenia, given by the presence of either low muscle mass (i.e., an SMI < 7.0 kg/m2 for males or <5.5 kg/m2 for females) and of low muscle strength (i.e., an HGS < 27 kg for males or <16 kg for females) or low physical performance (i.e., an SPPB ≤ 8), according to the EWGSOP2 panel guidelines. A machine-learning feature-selection approach, the random forest (RF), was used to identify the most predictive features of sarcopenia in the whole dataset, considering every possible interaction among variables and taking into account nonlinear relationships that classical models could not evaluate. Then, a logistic regression was performed for comparative purposes. Leading variables of association to sarcopenia overlapped in the two population subsets and included SMI, HGS, FFM of legs and arms, and sex. Using parametric and nonparametric whole-sample analysis to investigate the clinical variables and biological markers most associated with sarcopenia, we found that albumin, CRP, folate, and age ranked high according to RF selection, while sex, folate, and vitamin D were the most relevant according to logistics. Albumin, CRP, vitamin D, and serum folate should not be neglected in screening for sarcopenia in the aging population. Better preventive medicine settings in geriatrics are urgently needed to lessen the impact of sarcopenia on the general health, quality of life, and medical care delivery of the aging population.


Introduction
As the burden of population aging increases [1], a multidisciplinary research effort is needed to fill the knowledge gap around risk biopaths and to foster preventive strategies against sarcopenia, a multifactorial syndrome characterized by a progressive and generalized loss of skeletal muscle mass as well as poor physical endurance, which is often combined with subclinical systemic inflammation [2]. Such a decline in skeletal muscle function is a nearly inevitable part of the aging process and has a considerable impact on health care costs and quality of life, since it raises the chance of negative outcomes including falls, fractures, physical impairments, and mortality [3].
Sarcopenia is a geriatric giant triggered by adverse muscle changes commonly experienced late in life. According to the Revised European Consensus on Definition and Diagnosis (EWGSOP2) [4], the dimensions that best define sarcopenia are low levels of three parameters: muscle strength, muscle quantity/quality, and physical performance, of which the latter is an indicator of severity. Hence, poor muscle quantity and quality confirms the presence of sarcopenia, whereas low physical performance clearly rates its severity. Moreover, to detect this condition early in clinical settings, the consensus group focused on a novel algorithm path, so called FACS (Find-Assess-Confirm-Severity), that also took into account the probability of sarcopenia, with reference to the SARC-F questionnaire or the calf circumference. As for the assessment of the muscle mass, a large number of tools are detailed in the literature, but only a few are effectively applicable in the clinical setting and therefore considered in the consensus. Of these, MRI and computed tomography (CT) are the gold standard, but they cannot really be applied in a context beyond research and thus are poorly understood in the clinical setting [5]. Dual-energy X-ray absorptiometry (DXA) is the most available, reliable, and feasible approach to directly assess body composition variables [6], while bioelectrical impedance analysis (BIA) is an alternative indirect method-low-cost compared to DXA-that can be used to screen a much larger population, as DXA is not handheld and cannot be applied in specific populations (e.g., pregnant women, bedridden patients, etc.).
Understanding multimodal indicators that can successfully anticipate the onset of sarcopenia and avert the deleterious cascade of late-life multimorbidity remains an open issue from a preventative standpoint. Given sarcopenia as a multifactorial condition, a single fluid or clinical marker cannot be easily pinpointed or be helpful, and thus focus turns to the implementation of a panel that includes multidomain markers. Further, ideal markers of sarcopenia should be valid, replicable, reliable, specific, affordable, and easily available [7]. In the field of nutrition, the scientific community openly acknowledges haemoglobin, albumin, leptin, uric acid, iron, and vitamin D, amongst others, in predicting the risk of sarcopenia [8][9][10].
Of note, the heterogeneous biological, clinical, and social complexities involving the individual, as well as the different characteristics of the personnel and place where such assessments are conducted, inevitably play a role in decisions about risk variables, their cut-points, and their ranking [11,12]. For these reasons, there could be arguments that a single gold standard variable as well as the best cut-points could translate poorly from the epidemiology field and computational modelling to the real clinical practice. To address this challenge and add to literature surrounding the biopathways of sarcopenia, this research sought to generate scientific evidence by testing a convenience sample composed of two subsets, i.e., one clinical-based and one population-based, both aged over 65 years. Machine-learning-based techniques were implemented and compared in order to evaluate the clinical and fluid markers most associated with the sarcopenia condition across the two population settings. We enrolled male and female subjects aged ≥65 years, with a body mass index (BMI) between 20 and 30 kg/m 2 [13], who are outpatients at the metabolic rehabilitation unit of the Santa Margherita Institute, Department of Public Health, University of Pavia. Subjects with the following conditions were excluded from the study: severe kidney disease (glomerular filtration rate < 30 mL/min), moderate-to-severe hepatic failure (Child-Pugh Class of B or C), endocrine diseases associated with disorders of calcium metabolism (with the exception of osteoporosis), psychiatric disorders, and cancer (in the previous 5 years). The recruitment period was between January 2020 and January 2022. Informed consent was obtained from all subjects involved in the study. The study was conducted according to the guidelines of the Declaration of Helsinki, and approved by the Institutional Review Board of IRCCS S. De Bellis (protocol code n. 68, 9 April 2019).

Southern-Italy Population Subset (the Salus in Apulia Study)
Participants of the Salus in Apulia population-based study were recruited from the electoral rolls of Castellana Grotte (Bari, Apulia, Southern Italy). The recruiting and evaluation centre was the National Institute of Gastroenterology IRCCS "S. De Bellis" Research Hospital, and the initiative was supported by the Italian Ministry of Health and the Apulia Regional Government. The Salus is an ongoing longitudinal population-based study, activated in 2014, of a representative population of residents in Castellana Grotte (Apulia, southern Italy) who were 65 years of age or older at the time of initial recruitment. While the minimum age of 65 was required for enrolment in the Salus, conversely, the exclusion criteria were lack of mental capacity to express consent, having digestive tract cancers or other malignancies, including dementia and motoneuron diseases, or being under major therapies, which could affect nutritional/physical status. The study design and data collection method are detailed elsewhere [14,15]. Briefly, the entire sampling frame consisted of the 4021 elderly residents in the health registry of the Apulia Region as of 31 December 2014. The study was born as multidisciplinary, including the assessments of the cognitive, sensory, physical, and nutritional domains, as illustrated in some of our previous work [16], and aimed to search for new biological and phenotypic determinants to predict and prevent risky trajectories of aging. Specifically, the data used for the present study came from a subset of the Salus, which included 479 elders who had undergone all examinations required for the purposes of this study. The IRB approved the study of the lead institution, the National Institute of Gastroenterology and Research Hospital "Saverio de Bellis", and all subjects completed informed consent forms before their evaluation. The study met the principles of the Helsinki Declaration and adhered to the "Standards for Reporting Diagnostic Accuracy Studies" (STARD) guidelines (http://www.stard-statement. org/, accessed on 12 January 2023) and the "Strengthening the Reporting of Observational Studies in Epidemiology" (STROBE) guidelines.

Fluid Biomarker Assessment
A blood sample was collected in the morning after overnight fasting to measure the levels of fasting blood glucose (FBG), glycated haemoglobin (HbA1c), total cholesterol, high-density lipoprotein (HDL) cholesterol, low-density lipoprotein (LDL) cholesterol, and triglycerides, using standard automated enzymatic colorimetric methods (AutoMate 2550, Beckmann Coulter, Brea, CA, USA) under strict quality control. LDL cholesterol was calculated using the Friedewald equation. Plasma glucose was determined using the glucose oxidase method (Sclavus, Siena, Italy). Blood cell count was determined by a Coulter haematology analyser (Beckman-Coulter, Brea, CA, USA). Serum FT3, FT4, and TSH were measured using a competitive photometric method based on the solid-phase antigen-linked technique (LIASON FT3, LIASON FT4, LIASON TSH, Dia-Sorin, Saluggia, Italy). Serum high-sensitivity C-reactive protein (CRP) was assayed using a latex particleenhanced immunoturbidimetric assay (Kamiya Biomedical Company, Seattle, WA, USA) (reference range: 0-5.5 mg/L; inter-assay coefficient of variation: 4.5%). Serum 25(OH)D was quantified by a chemiluminescence method (Diasorin Inc., Stillwater, MN, USA), and all samples were analysed in duplicate.

Clinical and Physical Assessment
Height was measured to the nearest 0.5 cm using a wall-mounted stadiometer (Seca 711; Seca, Hamburg, Germany). Body weight was determined at the time of DXA to the nearest 0.1 kg using a calibrated balance beam scale (Seca 711; Seca, Hamburg, Germany). BMI was calculated as weight in kilograms divided by height in metres squared (kg/m 2 ). Low physical performance was assessed using the Short Physical Performance Battery (SPPB), an objective tool for measuring the physical performance status of the lower extremities [17]. The SPPB is based on three timed tasks: standing balance, walking speed, and chair sit-to-stand tests [12]. The timed results of each subtest were rescaled according to the predefined cutoff points, obtaining a score ranging from 0 (worst performance) to 12 (best performance). A cutoff value of 8 in the SPBB score was considered to indicate low physical performance, in accordance with both EWGSOP panels [4,18]. Handgrip strength (HGS) was assessed using the Jamar Plus Digital Hand Dynamometer (Patterson Medical, Cedarburg, WI, USA). Seated with arms 90 degrees to the sides, 2 trials were taken per arm in an alternating fashion with 30 s of rest between trials. The highest reading was recorded [4].
Bone mineral density (BMD) and whole-body lean mass were measured using DXA (Discovery WI, Hologic, Inc., Marlborough, MA, USA). The skeletal muscle mass index (SMI) was defined as the sum of the muscle masses of the four limbs as appendicular skeletal muscle mass divided by squared height.
Whole-body lean mass (kg) was taken as the sum of the fat-free, bone-free mass of the arms and legs as lean mass. According to the operational definition by the European Working Group on Sarcopenia in Older People (EWGSOP2) [4], the diagnosis of sarcopenia was given by the presence of both low muscle mass (that is, an SMI < 7.0 kg/m 2 for males or <5.5 kg/m 2 for females) [4], and low muscle strength, as defined by a low handgrip strength (HGS), that is, an HGS < 27 kg for males or <16 kg for females, or low a physical performance (that is, an SPPB ≤ 8).

Statistical Analysis
The entire sample was first divided according to the population setting, i.e., subjects from the Salus in Apulia population-based study and those from the clinical setting of the Santa Margherita Hospital, in order to assess the overlap of the samples (Table 1). Then, the overall population was further subdivided according to the outcome variable, that is, the sarcopenia condition (presence/absence), and groups were compared to describe the clinical and functional differences in terms of frequency and associations ( Table 2).
Normal distributions of quantitative variables were tested using the Kolmogorov-Smirnov test. Data are reported as mean ± standard deviation (M ± SD) for continuous measures and frequency and percentages (%) for all categorical variables. In order to focus on the practical differences between the groups in terms of effect size (ES) [19], differences between continuous variables, between the groups, were calculated using Wilcoxon's effect size difference between ranks, and their 95% confidence intervals (CI) to assess the magnitude of ES [20]. Prevalence differences were calculated to assess differences between categorical variables.
A machine-learning feature-selection approach, the random forest (RF), was employed to identify the most predictive features in the dataset for the sarcopenic condition considering every possible interaction between them, considering also the non-linear relationships that classical models could not assess ( Figure 1). Three RF regression models were built. The first model also included the variables used for the detection of sarcopenia in order to assess which variables were most important in the algorithm of detection using the Mean Decrease Gini. The variables SMI and SPPB were used within an ensemble learning approach in order to highlight possible relationships even in light of variables known to be associated with the condition of sarcopenia. Such an inclusion is therefore useful in order to highlight possible interactions of a nonparametric nature between predictive factors in the classification of sarcopenia. The second RF regression model was performed subdividing according to the clinical study centre in order to assess if there were differences in the association ranking due to the clinical setting. A third RF regression was performed using only socio-demographic and haematochemical variables in order to assess which variables were most associated with the sarcopenia condition using the Mean Decrease Gini ( Figure 2).       The accuracy of the logistic regression model was calculated using a first confusion matrix, as shown in Table 3. A logistic regression model was performed on the sarcopenia condition as a dependent variable and sociodemographic and blood chemistry parameters as regressors (Table 4) in order to assess differences between the parametric and nonparametric approach in the prediction of sarcopenia. The accuracy of the third RF regression model was calculated using a further confusion matrix, as shown in Table 5.

Results
A total of 1791 subjects made up the entire sample; of these, the majority (n = 1312, 73.3%) came from the clinical setting of northern Italy. Age over 65 years was a common feature of both samples, and the mean age (±standard deviation, SD) was 79.79 ± 7.18 years and 74.81 ± 5.67 years for the northern and southern Italian populations, respectively. Table 1 shows a description of the entire sample according to the population setting, i.e., northern (clinical setting) versus southern (population-based setting) Italy. Here, when analysing the between-group practical differences in terms of effect size (ES), as defined by Wilcoxon's effect size difference and respective 95% confidence intervals (CI), the age (ES: 0.32, 95%CI 0.28-0.36) and sex (ES: 19.10, 95%CI 14.01-24.18) showed significant differences, meaning that the northern Italian sample was older and included more females than the southern counterpart. There was a significantly higher prevalence of sarcopenia in the northern sample (12.6% vs. 7.3%) compared to the southern sample (ES: −5.27, 95%CI −8.21 to −2.33). Along these lines, moving toward DXA-derived body composition variables, significant differences showed up for arm free-fat mass (FFM), whole-body lean mass, and whole-body fat mass. That is, the southern sample showed Following the same analytical approach, Table 2 shows a description of the whole sample by sarcopenia condition (presence/absence). To substantiate the internal validity of the findings, sarcopenic subjects were predominantly older (ES: 0.16, 95%CI 0.12 to 0.  Figure 1 shows a plot of important variables from the RF regression model with sarcopenia condition as the dependent variable and the other variables as regressors. The following rationale guided the selection procedure. The first model was run including both the fluid markers and the functional domains of sarcopenia to evaluate which variables were the most important to be included in the algorithm of sarcopenia detection by using the Mean Decrease Gini. The graph showed that SMI (Mean Decrease Gini greater than 60), followed by handgrip strength, FFM (arms and legs), sex (all showing a 20 to 40 Mean Decrease Gini), BMI, and total lean body mass (Mean Decrease Gini over 15) were the most relevant. The second RF regression model was run separately after dividing the sample into two subsets according to the population setting (northern vs. southern Italy, i.e., clinical vs. population-based setting) to assess whether there were differences in the importance ranking due to the different settings. Here, the ranking showed almost overlapping findings, that is, SMI, handgrip strength, FFM (legs and arms), and sex being the top variables of importance in both subsets. A third RF regression was performed using only the socio-demographic and haematochemical variables to assess which variables were most associated with the condition of sarcopenia in the total sample. Here, albumin, CRP, and folate were shown to be top-ranked (Mean Decrease Gini: 27.5, 21.56, and 20.08, respectively) ( Figure 2). The reliability of the third RF regression model was evaluated using a confusion matrix ( Table 3). The findings showed an accuracy of 94.57 (95%CI 92.15 to 96.42), a sensitivity of 1.00, and a specificity of 0.94.
To explore the differences between the parametric and nonparametric approaches in predicting sarcopenia, a logistic regression model was run on sarcopenia condition as the dependent variable and sociodemographic and fluid biomarkers variables as regressors (Table 4). Male gender showed the strongest association with sarcopenia (OR: 4.384, 95%CI 3.027 to 6.351). Slightly at risk were those subjects showing lower SPPB scores (OR: 0.906, 95%CI 0.847 to 0.969), serum folate (OR: 1.022, 95%CI 1.005 to 1.038), and vitamin D (OR: 1.015, 95%CI 1.002 to 1.028). Again, the reliability of the logistic regression model was evaluated using a confounding matrix ( Table 5). The findings showed an accuracy of 89.89 (95%CI 87.70 to 90.62), a sensitivity of 14.50, and a specificity of 99.37.

Discussion
This study aimed to investigate clinical and fluid markers most associated with the condition of sarcopenia by implementing a machine-learning-based approach across two subsets (clinical and population-based) of adults over 65 years of age to provide evidence on how to better set up preventive strategies in sarcopenia settings. The research was carried out involving populations from northern and southern Italy, respectively from the Santa Margherita Clinic (Pavia, northern Italy) and the Salus in Apulia populationbased study (Apulia, southern Italy). The top-ranked association variables for sarcopenia in RF selection overlapped across the two population subsets, and included SMI, HGS, FFM of legs and arms, and sex. Given these similarities in terms of clinical features, we implemented parametric and nonparametric analysis on the whole sample to investigate those variables most associated with sarcopenia, and found albumin, CRP, folate, and age to rank high in RF selection, while sex, folate, and vitamin D were most relevant in logistic.
First, to substantiate the internal validity of our findings, it is worth noting that statistical analyses by sarcopenia condition (presence/absence) showed a higher proportion of males as well as older age, and lower levels of albumin, haemoglobin, vitamin D, and BMI in the sarcopenic population. In terms of comparison, our clustered analyses by population setting showed meaningful differences between subsets in the prevalence of sarcopenia (higher in the clinical setting population, i.e., 12% compared to 7% of the population-based counterpart) and, in turn, fluid biological markers and functional proxies. The clinical setting population showed lower (worst) values in FFM legs and arms, HGS, SPPB, and BMI, as well as lower serum levels of albumin, folate, vitamin D, haemoglobin, RBC, triglycerides, and HDL cholesterol. Functional dimensions such as HGS and SPPB scoring were also worse in the clinical subset. These data fit well within a geriatric outpatient/clinical healthcare setting, as the latter assumes that subjects come to the hospital to fix some medical issue, whereas in a population-based setting, subjects are usually recruited to participate in data collection for research purposes because they fit the purposes of the study well, and thus come to the hospital for a visit without a specific health concern to fix. In our specific case, subjects from the north were recruited in a clinical setting that included both outpatients and inpatients, and were therefore expected to be more frail and physically impaired than the general population recruited in the southern sample. This view most likely explains the poorer general health status of the clinical subset. Moreover, the physiological path of reduction in testosterone, the hormone that drives protein synthesis and muscle development, may then elucidate the higher male incidence of sarcopenia already acknowledged by the scientific community [21]. Basically, testosterone works as the fuel that powers muscle building.
The RF plot of important variables for sarcopenia algorithm showed SMI, HGS, FFM (arms and legs), sex, and BMI as the most relevant. Due to the predominance of skeletal muscle in the arms and legs, is not surprising that both lean soft tissues are key and are actually embedded in the SMI as muscle proxies in the consensus panel. For BMI, the role in relation to sarcopenia can be explained in both excess and deficit values. On the one hand, weight gain and obesity can speed up the onset and the progression of sarcopenia directly or indirectly. For example, in fat tissue of obesity phenotypes [22], the accumulation of pro-inflammatory macrophages and other immune cells, as well as the dysregulated production of various adipokines together with cytokines released by immune cells, create a prolonged local pro-inflammatory state [23,24]. In addition, overproduction and impaired lipid storage capacity is a feature of adipose tissue in obesity, which accumulates ectopically in skeletal muscle. These intramuscular lipids and their products can result in mitochondrial dysfunction and increased secretion of certain proinflammatory myokines with the potential to induce muscle dysfunction. On the other hand, reduced BMI may be equally troublesome as it would mean a reduced fat mass, that is believed to be an energy reserve in older people and helps them survive disease and chronic conditions [25]. It has also been hypothesized that individuals with higher fat mass may have higher protein intake, which is a protective factor against sarcopenia [26]. Therefore, maintaining a healthy weight is important for older adults to preserve muscle mass and strength.
In a further RF plot of significance, we found overlapping top-ranked association variables across the two population settings (i.e., SMI, HGS, FFM of legs and arms, and sex) showing how the clinical and population-based subsets were not much divergent. Therefore, an additional RF plot was built without splitting the two samples to get a selection of variables most associated with sarcopenia, and found that albumin, CRP, and folate ranked high. A further comparative logistic analysis found sex, folate, and vitamin D among the most closely associated, although the latter lacked significance. These findings are discussed below.
It is well-acknowledged that serum albumin concentration may be an indicator of individual nutritional status [27,28], with lower values indicating a decrease in protein reserve, stimulating catabolic processes that lead to muscle breakdown [29]. There is also a body of research indicating the antioxidant properties of albumin, showing that albumin is a specific modulator of cellular glutathione, one of the body's major antioxidants [30]. Oxidative damage may play a crucial role in skeletal muscle decline with aging. Furthermore, increased concentrations of free cortisol have been observed in hypoalbuminemic individuals, and this other biological pathway potentially stimulates muscle breakdown, especially in inactive people. Albumin has also been shown to activate the phosphatidyl-inositol 3-kinase pathway, thus mediating muscle breakdown.
For serum folate, our findings are in line with other studies revealing serum folate levels significantly correlated with reduced lower limb strength and grip in the elderly over 65, especially in women [31]. Findings from other research suggest that folate deficiency is associated with a decline in muscle strength, and that a reduced dietary micronutrient intake [32,33], including folate, has an important impact on muscle health, as indicated by decreased ability to generate strength and endurance as well as reduced physical activity. We therefore suggest that the effect of folate deficiency on measures of strength and physical performance involves biological mechanisms that could include specific folate activities in addition to the homocysteine pathway, such as neurotransmitter synthesis, myelination, DNA and protein synthesis, DNA methylation, and epigenetic regulation [34].
For vitamin D, epidemiological studies showed clinical correlations between vitamin D deficiency and sarcopenia, such that this lipoprotein is attracting more and more attention among the scientific community. In the general population, low serum vitamin D concentration has been significantly found to be associated with a higher prevalence of sarcopenia and loss of physical performance, such as walking speed [35,36]. Several meta-analyses and RCT studies have also demonstrated the positive effects of vitamin D supplementation, such as improved overall muscle strength (particularly of lower limb muscles) and decreased standing time [37][38][39].
Last, regarding the inflammatory pathway, it has long been hypothesized that higher levels of inflammatory markers play a role in functional decline in aging populations [40,41]. Cross-sectional and longitudinal studies consistently demonstrated associations between high levels of interleukins, especially interleukin-6 (IL-6) [42], or CRP and poor physical performance and disability. The causal pathway running from inflammation to disability is suggested to involve catabolic effects of inflammatory markers on muscles. To further substantiate our findings, there is evidence that in older men and women, high levels of CRP are associated with a 2-to 3-fold increased risk of losing more than 40% of muscle strength over 3 years [43].
When comparing sarcopenia to other studies, we can look at what has been found in other reports. For example, among elderly people aged ≥ 65 years admitted to daycare centres in Taiwan [44], calf circumference, Mini Nutritional Assessment, dementia, and BMI were factors associated with sarcopenia, diagnosed, however, using Asian Working Group for Sarcopenia (AWGS) criteria. In another report using data from elderly people in Brazil and the EWGSOP criteria for sarcopenia diagnosis [45], older age, cognitive impairment, lower income, smoking, and undernutrition and undernutrition risk were factors associated with sarcopenia. In a report that used machine learning like ours as a data analysis approach [46], the top risk factors in men were BMI, red blood cell count (RBC), blood urea nitrogen (BUN), vitamin D, ferritin, fibre intake (g/d), primary diastolic blood pressure, white blood cell count (WBC), fat intake (g/d), age, glutamic-pyruvic transaminase, niacin intake (mg/d), protein intake (g/d), fasting blood glucose, and water intake (g/d). Inevitably, the most important risk factors in women were BMI, water intake (g/d), WBC, RBC count, iron intake (mg/d), BUN, high-density lipoprotein, protein intake (g/d), fibre consumption (g/d), vitamin C (mg/d), parathyroid hormone, niacin intake (mg/d), carotene intake (µg/d), potassium intake (mg/d), calcium intake (mg/d), sodium intake (mg/d), retinol intake (µg/d), and age. The population setting, study setting, exposome, and diagnostic criteria used for diagnosis certainly create heterogeneity in the findings, and making it unfeasible to draw consistent conclusions.

Strengths and Limitations
The strengths of this study include the large sample size, the multiple anthropometric and clinico-metabolic variables collected and correlated with sarcopenia, the use of DXA as the gold standard for body composition and thus for reliability of findings, and the employment of robust statistical algorithms and evidence-based references. Further, double recruitment and data collection on two different populations lacks any previous research to compare to in Italy. However, some limitations should be acknowledged. The crosssectional nature precludes causal inference on outcomes, and although comprehensive, the database lacked a broader assessment of fluid markers. Moreover, by skimming the dataset for variables shared by the two population samples, a more comprehensive assessment of the biological markers panel was not allowed.

Conclusions
Albumin, CRP, vitamin D, and serum folate should not be overlooked when screening for sarcopenia in geriatric settings, particularly in the male population. Improving the health burden and quality of life of the aging population is urgently needed. Employing multidimensional methodology to model risk management pathways may provide a way to stratify the risk of sarcopenia in preventive medicine settings, and thus ease the identification of a deteriorating health status in the aged population. Informed Consent Statement: Informed consent was obtained from all subjects involved in the study. Written informed consent was obtained from the patients to publish this paper.

Data Availability Statement:
The data presented in this study are available on request from the corresponding author Fabio Castellana. The data are not publicly available due to privacy reasons.

Conflicts of Interest:
The authors declare no conflict of interest.