Applications of Machine Learning Algorithms in Geriatrics

Stancu, Adrian; Rosca, Cosmina-Mihaela; Iovanovici, Emilian Marian

doi:10.3390/app15158699

Open AccessReview

Applications of Machine Learning Algorithms in Geriatrics

by

Adrian Stancu

^1,*

,

Cosmina-Mihaela Rosca

²

and

Emilian Marian Iovanovici

²

¹

Department of Business Administration, Faculty of Economic Sciences, Petroleum-Gas University of Ploiesti, 39 Bucharest Avenue, 100680 Ploiesti, Romania

²

Department of Automatic Control, Computers, and Electronics, Faculty of Mechanical and Electrical Engineering, Petroleum-Gas University of Ploiesti, 39 Bucharest Avenue, 100680 Ploiesti, Romania

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2025, 15(15), 8699; https://doi.org/10.3390/app15158699

Submission received: 7 July 2025 / Revised: 31 July 2025 / Accepted: 4 August 2025 / Published: 6 August 2025

(This article belongs to the Special Issue Diet, Nutrition and Human Health)

Download

Browse Figures

Versions Notes

Abstract

The increase in the elderly population globally reflects a change in the population’s mindset regarding preventive health measures and necessitates a rethinking of healthcare strategies. The integration of machine learning (ML)-type algorithms in geriatrics represents a direction for optimizing prevention, diagnosis, prediction, monitoring, and treatment. This paper presents a systematic review of the scientific literature published between 1 January 2020 and 31 May 2025. The paper is based on the applicability of ML techniques in the field of geriatrics. The study is conducted using the Web of Science database for a detailed discussion. The most studied algorithms in research articles are Random Forest, Extreme Gradient Boosting, and support vector machines. They are preferred due to their performance in processing incomplete clinical data. The performance metrics reported in the analyzed papers include the accuracy, sensitivity, F1-score, and Area under the Receiver Operating Characteristic Curve. Nine search categories are investigated through four databases: WOS, PubMed, Scopus, and IEEE. A comparative analysis shows that the field of geriatrics, through an ML approach in the context of elderly nutrition, is insufficiently explored, as evidenced by the 61 articles analyzed from the four databases. The analysis highlights gaps regarding the explainability of the models used, the transparency of cross-sectional datasets, and the validity of the data in real clinical contexts. The paper highlights the potential of ML models in transforming geriatrics within the context of personalized predictive care and outlines a series of future research directions, recommending the development of standardized databases, the integration of algorithmic explanations, the promotion of interdisciplinary collaborations, and the implementation of ethical norms of artificial intelligence in geriatric medical practice.

Keywords:

predictive models; elderly care; artificial intelligence; bibliometric analysis; diet; nutrition; algorithm; classification; prediction; performance; PubMed; WOS; Scopus; IEEE

1. Introduction

Modernization and technological advancements over the past decades have led to an improvement in the quality of life, increasing the aging population [1]. For these reasons, life expectancy is much higher than it was a few hundred years ago, when humans were exposed to a series of diseases for which there was no treatment at that time. As life expectancy increased, people became more concerned with the quality of their own lives, improving it through healthy eating, sports, alleviating daily activities, and consuming medications that lead to longevity [2,3]. In this context, the issue arises of studying modern techniques associated with artificial intelligence (AI) to explore the possibility of increasing longevity in a context where the population is becoming increasingly healthier. Thus, the emphasis is placed on longevity coupled with both mental and physical health. To achieve this goal, prevention and early diagnosis are key elements for the elderly.

In the scientific literature, geriatrics is a medical branch, while gerontology is a socio-biological branch. The two concepts are often confused. Geriatrics is a medical discipline focused on prevention, diagnosis, personalized treatment, and pain relief caused by diseases specific to the elderly. Gerontology is an interdisciplinary science of aging. In this paper, only works that fall within the field of geriatrics will be analyzed.

This study is motivated by the growing interest in the use of AI. Geriatrics involves managing patients with multiple comorbidities and personalized needs. This approach is assisted by the integration of modern technologies for optimizing medical practice. This study provides a synthesis of recent literature (1 January 2020–31 May 2025) regarding the exploration of ML algorithms in geriatrics.

The paper addresses several categories of the public, including researchers and PhD students in the field of AI and medical sciences, university faculty, medical staff interested in integrating modern technologies into practice, and the industrial sector in the field of medical technology.

The study aims to answer the following research questions (RQs):

RQ1: In which subfields of geriatrics is ML applied?
RQ2: What are the purposes of using algorithms in the context of geriatrics?
RQ3: What metrics are used to evaluate the performance of these models, and how do they relate to the practical integration of applications?

The structure of the paper is based on an introduction that contextualizes the issue of aging and the implications of AI in geriatrics. Section 2 presents the methodology, which describes the process of selecting articles from the specialized literature. Section 3 is dedicated to the ML models used in the research articles extracted from the Web of Science (WOS) database. Section 4 comprised the bibliometric analysis of the papers from the literature. Subsequently, a discussion is conducted to interpret the results in Section 5. Section 6 presents conclusions that synthesize the main ideas and outline future research directions.

2. Methodology

This study represents a systematic review of the literature, with the main aim of analyzing and synthesizing the applications of ML algorithms in the field of geriatrics. The scientific review is based on a research process structured in several stages. Initially, the ML models used in the diagnosis, monitoring, prevention, and treatment of geriatric patients are identified. Initially, the authors identify the ML algorithms used in the field of geriatrics and analyze them in their applications. The analysis targets the performance indicators identified at the level of each study. Subsequently, a search is conducted for all selected scientific articles from the WOS database, using a series of domain-specific keywords. The period analyzed spans from 1 January 2020 to 31 May 2025. In this way, the most recent contributions in the field are highlighted. Both scientific contributions and reviews were included in the analysis, as they provide an overview of the applications used in the geriatric field that are integrated with ML components. The inclusion criteria for the study were as follows: (1) the application of ML techniques for classification or prediction in the geriatric context, (2) the identification of the performance of the models used, (3) detailing the datasets used, and (4) the clinical characteristics analyzed. Theoretical works without practical involvement, as well as articles targeting age categories other than those specifically geriatric, were excluded. The selected works were analyzed both qualitatively, from the authors’ perspective, through performance indicators, and quantitatively. For the qualitative analysis, the types of algorithms used, the clinical applicability domain, the objective pursued, and the datasets utilized were identified, all to create a technical synthesis regarding the performance indicators. For the qualitative analysis, the metrics reported in research articles were compared, with a focus on accuracy, sensitivity, and F1-score.

The quantitative analysis is ensured through detailed searching in the WOS, PubMed, Scopus, and Institute of Electrical and Electronics Engineers (IEEE) Explore databases, according to the schematically structured logic represented in Figure 1. This diagram progressively illustrates the manner in which the Boolean expressions used in thematic searches were constructed. The four directions included the following:

Generic terms represented by “geriatr*”, which will extract all research with derivatives of this expression. These terms aim to identify all studies in the literature on this topic within the 1 January 2020–31 May 2025 range. The search aims to identify the total number of papers, which will later be compared to the number specifically targeting ML components. This initial stage builds a general corpus related to GERIATRICS in the current scientific literature.
Subsequently, thematic filtering is applied to subdomains. The filters are DIET, NUTRITION, ELDERLY, and ML. These filters capture studies that address the interaction between nutrition and geriatric health, the elderly, and geriatrics, as well as ML and geriatrics.
The final search combines a common Boolean expression, the intersection of these components: geriatrics, diet, nutrition, elderly, and ML. The syntax is used to identify articles that trace the intersection of these four research development directions. The methodology considers a comprehensive approach by gradually combining the terms. In this way, comprehensive coverage of the literature is ensured. The other, the second role, is thematic focus through filtering based on technological terms, which allows for the extraction of works that explicitly address the application of ML in geriatrics, eliminating generalist literature. This query strategy builds an extraction of scientific data based on the distribution of ML algorithms in the subdomains of geriatrics.

The process of building the search equation was carried out using a progressive approach, employing the Boolean operators “AND” and “OR”, and the wildcard symbol “*”. In the first stage, generic terms that define the field of geriatrics were selected. In the second stage, specific terms were added to refine the search in the thematic areas of nutrition, elderly care, and ML algorithms. In the final stage, the resulting expressions were combined into a unified logical query that extracts the common articles of these domains.

The quality assessment of the studies included in the analysis was conducted using the authors’ expertise. Each article was evaluated using the dataset characteristics, performance metrics, and clinical applicability. The studies were scored on a grid with scores from 0 to 2 for each criterion, resulting in a total score that serves as a guide in the comparative qualitative analysis.

The bibliometric analysis compares these directions to highlight the following aspects:

The distribution of publication years illustrates academic interest in a general versus technological context;
The types of publications (articles, reviews, and proceedings) demonstrate the predominant nature of the research (empirical or theoretical);
WOS categories and research areas identify the interdisciplinarity of the field;
The countries and institutions involved identify regions with intense activity in geriatric research;
Scientific publishers and open-access reflect the capacity for the broad dissemination of knowledge.

The overlap of the results from the two searches highlights the differences between traditional and modern approaches in geriatrics. The first query should reflect a mature, well-developed research area. The second highlights an emerging niche focused on the applicability of ML in the nutritional assessment of elderly individuals. These comparisons will lead to a series of recommendations in the discussion section regarding future research directions. Thus, this paper is addressed to PhD students, researchers, university faculty, and the industrial sector that wishes to invest in technological development.

In this paper, the methodology focuses on the selection of articles from the WOS database. This is recognized for its multidisciplinary nature, which filters publications using specific criteria for bibliometric analysis. Additionally, comparative bibliometric analyses were added using complementary databases such as PubMed, which specializes in biomedical literature; Scopus, which covers technical fields; and IEEE, which is associated with specialized articles. For each database, all the methodological searches presented in Figure 1 were conducted.

3. ML Models Used in the Diagnosis and Monitoring of Geriatric Patients

ML algorithms are being examined in a multitude of fields, with them being particularly valuable in the medical field due to the integration of solutions for diagnosing, monitoring, and forecasting the progression of diseases in elderly patients. In the field of geriatrics, clinical complexity is very high due to multiple comorbidities and the need for personalized interventions. For these reasons, technologies that utilize ML algorithms have been developed in the field of geriatrics through predictive and classificatory tools based on clinical, sensory, or imaging data. This paper conducts an analysis regarding the degree of integration of ML models used in the specialized literature for tasks aimed at classification or prediction in the context of geriatrics. The syntheses conducted on the selected studies identify the main categories of models used, their performances measured by specific metrics, and analyze the datasets utilized.

The diagram in Figure 2 presents the systematic structure of the geriatric subdomains extracted from the analyzed works, highlighting the interdisciplinary relationships between the domains. These are addressed in the recent literature (2020–2025) from the perspective of ML algorithms.

These subfields of geriatrics are studied through research that presents contributions at the level of prediction or classification, with the main goal being the technological advancement of the field as a whole.

3.1. The Most Studied ML Models in Recent Literature

The clinical prediction models used for ML algorithms in the geriatric field have age, sex, comorbidities, and laboratory values, as well as preoperative and postoperative medications as input variables [4]. Walking speed is an indicator that classifies the elderly using ML algorithms. This characteristic suggests whether the patient belongs to a certain group, namely, young adults, healthy older adults, or geriatric patients without cognitive deficiencies [5]. The identification of biochemical, hematological, and immunological parameters in the blood provides an overview regarding the detection of malnutrition [6,7]. Investigating these indicators provides an overview of the health status of geriatric versus non-geriatric patients [8]. In the category of these investigations, the dosage of vitamins in relation to the uniqueness of the patient is also included [9]. These investigations can be conducted using ML algorithms that personalize treatment based on the uniqueness of the patient [10].

Choudhury et al. [11] note the Support Vector Machine (SVM) algorithm was the most studied, followed by Deep Learning (DL) algorithms, Decision Tree (DT), and other models. It is important to note the emphasis placed on SVMs in this paper, which is recorded as the most studied in the geriatric field. In the paper by Chu and Kurup [12], large volumes of data, referred to as Big Data, are analyzed in relation to ML algorithms for the purpose of transforming geriatric anesthesiology tailored to the elderly population. The study was conducted on 797 patients and aimed to predict the risk of postoperative delirium. The model examined which method achieved the best accuracy, and it was Random Forest (RF) [13]. Benovic et al. [14] also investigated the prediction of postoperative delirium and identified SVMs as having the best results.

Gulcicek and Seyahi [15] compare the progression of chronic kidney disease and the associated complications between geriatric patients aged over 65 and non-geriatric patients aged under 65. In this study [15], statistical models such as the analysis of variance (ANOVA) were used to compare the study groups. These groupings can also be performed using ML algorithms specialized in unsupervised classification. Chronic kidney diseases are investigated by Heybeli et al. [16] for elderly patients using geriatric assessment parameters. These investigations can be assisted by AI tools, even in kidney transplantation, which is studied by Elihimas Júnior et al. [17] through a comparative analysis between elderly patients over 60 years old versus younger patients aged 18 to 59 years. In this paper, a predictive model for post-transplant renal function is used, achieving an accuracy of 69.44%, which suggests the need for considerable improvements. Moreover, in the field of surgery, the degree of necessity for surgical intervention is predicted in patients over the age of 75. Thus, the emergency medical triage for geriatric patients with trauma is performed using the Extreme Gradient Boosting (XGBoost) algorithm, which achieved a prediction accuracy of 90.3%. Non-invasive medical diagnosis combines pulse measurements and frequency domain analysis to detect changes in the pulse wave [18,19].

Using the statistical method ANOVA together with the U and Mann–Whitney tests, the concentrations of vitamin D25-OH in outpatient and hospitalized geriatric patients are compared [20]. The experiment does not aim to integrate ML methods, but the deficiency of vitamin D in the elderly could also be studied using them in future research. In the category of image processing for the purpose of unlabeled classification, using clustering algorithms, the work [21] was developed. This used the K-means algorithm for medical diagnosis using tongue images in various geriatric diseases. Lech et al. [22] evaluate vitamin D status in geriatrics through multivariable logistic regression (LR), achieving an area under the curve (AUC) of 62.5%. The same tests are performed to identify procalcitonin (PCT) levels in geriatric patients with a preliminary diagnosis of sepsis upon admission to intensive care [23]. The level of vitamin D3 is investigated by Zheng et al. [24], and the XGBoost algorithm is reported to have the best results in variable selection and assessing its importance on the health of the elderly.

One of the features of AI tools aims to assist medical staff in making the best decisions. Moghadam et al. [25] evaluate the impact of AI tools in geriatric medical assistance for the analysis of patients’ chronic diseases. The authors note RF as the most used for this purpose.

Chu et al. [26] predict the risk of falls in hospitalized elderly patients by combining data from electronic medical records with the geriatric assessment. In this paper, several models were analyzed, but XGBoost had the best performance, reporting an accuracy of 73.2%, a sensitivity of 91%, and a specificity of 27%. Tian et al. [27] develop ML models to predict the length of hospital stay for geriatric patients with hip fractures. Additionally, the paper aims to identify associated risk factors. Among the inspected models, DT achieved an accuracy of 92.4% and an AUC of 98.8%. Nahian et al. [28] developed a fall detection system for the elderly using an accelerometer. The paper investigates time series, and the data are trained by specific ML algorithms. The best results are reported by RF, which achieved an accuracy of 99%, as well as by LR, which also achieved 99%. Furthermore, Rosca and Stancu [29] designed a wearable bracelet that monitors the health of elderly people. In the case of hip fractures in geriatric patients, Xing et al. [30] developed a prediction model regarding mortality, which was investigated using the RF algorithm.

In the paper by Akbari et al. [31], a model for classifying the level of frailty in the elderly is developed. Among the inspected algorithms, the Support Vector Classifier (SVC) and Multilayer Perceptron achieved the best results, with a median of 95%. Moreover, on the same subject, Amjad et al. [32] analyze various frailties of the elderly using portable sensors with a focus on gait analysis. Among the models examined, the k-Nearest Neighbors (KNNs) model achieved the best results, with an accuracy of 99.4%. In the paper by Ramachandran and Karuppiah [33], the implications of medical assistance through IoT devices with wearable sensors are presented. The paper analyzes data from these devices using threshold-based algorithms and identifies the SVM and KNN algorithms as having a sensitivity between 90% and 100%. The best values in terms of accuracy are obtained for the Convolutional Neural Network (CNN) and KNN. A specificity with values between 83 and 100% was reported for the RF and SVM algorithms. Geriatric health in the community can be investigated based on factors associated with pre-frailty in middle-aged adults, using ML algorithms, for which Sajeev et al. [34] achieved a maximum AUC of 81.7% and an accuracy of up to 78.7%. Sasu et al. [35] monitor geriatric health through wearable sensors, and the data is analyzed using a multitude of algorithms, achieving an accuracy of 97.5% for RF in the best scenario.

A special class of applications that use ML models is those in the neurological field [36]. Tunthanathip et al. [37] predict the presence of intracranial hematoma in elderly patients with traumatic brain injury. The best algorithm was Naïve Bayes (NB), which achieved an Area under the Receiver Operating Characteristic Curve (AUC-ROC) of 84.6%, an F1-score of 92%, and a sensitivity of 95%. Xu et al. [38] identify operative factors in the diagnosis of hepatocellular carcinoma undergoing hepatectomy. Among the analyzed algorithms, classification algorithms are the most suitable for predicting patient survival rates. The impact of intergenerational relationships, demographic factors, lifestyle, and health on depression among Chinese elderly, with an emphasis on the link between interactions with children and depressive symptoms, is investigated by Sheng et al. [39]. The correlations are studied using the RF algorithm, which achieved an accuracy of 98%, indicating the ability to predict such events in order to combat them.

Moreover, in the field of estimating the number of hospital days, this time for patients with fractures, it is examined through a multitude of ML algorithms. The Wide and DL model proposed by Lai et al. [40] achieved the best performance with an AUC-ROC of 84%, an accuracy of 79%, a precision of 73%, and a recall of 62%. This model is followed by the Light Gradient-Boosting Machine (LightGBM) and Artificial Neural Network (ANN). Sudha et al. [41] developed an ML model for identifying the hospital stay duration. The SVM model achieved the best performance, with an accuracy of 94.53%, an F1-score of 82.10%, a sensitivity of 72.55%, and a specificity of 76.47%. Früh et al. [42] conducted a retrospective study to predict the length of hospital stay after single-segment lumbar spinal fusion. Several algorithms were investigated in this research, with the best performance achieved by Categorial Boosting (CatBoost).

Another issue for patients who belong to the geriatric population concerns gastroenterology. The paper by Sudha et al. [41] assists medical staff in evaluating the tolerability of the probiotic Bacillus coagulans in alleviating indigestion. This issue can also be modeled using various ML algorithms [43]. Wu et al. [44] focus on oncology for the classification of geriatric breast cancer. The algorithm with the best results was OneClass Detection, for which the AUC value was 71%. The prediction of survival in elderly patients with squamous esophageal cancer based on oxidative stress biomarkers is investigated by Xie et al. [45], where, again, RF achieved the best results with an accuracy of 88.5%, a recall of 94.3%, and an F1-score of 91.3%. From the perspective of cancers, they are also investigated using ML algorithms, taking into account various combinations of patient characteristics [46].

In hematology, classification algorithms can identify events associated with platelets or hemorrhagic events, as presented by Nahornyj et al. [47]. The anemia of hospitalized elderly patients is investigated by Soraci et al. [48], achieving a detection rate of 66.7%. Regarding the evaluation of the impact of prediction models for anticoagulants in geriatric patients with cranio-cerebral trauma, Fujiwara et al. [49] investigated this using the RF algorithm, which achieved an accuracy of 95% in predicting the type of anticoagulant.

Another direction that can also be explored using ML models focuses on gerontology, with an emphasis on the quality of life for the elderly. For this purpose, Mirzaeian et al. [50] identified the RF algorithm as having an exceptional classification capability, achieving an accuracy of 94%, an F1-score of 94%, a sensitivity of 95%, and a specificity of 94%. In the paper by Şenol et al. [51], geriatric patients are investigated in the detection of COVID-19 using predictive markers of mortality. The Receiver Operating Characteristic (ROC) analysis in this case achieved a sensitivity of 57.1% and a specificity of 70.5%. Zadgaonkar et al. [52] investigate the early detection of dementia using lifestyle-associated parameters for non-invasive screening. The RF model achieved the best results with an accuracy of 76%, an F1-score of 86%, and a recall of 100%.

Most studies in the field of geriatrics investigate multiple ML models, but the best results are reported by the RF algorithm [53,54,55,56], a fact highlighted in Table 1, where a comparative analysis of the models’ performances is conducted based on the specifics of the applications.

Analyzing the works in the literature, it is found that the RF algorithm is the most studied in modeling geriatric problems through AI components. In Table 1, one can observe the large volume of works disseminated in the field. The authors’ preference for this model stems from its ability to handle incomplete data that contains noise or does not conform to a standard template. This algorithm is preferred for tasks such as identifying frailty, estimating postoperative mortality, detecting falls, or making clinical decisions due to its ease of implementation, especially since many modern tools include it in their list of automatic training algorithms, as well as its ability to train with a small volume of data. The RF model is followed by XGBoost, which is predominantly used in clinical predictions that require a multivariable approach. Additionally, this model can handle situations where values are missing from the training dataset. Another model that has been extensively studied is the KNN and NB models, which are traditional models for classification using similarities in clinical features. Table 1 highlights the comparison between the models studied with a geriatric focus. The choice of an algorithm depends on the nature of the available data, the objective of the study, the extent to which the model can be explained to the medical team, and the quality of the data used in training. In Table 1, it can be observed that RF and XGBoost stand out considerably from the other models investigated. The two models are predominantly used in prediction problems.

3.2. Classification of the Studied Models Based on Objectives

ML algorithms are classified into prediction algorithms and classification algorithms. In Table 2, the classification of works in the field of geriatrics was carried out based on the objective of the ML algorithm. Thus, nine papers have classification as their main objective, while seven papers have prediction as their main objective. Prediction works aim to anticipate various scenarios, while classification algorithms label a certain scenario as belonging to a specific group. The fundamental difference between classification algorithms and clustering algorithms lies in the fact that, in the case of classification, there are labels for the training dataset, while, in the latter case, clustering elements are actually used. However, most of the works are disseminated on the two mentioned objectives.

Classification models are used in the medical field to associate a label with a patient’s diagnosis. A concrete example is the presence or absence of a bacillus or exceeding a safety threshold of a certain vitamin dose in the blood. These classification models are used in differential diagnosis or screening in the hospital or community setting. Table 3 outlines that RF is the most frequently used classification model in the specialized literature. Researchers have analyzed multiple classification models in relation to the geriatric field, but only some of these models are effective in classification. From Table 3, it can be noted that the RF and SVM models are the most used.

Regarding prediction models, they estimate a future event, such as postoperative mortality, length of stay, recurrence of falls, or progression of a specific disease. These models are used in the medical field for making anticipated clinical decisions. They are also used for the allocation of medical resources. These aspects are summarized in Table 4, where it is observed that XGBoost is the most used in prediction problems, such as osteoporosis prediction, cognitive deterioration, or postoperative delirium. Additionally, RF and LR are examined in predictions regarding postoperative mortality.

Therefore, classification models and prediction models are researched in the field of geriatric medicine with different objectives. Classification models are used to diagnose and label the clinical conditions of patients. Prediction models are used to anticipate, forecast, predict, and prevent the progression of the patient’s condition, and plan personalized geriatric treatment. In both cases, RF and XGBoost are considered the most popular models.

3.3. Metrics for Evaluating the Performance of ML Models

The comparative evaluation of algorithms is carried out through evaluation metrics. Depending on the type of algorithm, these metrics are distinctive. The most studied metrics are accuracy, sensitivity (recall), specificity (precision), F1-score, and AUC-ROC.

Accuracy is the simplest metric for evaluating a classification model. This expresses the total number of correct predictions. Regarding recall, it indicates the model’s ability to correctly identify positive cases. In the medical field, false negative rates have serious consequences, which is why the authors of this paper believe that sensitivity should be a mandatory indicator regarding the quality of the developed model. Specificity reflects the model’s ability to correctly identify negative cases. The value of this indicator suggests the need for adjustments in the classification model. The F1-score indicator is the harmonic mean between precision and recall. AUC-ROC is one of the most important metrics in evaluating classification models, for which an AUC value close to 1 indicates a model considered to be perfect.

Based on the information presented in the geriatric literature, Table 5 shows the AUC graph. This is one of the most important metrics for the comparative evaluation of a classification model’s performance. This is due to the indicator’s ability to show whether the model correctly separates positive classes from negative ones. In the field of geriatrics, AUC evaluates the effectiveness of predicting adverse events, such as mortality or postoperative survival. A considerable variation in AUC values can be observed from the presented data in Table 5. This variation reflects the diversity of problems addressed, ranging from binary classification to complex predictions. The best results have a value close to 0.98, indicating that the model is performing well.

Depending on the complexity of the presented application, most results vary between 0.8 and 0.9, which can indicate very good results, depending on the context.

Regarding accuracy, Table 6 presents the results obtained from the contributions synthesized from the literature.

Accuracy shows the proportion of predictions made correctly by the model. This metric can be considered for evaluating the quality of the model only when the dataset is balanced. From these considerations, the quality of the dataset directly influences the quality of the model. In geriatrics, accuracy should be treated as a primary metric when there are no disparities between positive and negative classes. In Table 6, the data show a wide range of values, from 57% to 99.4%, reflecting significant differences between the types of issues addressed. The best results were obtained around the value of 99.4%, indicating extremely precise models. Some studies have reported more modest values, around 69.44% or 70%. These suggest possible issues related to data variability, task complexity, dataset quality, or model integration difficulty. Most of the values are concentrated between 80 and 90%, with the most frequent being 81.1%. This is considered a decent value for problems with a high degree of classification difficulty.

Recall measures the model’s ability to correctly detect positive cases. This indicator should be a benchmark in the medical context, where a negative diagnosis, that is, a false or incorrect one, can have serious consequences. For example, incorrectly diagnosing chronic kidney failure can have devastating consequences for the patient. In geriatrics, recall should be a priority to avoid missing diagnoses for elderly patients who are at increased risk of adverse events. In Table 7, the recall-associated data identified in various studies are represented.

The values range from 68% to 97%. These differences reflect the wide range of problem types addressed, from diagnosis to prediction. The best results were close to the value of 97%, for which the models detected a large portion of the positive cases. The 68% values suggest issues related to feature selection or decision threshold adjustment. Most values are concentrated between 80% and 90%. The most popular value is 91%, a value reported by most studies.

Analyzing these graphs, it is observed that the ML models used in geriatrics have achieved good performance, with percentages over 80%. Different subfields of geriatrics, such as oncology, nephrology, and traumatology, require different approaches, which somewhat explains the significant variation in metrics. These graphs illustrate the overall performance of ML algorithms in geriatrics. They also highlight the general trends in research in the field.

4. Bibliometric Analysis

4.1. Analysis of Publication-Based Metrics

From the analysis of the distribution of publications by year, an annual increase is noted, culminating in 2024 with 162 publications in the field of geriatrics that integrate ML techniques. From Figure 3, an upward trend is observed, this increase being driven by the digitalization of medical services with the help of AI technologies in elderly care.

The analysis of the type of publication is presented in Figure 4 and shows a dominance of original contributions, given that approximately 86.69% of all papers are original articles. These values show the researchers’ involvement in discovering new techniques that provide results through the application of clinical studies within ML algorithms. Review articles accounted for 48 out of the total 819 articles, which represents a small number of syntheses highlighting future research directions. Their presence indicates a maturation of the field of geriatrics in the context of ML. The proceedings and conference abstracts indicate a low number, suggesting a greater focus on peer-reviewed publications than on conference presentations.

In the WOS categories, there are 174 articles in Geriatrics gerontology, 91 in General internal medicine, 86 in Gerontology, and 55 in Medical informatics, with the rest distributed in other categories, as shown in Figure 5.

The research areas overlap with the WOS categories, as can be seen in Figure 6, where it can be observed that 177 articles were published in Geriatrics gerontology, 102 in General internal medicine, 55 in Medical informatics, 50 in Computer science, and 46 in Engineering. These values show that the two fields, geriatrics and gerontology, represent a quarter of the total publications, demonstrating the relevance of the central theme. The publications intersect between medicine and computer science due to the way geriatrics is approached through the use of ML algorithms.

The geographical distribution shows that at the top of the countries are the USA with 177 articles, China with 142, Japan with 66, and Germany with 58. The geographical distribution by country of the 932 articles is presented in Figure 7. The fact that 40% of the total publications come from the USA and China indicates the major resources invested in medical research through a technological approach. The institutions from which these articles originate include the University of California System with 24 articles, Harvard University with 22, Harvard University Medical Affiliates with 18 articles, and the US Department of Veterans Affairs with 18 articles, as well as other research institutes and universities. Figure 8 shows the distribution of the number of articles in relation to the institutions, highlighting the interest of prestigious universities in the studied field.

The publishers that disseminated the articles are Springer Nature with 162, Elsevier with 130, MDPI with 59, and Wiley with 55. Springer Nature and Elsevier have published over half of the articles, which demonstrates their position at the top of the academic publishers (Figure 9).

In Figure 10, it can be observed that most of the papers are freely accessible, which promotes the dissemination of knowledge in the medical and research communities. The predominance of all open-access (550 papers) and gold model papers, with 379 publications, suggests a preference for direct publication in journals, while the green model is represented with 333 articles, indicating researchers’ concern for depositing articles in institutional repositories or public archives.

For the 394 results from WOS, the co-occurrence map of keywords was generated using the VosViewer 1.6.20 tool. This map shows the relationships between key terms in the field of geriatrics and ML based on a repetition of 30 times, generating a grouping of three distinct clusters (Figure 11).

The red cluster is associated with diagnosis and prediction using AI technologies. The blue cluster is associated with terms related to the management of elderly patients, while the green cluster corresponds to their health, frailty, dementia, and chronic conditions. The red cluster is the central cluster of the map and is based on advanced technologies for diagnosis and prediction, validated by results in the context of geriatrics. Terms such as dementia, older adults, and aging indicate an interest in developing systems that can anticipate early common conditions in elderly individuals. The important correlations in this cluster are ML, artificial intelligence, and validation. The blue cluster addresses the practical aspects of elderly patient care, and this cluster includes surgeries, risk management, mortality, and therapy outcomes. Terms such as hip fracture and surgery indicate the attention given to medical issues frequently encountered in elderly individuals. The important correlations in this cluster are elderly patients, mortality, and outcomes, highlighting the importance of monitoring the results of surgical therapies. The green cluster explores the overall health aspects of elderly individuals, including frailty, chronic conditions such as chronic kidney disease, and depression. The important correlations are frailty, chronic kidney disease, depression, and prevalence, suggesting that studies focus on the frequency of these diseases in the elderly. At the center of the map is the term ML, which is a central element connected with the other terms of the three clusters. This demonstrates that the notion of ML is a dominant concept and acts as a key tool in geriatric research. The map demonstrates the current research priorities, which include automated diagnosis, risk management, the impact of chronic conditions, and surgical outcomes. This information is useful for researchers in identifying unexplored areas, so they can prioritize future research directions in interdisciplinary approaches that will be disseminated in upcoming papers.

4.2. Comparative Analysis in WOS, PubMed, IEEE, and Scopus

To expand the research, a comparative analysis was conducted between the PubMed, IEEE, Scopus, and WOS databases. The nine search categories separate the field of geriatrics from the one where the field is approached from an ML perspective. Table 8 presents the comparative analysis graph across all nine search types.

The first search included a simple search for the “geriatr*” query, which yielded 116,692 articles in PubMed, 51,911 in Scopus, 31,951 in WOS, and 1381 in IEEE. Thus, PubMed has the highest number of articles associated with the field of geriatrics. This can be explained by the nature of the database, which primarily includes articles in the medical field. At the opposite end, IEEE has a relatively small number of articles because it is focused on the technological field.

The “geriatr* AND diet” query yielded 2967 articles in PubMed, 1562 in Scopus, 403 in WOS, and 5 in IEEE. Thus, PubMed continues to offer the most results, demonstrating the variety in the medical field, unlike IEEE, which is not sufficiently well represented in the technical literature.

Similarly, the “geriatr* AND nutrition” query offered the highest values in PubMed, with 10,322 articles, followed by 3346 in Scopus, 1501 in WOS, and 16 in IEEE.

Surprisingly, the “geriatr* AND elderl*” query yielded 26,806 results in Scopus, 17,397 in PubMed, 9469 in WOS, and only 427 in IEEE, and generates the largest number of results in this category. The surprising factor stems from the large volume of results associated with this database. The search strategy of including “geriatr*” and “elderl*” in the same query was to narrow the search coverage to studies that use the specific medical terminology “geriatr*” and the more general phrase “elderl*”, focusing on those that explicitly target older people.

The “diet AND geriatr* AND machine learning” query yielded 20 articles in PubMed, 15 in WOS, 9 in Scopus, and 1 in IEEE. The category combines three fields, which explains the small number of articles. Although PubMed once again yields a large number of results, it is considerably smaller and suggests the need for intensified research.

The “nutrition AND geriatr* AND machine learning” query yielded 113 results in PubMed, 34 in Scopus, 34 in WOS, and 6 in IEEE. PubMed dominates this field due to the large number of articles related to elderly nutrition addressed with ML. However, the number of articles is very small.

The “elderl* AND geriatr* AND machine learning” query generated 327 results in Scopus, 234 articles in PubMed, 282 in WOS, and 65 in IEEE. And, this time, Scopus surprises with a larger number of results compared to other databases. The “geriatr* AND machine learning” query yielded 1800 results in PubMed, 788 in WOS, 764 in Scopus, and 234 in IEEE. Thus, PubMed has the most articles related to geriatrics and ML, followed by WOS with an almost equal number. This time, Scopus and IEEE have lower results.

The “geriatr* AND elderl* AND nutrition AND machine learning” query yielded 29 studies in PubMed, 18 in Scopus, 12 in WOS, and 2 in IEEE. This category is limited, generating only a few dozen papers. PubMed has the most results, but the number is very small.

Following a systematic search process conducted across four major databases (PubMed, Scopus, WOS, and IEEE), a total of 60 articles were initially identified, using the search terms geriatrics, elderly, nutrition, and machine learning, limited to the period 2020–2025. After removing duplicates (n = 10), 50 unique articles remained and were subjected to the title and abstract screening process (Figure 12).

Of these, 15 articles were excluded because they did not meet the inclusion criteria, particularly due to the lack of an application of machine-learning techniques or explicit correlation with the field of nutrition in the elderly population. The remaining 35 articles were evaluated, and 10 of them were excluded because they did not provide sufficient data, had a poor design, had insufficiently described datasets, or did not directly address the relationship between nutrition, the elderly, and machine learning.

Ultimately, a total of 25 studies were included in the qualitative analysis. This approach represents the basis for the systematic evaluation of how ML methods are used to investigate nutrition and health-related aspects among the geriatric population.

4.3. Overview of ML Studies in Geriatrics

Table 9 presents a summary of scientific studies that use ML algorithms in the field of geriatrics. For each study, information was extracted regarding the dataset size and type, the validation method applied, the performance metrics obtained during the training and testing phases, and comments made on the possibility of overfitting. The synthesis provides an overview of how learning models are developed and validated in the context of an aging population.

The authors note that this comparison is difficult because the datasets are heterogeneous. Essentially, these include different data sources, with major variations in both volume and content. The clinical goals are also distinct because some studies aim to predict mortality, while others focus on malnutrition, sarcopenia, frailty, or depression. The fact that these studies target different predictions implies differences in the input variables and target labels. The lack of metric standardization also makes a direct comparison between studies impossible [74]. Not all studies report the same metrics, and some completely omit the intermediate results. Under these conditions, Table 9 provides a comparative overview, but interpreting the differences between the studies is not straightforward because the methodological and clinical context of each article varies.

5. Discussions

In the discussion section, the results obtained from the systematic analysis, the practical and theoretical implications of the obtained values, future research directions, a series of authors’ observations regarding the results from the literature review, and the target audience of the research will be presented.

In this paper, articles published between 1 January 2020 and 31 May 2025 were analyzed to highlight the level of research on ML algorithms in both theoretical and applied descriptive areas. The paper emphasizes a series of performances obtained in scientific contribution articles, based on the values of the performance indicators of ML algorithms.

The comparative evaluation of existing research results included documentation on the WOS, PubMed, Scopus, and IEEE Explore databases. A first search, for the “geriatr*” query, generated 116,692 papers in PubMed. Compared to this value, Scopus yielded 51,911 results, WOS 31,951 results, and IEEE 1381 results. This high value associated with the PubMed database is explained by the fact that this database predominantly contains publications from the medical field. The second search based on the “geriatr* AND diet” query yielded 2967 articles in PubMed, 1562 in Scopus, 403 in WOS, and 5 in IEEE. The third search criterion was conducted on the “geriatr* AND nutrition” query. In PubMed, this search yielded 10,332 results, in Scopus 3346, in WOS 1501, and in IEEE 16. High values are identified in PubMed Scopus, and WOS, with IEEE having the lowest values because they are associated with technical publications and include sensors, IoT technology, area-specific wearable device technologies, and technical applications that integrate ML. The fourth search on the “geriatr* AND elderl*” query yielded the following results: Scopus generated 26,806 results, PubMed 17,397, WOS 9469, and IEEE 427. These results have high values, which suggests increased interest from researchers in this field.

Furthermore, the following searches add specific ML constraints. Thus, the fifth search based on the “diet AND geriatr* AND machine learning” query provides 20 results in PubMed, 15 in WOS, 9 in Scopus, and 1 in IEEE. The sixth search on the “nutrition AND geriatr* AND machine learning” query generated 113 results in PubMed, 34 in Scopus, 34 in WOS, and 6 in IEEE. The seventh search on the “elderl* AND geriatr* AND machine learning” query yielded 327 papers in Scopus, 234 articles in PubMed, 282 in WOS, and 65 in IEEE. The eighth search on the “geriatr* AND machine learning” query showed 1800 results in PubMed, 788 in WOS, 764 in Scopus, and 234 in IEEE. The ninth search based on the “geriatr* AND elderl* AND nutrition AND machine learning” query generated 29 papers in PubMed, 18 in Scopus, 12 in WOS, and 2 in IEEE. These results suggest the need to intensify the research in the field of geriatrics, specifically regarding ML components, as the number of results remains limited regardless of the database used.

Table 10 presents a summary of the studies disseminated in the Multidisciplinary Digital Publishing Institute (MDPI). These studies focus on the field of elderly health through an approach using ML methods. These articles cover issues such as sarcopenia, frailty, delirium, mood disorders, and locomotor syndrome. These works come from open-access journals such as Sensors, Journal of Clinical Medicine, and International Journal of Environmental Research and Public Health. The fact that these articles are disseminated in such journals reflects the trend of combining sensor technology with ML models for assessment, prevention, prediction, and treatment suggestions. A common aspect of these works is the use of moderately sized datasets. Some studies use medical imaging, with an emphasis on non-invasive methods that can be easily implemented in clinical practice. The ML models are predominantly RF and LR. More complex architectures like MLP, CatBoost, or Stacking achieve accuracies exceeding 95%. The reported metrics include, in addition to accuracy, the F1-score, AUC, and specialized metrics such as Hamming Loss for classification or R² for regression.

The results presented in Table 10 show that ML algorithms are tools that enable continuous monitoring, early detection, the prevention of certain behaviors, and the personalization of investigations in geriatrics. However, the lack of detailed metrics in some studies indicates the need for standardization in evaluating these models. Moreover, the lack of dataset standardization, and, on the other hand, the need to increase the volume of research, as can be seen in Table 10, which shows a very small number of results, suggest the need to encourage researchers to explore this area. Overall, this research outlines a direction for integrating ML methods into prevention, assistance, and detection systems, as well as personalization in healthy aging.

The factors influencing the field of geriatrics are analyzed in various processes targeting environmental factors, also addressed through ML methods, such as water quality [86,87] or air quality [88]. On the other hand, digital systems that introduce data scaling methods at the sensory level [89], implemented with embedded microcontrollers [90], allow for the remote monitoring of the elderly. Thus, two research directions are defined. The first direction focuses on research related to environmental factors that influence the quality of life of the elderly, while the second direction directly impacts how the elderly are monitored.

Thus, from the analysis of the papers, the following are observed:

The most frequently studied ML models are RF, XGBoost, and SVM. These are predominantly analyzed in the geriatric context due to their ability to operate with incomplete data. This behavior provides accurate results in order to manage complex clinical scenarios.
The subdomains of geriatrics applicable in ML include multiple applicable areas such as predicting postoperative mortality, classifying the level of frailty, identifying fall risk, monitoring nutritional status, analyzing pain, the early detection of dementia, managing patients with renal or cardiovascular insufficiency, and analyzing depression among the elderly.
The performance metrics of ML algorithms, predominantly analyzed in contribution articles, are accuracy, precision, F1-score, and AUC-ROC. In the analyzed literature, the reported values exceed 80% in most applications, which confirms the utility of the models in clinical practice. Additionally, these values also suggest the possibility of improving the metrics if specialists continue to investigate other supplementary algorithms, in addition to those preferred so far.
The geographical distribution of research indicates a concentration of scientific activity in developed countries such as the USA, China, Japan, and Germany, but also a gradual openness towards interdisciplinary international collaborations.
The type of publications shows an increased interest in empirical articles, as opposed to purely theoretical ones, an aspect highlighted by the maturation of the field and the focus on concrete clinical applications.

In relation to the RQ, the study responds as follows:

RQ1: ML algorithms are applied in various subfields of geriatrics. In the specialized literature, the postoperative mortality prediction, fall risk identification, hospitalization duration estimation, frailty level classification, nutritional status assessment, early dementia detection, and analysis of depression and overall functional status in the elderly are highlighted. These directions reflect the need for personalized care in a complex clinical context.
RQ2: The purposes of using ML algorithms are divided between classification, with directions such as differential diagnosis, functional status labeling, and prediction in applications for disease progression, recurrence risk, or post-intervention survival. Out of the total number of analyzed works, 40% aimed at classification, while 60% focused on prediction, indicating a trend towards anticipatory clinical applicability.
RQ3: The most commonly used metrics for performance evaluation are accuracy, precision, sensitivity, the F1-score, and AUC-ROC. These are directly correlated with the potential for the practical integration of applications in real-world contexts. The reported performances show that the models can support clinical decision-making, provided they are used for assistance and not for automatic decision-making in existing healthcare systems.

From the analyzed articles, 10 papers were selected that the authors consider representative of the field of geriatrics. These were qualitatively evaluated based on the dataset details, performance metrics, and clinical applicability. The scores ranged from 0 to 2, with 0 representing vague, 1 partially detailed, and 2 well-detailed (Table 11).

The qualitative analysis is supported by 10 papers that evaluate the three reference indicators for ML tools based on the authors’ expertise. Table 11 summarizes this evaluation based on the total score obtained for each criterion individually. This approach helps in interpreting the results regarding the need for the current systematic review.

The differences between the performance indicator values for AUC range from 0.57 to 0.98. These variations are justified by the type and quality of datasets used, the class imbalance, the degree of preprocessing applied before model training, or the complexity of the tasks addressed. Models that handle simple tasks, such as binary classification with clinical variables, perform better. Tasks involving heterogeneous variables, incomplete data, insufficient data volume, or highly complex clinical scenarios, such as predicting rare or multidimensional events, often result in lower performance. Therefore, variations in AUC reflect the quality of the model and the specific challenges inherent to each clinical application.

The paper is addressed to researchers in the field of ML and AI, who can identify, in this work, the future directions they should focus on in the study of algorithms in the geriatric field. They can also understand the technical challenges specific to this data. The paper is addressed to healthcare professionals and medical staff, who can study how these technologies can be integrated into their practice to improve clinical decisions. University faculty and PhD students can use this study as a systematic database for developing future research papers or interdisciplinary projects. In the private sector and the medical industry, ML solutions with the potential for scaling and implementation in portable devices, mobile applications, modern devices, or integrated digital health systems can be identified.

The authors believe that ML models are used in the field of geriatrics to improve diagnosis, prognosis, prevention, and the personalization of care for aging patients. Transparency regarding how these models provide suggestions is a major challenge at the moment. In the context of clinical decision-making, integrating tools like Explainable AI (XAI) into the prediction process represents a necessary future research direction to enable the understanding of the factors influencing model predictions. Implementing such mechanisms will increase clinicians’ confidence in using ML technologies. This process will facilitate its widespread adoption in medical practice.

The gaps identified in the literature represent starting points for future research. These address the following aspects:

The lack of standardization of datasets makes it impossible to replicate studies and objectively compare the performance between the models studied.
The explainability of models refers to the fact that, in most cases, the papers predominantly address quantitative metrics without providing an insight into the interpretability of the models for medical staff. In the context of geriatrics, where decisions often involve high risks, explainability should be a benchmark for future research.
Longitudinal studies are validated on cross-sectional data sets. Few studies follow patients in the long term, limiting the models’ ability to provide predictions over time. A future research direction should address the longitudinal issues of research in relation to ML.
Integration into real clinical practice represents another future direction for research, given that, although studies report good performance on limited datasets, in practice, they may encounter situations that were not considered in the datasets used for research.
Ethics and data protection in geriatrics is an extremely sensitive topic, especially in the context of AI usage, which raises a series of ethical questions related to consent, data access, algorithmic bias, and equity.

Considering these gaps, the research directions recommended by the authors of this paper are as follows:

The creation of standardized open-access databases in which the collected and labeled data are explained, so that as many studies as possible related to ML can be provided in the geriatric field.
Studying the explainability of algorithms through methods such as SHAP or Locally Interpretable Model-Agnostic Explanations (LIME), XAI, or other ML algorithms.
The implementation of prospective studies that validate the models over time and track the impact on patients’ quality of life, thus extending the studies generated by ML algorithms over a span of decades.
Interdisciplinary collaborations between engineers, programmers, geriatricians, psychologists, and medical ethics experts.
Designing integrated AI systems with interfaces that can be used by medical staff without requiring expertise in programming or data science.

Synthesizing the results analyzed in this paper, it is found that ML algorithms impact the way health policies for the elderly are approached. Thus, ML models triage patients in emergency units in the context of healthcare system overload. With the help of these algorithms, the monitoring of patients with chronic diseases is ensured through wearable devices and sensors connected to these devices [91]. Additionally, these algorithms allocate medical resources, and, by these resources, the authors refer to beds, personnel, and equipment, with allocation being based on predictions of the patients’ health status evolution. Additionally, these algorithms personalize patient treatment by taking into account historical data, laboratory tests, lifestyle, and patient comorbidities. For these benefits to materialize, ML algorithms need to be integrated into the national electronic health systems by developing clinical guidelines that include the use of these algorithms. Additionally, medical personnel require proper training in interpreting results obtained through AI. For all these things to materialize, it is necessary that we ensure a legal and ethical framework regarding data protection.

Some ML applications in the literature reviewed have demonstrated quantifiable clinical benefits, such as reducing the hospital stay duration. For example, in the study conducted by Tian et al. [27] on geriatric patients with hip fractures, the XGBoost model predicted the length of hospital stay with an accuracy of 92.4% and an AUC of 98.8%. These results optimize the allocation of medical resources. Another example is provided by Früh et al. [42], who correlated ML scores with the probability of prolonged hospitalization. This approach supports early intervention, which prevents complications from arising. Such results as ML models impact clinical workflows at the geriatric level.

In the field of geriatrics, some patients suffer from cognitive impairments, which render them unable to provide informed consent regarding the use of AI. For these reasons, the patient or their relatives need to be informed that personal data will be processed through ML models. On the other hand, algorithms do not replace human clinical decision-making. These models amplify unintentional discrimination if they are trained on historical data with bias, such as treatment inequalities based on age or gender. The lack of transparency in the models, as the algorithms represent black boxes, reduces the trust of the doctor or the patient in the final decision. For this reason, it is important to mention that the professional is the one who must always make the final decision in the case of ML algorithms.

Regarding patient personal data, in Europe, the General Data Protection Regulation (GDPR) restricts the processing of sensitive data, especially if it is of a medical nature. ML models that use such data must adhere to the principle of data minimization, provide explanations for algorithmic decisions through the right to an explanation according to Article 22 of GDPR, and maintain their absolute confidentiality. In addition to the GDPR, the AI Act is being adopted, representing the first legal framework for AI that classifies medical applications as high-risk. According to this act, ML applications in geriatrics must be transparent, allow for auditability, and be subject to continuous clinical validation. Given these constraints, the authors’ ethical recommendations include explaining the algorithms to doctors, ensuring that decision support systems have logs and a decision history in a format accessible to the doctor, accompanying each ML model with formal ethical evaluations, such as Clinical Ethics Committees, and, last but not least, obtaining the patient’s or legal representative’s consent, who must be informed about how the ML algorithms will process their data.

GDPR-integrated ML applications intended for geriatrics should benefit from technical measures that enable developers of these algorithms to pseudonymize patient data, making it impossible to identify patients based on the analyzed data. Another provision should be the limitation of access to patient datasets. These should only be accessed by authorized researchers, as determined by permission control. The periodic auditing of algorithmic decisions should be another measure with which to identify potential forms of bias within the context of GDPR compliance. Another norm should be ensuring the right to an explanation under the GDPR. Methods like SHAP or LIME provide medical staff with access to the algorithm’s decision-making reasoning. This is associated with the transparency of the algorithm’s decision-making and should be a mandatory norm in the development of ML algorithms. Moreover, obtaining consent accompanied by a description of how the algorithms process data should be another mandatory norm in the development of these algorithms.

In this systematic review, both internal and external validity were established to ensure the importance of the conclusions. In this paper, a study selection methodology and a content analysis conducted by the authors based on their own expertise were used. In this section, an explicit discussion will be conducted on the validity dimension, highlighting the understanding of the limitations and generalizability of the results. Internal validity is associated with the logical coherence between the objectives of the paper, the inclusion criteria, the analysis method, and the conclusions drawn. In this study, internal validity is due to the use of a search strategy defined according to WOS rules by including terms, bulleted lists, and the temporal delimitations of 1 January 2020 to 31 May 2025. The selection of articles is based on eligibility criteria and dual analytical methods, qualitative and quantitative. The analysis of performance metrics ensures the objectivity of evaluations in relation to the performance of ML models used in the geriatric context. External validity refers to the extent to which conclusions generalize to external contexts; yet, it is limited by factors such as the studies’ origins in developed countries with an advanced IT infrastructure or the frequently heterogeneous data that do not allow for the reproducibility of models due to the lack of common standardization.

The implementation of ML algorithms in geriatrics faces an obstacle in the form of low trust from medical staff in black box models. Doctors are hesitant to use these tools, whose internal mechanism is not always explained in clinical terms. Another problem with these algorithms is the lack of justification to the patient. To overcome this obstacle, the authors recommend integrating explainable techniques such as SHAP or LIME. They interpret algorithmic decisions for clinicians. On the other hand, ML models should not be seen as replacements for medical decision-making. The authors emphasize that they recommend using these tools as complementary methods to assist data analysis specialists. Human–machine collaboration, where the final decision must remain with the medical professional, aids in clinical acceptance and the adherence to ethical principles. The development of explainable interfaces to support clinical validation is a prerequisite for integrating AI components into geriatric practice.

The authors highlight the gaps identified in the specialized literature through a series of research deficiencies represented by the lack of standardization of datasets, the lack of explanations for algorithmic decisions as they treat the issue like black boxes, the lack of longitudinal studies, the reduced integration into clinical practice, and the lack of regulations regarding ethics and data protection in AI-assisted geriatrics.

6. Conclusions

The paper is a systematic review of studies published between 1 January 2020 and 31 May 2025. These studies investigate ML-type algorithms in the field of geriatrics. The analysis reveals that the field is in an expansion phase. The annual increase in publications is approximately 18%. Interest in this field is particularly strong among researchers from the USA, China, Japan, and Germany.

A comparative analysis of the four databases revealed that PubMed provides the most results regarding research related to the field of geriatrics. Scopus stands out with its academic research-based approach, offering competitive results in the Elderly category. WOS has an intermediate position with lower results than PubMed or Scopus, but including a roughly constant number across all categories. IEEE is the most restricted platform, offering databases where the number of research papers is relatively small due to its focus on the technological field. In total, the comparative analysis highlighted the very small number of results from collaborations between geriatric specialists and those in the field of ML.

Among the analyzed algorithms, the most used are RF with over 52% of the total studies analyzed, followed by XGBoost with 24%, SVM with 16%, and other models such as KNN and Naïve Bayes. Algorithms are used in various areas of geriatrics, such as predicting postoperative mortality, where they achieve an accuracy of up to 92.4%, detecting falls with an accuracy of up to 99%, analyzing nutritional status with a sensitivity of 91%, and classifying the level of frailty with an accuracy of 95%.

The evaluation of algorithm performance is carried out using standard metrics represented by accuracy, precision, recall, sensitivity, F1-score, and AUC-ROC. Regarding accuracy, studies have reported values between 57% and 99.4%, with an average of 81.1%. Sensitivity is reported between 68% and 97%, with an average reported value of 91%. The F1-score is reported between 73% and 94%, while the AUC-ROC values range from 62.5% to 98.8%, suggesting the predictive capability of the models, especially in oncology and trauma. These results demonstrate the ability of ML algorithms in the early identification, prediction, and classification of diseases in the field of geriatrics.

In addition to these results, five deficient directions in the current literature are noted, namely, the lack of standardization of the datasets, the reduced explainability of the models, the absence of longitudinal studies, the low implementation in clinical practice, and the lack of explicit treatment of ethical and privacy issues. Based on these findings, the authors recommend the creation of open databases labeled with metadata that can be studied in various research contexts, the explicit integration of algorithmic explanations into clinical decision-making processes, the conduct of prospective studies with temporal validation and long-term patient follow-up, interdisciplinary collaboration between bioethics engineers and public policy specialists, and the adaptation of legal frameworks such as the GDPR and the AI Act to support the use of AI in geriatrics.

In conclusion, the paper demonstrates that ML algorithms contribute to the personalization, prediction, prevention, and adoption of tailored means of elderly care. With an average accuracy of over 80% in most clinical scenarios, these algorithms assist medical decision-making and optimize resource allocation in order to improve patients’ quality of life. For the integration of these models into health systems, technical validations and responsible approaches from ethical, legal, and social perspectives are necessary, aspects that can only be addressed through increased ongoing research in this field.

Author Contributions

Conceptualization, A.S. and C.-M.R.; methodology, A.S. and C.-M.R.; software, C.-M.R.; validation, A.S. and C.-M.R.; formal analysis, A.S., C.-M.R. and E.M.I.; investigation, A.S., C.-M.R. and E.M.I.; resources, A.S. and C.-M.R.; data curation, C.-M.R. and E.M.I.; writing—original draft preparation, A.S. and C.-M.R.; writing—review and editing, A.S., C.-M.R. and E.M.I.; visualization, A.S. and C.-M.R.; supervision, A.S. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Petroleum-Gas University of Ploiesti, Romania.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

AdaBoost	Adaptive Boosting
AHS	American Housing Survey
AI	Artificial intelligence
ANN	Artificial Neural Network
ANOVA	Analysis of variance
AUC	Area under the curve
AUC-ROC	Area under the Receiver Operating Characteristic Curve
BC	Bagging Classifier
CC	Classifier chains
CHARLS	China Health and Retirement Longitudinal Study
CNN	Convolutional Neural Network
CT	Computed tomography
DL	Deep learning
DSST	Digit Symbol Substitution Test
DT	Decision Tree
DXA	Dual-energy X-ray absorptiometry
EHCR	Electronic home care record
EWUMH	Ewha Womans University Mokdong Hospital
FCB	Frailty Care Bundl
FTSTS	Five-Time Sit-to-Stand Test
GBC	Gradient-boosting classifier
GBDT	Gradient-Boosting Decision Tree
GBM	Gradient-Boosting Machine
GDPR	General Data Protection Regulation
GLIM	Global Leadership Initiative on Malnutrition
GNB	Gaussian Naïve Bayes
GNSGCSPEN	Geriatric Nutrition Study Group of the Chinese Society for Parenteral and Enteral Nutrition
HRDepthNet	High-Resolution Depth Net
HRNet	High-Resolution Net
ICU	Intensive care unit
IEEE	Institute of Electrical and Electronics Engineers
IMU	Inertial Measurement Unit
INSCOC	Investigation on Nutrition Status and Clinical Outcome of Common Cancers
IoT	Internet of Things
KNHNES	Korea National Health and Nutrition Examination Survey.
KNN	K-Nearest Neighbor
LASSO	Least Absolute Shrinkage Selection Operator
LightGBM	Light Gradient-Boosting Machine
LIME	Locally Interpretable Model-Agnostic Explanations
LR	Logistic Regression
LRC	Logistic regression classifier
LSTM	Long Short-Term Memory
MDPI	Multidisciplinary Digital Publishing Institute
MIMIC	Medical Information Mart for Intensive Care
ML	Machine learning
MLP	Multilayer Perceptron
MNHT	Medical Norte Hospital in Tijuana
NB	Naïve Bayes
NHANES	National Health and Nutrition Examination Survey
NIGSBRH	National Institute of Gastroenterology “S. de Bellis” Research Hospital
NN	Neural Network
PCA	Principal component analysis
PCT	Procalcitonin
PIM	Potentially inappropriate medications
PLR	Penalized Logistic Regression
PR-AUC	Precision–Recall Area Under the Curve
RF	Random Forest
RFC	Random forest classifier
RMSE	Root Mean Square Error
ROC	Receiver Operating Characteristic
RQ	Research question
RUSBoost	Random under-sampling with a boosting algorithm
SGB	Stochastic Gradient Boosting
SHAP	SHapley Additive exPlanations
SMR	Standardized mortality ratio
SVC	Support Vector Classifier
SVM	Support Vector Machines
TBI	Traumatic brain injury
VC	Voting Classifier
WOS	Web of Science
XAI	Explainable AI
XGBoost	Extreme Gradient Boosting

References

Rosca, C.-M.; Carbureanu, M.; Stancu, A. Data-Driven Approaches for Predicting and Forecasting Air Quality in Urban Areas. Appl. Sci. 2025, 15, 4390. [Google Scholar] [CrossRef]
Stancu, A. The Relationship among Population Number, Food Domestic Consumption and Food Consumer Expenditure for Most Populous Countries. Procedia Econ. Financ. 2015, 22, 333–342. [Google Scholar] [CrossRef]
Anghelache, C.; Anghel, M.-G.; Iacob, Ș.V.; Panait, M.; Rădulescu, I.G.; Brezoi, A.G.; Miron, A. The Effects of Health Crisis on Economic Growth, Health and Movement of Population. Sustainability 2022, 14, 4613. [Google Scholar] [CrossRef]
Oosterhoff, J.H.F.; Savelberg, A.B.M.C.; Karhade, A.V.; Gravesteijn, B.Y.; Doornberg, J.N.; Schwab, J.H.; Heng, M. Development and internal validation of a clinical prediction model using machine learning algorithms for 90 day and 2 year mortality in femoral neck fracture patients aged 65 years or above. Eur. J. Trauma Emerg. Surg. 2022, 48, 4669–4682. [Google Scholar] [CrossRef]
Zhou, Y.; Romijnders, R.; Hansen, C.; Campen, J.V.; Maetzler, W.; Hortobágyi, T.; Lamoth, C.J.C. The detection of age groups by dynamic gait outcomes using machine learning approaches. Sci. Rep. 2020, 10, 4426. [Google Scholar] [CrossRef]
Mądra-Gackowska, K.; Szewczyk-Golec, K.; Gackowski, M.; Hołyńska-Iwan, I.; Parzych, D.; Czuczejko, J.; Graczyk, M.; Husejko, J.; Jabłoński, T.; Kędziora-Kornatowska, K. Selected Biochemical, Hematological, and Immunological Blood Parameters for the Identification of Malnutrition in Polish Senile Inpatients: A Cross-Sectional Study. J. Clin. Med. 2025, 14, 1494. [Google Scholar] [CrossRef]
Popescu, C.; El-Chaarani, H.; El-Abiad, Z.; Gigauri, I. Implementation of Health Information Systems to Improve Patient Identification. Int. J. Environ. Res. Public Health 2022, 19, 15236. [Google Scholar] [CrossRef]
Wu, E.-B.; Lin, Y.-H.; Yang, J.C.-S.; Lai, C.-W.; Chin, J.-C.; Wu, S.-C. Density Spectral Array Enables Precise Sedation Control for Supermicrosurgical Lymphaticovenous Anastomosis: A Retrospective Observational Cohort Study. Bioengineering 2023, 10, 494. [Google Scholar] [CrossRef]
Khan, U.H.; Mantoo, S.; Dhar, A.; Shabir, A.; Shah, A.; Mehfooz, N.; Shah, S. Vitamin D Toxicity Presenting as Altered Mental Status in Elderly Patients. Cureus 2022, 14, e32654. [Google Scholar] [CrossRef]
Rosca, C.-M.; Stancu, A.; Tănase, M.R. A Comparative Study of Azure Custom Vision Versus Google Vision API Integrated into AI Custom Models Using Object Classification for Residential Waste. Appl. Sci. 2025, 15, 3869. [Google Scholar] [CrossRef]
Choudhury, A.; Renjilian, E.; Asan, O. Use of machine learning in geriatric clinical care for chronic diseases: A systematic literature review. JAMIA Open 2020, 3, 459–471. [Google Scholar] [CrossRef]
Chu, L.F.; Kurup, V. Preparing for The Silver Tsunami: The Potential for use of Big Data and Artificial Intelligence in Geriatric Anesthesia. Curr. Anesthesiol. Rep. 2025, 15, 17. [Google Scholar] [CrossRef]
Das, A.; Dhillon, P. Application of machine learning in measurement of ageing and geriatric diseases: A systematic review. BMC Geriatr. 2023, 23, 841. [Google Scholar] [CrossRef] [PubMed]
Benovic, S.; Ajlani, A.H.; Leinert, C.; Fotteler, M.; Wolf, D.; Steger, F.; Kestler, H.; Dallmeier, D.; Denkinger, M.; Eschweiler, G.W.; et al. Introducing a machine learning algorithm for delirium prediction—The Supporting SURgery with GEriatric Co-Management and AI project (SURGE-Ahead). Age Ageing 2024, 53, afae101. [Google Scholar] [CrossRef] [PubMed]
Gulcicek, S.; Seyahi, N. Comparison of chronic kidney disease progression and associated complications between geriatric and non-geriatric groups. Medicine 2024, 103, e37422. [Google Scholar] [CrossRef]
Heybeli, C.; Soysal, P.; Smith, L.; Keskin, E.B.; Kazancıoğlu, R. Effects of a Change in the Definition of Chronic Kidney Disease on Geriatric Assessment Parameters. Turk. J. Nephrol. 2022, 31, 209–217. [Google Scholar] [CrossRef]
Elihimas Júnior, U.F.; Couto, J.P.; Pereira, W.; Barros De Oliveira Sá, M.P.; Tenório De França, E.E.; Aguiar, F.C.; Cabral, D.B.C.; Alencar, S.B.V.; Feitosa, S.J.D.C.; Claizoni Dos Santos, T.O.; et al. Logistic Regression Model in a Machine Learning Application to Predict Elderly Kidney Transplant Recipients with Worse Renal Function One Year after Kidney Transplant: Elderly KTbot. J. Aging Res. 2020, 2020, 7413616. [Google Scholar] [CrossRef]
Wu, L.-W.; Ouyoung, T.; Chiu, Y.-C.; Hsieh, H.-F.; Hsiu, H. Discrimination between possible sarcopenia and metabolic syndrome using the arterial pulse spectrum and machine-learning analysis. Sci. Rep. 2022, 12, 21452. [Google Scholar] [CrossRef]
Rosca, C.-M.; Bold, R.-A.; Gerea, A.-E. A Comprehensive Patient Triage Algorithm Incorporating ChatGPT API for Symptom-Based Healthcare Decision-Making. In Emerging Trends and Technologies on Intelligent Systems; ETTIS 2024; Lecture Notes in Networks and Systems; Springer: Singapore, 2025; pp. 167–178. [Google Scholar] [CrossRef]
Nieradko-Iwanicka, B. Vitamin D25-OH concentration in outpatient and hospitalized geriatric patients—Retrospective study. Curr. Issues Pharm. Med. Sci. 2024, 37, 75–78. [Google Scholar] [CrossRef]
Shi, D.; Tang, C.; Blackley, S.V.; Wang, L.; Yang, J.; He, Y.; Bennett, S.I.; Xiong, Y.; Shi, X.; Zhou, L.; et al. An annotated dataset of tongue images supporting geriatric disease diagnosis. Data Brief 2020, 32, 106153. [Google Scholar] [CrossRef]
Lech, M.A.; Warpechowski, M.; Wojszel, A.; Rentflejsz, J.; Świętek, M.; Wojszel, Z.B. Vitamin D Status among Patients Admitted to a Geriatric Ward—Are Recommendations for Preventing Its Deficiency Effective Enough? Nutrients 2024, 16, 193. [Google Scholar] [CrossRef]
Demïr, İ.; Yilmaz, İ. The Effect of Polypharmacy on Procalcitonin Levels in The Intensive Care Admission of Geriatric Patients with Sepsis. Konuralp Tıp Derg. 2020, 12, 216–222. [Google Scholar] [CrossRef]
Zheng, Z.; Xu, W.; Wang, F.; Qiu, Y.; Xue, Q. Association between vitamin D3 levels and frailty in the elderly: A large sample cross-sectional study. Front. Nutr. 2022, 9, 980908. [Google Scholar] [CrossRef] [PubMed]
Moghadam, M.P.; Moghadam, Z.A.; Qazani, M.R.C.; Pławiak, P.; Alizadehsani, R. Impact of Artificial Intelligence in Nursing for Geriatric Clinical Care for Chronic Diseases: A Systematic Literature Review. IEEE Access 2024, 12, 122557–122587. [Google Scholar] [CrossRef]
Chu, W.-M.; Kristiani, E.; Wang, Y.-C.; Lin, Y.-R.; Lin, S.-Y.; Chan, W.-C.; Yang, C.-T.; Tsan, Y.-T. A model for predicting fall risks of hospitalized elderly in Taiwan-A machine learning approach based on both electronic health records and comprehensive geriatric assessment. Front. Med. 2022, 9, 937216. [Google Scholar] [CrossRef]
Tian, C.-W.; Chen, X.-X.; Shi, L.; Zhu, H.-Y.; Dai, G.-C.; Chen, H.; Rui, Y.-F. Machine learning applications for the prediction of extended length of stay in geriatric hip fracture patients. World J. Orthop. 2023, 14, 741–754. [Google Scholar] [CrossRef]
Nahian, M.J.A.; Ghosh, T.; Banna, M.H.A.; Aseeri, M.A.; Uddin, M.N.; Ahmed, M.R.; Mahmud, M.; Kaiser, M.S. Towards an Accelerometer-Based Elderly Fall Detection System Using Cross-Disciplinary Time Series Features. IEEE Access 2021, 9, 39413–39431. [Google Scholar] [CrossRef]
Rosca, C.-M.; Stancu, A. Anomaly Detection in Elderly Health Monitoring via IoT for Timely Interventions. Appl. Sci. 2025, 15, 7272. [Google Scholar] [CrossRef]
Xing, F.; Luo, R.; Liu, M.; Zhou, Z.; Xiang, Z.; Duan, X. A New Random Forest Algorithm-Based Prediction Model of Post-operative Mortality in Geriatric Patients With Hip Fractures. Front. Med. 2022, 9, 829977. [Google Scholar] [CrossRef]
Akbari, G.; Nikkhoo, M.; Wang, L.; Chen, C.P.C.; Han, D.-S.; Lin, Y.-H.; Chen, H.-B.; Cheng, C.-H. Frailty Level Classification of the Community Elderly Using Microsoft Kinect-Based Skeleton Pose: A Machine Learning Approach. Sensors 2021, 21, 4017. [Google Scholar] [CrossRef]
Amjad, A.; Qaiser, S.; Błaszczyszyn, M.; Szczęsna, A. The evolution of frailty assessment using inertial measurement sensor-based gait parameter measurements: A detailed analysis. WIREs Data Min. Knowl. Discov. 2024, 14, e1557. [Google Scholar] [CrossRef]
Ramachandran, A.; Karuppiah, A. A Survey on Recent Advances in Wearable Fall Detection Systems. BioMed Res. Int. 2020, 2020, 2167160. [Google Scholar] [CrossRef] [PubMed]
Sajeev, S.; Champion, S.; Maeder, A.; Gordon, S. Machine learning models for identifying pre-frailty in community dwelling older adults. BMC Geriatr. 2022, 22, 794. [Google Scholar] [CrossRef]
Sasu, G.-V.; Ciubotaru, B.-I.; Goga, N.; Vasilățeanu, A. Addressing Missing Data Challenges in Geriatric Health Monitoring: A Study of Statistical and Machine Learning Imputation Methods. Sensors 2025, 25, 614. [Google Scholar] [CrossRef] [PubMed]
Rosca, C.-M.; Stancu, A. A Review of Neuro-ML Breakthroughs in Addressing Neurological Disorders. Appl. Sci. 2025, 15, 5442. [Google Scholar] [CrossRef]
Tunthanathip, T.; Phuenpathom, N.; Jongjit, A. Web-based calculator using machine learning to predict intracranial hematoma in geriatric traumatic brain injury. J. Hosp. Manag. Health Policy 2023, 7, 16. [Google Scholar] [CrossRef]
Xu, L.; Xu, Y.; Li, G.; Yang, B. Perioperative factors related to the prognosis of elderly patients with hepatocellular carcinoma. Eur. J. Med. Res. 2022, 27, 280. [Google Scholar] [CrossRef]
Sheng, B.; Zhang, S.; Gao, Y.; Xia, S.; Zhu, Y.; Yan, J. Elucidating the influence of familial interactions on geriatric depression: A comprehensive nationwide multi-center investigation leveraging machine learning. Acta Psychol. 2024, 246, 104274. [Google Scholar] [CrossRef]
Lai, C.-H.; Mok, P.K.-L.; Chau, W.-W.; Law, S.-W. Application of machine learning models on predicting the length of hospital stay in fragility fracture patients. BMC Med. Inform. Decis. Mak. 2024, 24, 26. [Google Scholar] [CrossRef]
Sudha, K.; Kumar, V.; Bennur, S.; Dhanasekar, K. A prospective, randomized, open-label, placebo-controlled comparative study of Bacillus coagulans GBI-30,6086 with digestive enzymes in improving indigestion in geriatric population. J. Fam. Med. Prim. Care 2020, 9, 1108. [Google Scholar] [CrossRef]
Früh, A.; Frey, D.; Hilbert, A.; Jelgersma, C.; Uhl, C.; Nissimov, N.; Truckenmüller, P.; Wasilewski, D.; Rallios, D.; Hoppe, M.; et al. Preoperatively-determined Red Distribution Width (RDW) predicts prolonged length of stay after single-level spinal fusion in elderly patients. Brain Spine 2024, 4, 102827. [Google Scholar] [CrossRef]
Rosca, C.-M.; Stancu, A.; Popescu, M. The Impact of Cloud Versus Local Infrastructure on Automatic IoT-Driven Hydroponic Systems. Appl. Sci. 2025, 15, 4016. [Google Scholar] [CrossRef]
Wu, X.; Chen, M.; Liu, K.; Wu, Y.; Feng, Y.; Fu, S.; Xu, H.; Zhao, Y.; Lin, F.; Lin, L.; et al. Molecular classification of geriatric breast cancer displays distinct senescent subgroups of prognostic significance. Mol. Ther. Nucleic Acids 2024, 35, 102309. [Google Scholar] [CrossRef] [PubMed]
Xie, J.-B.; Huang, S.-J.; Yang, T.-B.; Wang, W.; Chen, B.-Y.; Guo, L. Comparison of machine learning methods for Predicting 3-Year survival in elderly esophageal squamous cancer patients based on oxidative stress. BMC Cancer 2024, 24, 1432. [Google Scholar] [CrossRef] [PubMed]
Takahashi, M.; Sakamoto, Y.; Ohori, H.; Tsuji, Y.; Kuroki, M.; Kato, S.; Otsuka, K.; Komine, K.; Takahashi, M.; Takahashi, S.; et al. Phase II study of trifluridine/tipiracil (TAS-102) therapy in elderly patients with colorectal cancer (T-CORE1401): Geriatric assessment tools and plasma drug concentrations as possible predictive biomarkers. Cancer Chemother. Pharmacol. 2021, 88, 393–402. [Google Scholar] [CrossRef]
Nahornyj, E.; Goutelle, S.; Bourguignon, L.; De La Gastine, B. Évaluation des prescriptions d’anticoagulants oraux directs (AOD) en gériatrie hospitalière sur 3 ans. Therapies 2021, 76, 191–200. [Google Scholar] [CrossRef]
Soraci, L.; De Vincentis, A.; Aucella, F.; Fabbietti, P.; Corsonello, A.; Arena, E.; Aucella, F.; Gatta, G.; Incalzi, R.A. Prevalence, risk factors, and treatment of anemia in hospitalized older patients across geriatric and nephrological settings in Italy. Sci. Rep. 2024, 14, 19721. [Google Scholar] [CrossRef]
Fujiwara, G.; Okada, Y.; Suehiro, E.; Yatsushige, H.; Hirota, S.; Hasegawa, S.; Karibe, H.; Miyata, A.; Kawakita, K.; Haji, K.; et al. Development of Machine-learning Model to Predict Anticoagulant Use and Type in Geriatric Traumatic Brain Injury Using Coagulation Parameters. Neurol. Med.-Chir. 2025, 65, 61–70. [Google Scholar] [CrossRef]
Mirzaeian, R.; Nopour, R.; Asghari Varzaneh, Z.; Shafiee, M.; Shanbehzadeh, M.; Kazemi-Arpanahi, H. Which are best for successful aging prediction? Bagging, boosting, or simple machine learning algorithms? Biomed. Eng. OnLine 2023, 22, 85. [Google Scholar] [CrossRef]
Şenol, A.; Özer Balin, Ş.; Telo, S. Evaluation of Pentraxin 3 Utility in Predicting Mortality in Geriatric Patients with COVID-19: A Prospective Study. Turk. J. Geriatr. 2022. [Google Scholar] [CrossRef]
Zadgaonkar, A.; Keskar, R.; Kakde, O. Towards a Machine Learning Model for Detection of Dementia Using Lifestyle Parameters. Appl. Sci. 2023, 13, 10630. [Google Scholar] [CrossRef]
Millet, A.; Madrid, A.; Alonso-Weber, J.M.; Rodríguez-Mañas, L.; Pérez-Rodrá-Guez, R. Machine Learning Techniques Applied to the Development of a Fall Risk Index for Older Adults. IEEE Access 2023, 11, 84795–84809. [Google Scholar] [CrossRef]
Ocagli, H.; Bottigliengo, D.; Lorenzoni, G.; Azzolina, D.; Acar, A.S.; Sorgato, S.; Stivanello, L.; Degan, M.; Gregori, D. A Machine Learning Approach for Investigating Delirium as a Multifactorial Syndrome. Int. J. Environ. Res. Public Health 2021, 18, 7105. [Google Scholar] [CrossRef] [PubMed]
Cai, C.; Zhu, H.; Li, B.; Tang, C.; Ren, Y.; Guo, Y.; Li, J.; Wang, L.; Li, D.; Li, D. Prognostic Analysis of Elderly Patients with Hepatocellular Carcinoma: An Exploration and Machine Learning Model Prediction Based on Age Stratification and Surgical Approach. J. Hepatocell. Carcinoma 2025, 12, 747–764. [Google Scholar] [CrossRef]
Rosca, C.-M.; Stancu, A. Earthquake Prediction and Alert System Using IoT Infrastructure and Cloud-Based Environmental Data Analysis. Appl. Sci. 2024, 14, 10169. [Google Scholar] [CrossRef]
Chou, Y.-Y.; Wang, M.-S.; Lin, C.-F.; Lee, Y.-S.; Lee, P.-H.; Huang, S.-M.; Wu, C.-L.; Lin, S.-Y. The application of machine learning for identifying frailty in older patients during hospital admission. BMC Med. Inform. Decis. Mak. 2024, 24, 270. [Google Scholar] [CrossRef]
Shooshani, T.; Pooladzandi, O.; Nguyen, A.; Shipley, J.H.; Harris, M.H.; Hovis, G.E.A.; Barrios, C. Field Measures Are All You Need: Predicting Need for Surgery in Elderly Ground-Level Fall Patients via Machine Learning. Am. Surg.™ 2023, 89, 4095–4100. [Google Scholar] [CrossRef]
Bednorz, A.; Lach, E.; Seiffert, P. Assessment of the usefulness of statistical learning systems in drawing conclusions about the cognitive performance status of elderly patients. Neuropsychiatr. I Neuropsychol. 2022, 17, 83–94. [Google Scholar] [CrossRef]
Liu, X.; Hu, P.; Yeung, W.; Zhang, Z.; Ho, V.; Liu, C.; Dumontier, C.; Thoral, P.J.; Mao, Z.; Cao, D.; et al. Illness severity assessment of older adults in critical illness using machine learning (ELDER-ICU): An international multicentre study with subgroup bias evaluation. Lancet Digit. Health 2023, 5, e657–e667. [Google Scholar] [CrossRef]
Sakal, C.; Li, T.; Li, J.; Yang, C.; Li, X. Association Between Sleep Efficiency Variability and Cognition Among Older Adults: Cross-Sectional Accelerometer Study. JMIR Aging 2024, 7, e54353. [Google Scholar] [CrossRef]
Li, Q.; Zhao, Y.; Chen, Y.; Yue, J.; Xiong, Y. Developing a machine learning model to identify delirium risk in geriatric internal medicine inpatients. Eur. Geriatr. Med. 2022, 13, 173–183. [Google Scholar] [CrossRef] [PubMed]
Zupo, R.; Moroni, A.; Castellana, F.; Gasparri, C.; Catino, F.; Lampignano, L.; Perna, S.; Clodoveo, M.L.; Sardone, R.; Rondanelli, M. A Machine-Learning Approach to Target Clinical and Biological Features Associated with Sarcopenia: Findings from Northern and Southern Italian Aging Populations. Metabolites 2023, 13, 565. [Google Scholar] [CrossRef] [PubMed]
Zheng, Z.; Li, S.; Li, R.; Qin, S.; Wang, W.; Wu, C. NHANES-based machine learning for cognitive impairment classification and blood and hearing threshold characterization in age-related hearing loss. Geriatr. Nurs. 2025, 63, 8–14. [Google Scholar] [CrossRef] [PubMed]
He, Y.; Cui, W.; Fang, T.; Zhang, Z.; Zeng, M. Metabolites of the gut microbiota may serve as precise diagnostic markers for sarcopenia in the elderly. Front. Microbiol. 2023, 14, 1301805. [Google Scholar] [CrossRef]
Wang, X.; Yang, F.; Zhu, M.; Cui, H.; Wei, J.; Li, J.; Chen, W. Development and Assessment of Assisted Diagnosis Models Using Machine Learning for Identifying Elderly Patients With Malnutrition: Cohort Study. J. Med. Internet Res. 2023, 25, e42435. [Google Scholar] [CrossRef]
Duan, R.; Li, Q.; Yuan, Q.X.; Hu, J.; Feng, T.; Ren, T. Predictive model for assessing malnutrition in elderly hospitalized cancer patients: A machine learning approach. Geriatr. Nurs. 2024, 58, 388–398. [Google Scholar] [CrossRef]
Kim, J.H. Machine-learning classifier models for predicting sarcopenia in the elderly based on physical factors. Geriatr. Gerontol. Int. 2024, 24, 595–602. [Google Scholar] [CrossRef]
Liu, H.; Liu, Q.; Si, H.; Yu, J.; Li, Y.; Zhou, W.; Wang, C. Development and Validation of a Nutritional Frailty Phenotype for Older Adults Based on Risk Prediction Model: Results from a Population-Based Prospective Cohort Study. J. Am. Med. Dir. Assoc. 2025, 26, 105425. [Google Scholar] [CrossRef]
Witt, U.F.; Nibe, S.M.; Ole, H.; Lebech, C.S. A novel approach for predicting acute hospitalizations among elderly recipients of home care? A model development study. Int. J. Med. Inform. 2022, 160, 104715. [Google Scholar] [CrossRef]
Lee, M.J.; Kim, D.; Bian, J.; Romero, S.; Bliznyuk, N. Identifying demographic and home features influencing older adults’ home functioning by using machine learning models. Arch. Gerontol. Geriatr. 2024, 116, 105149. [Google Scholar] [CrossRef]
Shang, H.; Ji, Y.; Cao, W.; Yi, J. A predictive model for depression in elderly people with arthritis based on the TRIPOD guidelines. Geriatr. Nurs. 2025, 63, 85–93. [Google Scholar] [CrossRef] [PubMed]
Crowe, C.; Naughton, C.; De Foubert, M.; Cummins, H.; McCullagh, R.; Skelton, D.A.; Dahly, D.; Palmer, B.; O’Flynn, B.; Tedesco, S. Treatment effect analysis of the Frailty Care Bundle (FCB) in a cohort of patients in acute care settings. Aging Clin. Exp. Res. 2024, 36, 187. [Google Scholar] [CrossRef] [PubMed]
Rosca, C.-M.; Stancu, A.; Iovanovici, E.M. The New Paradigm of Deepfake Detection at the Text Level. Appl. Sci. 2025, 15, 2560. [Google Scholar] [CrossRef]
Abzaliyev, K.; Suleimenova, M.; Chen, S.; Mansurova, M.; Abzaliyeva, S.; Manapova, A.; Kurmanova, A.; Bugibayeva, A.; Sundetova, D.; Bitemirova, R.; et al. Predicting Cardiovascular Aging Risk Based on Clinical Data Through the Integration of Mathematical Modeling and Machine Learning. Appl. Sci. 2025, 15, 5077. [Google Scholar] [CrossRef]
Hosseini, I.; Ghahramani, M. Assessing Locomotive Syndrome Through Instrumented Five-Time Sit-to-Stand Test and Machine Learning. Sensors 2024, 24, 7727. [Google Scholar] [CrossRef]
Hu, Q.; Tian, F.; Jin, Z.; Lin, G.; Teng, F.; Xu, T. Developing a Warning Model of Potentially Inappropriate Medications in Older Chinese Outpatients in Tertiary Hospitals: A Machine-Learning Study. J. Clin. Med. 2023, 12, 2619. [Google Scholar] [CrossRef]
Wang, R.; Zeng, X.; Long, Y.; Zhang, J.; Bo, H.; He, M.; Xu, J. Prediction of Mortality in Geriatric Traumatic Brain Injury Patients Using Machine Learning Algorithms. Brain Sci. 2023, 13, 94. [Google Scholar] [CrossRef]
Wei, M.; Meng, D.; Guo, H.; He, S.; Tian, Z.; Wang, Z.; Yang, G.; Wang, Z. Hybrid Exercise Program for Sarcopenia in Older Adults: The Effectiveness of Explainable Artificial Intelligence-Based Clinical Assistance in Assessing Skeletal Muscle Area. Int. J. Environ. Res. Public Health 2022, 19, 9952. [Google Scholar] [CrossRef]
Lee, T.-R.; Kim, G.-H.; Choi, M.-T. Identification of Geriatric Depression and Anxiety Using Activity Tracking Data and Minimal Geriatric Assessment Scales. Appl. Sci. 2022, 12, 2488. [Google Scholar] [CrossRef]
Cheung, J.C.-W.; Tam, E.W.-C.; Mak, A.H.-Y.; Chan, T.T.-C.; Zheng, Y.-P. A Night-Time Monitoring System (eNightLog) to Prevent Elderly Wandering in Hostels: A Three-Month Field Study. Int. J. Environ. Res. Public Health 2022, 19, 2103. [Google Scholar] [CrossRef]
Das, S.; Sakoda, W.; Ramasamy, P.; Tadayon, R.; Ramirez, A.V.; Kurita, Y. Feature Selection and Validation of a Machine Learning-Based Lower Limb Risk Assessment Tool: A Feasibility Study. Sensors 2021, 21, 6459. [Google Scholar] [CrossRef] [PubMed]
Rojas-Mendizabal, V.; Castillo-Olea, C.; Gómez-Siono, A.; Zuñiga, C. Assessment of Thoracic Pain Using Machine Learning: A Case Study from Baja California, Mexico. Int. J. Environ. Res. Public Health 2021, 18, 2155. [Google Scholar] [CrossRef] [PubMed]
Büker, L.C.; Zuber, F.; Hein, A.; Fudickar, S. HRDepthNet: Depth Image-Based Marker-Less Tracking of Body Joints. Sensors 2021, 21, 1356. [Google Scholar] [CrossRef] [PubMed]
Davidson, P.; Virekunnas, H.; Sharma, D.; Piché, R.; Cronin, N. Continuous Analysis of Running Mechanics by Means of an Integrated INS/GPS Device. Sensors 2019, 19, 1480. [Google Scholar] [CrossRef]
Carbureanu, M.; Mihalache, S.F.; Zamfir, F. Machine Learning Methods Applied for Wastewater pH Neutralization Process Modeling. In Proceedings of the 14th International Conference on Electronics, Computers and Artificial Intelligence, Ploiesti, Romania, 30 June–1 July 2022; pp. 1–6. [Google Scholar] [CrossRef]
Cărbureanu, M. The Wastewater pH Control Using an Artificial Intelligence Technique. Bul. Univ. Pet.—Gaze Din Ploieşti Ser. Teh. 2012, LXIV, 83–93. Available online: http://jpgt.upg-ploiesti.ro/wp-content/uploads/2024/04/12_T_3_2012-Madalina-Carbureanu.pdf (accessed on 15 June 2025).
Oprea, M.; Cărbureanu, M.; Dragomir, E.G. AirQMAS: A Collaborative Multi-agent System for Air Quality Analysis. Ann. Univ. Craiova Ser. Autom. Comput. Electron. Mechatron. 2012, 9, 20–26. Available online: https://ace.ucv.ro/anale/2012_vol1/04_Oprea_Mihaela.pdf (accessed on 12 June 2025).
Paraschiv, N.; Pricop, E.; Fattahi, J.; Zamfir, F. Towards a reliable approach on scaling in data acquisition. In Proceedings of the 23rd International Conference on System Theory, Control and Computing, Sinaia, Romania, 9–11 October 2019; pp. 84–88. [Google Scholar] [CrossRef]
Pricop, E.; Zamfir, F.; Paraschiv, N. Feedback control system based on a remote operated PID controller implemented using mbed NXP LPC1768 development board. J. Phys. Conf. Ser. 2015, 659, 012028. [Google Scholar] [CrossRef]
Rosca, C.-M.; Stancu, A. Integration of AI in Self-Powered IoT Sensor Systems. Appl. Sci. 2025, 15, 7008. [Google Scholar] [CrossRef]

Figure 1. Topic search strategy for WOS research database in geriatrics and ML context (2020–2025).

Figure 2. Systematic structure of identified geriatric subdomains in the literature from the ML perspective.

Figure 3. Distribution of publications by year (2020–2025).

Figure 4. Distribution of publications by type (2020–2025) based on 819 articles retrieved from the WOS using the query TS = (“geriatr*” AND (“Machine Learning” OR “ML”)) AND PY = (2020–2025) and filtered by the document types criterion.

Figure 5. Distribution of publications by WOS category (2020–2025) based on 1047 articles retrieved from the WOS using the query TS = (“geriatr*” AND (“Machine Learning” OR “ML”)) AND PY = (2020–2025) and filtered by the WOS categories criterion.

Figure 6. Distribution of publications by research area (2020–2025) based on 972 articles retrieved from the WOS using the query TS = (“geriatr*” AND (“Machine Learning” OR “ML”)) AND PY = (2020–2025) and filtered by the research areas criterion.

Figure 7. Geographical distribution of publications (2020–2025) based on 932 articles retrieved from the WOS using the query TS = (“geriatr*” AND (“Machine Learning” OR “ML”)) AND PY = (2020–2025) and filtered by the countries/regions criterion.

Figure 8. Distribution of publications by authors’ institutional affiliations (2020–2025) based on 283 articles retrieved from the WOS using the query TS = (“geriatr*” AND (“Machine Learning” OR “ML”)) AND PY = (2020–2025) and filtered by affiliations criterion.

Figure 9. Distribution of publications by publisher (2020–2025) based on 643 articles retrieved from the WOS using the query TS = (“geriatr*” AND (“Machine Learning” OR “ML”)) AND PY = (2020–2025) and filtered by publishers criterion.

Figure 10. Distribution of publications by open access status (2020–2025) based on 1462 articles retrieved from the WOS using the query TS = (“geriatr*” AND (“Machine Learning” OR “ML”)) AND PY = (2020–2025) and filtered by open access criterion.

Figure 11. Mapping keyword co-occurrence in geriatric research integrated with ML (2020–2025).

Figure 12. PRISMA flow diagram of the study selection process.

Table 1. Distribution of ML algorithms in relation to the models investigated in the analyzed specialized literature.

Algorithm	References
RF	[4,12,13,24,25,26,28,30,32,39,40,45,50,52,53,55,57]
XGBoost	[13,26,27,39,40,50,53,57,58]
SVM	[5,11,13,14,25,27,28,32,39,53,57]
LR	[4,13,26,27,30,34,39,50,53,57,59]
KNN	[5,11,28,32,37,53,59]
DT	[11,27,28,32,45,53]
NN	[5,12,13,26,27,31,32,37,40,42,45,52,53,57]
NB	[11,28,32,37,59]
LightGBM	[26,40]
CatBoost	[4,40,42]
GBM	[4,45]
Bayesian Models	[11,32]
King Lu, Tariff	[11]
HMM	[32]
Autoencoders	[32]
LSTM/Bi-LSTM	[32]
Transfer Learning	[32]
Stochastic Gradient Descent	[26]
Bagging Classifier	[31]
Voting Classifier	[31]
Regularized Models	[4,40,42]
OneClassDetection	[44]
GradientBoostingRegressor	[44]
LinearRegressor	[44]
SupportVectorRegressor	[44]
K-means	[21]
Threshold-based	[33]
Hybrid Systems	[33]
Vision Systems	[33]
SGB	[4]
Elastic-Net PLR	[4]

Note: GBM—Gradient-Boosting Machine; HMM—Hidden Markov Model; LSTM—Long Short-Term Memory; NN—Neural Network; PLR—Penalized Logistic Regression; SGB—Stochastic Gradient Boosting.

Table 2. Classification of literature papers based on the ML algorithm objective.

ML Algorithm Objective	References
Classification	[5,11,18,28,32,44,51,55,57]
Prediction	[14,24,26,30,40,54,58]

Table 3. Classification models investigated in the literature for geriatric purposes.

Algorithm	References
RF	[32,44,51,55,57]
SVM	[5,11,28,44]
NB	[18]
KNN	[28]
DT	[44]
LR	[44]
XGBoost	[18]
MLP	[44]

Note: MLP—Multilayer Perceptron.

Table 4. Prediction models investigated in the literature for geriatric purposes.

Algorithm	References
XGBoost	[58]
RF	[30,54]
LR	[30]
LASSO Regression	[24]
Generalized Additive Model	[24]
LightGBM	[40]
CatBoost	[40]
XGBoost	[26]
DT	[26]
SVM	[14]

Note: LASSO—Least Absolute Shrinkage Selection Operator.

Table 5. Performance of ML models in terms of AUC for the analyzed papers.

AUC	Reference
0.988	[27]
0.98	[39]
0.97	[11]
0.96	[13]
0.94	[32]
0.94	[35]
0.91	[5]
0.9	[25]
0.885	[49]
0.869	[55]
0.846	[37]
0.845	[50]
0.84	[40]
0.817	[34]
0.813	[30]
0.8106	[57]
0.81	[12]
0.81	[14]
0.81	[58]
0.791	[45]
0.74	[4]
0.729	[24]
0.71	[44]
0.7	[17]
0.7	[18]
0.67	[42]
0.625	[22]
0.596	[51]
0.57	[26]
0.57	[59]

Table 6. Performance of ML models in terms of accuracy for the analyzed papers.

Accuracy (%)	Reference
99.4	[32]
99.4	[35]
99	[28]
98	[39]
97.5	[31]
96.6	[25]
94	[50]
93	[33]
92.4	[27]
90.3	[58]
89.5	[5]
84.17	[11]
81.1	[13]
81.06	[57]
80	[52]
79	[40]
76.9	[53]
73.2	[26]
70.28	[18]
69.44	[17]

Table 7. Performance of ML models in terms of recall for the analyzed papers.

Recall (%)	Reference
97	[39]
97	[28]
92	[37]
91	[26]
90	[33]
72.55	[57]
68	[14]

Table 8. Comparative analysis of published papers across PubMed, WOS, Scopus, and IEEE.

Search Query	PubMed	Scopus	WOS	IEE
geriatr*	116,692	51,911	31,951	1381
geriatr* AND diet	2967	1562	403	5
geriatr* AND nutrition	10,332	3346	1501	16
geriatr* AND elderl*	17,397	26,806	9469	427
diet AND geriatr* AND machine learning	20	9	15	1
nutrition AND geriatr* AND machine learning	113	34	34	6
elderl* AND geriatr* AND machine learning	234	327	282	65
geriatr* AND machine learning	1800	764	788	234
geriatr* AND elderl* AND nutrition AND machine learning	29	18	12	2

Table 9. Datasets, validation methods, and performance metrics comparison.

Reference	Dataset Size	Dataset Type	Validation Method	Training Metrics	Testing Metrics
[60]	50,366	ICU; illness severity (USA, The Netherlands)	Internal, external, temporal	AUC-ROC = 0.866	AUC-ROC = 0.838–0.884, SMR = 0.99
[61]	1074	NHANES; sleep efficiency variability and cognitive function (USA)	Regression analysis	Pearson r = −0.63	β = −2.01 (DSST)
[62]	740	Hospital cohort; delirium risk in geriatric (China)	5-fold cross-validation	AUC = 0.967	AUC = 0.950, F1-score = 0.81
[63]	1971	NIGSBRH; sarcopenia (Italy)	RF, LR	-	Accuracy = 95%, Sensitivity 1, Specificity = 0.94
[64]	833	NHANES; audiometric and blood data (USA)	Algorithm comparison (RF best)	-	AUC = 0.834
[65]	63	Cohort group; gut microbiome and metabolome (China)	ML classifier	-	AUC = 0.7083–0.8833 (gut bacteria), AUC = 0.9223–0.9833 (metabolites)
[66]	2660	GLIM and GNSGCSPEN; hospitalized elderly with malnutrition (China)	Train/test split, 5 ML models	AUC = 0.791–925	AUC = 0.863–0.950
[67]	450	INSCOC; malnutrition in elderly cancer patients (China)	9 ML models tested	-	AUC = 0.945 (XGBoost)
[68]	-	KNHNES, sarcopenia in the elderly (Republic of Korea)	SVM, XGBoost, LightGBM, RF, MLP	-	Accuracy = 84.8% (LightGBM)
[69]	7641 training; 8656 validation	CHARLS, nutritional frailty phenotype (China)	RF, XGBoost	AUC = 0.746–0.752	AUC = 0.681–0.683
[70]	1282	EHCR, acute hospitalizations among elderly (Denmark)	RUSBoost, LR	PR-AUC = 0.71	PR-AUC = 0.71
[71]	13,549 training; 5807 testing	AHS; demographic and housing features associated with functional difficulties (USA)	RF	-	Accuracy = 85.37–85.79%; Sensitivity = 94.33–94.67%; Specificity = 57.66–60.21%; Precision = 86.95–87.6%
[72]	4240	NHANES; depression in arthritis patients (USA)	RF, LR, XGBoost, SGNB, GBDT SHAP	AUC = 0.811 (RF)	AUC = 0.780 (RF)
[73]	120	FCBlA; mobilization patterns of patients pre- and post-intervention (Ireland)	RF, LR	Accuracy = 82.3%	Accuracy = 74.7%

Note AHS—American Housing Survey; CHARLS—China Health and Retirement Longitudinal Study; DSST—Digit Symbol Substitution Test; EHCR—Electronic home care record; FCB—Frailty Care Bundl; GBDT—Gradient-Boosting Decision Tree; GLIM—Global Leadership Initiative on Malnutrition; GNB—Gaussian Naïve Bayes; GNSGCSPEN—Geriatric Nutrition Study Group of the Chinese Society for Parenteral and Enteral Nutrition; ICU—Intensive care unit; INSCOC—Investigation on Nutrition Status and Clinical Outcome of Common Cancers; KNHNES—Korea National Health and Nutrition Examination Survey; NHANES—National Health and Nutrition Examination Survey; NIGSBRH—National Institute of Gastroenterology “S. de Bellis” Research Hospital; PR-AUC—Precision–Recall Area Under the Curve; RUSBoost—Random under-sampling with a boosting algorithm; SHAP—SHapley Additive exPlanations; SMR—Standardized mortality ratio.

Table 10. ML research in geriatric healthcare disseminated in MDPI.

Reference	Objective	Dataset	ML Model	Metrics
[75]	Predict cardiovascular aging for ≥65 year old.	800 patients, Almaty; clinical and behavioral data	k-means, RF	Accuracy = 98.8%; AUC-ROC = 0.989
[76]	Assess locomotive syndrome via FTSTS and IMU	144 features from IMU (pelvis), GLFS-25 labels	MLP, PCA (best)	Accuracy = 90%; F1-score = 91%
[63]	Identify sarcopenia markers in elderly Italians	1971 adults, +65 year old, DXA-based diagnosis	RF, LR	Accuracy = 95%, Sensitivity 1, Specificity = 0.94
[77]	Predict PIMs in geriatric outpatients	11,741 prescriptions, 41 PIM types, China	CC, CatBoost (best), XGBoost, TabNet etc.	Accuracy = 97.83%, F1-score = 90.69%
[78]	Compare ML models for TBI prognosis in elderly	MIMIC-III, adults +65 year old, TBI, training set and testing set ratio = 7:3	AdaBoost, RF, LR	AUC: 0.799 (AdaBoost); AUC: 0.795 (RF); AUC: 0.792 (LR)
[79]	Predict sarcopenia reversal after exercise	90 older adults, CT scans	Stacking (best of 11 models)	Accuracy = 85.7 ± 10.6%, F1-score = 75.3 ± 11.5%
[80]	Detect geriatric depression and anxiety via wearables	352 patients, 60–90 years old, EWUMH	RFC, GBC, SVC, LRC, Multi-Label Classification	Hamming Loss = 0.0000 for RFC (depression)—RFC (anxiety), GBC (depression)—RFC (anxiety)
[81]	Detect nocturnal wandering for individuals in elderly hostel	26 elderly, 2762 bed-exit events	eNightLog	Accuracy = 99.8%, Precision = 99.6%, Sensitivity = 100%, Specificity = 99.6%
[82]	Early detection of locomotive decline	13 kinematic features (squat features and one-leg standing features)	ANN, RF Regressor	R² up to 0.86 (RF Regressor)
[54]	Predict delirium episodes in hospitalized elderly	78 patients, 1149 observations	RF, LR	R² = 75.8%, RMSE = 3.29
[31]	Predict frailty from Kinect skeleton data	787 elderly, functional exercises (chair stands etc.)	KNN, SVC, MLP, BC, VC	Accuracy = 97.5% (SVC, MLP)
[83]	Identify cardiac chest pain predictors	258 geriatric patients (60-year-old average), MNHT Mexico, 27 variables	LR	Accuracy = 96.4%; F1-score = 91.2%
[84]	Human joint detection in depth images	Custom depth image dataset	HRDepthNet (adapted HRNet)	Precision = 92%; Recall = 88.2%; F1-score = 90.1%
[85]	Real-time human motion tracking in natural settings	Body-mounted sensor, 6 h of data, 3 subjects, >40,000 steps	NN	Accuracy ≈ 12 ms

Note: AdaBoost—Adaptive Boosting; BC—Bagging Classifier; CC—Classifier chains; CT—Computed tomography; DXA—Dual-energy X-ray absorptiometry; EWUMH—Ewha Womans University Mokdong Hospital; FTSTS—Five-Time Sit-to-Stand Test; GBC—Gradient-boosting classifier; HRDepthNet—High-Resolution Depth Net; HRNet—High-Resolution Net; IMU—Inertial Measurement Unit; LRC—Logistic regression classifier; MIMIC—Medical Information Mart for Intensive Care; MNHT—Medical Norte Hospital in Tijuana; PCA—Principal component analysis; PIM—Potentially inappropriate medications; RFC—Random forest classifier; RMSE—Root Mean Square Error; TBI—Traumatic brain injury; VC—Voting Classifier.

Table 11. Evaluating the quality of studies.

Reference	Dataset	Performance Metrics	Clinical Applicability	Total
[15]	2	0	2	4
[26]	2	2	2	6
[28]	2	2	2	6
[30]	2	2	2	6
[31]	2	2	1	5
[39]	2	2	2	6
[41]	1	1	1	3
[45]	1	2	2	5
[49]	1	2	2	5
[52]	2	2	1	5

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Stancu, A.; Rosca, C.-M.; Iovanovici, E.M. Applications of Machine Learning Algorithms in Geriatrics. Appl. Sci. 2025, 15, 8699. https://doi.org/10.3390/app15158699

AMA Style

Stancu A, Rosca C-M, Iovanovici EM. Applications of Machine Learning Algorithms in Geriatrics. Applied Sciences. 2025; 15(15):8699. https://doi.org/10.3390/app15158699

Chicago/Turabian Style

Stancu, Adrian, Cosmina-Mihaela Rosca, and Emilian Marian Iovanovici. 2025. "Applications of Machine Learning Algorithms in Geriatrics" Applied Sciences 15, no. 15: 8699. https://doi.org/10.3390/app15158699

APA Style

Stancu, A., Rosca, C.-M., & Iovanovici, E. M. (2025). Applications of Machine Learning Algorithms in Geriatrics. Applied Sciences, 15(15), 8699. https://doi.org/10.3390/app15158699

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Applications of Machine Learning Algorithms in Geriatrics

Abstract

1. Introduction

2. Methodology

3. ML Models Used in the Diagnosis and Monitoring of Geriatric Patients

3.1. The Most Studied ML Models in Recent Literature

3.2. Classification of the Studied Models Based on Objectives

3.3. Metrics for Evaluating the Performance of ML Models

4. Bibliometric Analysis

4.1. Analysis of Publication-Based Metrics

4.2. Comparative Analysis in WOS, PubMed, IEEE, and Scopus

4.3. Overview of ML Studies in Geriatrics

5. Discussions

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI