Abstract
Background: Given the importance of early-life intervention in reducing future obesity, screening models that more accurately identify infants at risk are crucial. The aim of this study was to develop models based solely on growth parameters for better predicting childhood overweight than the current WHO growth chart risk prediction. Methods: A retrospective national cohort study was conducted among children born in Israel between August 2014 and June 2016, followed for at least 18 months. Machine learning models were generated to predict childhood overweight. Three models for 0–3, 3–6, and 6–12 months were generated. These models were compared with the current WHO growth chart predictions. The outcome was defined as overweight in early childhood, based on weight-for-length (or weight-for-height) ≥97th percentile, according to WHO standards, measured at 18–36 months of age. Results: Overall, 198,503 children were included, and 150,572, 146,584, and 149,628 infants were included in Models 1, 2, and 3, respectively, with an average target age of two years. The models demonstrated high predictive performance (AUC) for the 0–3 months’ model (0.76 [95% CI: 75 to 76.9%]), for the 3–6 months’ model (0.822 [95% CI: 81.3 to 83.0%]) and for 6–12-month-old infants (0.872 [95% CI: 86.6 to 87.8%]). The first two models better predict the risk of early childhood overweight than the current WHO growth chart prediction. Conclusions: These models are unique in that they are based on growth parameters, usually screened at early childhood worldwide, and can be implemented in any system collecting growth measurements of infants, providing better risk prediction than the current WHO growth charts. A web calculator is provided.
1. Introduction
The worldwide prevalence of adult obesity has nearly tripled over the last four decades [1], and the prevalence of obesity in infants, children, and adolescents is rapidly growing [2]. Childhood and adolescent obesity are associated with several co-morbidities, including abnormalities in the endocrine, cardiovascular, gastrointestinal, pulmonary, orthopedic, neurologic, dermatologic, and psychosocial systems [3,4]. Overweight in infancy is an independent risk factor for obesity in childhood, adolescence, and adulthood [5]. There is strong evidence to support the beneficial effects of child obesity prevention and intervention programs on later obesity, and the earlier the intervention is performed, the greater the effect measured [6]. Prominent health organizations such as the World Health Organization (WHO), the Institute of Medicine (IOM) in the United States, and the American Academy of Pediatrics (AAP) call for early intervention to reduce childhood obesity [7,8].
One of the key elements of child preventive medicine programs worldwide is the use of the WHO Child Growth Standards [9] in every well-child visit. This instrument is used for early growth screening and is the main suggested screening tool for overweight and obesity. According to the WHO, the definition of overweight for children under 5 years is a weight-for-length (or weight-for-height) greater than 2 standard deviations (97th percentile) above the WHO Child Growth Standards median [1]. Children between the 85th and 97th percentile lines on the growth chart are defined as at risk for overweight [10]. As part of efforts to prevent childhood obesity, the IOM recommends that healthcare professionals consider children’s attained weight-for-length at or above the 85th percentile as a risk factor in assessing which young children are at the highest risk for later obesity [10]. Given the importance of early intervention in preventing future overweight and obesity, it is crucial to use the best screening models to identify infants at risk. The main objective of this study was to develop prediction models for early identification of overweight and obese children in a large national cohort of infants in Israel. We then compared these models to the current suggested WHO worldwide screening paradigm.
2. Methods
This retrospective cohort study included all children born in Israel between August 2014 and June 2016 who were followed for at least 18 months and had available records in our database.
2.1. The Database
Preventive care and health promotion services for pregnant women and early childhood are provided in Israel through the healthcare mother and child care centers (“Tipat Halav”). They provide free health and medical screening for pregnant women and for child growth and development for all Israeli residents [11]. The objectives of the services provided are the prevention of infectious diseases through immunization, the early detection of health problems through routine growth and development examinations, and health education. Recommended visits are at post-birth and at 1, 2, 4, 6, 9, 12, 18, 24, 36, 48, and 60 months afterward. The healthcare staff includes nurses and physicians who have been specially trained in children’s public health [12].
There are about 1000 Tipat Halav clinics across Israel, operated by various parties. The Ministry of Health (MOH) and some local municipalities cover and operate approximately 80% of the clinics and share a single, comprehensive, computerized database used in this study. Other children are also served free of charge by additional clinics, mainly operated by the four Health Maintenance Organizations (HMOs) operating in Israel, based on convenience. All the data are entered into a national electronic health record specifically designed for monitoring child growth and development. This database comprises the national database of the Tipat Halav Israeli Screening (THIS) program database. To the best of our knowledge, this is a unique, continuously updated database that contains elaborated data, including family demographics, children’s growth measures, developmental milestones, vaccinations, and age-related health measures and assessments.
2.2. Inclusion and Exclusion Criteria
Three models were developed for the following age groups: 0–3, 3–6, and 6–12 months. Separate models were built since more data were available from later visits, and later measurements may be better predictors for childhood overweight and obesity. In all models, we included children with available records in the MOH database who were born between August 2014 and June 2016 and had at least 18 months of follow-up. Included children were only those with at least one weight-for-height measurement at 18–36 months. Also, each model included children with at least one weight-for-height measurement record for the specific time period (0–3, 3–6, and 6–12 months for the first, second, and third models, respectively). We excluded preterm infants because we believe this population warrants separate, careful analysis.
2.3. Model Variables
For each of the three models, the dependent variable (target variable) was defined as an indicator variable if the last visit measurement of the weight-for-height (or weight-for-length) percentile at 18–36 months was equal to or greater than the 97th percentile.
Various independent variables (predictors) were extracted from the THIS database, including demographics (age, gender, religion), birth variables (birth weight, birth type, gestational week, APGAR score), mother’s characteristics (mother’s age, education, depression status based on the Edinburgh Postnatal Depression Scale (EPDS) [13], family status, immigration status), child growth measurements (weight, length or height, weight-for-length, and head circumference percentiles), nutrition (nursing period, vitamin-D supplementation, iron supplementation, age at starting formula, formula type, age at starting water).
Because trends in growth measurements may be crucial for predicting the risk of overweight, we created new variables based on children’s growth measurements. These variables included the linear slope of the last weight-for-length percentile measurements, linear slope of the last weight percentile measurements, difference between the last and first weight-for-length percentiles divided by the age difference, ratio between the head circumference percentile and the weight-for-length percentile, ratio between the last two weight-for-length percentiles, maximum growth measurement percentile (weight-for-length and weight percentiles) throughout the relevant model period (0–3, 3–6, 6–12 months), and the higher measurement of the last two or more weight-for-length measurements throughout the relevant model period (0–3, 3–6, 6–12 months).
Since all variables included in the model are routinely recorded in the THIS program database regardless of the child’s weight, the outcome and its predictors are assessed blindly in the current study.
2.4. Machine Learning Algorithms
Machine learning is considered a method that can augment the diagnosis of chronic diseases. The key distinction between traditional approaches and machine learning is that in machine learning, a model learns from examples rather than being programmed with rules. Using algorithms for learning from observations, computers then determine how to perform the mapping from features to labels to create a model that will generalize the information, enabling a task to be performed correctly with new, never-before-seen inputs [14]. Feature selection was performed using a Random Forest machine learning algorithm. Random Forest algorithms determine the variables’ importance by calculating the relative influence of each variable: this determination is based on whether that variable was selected to split the tree further during the tree building process, and on how much the squared error at the split nodes was reduced, aggregated over all trees [15,16].
Variables with high relative importance were included in the XGBoost algorithm. XGBoost is a supervised learning algorithm that uses a boosting technique to produce accurate models. Boosting is an ensemble learning technique that builds many models sequentially, with each new model attempting to correct for the deficiencies of the previous model. In tree boosting, each new model that is added to the ensemble is a decision tree [17]. Feature importance derived from the Random Forest model guided the selection of predictors included in the XGBoost models. The final set of variables was determined based on model performance stability and overfitting avoidance, rather than a predefined importance threshold. Hyperparameter tuning for the XGBoost models was performed using a random search with 5-fold cross-validation. The final model configuration included a tree depth of three and 200 trees, as these settings provided stable predictive performance without excessive model complexity.
Missing values were not imputed during model training; the machine-learning algorithms handled them natively. During tree building, split decisions for each node were optimized by minimizing the loss function, while allowing missing values to be treated as a separate category that could be assigned to either branch. This approach does not assume a specific clinical or socioeconomic reason for missingness, but rather allows the algorithm to determine whether patterns of missing data contribute to prediction performance, without implying causality or introducing predefined bias.
The model population was randomly divided into a training set (70%) and a test set (30%). Model construction was performed using the training set, and the test set was used for model evaluation. This approach was chosen to reflect real-world implementation scenarios. Future studies may explore additional internal validation strategies to further assess robustness.
2.5. Model Evaluation
Model evaluation was performed on the test set. The model risk scores were calculated for each individual in the test set. Then each individual was ranked by its risk score into percentiles (100 = high risk, 1 = low risk). We measured specificity, sensitivity, positive predictive value (PPV), and negative predictive value (NPV) to assess the model’s performance. Lift and AUC were also calculated. The performance of each of the three models was compared to that currently used by WHO for weight-for-length prediction. For comparison, WHO-based screening was defined using weight-for-length percentiles measured during the corresponding infant age windows, with overweight defined as ≥97th percentile according to WHO standards. This approach reflects current screening practice rather than a predictive model and was used as a benchmark for evaluating model performance.
2.6. Statistical Analyses
Statistical analyses were performed using Python version 3.6.7. Random Forest and XGBoost models were constructed using the H2O package version 3.22 [18].
The study follows transparent reporting of a multivariable prediction model for individual prognosis or diagnosis, the TRIPOD statement.
2.7. Ethics Declarations
All analyses were carried out in accordance with relevant guidelines and regulations. The study protocol was approved by the Soroka University Medical Center Institutional Ethics Committee (MHC-0014-19; 5 July 2018). Informed consent was waived owing to the use of de-identified data.
3. Results
A total of 198,503 children born between August 2014 and June 2016 who had relevant records in the THIS national program database were screened (approximately 60% of the children born in Israel during this period). Altogether, 15,310 pre-term children (gestational age < 37 weeks), children lacking at least one visit (N = 25,954), or with a missing value of the weight-for-height measurement between ages 18 and 36 months (N = 1795) were excluded from the analysis. Children lacking at least one visit during the specific model period (0–3 months: N = 28, 3–6 months: N = 8039, and 6–12 months: N = 5374) and children with missing weight-for- length (or weight-for-height) measurements were also excluded (0–3 months: N = 4444, 3–6 months: N = 821, and 6–12 months: N = 442). Overall, 150,572, 146,584, and 149,628 children were included in Models 1, 2, and 3, respectively (Figure 1). Main cohort characteristics are summarized in Table 1.
Figure 1.
The models’ cohorts: Sample sizes differ across models due to age-specific inclusion criteria and the availability of measurements within each age window.
Table 1.
Cohort characteristics.
3.1. Prediction Equations
3.1.1. Model for 0–3 Months
Feature selection was performed using a Random Forest machine learning algorithm. Of all variables entered into the model, 15 were selected for further analysis using an XGBoost model (Table 2). The performance of the 0–3-month model for different cut points is presented in Table 3. This model prediction was compared to the prediction made using the WHO weight-for-length percentile. As demonstrated in Figure 2, the AUC of this model was 0.760 [95% CI: 75.4 to 77.3%]. These values were substantially higher than the AUC calculated using the current WHO model based on the weight-for-length percentile only: 0.689 [95% CI: 67.9 to 69.9%].
Table 2.
Final variables selected for each model sorted by relative importance.
Table 3.
The models’ performance at different cut points.
Figure 2.
AUC ROC Curves of the models: ROC curves compare model-based prediction to WHO weight-for-length percentile–based screening (≥97th percentile) for each age window.
3.1.2. Model for 3–6 Months
Of the variables described, 14 were selected for further analysis using the XGBoost model (Table 2). The performance of the 3–6-month model across different cut points is shown in Table 3. The AUC of this model was 0.822 [95% CI: 81.3 to 83.0%]. Again, these values were significantly higher than the AUC calculated using the current WHO model based on the weight-for-length percentile: 0.79 [95% CI: 78.0 to 79.9%]. (p < 0.05). The AUC curves are presented in Figure 2.
3.1.3. Model for 6–12 Months
Of all variables entered into the model, 9 were selected for further analysis using the XGBoost model (Table 2). Unlike previous models, the difference between the AUC values of the two models was not significantly different; the AUC of our model was 0.872 [95% CI: 86.6 to 87.8%], and the AUC using the current WHO model was 0.863 [95% CI: 85.6 to 87%] (Figure 2). The sensitivity, specificity, PPV, and NPV of each model are shown in Table 3. Interestingly, although many different variables were entered in each of the models, including demographic variables, birth variables, mothers’ characteristics, nutrition, and child growth measurements, all models were eventually based solely on growth measurement parameters. The only exception was the age when formula was added to or replaced breast milk (“age at starting formula”) for the first 0–3 months’ model, which made a minimal contribution. The performance of the first model, based solely on growth parameters excluding this variable, was examined. The AUC was 0.76 [95% CI: 75 to 76.9%], a reduction of less than 0.5% compared with the model that included this variable. In all other models, growth measurements were the only final variables included.
3.1.4. Validity of the Models for Different Target Ages
Further analyses were conducted for each model when the target weight-for-height percentile was measured at age 18–24 months or 24–36 months. According to the analysis, the three models show similar trends for both earlier and later target ages (Table 1, Appendix A).
3.1.5. Validity of the Models in Different Populations
To examine whether this model’s performance was valid across populations of different ethnic backgrounds, performance was evaluated separately for Muslim (27% of the cohort) and Jewish (66% of the cohort) infants. No differences were found between these populations. For example, using the 3–6-month model, the AUC of Muslim infants was 0.817 [95% CI: 80.1 to 83.5%], and for Jewish infants, 0.826 [95% CI: 81.6 to 83.7%], with no significant difference between these two subpopulations.
4. Discussion
Three different models were built for 0–3-, 3–6-, and 6–12-month-old infants. Our models demonstrate higher AUC performance than the WHO Child Growth Standards for the 0–3- (0.760 vs. 0.689; p < 0.05) and 3–6-month (0.822 vs. 0.79; p < 0.05) models. However, for 6–12 months, we could not establish a model with significantly better performance than the WHO model (0.872 vs. 0.863). The model’s AUC was higher for older infants (6–12 months > 3–6 months > 0–3 months). This might, in part, be due to additional measurements being available for the older infants. It is also possible that the higher performance observed in older infants reflects the shorter interval between the age at prediction and the age at which overweight was assessed. To address this, we evaluated model performance separately for target measurements at 18–24 months and 24–36 months. Model performance remained stable across these age ranges, with no significant differences in AUC, suggesting robustness across the toddler age window.
In this study, we assessed the contribution of multiple variables, including demographics, pregnancy and birth variables, mothers’ characteristics, and nutrition. However, the most significant predictors were based on growth measurements commonly conducted during infant surveillance visits. We assume that growth measurements at birth and throughout development most accurately reflect the final effects of all relevant factors that contribute to a child’s risk of overweight and obesity. Although there are many genetic, pregnancy, birth-related, or later early-years’ risk factors for overweight and obesity, they will have a major effect on the child’s growth measurements used for predicting overweight. Since growth measurements are recommended to be routinely measured at well-baby or vaccination visits, these data should be available for parents and healthcare providers [19,20].
This study builds a machine learning clinical prediction model for early childhood overweight using a comprehensive national cohort, including a multivariable database of healthy children. This model can be easily implemented in systems that collect growth measurements, providing better prediction of the risk of early childhood overweight than the current WHO prediction, using the same commonly collected measurements at no additional effort. Link for THIS early childhood overweight risk calculator is provided here (https://www.yeled-dev.org/risk-of-overweight-calculator/ (accessed on 8 January 2026)).
Although clinical prediction models are increasingly prevalent [21,22], few studies have focused on the prediction of overweight or obesity in early childhood [23,24,25,26,27,28,29,30]. Most models are based on small to medium samples rather than a national cohort, and none provide easily implemented tools based solely on growth measurements.
Using these suggested models may improve infant screening for overweight. According to the WHO assessments, if current trends continue, the number of overweight or obese infants and young children under five years of age globally will increase to 70 million by 2025 [1]. Using our 0–3-month model (threshold selected—5% with the highest score), out of 1,000,000 newborns per year, it will be possible to detect 4500–5000 more overweight or obese children than using WHO prediction (over 97th percentile) (based on the sensitivity difference), while 4500–5000 more children will be accurately classified as being at no risk for overweight (based on the specificity difference). Thus, this model is important when targeted interventions are considered for infants at risk for overweight and obesity, given the need for directing resources to the actual populations in need. Different thresholds are provided to ensure an appropriate match between the intervention’s resources and costs and the relevant target population. For infants aged 6–12 months, the predictive performance of the proposed model was comparable to that of the WHO growth chart–based approach. While this limits the incremental statistical advantage of the model in this age group, its potential value lies in other aspects, including automation, consistency across age-specific models, and ease of integration into electronic medical record–based surveillance systems. Importantly, the main clinical added value of the proposed approach is observed during the first six months of life, when the models demonstrate clear superiority over the WHO growth charts and when early preventive interventions may have the greatest impact.
As with any screening tool, false-positive and false-negative classifications must be considered in a clinical context. In particular, identifying an infant as being at increased risk for overweight at an early age may raise concerns regarding potential parental anxiety. It is important to emphasize that the proposed models are intended for risk stratification rather than diagnosis, and should be used to guide supportive, non-stigmatizing nutritional counseling and follow-up. When applied appropriately, the benefits of early identification and targeted prevention may outweigh the potential psychosocial risks associated with false-positive results.
There are some limitations to our study. The research was conducted only with children who attended Tipat Halav clinics. Children of parents not compliant with follow-ups were excluded. The Israeli immunization coverage in Tipat Halav centers ranges between 94% and 99%, reflecting the high compliance with this service [31].
Also, the study was conducted only with children from Israel. However, the Israeli population is multicultural, and although a wide range of demographic, maternal, pregnancy, and delivery-related variables were initially included in the models, none were retained in the final models. The final prediction models were based exclusively on routinely collected growth parameters, which are universally measured in early childhood surveillance programs worldwide. This suggests that model performance relies primarily on growth trajectories rather than population-specific characteristics, supporting the potential external validity of the proposed models. Nevertheless, external validation in other countries and healthcare systems is required to confirm their generalizability.
Also, there are further variables omitted in this study that might further improve the model’s prediction, such as mother’s Body Mass Index (BMI) prior to pregnancy, maternal smoking status during pregnancy and paternal BMI [30]; however, these variables are not commonly available in any electronic medical records (EMRs), and thus, models with such variables are not feasible for easy implementation within these systems. Finally, although discrimination performance was strong, further evaluation of model calibration and clinical utility metrics will be important for future implementation studies.
Overweight is examined in the current study for children up to three years of age. Although this is known to be a crucial period in determining the chances for obesity as adults [5], it is important to further evaluate the model performance for older children and adults. The current models excluded infants with a gestational age of less than 37 weeks. Further specific models should be developed for this population.
5. Conclusions
This study introduces innovative models for predicting early childhood overweight and obesity, leveraging commonly screened growth parameters in early childhood. These models outperform the current WHO growth charts in predicting the risk of overweight in young children. Their universal applicability, owing to the routine collection of growth measurements globally, makes them a significant tool in early obesity intervention. Additionally, the study provides a web calculator, enhancing the models’ practical utility and ease of implementation in diverse healthcare settings. From a practical perspective, these models enable early risk stratification using data that are already available during routine well-child visits. This allows healthcare providers to identify infants at increased risk for overweight at a very early age, supporting targeted nutritional counseling, closer growth monitoring, and early preventive interventions. Such an approach may improve decision-making in both clinical and public health settings by focusing resources on infants most likely to benefit from early intervention.
Author Contributions
I.L.S. conceptualized and designed the study, drafted the initial manuscript, and reviewed and revised the manuscript. D.G., I.G. and Y.S. conceptualized and designed the study, drafted the initial manuscript, and reviewed and revised the manuscript. R.E., A.P. designed the web calculator. All authors have read and agreed to the published version of the manuscript.
Funding
This research received no external funding.
Institutional Review Board Statement
All analyses were carried out in accordance with relevant guidelines and regulations. The study protocol was approved by the Soroka University Medical Center Institutional Ethics Committee and conducted in accordance with the principles of the Declaration of Helsinki (MHC-0014-19; 5 July 2018). The need for informed consent was waived owing to the use of de-identified data.
Data Availability Statement
The deidentified database contains the medical records of Israeli children that support the findings of this study, which were used under license from the Israeli Ministry of Health for the current study, and may be available for other under a similar license.
Acknowledgments
We would like to thank the Timna Initiative team of the Israeli Ministry of Health for their important contribution to the data analysis in this study. Timna is a national platform that supports the collection, storage and analysis of large-scale data using advanced research tools for analyzing big data.
Conflicts of Interest
The authors declare no conflicts of interest.
Abbreviations
WHO—World Health Organization; AUC—Area Under the ROC Curve; CI—Confidence Interval; IOM—Institute of Medicine; AAP—American Academy of Pediatrics; MOH—Ministry of Health; HMO—Health Maintenance Organization; THIS program—Tipat Halav Israeli Screening program; EPDS—Edinburgh Postnatal Depression Scale; PPV—Positive Predictive Value; NPV—Negative Predictive Value; BMI—Body Mass Index; EMR—Electronic Medical Record.
Appendix A
Table A1.
AUC of the three models for children using the last measurements at 18–24 months and 24–36 months separately.
Table A1.
AUC of the three models for children using the last measurements at 18–24 months and 24–36 months separately.
| Model | Target: 18–24 Months | Target: 24–36 Months | ||
|---|---|---|---|---|
| ML Model | WHO | ML Model | WHO | |
| 0–3 months | 77.12% | 69.43% | 75.81% | 68.61% |
| 3–6 months | 83.91% | 81% | 80.96% | 77.50% |
| 6–12 months | 89.27% | 88.43% | 85.55% | 84.64% |
Footnote: Values are presented separately for each model according to age-specific inclusion criteria (0–3, 3–6, and 6–12 months). Differences in sample size reflect the availability of growth measurements within each age window and model-specific eligibility requirements.
References
- Obesity and Overweight. Available online: https://www.who.int/news-room/fact-sheets/detail/obesity-and-overweight (accessed on 21 December 2019).
- Ogden, C.L.; Carroll, M.D.; Lawman, H.G.; Fryar, C.D.; Kruszon-Moran, D.; Kit, B.K.; Flegal, K.M. Trends in Obesity Prevalence Among Children and Adolescents in the United States, 1988–1994 Through 2013–2014. JAMA 2016, 315, 2292–2299. [Google Scholar] [CrossRef]
- GBD 2015 Obesity Collaborators. Health Effects of Overweight and Obesity in 195 Countries over 25 Years. N. Engl. J. Med. 2017, 377, 13–27. [CrossRef]
- Comorbidities and Complications of Obesity in Children and Adolescents—UpToDate|Prediabetes|Metabolic Syndrome. Scribd. Available online: https://www.scribd.com/document/373770796/Comorbidities-and-Complications-of-Obesity-in-Children-and-Adolescents-UpToDate (accessed on 8 January 2026).
- Geserick, M.; Vogel, M.; Gausche, R.; Lipek, T.; Spielau, U.; Keller, E.; Pfäffle, R.; Kiess, W.; Körner, A. Acceleration of BMI in Early Childhood and Risk of Sustained Obesity. N. Engl. J. Med. 2018, 379, 1303–1312. [Google Scholar] [CrossRef] [PubMed]
- Wen, L.M.; Baur, L.A.; Simpson, J.M.; Rissel, C.; Wardle, K.; Flood, V.M. Effectiveness of home based early intervention on children’s BMI at age 2: Randomised controlled trial. BMJ 2012, 344, e3732. [Google Scholar] [CrossRef] [PubMed]
- WHO. Report of the Commission on Ending Childhood Obesity: Implementation Plan: Executive Summary. Available online: https://iris.who.int/server/api/core/bitstreams/34dc9459-151a-4dc2-abfd-1d8f30ccaef5/content (accessed on 8 January 2026).
- Daniels, S.R.; Hassink, S.G.; Committee on Nutrition. The Role of the Pediatrician in Primary Prevention of Obesity. Pediatrics 2015, 136, e275–e292. [Google Scholar] [CrossRef] [PubMed]
- World Health Organization. WHO Child Growth Standards: Length/Height-for-Age, Weight-for-Age, Weight-for-Length, Weight-for-Height and Body Mass Index-for-Age: Methods and Development; World Health Organization: Geneva, Switzerland, 2006; ISBN 924154693x. [Google Scholar]
- Committee on Obesity Prevention Policies for Young Children; Institute of Medicine. Early Childhood Obesity Prevention Policies; National Academies Press: Washington, DC, USA, 2011; ISBN 978-0-309-21024-9. [Google Scholar]
- Israel: Health System Review. PubMed—NCBI. Available online: https://www.ncbi.nlm.nih.gov/pubmed/?term=israel+health+system+review+2015+rosen (accessed on 17 January 2020).
- Sudry, T.; Zimmerman, D.R.; Yardeni, H.; Joseph, A.; Baruch, R.; Grotto, I.; Greenberg, D.; Eilenberg, R.; Amit, G.; Akiva, P.; et al. Standardization of a Developmental Milestone Scale Using Data from Children in Israel. JAMA Netw. Open 2022, 5, e222184. [Google Scholar] [CrossRef] [PubMed]
- Mother and Child Clinics. Available online: https://www.mevaker.gov.il/sites/DigitalLibrary/Pages/Reports/768-17.aspx?AspxAutoDetectCookieSupport=1 (accessed on 21 December 2019).
- Rajkomar, A.; Dean, J.; Kohane, I. Machine Learning in Medicine. N. Engl. J. Med. 2019, 380, 1347–1358. [Google Scholar] [CrossRef] [PubMed]
- The H2O Python Module—H2O Documentation. Available online: http://docs.h2o.ai/h2o/latest-stable/h2o-py/docs/intro.html (accessed on 21 December 2019).
- General—H2O 3.28.0.1 Documentation. Available online: http://docs.h2o.ai/h2o/latest-stable/h2o-docs/faq/general.html (accessed on 18 January 2020).
- Chen, T.; Guestrin, C. XGBoost: A Scalable Tree Boosting System. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining—KDD 16, San Francisco, CA, USA, 13–17 August 2016; ACM: New York, NY, USA, 2016; pp. 785–794. [Google Scholar] [CrossRef]
- Aiello, S.; Click, C.; Roark, H.; Rehak, L. Machine Learning with Python and H2O; H₂O.ai Inc.: Mountain View, CA, USA, 2015. [Google Scholar]
- American Academy of Pediatrics. Recommendations for Preventive Pediatric Health Care. 2015. Available online: https://pediatrics.aappublications.org/content/136/3/e727 (accessed on 18 January 2020).
- Using the New UK-World Health Organization 0–4 Years Growth Charts—GOV.UK. Available online: https://www.gov.uk/government/publications/using-the-new-uk-world-health-organization-0-4-years-growth-charts (accessed on 18 January 2020).
- Alonzo, T.A. Clinical Prediction Models: A Practical Approach to Development, Validation, and Updating: By Ewout W. Steyerberg. Am. J. Epidemiol. 2009, 170, 528. [Google Scholar] [CrossRef]
- Machine Learning and Prediction in Medicine—Beyond the Peak of Inflated Expectations. Available online: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5953825/ (accessed on 17 January 2020).
- Levine, R.S.; Dahly, D.L.; Rudolf, M.C.J. Identifying infants at risk of becoming obese: Can we and should we? Public Health 2012, 126, 123–128. [Google Scholar] [CrossRef] [PubMed]
- Lee, I.; Bang, K.S.; Moon, H.; Kim, J. Risk Factors for Obesity Among Children Aged 24 to 80 months in Korea: A Decision Tree Analysis. J. Pediatr. Nurs. 2019, 46, e15–e23. [Google Scholar] [CrossRef] [PubMed]
- Colmenarejo, G. Machine Learning Models to Predict Childhood and Adolescent Obesity: A Review. Nutrients 2020, 12, 2466. [Google Scholar] [CrossRef] [PubMed]
- Pang, X.; Forrest, C.B.; Lê-Scherban, F.; Masino, A.J. Prediction of early childhood obesity with machine learning and electronic health record data. Int. J. Med. Inf. 2021, 150, 104454. [Google Scholar] [CrossRef]
- Butler, É.M.; Derraik, J.G.B.; Taylor, R.W.; Cutfield, W.S. Prediction Models for Early Childhood Obesity: Applicability and Existing Issues. Horm. Res. Paediatr. 2018, 90, 358–367. [Google Scholar] [CrossRef] [PubMed]
- Singh, B.; Tawfik, H. Machine Learning Approach for the Early Prediction of the Risk of Overweight and Obesity in Young People. In Computational Science—ICCS 2020; Krzhizhanovskaya, V.V., Závodszky, G., Lees, M.H., Dongarra, J.J., Sloot, P.M., Brissos, S., Teixeira, J., Eds.; Springer International Publishing: Cham, Switzerland, 2020; pp. 523–535. [Google Scholar]
- Redsell, S.A.; Rose, J.; Weng, S.; Ablewhite, J.; Swift, J.A.; Siriwardena, A.N.; Nathan, D.; Wharrad, H.J.; Atkinson, P.; Watson, V.; et al. Digital technology to facilitate Proactive Assessment of Obesity Risk during Infancy (ProAsk): A feasibility study. BMJ Open 2017, 7, e017694. [Google Scholar] [CrossRef] [PubMed]
- Rotevatn, T.A.; Mortensen, R.N.; Ullits, L.R.; Torp-Pedersen, C.; Overgaard, C.; Høstgaard, A.M.B.; Bøggild, H. Early-life childhood obesity risk prediction: A Danish register-based cohort study exploring the predictive value of infancy weight gain. Pediatr. Obes. 2021, 16, e12790. [Google Scholar] [CrossRef] [PubMed]
- Ministry of Health (Israel). Vaccination Coverage for Routine Vaccines Provided at Tipat Halav (Well-Baby Clinics), 2017–2022. Available online: https://www.gov.il/he/pages/vaccination-coverage-june-2023?utm_source=chatgpt.com (accessed on 8 January 2026).
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.

