You are currently viewing a new version of our website. To view the old version click .
Healthcare
  • Article
  • Open Access

16 December 2025

Artificial Intelligence in Sub-Elite Youth Football Players: Predicting Recovery Through Machine Learning Integration of Physical, Technical, Tactical and Maturational Data

,
,
,
,
and
1
Biosciences Higher School of Elvas, Polytechnic Institute of Portalegre, 7300-110 Portalegre, Portugal
2
Department of Sport, Exercise and Health Sciences, University of Trás-os-Montes e Alto Douro, 5000-801 Vila Real, Portugal
3
Research Centre for Active Living and Wellbeing (Livewell), Instituto Politécnico de Bragança, 5300-253 Bragança, Portugal
4
Department of Sports, Higher Institute of Educational Sciences of the Douro, 4560-708 Penafiel, Portugal
This article belongs to the Special Issue From Prevention to Recovery in Sports Injury Management

Abstract

Background: Monitoring training load and recovery is essential for performance optimization and injury prevention in youth football. However, predicting subjective recovery in preadolescent athletes remains challenging due to biological variability and the multidimensional nature of training responses. This exploratory study examined whether supervised machine learning (ML) models could predict Total Quality of Recovery (TQR) using integrated external load, internal load, anthropometric and maturational variables collected over one competitive microcycle. Methods: Forty male sub-elite U11 and U13 football players (age 10.3 ± 0.7 years; height 1.43 ± 0.08 m; body mass 38.6 ± 6.2 kg; BMI 18.7 ± 2.1 kg/m2) completed a microcycle comprising four training sessions (MD-4 to MD-1) and one official match (MD). A total of 158 performance-related variables were extracted, including external load (GPS-derived metrics), internal load (RPE and sRPE), heart rate indicators (U13 only), anthropometric and maturational measures, and tactical–cognitive indices (FUT-SAT). After preprocessing and aggregation at the player level, five supervised ML algorithms—K-Nearest Neighbors (KNN), Support Vector Machine (SVM), Decision Tree (DT), Random Forest (RF), and Gradient Boosting (GB)—were trained using a 70/30 train–test split and 5-fold cross-validation to classify TQR into Low, Moderate, and High categories. Results: Tree-based models (DT, GB) demonstrated the highest predictive performance, whereas linear and distance-based approaches (SVM, KNN) showed lower discriminative ability. Anthropometric and maturational factors emerged as the most influential predictors of TQR, with external and internal load contributing modestly. Predictive accuracy was moderate, reflecting the developmental variability characteristics of this age group. Conclusions: Using combined physiological, mechanical, and maturational data, these ML-based monitoring systems can simulate subjective recovery in young football players, offering potential as decision-support tools in youth sub-elite football and encouraging a more holistic and individualized approach to training and recovery management.

1. Introduction

The continuous monitoring of training load and recovery is fundamental for optimizing performance and preventing injuries in youth football athletes, who exhibit high variability in physiological, mechanical, and cognitive responses due to differences in maturation, anthropometry, and technical-tactical understanding [1,2]. Individualized monitoring strategies that account for these developmental differences are essential, incorporating tools to assess both physiological and psychological parameters [1] and implementing age-appropriate, engaging injury prevention protocols [3,4]. Effective programs should integrate coach education and athlete feedback to support physical and emotional well-being [5]. Comprehensive monitoring must include external loads, perceived exertion, and recovery data to provide a holistic understanding of athlete readiness and injury risk, enabling evidence-based, adaptive training interventions [6,7,8]. Moreover, contextual and interpersonal factors such as coaching style and social support play critical roles in athlete safety and development [9]. Ultimately, the interplay between individualized load monitoring, age-related physiological responses, and systematic injury prevention highlights the complexity of managing youth football athletes, underscoring the need for continued research addressing maturation-related differences to sustain performance optimization and injury reduction [7,10].
Integrated load monitoring frameworks have recently emerged to unify diverse data types, combining external loads (e.g., GPS-derived mechanical metrics) with internal loads (e.g., heart rate and rating of perceived exertion, RPE) and tactical-cognitive performance indicators. This allows the generation of comprehensive datasets supporting athlete development [11,12]. However, interpreting such multidimensional information in preadolescent athletes is challenging due to non-linear interactions between growth, training workload, and recovery [13,14]. Accurate interpretation is essential, as youth maturation variability can obscure load–response relationships, and excessive workloads during growth spurts may heighten injury risk [12,14,15]. Therefore, tailored and individualized monitoring strategies are required to adapt training regimens, reducing overload and accounting for athletes’ unique physiological and recovery responses [16,17,18,19]. Future research should explore the complex interplay among load components, physiological responses, and psychosocial factors, potentially leveraging machine learning to predict performance and injury outcomes [20,21,22]. Such advancements would enable more effective and individualized coaching strategies, optimizing athlete welfare and long-term development in demanding youth sports contexts like football [23,24,25,26].
The Total Quality of Recovery (TQR) scale has emerged as a reliable subjective indicator of an athlete’s psychophysiological readiness to perform, showing correlations with training load measures such as session-RPE (sRPE) and heart rate variability (HRV) in both adolescent and adult athletes [8,27]. By integrating psychological and physiological markers, TQR offers a multidimensional perspective on recovery, encompassing factors such as nutrition, hydration, sleep, rest, and physical activity [28]. However, in younger athletes, these relationships appear less consistent due to developmental and perceptual differences that influence fatigue perception and recovery appraisal [29,30]. Traditional physiological metrics may not fully capture the complex interplay of growth, maturation, and psychological stress in youth athletes, which can lead to incomplete recovery and increased injury risk under high training loads [31,32]. Therefore, a deeper understanding of TQR’s applicability to younger populations is necessary to design age-appropriate recovery and training protocols that support safe and sustainable athletic development [33].
Recent advances in machine learning (ML) have introduced powerful tools for analyzing complex, non-linear relationships in sports performance data, enabling the identification of hidden patterns across physiological and behavioral variables to predict readiness, recovery, and performance outcomes [34]. These techniques have proven effective in forecasting match outcomes, injury risk, and workload responses in senior athletes [35,36], yet their application in youth football, particularly regarding subjective recovery measures, remains limited. Challenges arise from developmental differences that affect how young athletes perceive and report recovery, as well as from the psychological and physiological variability inherent to this group [37,38,39]. Youth athletes’ lower developmental maturity may further hinder accurate self-assessment, complicating ML models built primarily on adult data [40,41]. Given the non-linear responses of young athletes to training loads, ML systems must be specifically adapted to account for growth-related variability and perceptual differences [35,39,42]. Overall, while ML holds great potential for improving recovery and performance monitoring, its effective implementation in youth sports demands tailored, developmentally sensitive approaches supported by continued research [43]. These developmental and perceptual instabilities represent a fundamental limitation for ML applications in youth sports, as they reduce signal consistency, increase inter-individual noise, and challenge the reliability of predictive models built on small or heterogeneous samples [44]. As a result, ML approaches validated in adult or elite populations cannot be directly transferred to pre-adolescents without substantial adaptation. This limitation further reinforces the need for age-specific modeling frameworks and highlights the scientific relevance of investigating ML-based recovery prediction in U11 and U13 players.
In parallel with these challenges, the recent literature on load monitoring for young athletes highlights the need for methodologies capable of integrating physical, technical, tactical, and maturational variables to predict subjective recovery states in pre-adolescent players. However, a clear literature gap remains for younger competitive categories, particularly U11 and U13, where developmental heterogeneity is greatest. These ages encompass the initial phases of rapid biological and cognitive maturation, during which perceptual abilities, coordination, and tactical understanding are still consolidating [45,46,47]. While most existing studies focus on adolescent groups (≥U14), early developmental categories remain understudied despite presenting unique challenges: highly variable maturation rates, greater sensitivity to training stress, and less stable subjective perception of recovery [46,48,49]. Investigating U11 and U13 athletes is thus crucial to understanding how growth-related changes influence training tolerance and recovery dynamics. These age groups also represent a stage in which technical and tactical skills are in accelerated development, potentially altering the interaction between load, decision-making, and fatigue [50,51]. Gaining insight into these processes can improve age-appropriate training strategies, support injury reduction, and optimize developmental pathways during the formative years of football participation [52,53]. The need to explore these dynamics reinforces the relevance of targeted research involving U11 and U13 athletes, particularly using integrated and data-driven approaches.
This study aimed to apply ML models to predict TQR based on physiological, mechanical, and maturational variables in sub-elite U11 football players. The specific objectives were to examine correlations between recovery and selected internal, external, and tactical performance metrics, evaluate the predictive accuracy of five supervised ML algorithms, and identify the most influential predictors of perceived recovery. By integrating anthropometric, training load, and cognitive–tactical dimensions, this exploratory analysis sought to enhance the understanding of recovery mechanisms in developing athletes and assess the feasibility of data-driven modeling for monitoring recovery in youth football. This study hypothesized that supervised machine learning models could predict the TQR in sub-elite under-11 football players using integrated physiological, mechanical, and maturational variables. It was further expected that non-linear, tree-based algorithms would achieve higher predictive accuracy than linear models and that maturational and anthropometric factors would emerge as the most influential predictors of perceived recovery.

2. Materials and Methods

2.1. Study Design

This observational and longitudinal study was carried out over the 2024/2025 season and focused on monitoring training and match performance among under-11 (U11) and under-13 (U13) sub-elite football players. The primary aim was to quantify, integrate, and model physiological, mechanical, and tactical variables gathered during training sessions and official matches. These variables provided a multidimensional framework that combined external load (GPS-derived metrics), internal load (heart rate and perceived exertion), and tactical–cognitive indicators (decision-making and positioning indices).
The monitored microcycle comprised four consecutive training sessions (MD-4, MD-3, MD-2, MD-1) and one official match (MD). Each 90 min session was conducted on outdoor synthetic turf pitches compliant with FIFA standards (100 × 70 m). Training loads were individualized according to players’ developmental stages to ensure age-appropriate progression and recovery. Environmental conditions were stable throughout the study, with temperatures ranging from 14 to 20 °C and relative humidity between 52 and 66%. All sessions were held between 4:00 p.m. and 8:00 p.m., minimizing heat stress and maintaining consistent conditions.
Training microcycles were collaboratively designed with the coaching and performance staff to align with the team’s developmental objectives. Each session began with a standardized warm-up protocol comprising (1) 5–7 min of low-intensity running, (2) dynamic stretching for the lower limbs, (3) technical passing and ball control drills, and (4) small-sided games and possession games. This consistent structure ensured comparability across microcycles and accurately reflected the athletes’ habitual training demands within a realistic competitive environment.
An overview of the study design, data collection procedures, preprocessing steps, and machine learning workflow is presented in Figure 1.
Figure 1. Flowchart about participant recruitment, monitoring procedures, data preprocessing, feature extraction, and machine learning implementation used to predict TQR model evaluation. Note: Global Navigation Satellite System (GNSS) and Global Positioning System (GPS); Heart Rate (HR); Rating Perception Effort (RPE); Session Rating Perception Effort (sRPE). Bold terms are the evaluated dimensions.

2.2. Participants

The sample consisted of 40 male sub-elite youth football players competing in the U11 and U13 category of a Portuguese regional football club. The players’ anthropometric characteristics were as follows (mean ± SD): age 10.3 ± 0.7 years, height 1.43 ± 0.08 m, body mass 38.6 ± 6.2 kg, and body mass index (BMI) 18.7 ± 2.1 kg/m2. All participants had a minimum of four years of structured football experience and trained regularly four times per week, in addition to participating in one official match per week. Eligibility criteria included: (i) participation in a competitive week including one official match; (ii) completion of all four training sessions and the match; and (iii) absence of medical conditions or injuries that could interfere with data collection. Exclusion criteria comprised: (i) missing any training session; (ii) unavailability for the match; (iii) incomplete GPS, heart rate (HR), RPE, or TQR data; and (iv) injury during the observation period.
All procedures complied with the Declaration of Helsinki (2013 revision). Prior to participation, players and their legal guardians received detailed information about the study’s aims, procedures, and potential risks. Participation was voluntary, with written informed consent obtained from parents or legal guardians and verbal assent from the players. Ethical approval was granted by the Ethics Committee of the University of Trás-os-Montes e Alto Douro (Doc104-CE-UTAD-2025). Data confidentiality was ensured through anonymization using player identification codes, and only aggregated results were reported to protect individual privacy.

2.3. Data Collection

The study combined technological monitoring systems with subjective self-report instruments to capture a comprehensive view of each player’s physical and cognitive performance.

2.3.1. External Load Monitoring

The quantification of external load was conducted using STATSports Apex® GNSS/GPS units (v. 2.0, Newry, Northern Ireland, UK), which tracked players’ positional and kinematic data throughout all training sessions and matches. Each device collected positional information at a sampling rate of 18 Hz, complemented by accelerometer (100 Hz), magnetometer (10 Hz), and gyroscope (100 Hz) sensors to capture multidimensional motion data. Players wore the sensors in a customized vest that positioned the device between the scapulae to ensure stable signal reception. Before each session, devices were activated at least 30 min in advance to guarantee accurate satellite synchronization, requiring a minimum of eight satellite connections for optimal tracking reliability [54].
The following external load variables were analyzed: total distance covered (TD, m), high-speed running distance (HSR, >18 km·h−1), high-intensity distance (HID, >21 km·h−1), relative distance (m·min−1), number of sprints (NSPR), maximum running speed (MRS, m·s−1), accelerations (ACC, >2 m·s−2), and decelerations (DEC, <−2 m·s−2). These metrics provide a comprehensive overview of both the volume and intensity components of physical performance during football-specific activities. The STATSports Apex® technology used in this study has previously been validated for team sports, demonstrating excellent accuracy and reliability, with measurement errors typically below 2% for 10–18 Hz models [55,56,57]. Accordingly, these devices are considered an industry standard for capturing external load indicators in youth and professional football settings.

2.3.2. Internal Load Monitoring

The internal load was assessed using both objective physiological and subjective perceptual indicators, allowing for a comprehensive evaluation of the players’ internal responses to training and match demands. For the objective component, HR data were collected exclusively from the U13 cohort using the Polar Team Pro® system (Polar Electro, Kempele, Finland). This system continuously recorded HR throughout all sessions and official matches, enabling the analysis of the average heart rate, peak heart rate, and the distribution of time spent in five individualized intensity zones (Z1–Z5), expressed as a percentage of each player’s maximum heart rate (%HRmax). The %HRmax reference was determined from the highest HR value observed during the monitoring period for each athlete. These metrics were subsequently processed for each session and summarized across the microcycle to depict intensity fluctuations throughout the week. The U11 group was excluded from HR monitoring due to methodological limitations and comfort considerations related to chest strap fitting and data consistency in younger players.
The subjective internal load was determined through the Borg CR10 scale (0–10 points) of RPE [58]. Approximately 30 min after every training session and match, players were interviewed individually by the investigator using a digital form administered on a tablet (Google Forms® interface, Google, Mountain View, CA, USA). The scale featured both numerical anchors and expressive emojis to facilitate understanding among younger athletes. Participants verbally communicated their perceived effort, which was immediately logged into the form. All players had previously undergone familiarization sessions with the RPE protocol to ensure comprehension and reliability.
Sessional training load (sRPE) was calculated using the formula sRPE = RPE × session duration (min), and the resulting values were expressed in arbitrary units (AU) [59]. The compiled data were exported to Microsoft Excel® (Version 2510, Microssoft, Redmond, WA, USA) for storage, organization, and subsequent statistical analysis. The RPE and sRPE methods have been validated in youth and elite football populations as effective, non-invasive tools to monitor psychophysiological load [60,61].

2.3.3. Recovery Status

Recovery status was assessed with the TQR scale (1–10), an instrument validated for youth football populations [8]. TQR was assessed 30 min before sessions to gauge readiness and recovery, allowing players to consider their physical and psychological states comprehensively [62]. This combined use of RPE (as internal load monitoring) and TQR provides an effective, low-cost framework for monitoring the interaction between training load and recovery, aligning with evidence that supports the inclusion of subjective metrics to complement objective monitoring in youth sports [63,64]. By systematically applying these psychophysiological tools, coaches can tailor training stimuli to the individual recovery profiles of young athletes, fostering improved performance, reduced injury risk, and enhanced overall well-being [65].

2.3.4. Tactical-Cognitive Performance

The assessment of technical–tactical performance was carried out using the Football Tactical Assessment System (FUT-SAT), a validated and widely used instrument designed to evaluate players’ understanding and execution of tactical principles in football [66,67]. This protocol enables systematic observation and quantification of both offensive and defensive game principles in representative playing contexts, providing a holistic understanding of the players’ decision-making and motor execution capacities. The inclusion of FUT-SAT in this study provides a robust framework for linking technical–tactical behavior with training load and recovery dynamics. By analyzing decision quality (DMI) and execution efficiency (MEI), the study captures how players’ cognitive and motor processes interact under varying physiological demands. This multidimensional approach aligns with contemporary research emphasizing the integration of physical, psychological, and tactical domains in football performance analysis [68,69].
The FUT-SAT protocol was implemented through small-sided games (SSGs) with the format GK + 3 vs. 3 + GK, which provides an ecologically valid and developmentally appropriate environment for tactical assessment in youth football [70]. Each SSG was played on a 36 × 27 m pitch, maintaining proportional field dimensions that ensure representativeness of official play conditions for young athletes [71]. Every game lasted 4 min, during which players were encouraged to maintain competitive intensity and apply game principles naturally.
The SSGs were video recorded using a GoPro HERO Action Camera (4K, 12 MP, Wi-Fi, Bluetooth) positioned at an elevated angle to capture all playing actions. Recorded footage was subsequently analyzed using LongoMatch® software (LongoMatch® version 1.5.9, Barcelona, Spain), which allows frame-by-frame annotation of game events and precise tagging of tactical actions for subsequent coding and quantification [72].
Player actions were analyzed and categorized based on the core tactical principles defined in the FUT-SAT model, which encompasses both offensive and defensive dimensions of play. The offensive principles included penetration, offensive coverage, mobility, and the effective use of space, while the defensive principles comprised delay, defensive coverage, balance, and concentration. Each action performed by a player was independently evaluated according to two analytical dimensions: decision-making, which assessed whether the player’s tactical decision was appropriate or inappropriate within the specific game context, and motor execution, which determined whether the player’s technical action was effective or ineffective in carrying out the intended decision. From these assessments, two main performance indicators were derived. The DMI represented the proportion of correct tactical decisions relative to the total number of decisions made, and the MEI quantified the proportion of technically successful executions relative to all attempted actions. Both indexes ranged from 0 to 1, with higher values reflecting greater tactical intelligence and execution quality. Additionally, an Overall Performance Indicator (OPI) was computed to integrate these dimensions, providing a comprehensive representation of each player’s tactical behavior by combining their decision-making and execution efficiency across all offensive and defensive principles [67], as presented in equation
O P I = ( D M I + M E I ) n
where n represents the number of analyzed tactical principles. This composite indicator offers a global assessment of tactical efficiency during game performance, facilitating comparisons between individuals or groups (U11 vs. U13).
To ensure objectivity and consistency, all video analyses were performed by two independent researchers previously trained in the FUT-SAT coding system. A familiarization phase was conducted using pilot videos not included in the study sample. Inter- and intra-observer reliability were assessed through Cohen’s kappa (κ) coefficients, yielding values above 0.85 for both decision-making and motor execution variables, which is considered excellent agreement [73].

2.4. Data Preprocessing and Normalization

Data processing and analysis were conducted using Python™ (version 3.10.4, Centrum Wiskunde & Informatica (CWI) in Amsterdam, The Netherlands), employing widely recognized libraries such as pandas, numpy, matplotlib, and scikit-learn for efficient data manipulation, visualization, and model preparation. Prior to analysis, data quality was ensured through several preprocessing steps. Missing values, which represented less than 10% of the dataset, were imputed using the mean of each respective variable to preserve data integrity while minimizing bias. Outliers exceeding ±3 standard deviations from the mean were identified and removed to prevent distortion of model performance. Categorical variables were transformed into numerical representations using one-hot encoding, allowing for their inclusion in machine learning algorithms that require numerical inputs. Continuous variables were then standardized using the StandardScaler function from sklearn, preprocessing, normalizing each feature to have a mean of zero and a standard deviation of one. This step was crucial for ensuring feature comparability and preventing scale-related bias, thereby improving the stability and interpretability of the subsequent machine learning models.
A total of 158 performance-related variables were computed across all monitoring domains (Table 1). These included: (i) external load metrics derived from GPS data (e.g., total distance, high-speed running, accelerations, decelerations, and peak speed); (ii) internal load indicators obtained from RPE, sRPE and HR-derived metrics; (iii) anthropometric and maturational variables (e.g., height, leg length, body mass, maturity offset); and (iv) tactical–cognitive indices derived from FUT-SAT observations. The aim of generating such a comprehensive dataset was to capture the full multidimensional profile of each player, allowing machine learning algorithms to explore non-linear interactions among physical, physiological, perceptual, and tactical factors. For the purposes of machine learning modeling, data from one competitive microcycle (four training sessions: MD-4, MD-3, MD-2, MD-1, and one official match: MD) were aggregated at the player level. Thus, the final dataset consisted of 40 complete observations (one per player), each observation including 158 performance-related variables describing that player’s external load, internal load, anthropometric/maturational profile, and tactical–cognitive performance across the microcycle.
Table 1. Categorisation of the 158 extracted performance-related variables across monitoring domains.

2.5. Machine Learning Implementation

A supervised machine learning analysis was conducted to examine relationships between physical, physiological, and cognitive variables. The dataset was split into training (70%) and testing (30%) subsets using a random seed of 42 for reproducibility. Five classification algorithms were applied: K-Nearest Neighbors (KNN), based on distance between data points; Support Vector Machine (SVM), which separates classes using hyperplanes; Random Forest (RF), an ensemble of decision trees designed to reduce overfitting; Decision Tree (DT), a rule-based hierarchical classifier; and Gradient Boosting (GB), an ensemble boosting method that builds additive decision-tree models to improve predictive accuracy by sequentially reducing residual errors. Model training was performed using scikit-learn v1.3 (Python™, version 3.10.4, Centrum Wiskunde & Informatica (CWI) in Amsterdam, The Netherlands), with a 5-fold cross-validation procedure applied to obtain stable performance estimates and minimize variance. All algorithms were trained using their standard default hyperparameter configurations, following established guidelines for exploratory modeling in small datasets. This approach ensured methodological consistency across models while avoiding overfitting that may arise from excessive tuning in limited samples.
Model performance was assessed using accuracy, precision, recall, and F1-score metrics computed with the classification_report function. This exploratory analysis aimed to identify potential predictors of internal load and tactical indices rather than to establish definitive predictive systems.

2.6. Statistical Analysis

Descriptive statistics (mean ± SD) were computed for all quantitative variables. Normality was assessed with the Shapiro–Wilk test, and homogeneity of variances was tested using Levene’s test. Differences between training days (MD-4, MD-3, MD-2, MD-1) and match day (MD) were analyzed using one-way repeated-measures ANOVA, with Bonferroni post hoc correction when necessary. For non-normal data, the Friedman test was applied. Effect sizes were calculated using Cohen’s d and interpreted as small (0.2), moderate (0.5), or large (0.8). Correlations between internal and external load indicators, as well as between physical and tactical indices, were evaluated using Pearson’s or Spearman’s correlation coefficients depending on data distribution. All statistical analyses were performed using Python 3.10.4 and IBM SPSS Statistics v.29 (Armonk, NY, USA), with the level of significance set at p < 0.05.

3. Results

3.1. Variable Selection for the ML Algorithm

To ensure that the model effectively predicts TQR, a variable selection procedure was performed combining domain knowledge, correlation analysis, and statistical significance testing. Although 158 variables were initially extracted, only 17 were retained for modeling. The reduction was performed through a multi-step procedure combining: (i) theoretical relevance to recovery monitoring; (ii) correlation analysis to remove redundant highly collinear features; (iii) data completeness across all sessions; and (iv) preliminary feature screening using tree-based importance metrics. This process ensured that only the most informative and non-redundant predictors were entered in the final ML models. The selected variables were grouped into four main categories:
  • Technical and tactical performance—MEI_Total_Index, DMI_Total_Index, and Performance_MEI_DMI (sum of MEI and DMI);
  • Internal training load—Mean HR%, Max HR%, % Time Zone 1–5, sRPE_1–4 and sRPE_MD;
  • External training load—ACC, DEC, HSR, Distance, and Number of Sprints;
  • Anthropometric and maturational variables—Height, Weight, Leg Length, and Mirwald Maturity Offset.
No statistically significant correlations were identified (p > 0.05) with TQR. However, all features were retained in the models, given their theoretical and physiological relevance to recovery and fatigue monitoring. This approach allowed the algorithms to capture potential non-linear and multivariate dependencies that are not detectable through linear correlation analysis.

3.2. Algorithm Performance, Feature Importance, and Predictive Accuracy

Five supervised learning algorithms were implemented to classify TQR into three recovery categories (Low, Medium, and High): (1) KNN, (2) SVM, (3) DT Classifier, (4) RF Classifier, and (5) GB Classifier.
These models were selected because they represent a diverse range of linear vs. non-linear, parametric vs. non-parametric, and ensemble vs. single-tree approaches. Tree-based models (RF and GB) were included for their robustness in capturing hierarchical and interaction effects between multiple training and anthropometric variables.
All models were trained using 70% of the dataset and evaluated on the remaining 30%, with 5-fold stratified cross-validation applied to the training set. The metrics used for evaluation were Accuracy, Precision, Recall, F1-score, and the Average Metric (mean of the four). The results are shown in Table 2.
Table 2. Performance of machine learning algorithms in predicting TQR categories.
Among all models, the DT Classifier achieved the highest overall performance (Accuracy = 70.0%; Average Metric = 71.5%), followed closely by the GB Classifier. Tree-based models displayed superior performance compared to linear or distance-based methods, indicating that recovery perception is governed by non-linear relationships involving multiple performance and maturational factors. The feature importance extracted from the best-performing model (i.e., DT) revealed that anthropometric and maturational features contributed most to model accuracy, followed by internal training load and locomotor variables. The ten most influential predictors are presented in Table 3.
Table 3. Top 10 predictors influencing TQR classification (DT model).
These variables collectively explain approximately 40% of the model’s variance in predicting recovery status. Thus, the DT model relies primarily on maturational and anthropometric indicators, complemented by session-based internal and external load metrics, to estimate recovery perception.

3.3. Model ROC/AUC

The discriminative performance of each model was assessed using Receiver Operating Characteristic (ROC) analysis (Table 4). The Decision Tree Classifier demonstrated the best discriminative ability with an Area Under the Curve (AUC) of 0.40, followed by the SVM (0.38) and GB (0.35). The RF and KNN models achieved lower AUC values (0.32 and 0.30, respectively).
Table 4. Comparative AUC values for all ML algorithms.
Figure 2 presents the ROC curves for the Decision Tree Classifier, showing limited discrimination among recovery classes (Low, Medium, High) with both macro- and micro-average AUC = 0.40. Figure 3 compares the ROC curves across all algorithms, confirming the Decision Tree’s relative advantage.
Figure 2. ROC curves for the DT Classifier.
Figure 3. ROC curves across all algorithms.
Although all models showed modest AUC values (<0.5), their consistent ranking supports the robustness of the findings. The results indicate that recovery perception (TQR) in this sample is only partially predictable from biomechanical and physiological features, likely due to the small sample size (n = 40) and the subjective nature of perceived recovery. Nevertheless, the non-linear tree-based methods were better able to extract underlying patterns than linear approaches.

4. Discussion

The aim of this study was to apply supervised ML techniques to predict the TQR in sub-elite U11 and U13 football players based on physiological, mechanical, and maturational variables. It was hypothesized that ML models would be able to predict TQR with moderate accuracy, that non-linear, tree-based algorithms (such as DT and GB) would outperform linear approaches, and that maturational and anthropometric factors would be the most influential predictors of recovery perception. The findings partially verified this hypothesis: the DT model achieved the highest predictive accuracy (70%), confirming the superiority of non-linear algorithms, and maturational variables were the strongest contributors to model performance. However, the relatively low discriminative power (AUC ≈ 0.40) indicated that TQR was only moderately predictable, suggesting that subjective recovery in young athletes is influenced by additional physiological and psychological factors not captured in the current dataset.
The results demonstrated that the DT Classifier and GB Classifier achieved the best overall performance (Accuracy = 70%, Average Metric ≈ 71%), outperforming the SVM, RF, and KNN models. This indicates that non-linear, rule-based models are more effective in capturing the complex relationships underpinning recovery perception in young players. The ten most influential predictors identified by the Decision Tree included leg length, maturity offset, height, sRPE, and high-intensity distance, collectively explaining about 40% of model variance. These findings suggest that recovery perception is influenced not only by workload but also by biological maturation and anthropometric differences, consistent with previous research linking growth-related factors to variations in physical performance, fatigue, and recovery kinetics in preadolescent athletes. The DT and GB Classifiers achieved the highest predictive accuracy (≈70–71%), outperforming other models and highlighting the effectiveness of non-linear, rule-based algorithms in capturing complex recovery patterns in young athletes [74]. Key predictors—leg length, maturity offset, height, sRPE, and high-intensity distance—explained about 40% of model variance, showing that recovery perception depends on both training load and biological maturation [75,76,77]. These findings underscore the value of machine learning for identifying multifactorial influences on youth recovery and advocate for individualized, developmentally informed recovery strategies [78].
Interestingly, the inclusion of tactical indices (MEI, DMI) and locomotor variables (ACC, DEC, TD) contributed less to predictive accuracy than maturational features. This pattern implies that, at this developmental stage, biological readiness may play a greater role in perceived recovery than tactical or mechanical complexity. The weak linear correlations between TQR and sRPE further support this interpretation, indicating that subjective recovery may not directly scale with session intensity but rather with the player’s developmental capacity to tolerate and interpret training stress. Tactical indices such as the MEI and DMI, along with locomotor variables (ACC, DEC, TD), contributed less to recovery prediction than maturational features, suggesting that biological readiness plays a more decisive role in perceived recovery at this developmental stage [79,80]. Biological maturation factors like maturity offset and anthropometry strongly influence performance and recovery during growth phases [81,82]. The weak correlation between TQR and sRPE further indicates that subjective recovery is shaped more by developmental capacity than by training intensity [83,84,85]. Thus, incorporating biological indicators into performance assessment models can improve the understanding of youth athletes’ responses, enabling more individualized and developmentally informed recovery and coaching strategies [86].
ROC/AUC analyses showed limited discriminative ability across all models (AUC = 0.30–0.40), with the Decision Tree performing best (0.40), followed by SVM (0.38) and Gradient Boosting (0.35). Although these results indicate low classification accuracy, they are consistent with previous findings in youth sports ML research reporting similarly constrained predictive performance [87,88,89]. The results also reinforce that subjective recovery is a multifactorial construct shaped by both physiological and psychological factors [90], requiring consideration of variables such as sleep quality, nutrition, and psychological well-being [91,92]. The weak association between TQR and sRPE further suggests that subjective recovery reflects developmental capacity more than objective training intensity [93,94]. Overall, these findings highlight the need to improve predictive accuracy by integrating physiological, psychological, and contextual factors into future models of youth athlete recovery [95,96].
From a practical perspective, this study demonstrates the feasibility of ML approaches to youth football monitoring systems, highlighting their potential to support evidence-based decision-making in player management. Despite the relatively small dataset, the algorithms were capable of identifying meaningful predictors of recovery perception, suggesting that maturational factors—such as leg length, maturity offset, and height—should be monitored alongside traditional workload indicators to enhance understanding of players’ recovery dynamics and readiness to train. These findings emphasize the importance of individualized load management strategies that account for the physiological and developmental variability characteristic of preadolescent athletes. ML-driven monitoring tools could eventually aid coaches and sport scientists in the early detection of maladaptation, under-recovery, or excessive fatigue, contributing to more precise and age-appropriate training interventions. However, this potential must be interpreted within the limitations of the current study, particularly the small sample size (n = 40) and the subjective nature of the TQR measure, which may be affected by players’ cognitive and motivational maturity. Moreover, the absence of variables related to sleep quality, nutrition, and psychological stress may have limited the model’s predictive capacity, given the multifactorial nature of recovery. Future studies should therefore aim to collect larger, multicentric datasets encompassing a broader range of psychophysiological, contextual, and behavioral variables. Incorporating wearable-based sleep and recovery markers, mood profiling, and training context descriptors could improve the granularity and accuracy of predictive models. Additionally, the exploration of ensemble and deep learning methods, such as convolutional or recurrent neural networks, may provide superior pattern recognition capabilities, enabling the development of more robust and generalizable recovery prediction systems for youth athletes. Expanding this perspective, future research may also integrate technical–motor assessments and long-term developmental indicators, which were beyond the scope and feasibility of the present applied setting but could contribute additional predictive relevance in larger and more controlled cohorts.

5. Conclusions

This exploratory study provides preliminary evidence that machine learning techniques can model subjective recovery in young football players using integrated physiological, mechanical, and maturational data. Although predictive accuracy was moderate and AUC values low, the results emphasize the role of biological maturation as a key determinant of recovery perception in preadolescent athletes. The study demonstrates the potential of ML-based monitoring systems as decision-support tools in youth sport, promoting a more holistic and individualized approach to training and recovery management.

Limitations

This study has some limitations that should be acknowledged. First, the relatively modest sample size (n = 40), although typical of applied research in single-club youth cohorts, limits the generalizability of the findings and may contribute to model variability. Second, data were collected over a single competitive microcycle, which limits the ability to capture week-to-week variability in training load, recovery status, and tactical–technical behavior. Third, the HR data were only available for the U13 players due to methodological and practical constraints, preventing complete physiological comparison across age groups. Fourth, while the machine learning models showed moderate predictive capacity, the absence of external validation limits the extrapolation of the results and highlights the need for replication in independent samples. Finally, natural inter-individual differences in biological maturation among preadolescent athletes may have influenced perceived recovery and contributed to model variability. Future research should incorporate larger multi-club cohorts, multi-week monitoring, full HR datasets across age groups, and external validation to strengthen model reliability and predictive robustness.

Author Contributions

Conceptualization, P.A., P.F. and J.E.T.; methodology, P.A. and L.B.; validation, P.F., R.F., N.D.G. and J.E.T.; formal analysis, P.A., P.F. and J.E.T.; investigation, P.A.; resources, P.F., N.D.G. and J.E.T.; data curation, P.F., R.F., L.B. and N.D.G.; writing—original draft preparation, P.A., P.F. and R.F.; writing—review and editing, L.B., N.D.G. and J.E.T.; supervision, P.F., N.D.G. and J.E.T.; project administration, P.F., N.D.G. and J.E.T. All authors have read and agreed to the published version of the manuscript.

Funding

The authors declare that financial support was received for the research and/or publication of this article. This project was supported by the National Funds through the FCT Portuguese Foundation for Science and Technology (projects UID/CED/04748/2025 and UIDB04045/2021), Life Quality Research Center (LQRC-CIEQV), Santarém, Portugal; Research Center in Sports Sciences, Health Sciences and Human Development, Vila Real, Portugal; SPRINT—Sport Physical Activity and Health Research and Innovation Center, Portugal; and This work is supported by national funds from FCT—Fundação para a Ciência e a Tecnologia, I.P., under the project/support UID/6157/2025.

Data Availability Statement

The data supporting the findings of this study are not publicly available due to privacy or ethical restrictions but can be provided by the corresponding author upon reasonable request.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
ACCAccelerations
AUCArea Under the Curve
AUArbitrary Units
BMIBody Mass Index
CVCross-Validation
DECDecelerations
DMIDecision-Making Index
DTDecision Tree Classifier
GBGradient Boosting Classifier
GNSS/GPSGlobal Navigation Satellite System/Global Positioning System
HRHeart Rate
HRmax/%HRmaxMaximum Heart Rate/Percentage of Maximum Heart Rate
HSRHigh-Speed Running Distance
HIDHigh-Intensity Distance
KNNK-Nearest Neighbors Classifier
MAEMean Absolute Error
MDMatch Day
MEIMotor Effectiveness Index
MLMachine Learning
MRSMaximum Running Speed
OPIOverall Performance Indicator
R2Coefficient of Determination
RFRandom Forest Classifier
RMSERoot Mean Square Error
ROCReceiver Operating Characteristic Curve
RPERating of Perceived Exertion
sRPESession Rating of Perceived Exertion
SDStandard Deviation
SVMSupport Vector Machine
SSGSmall-Sided Games
TQRTotal Quality Recovery
U11/U13Under-11/Under-13 age categories
Z1–Z5Heart rate intensity zones (Zones 1 to 5)

References

  1. Palmer, B.L.; van der Ploeg, G.E.; Bourdon, P.C.; Butler, S.R.; Crowther, R.G. Evaluation of Athlete Monitoring Tools across 10 Weeks of Elite Youth Basketball Training: An Explorative Study. Sports 2023, 11, 26. [Google Scholar] [CrossRef]
  2. Temm, D.A.; Standing, R.J.; Best, R. Training, Wellbeing and Recovery Load Monitoring in Female Youth Athletes. Int. J. Environ. Res. Public Health 2022, 19, 11463. [Google Scholar] [CrossRef]
  3. Emery, C.A.; Roy, T.-O.; Whittaker, J.L.; Nettel-Aguirre, A.; van Mechelen, W. Neuromuscular training injury prevention strategies in youth sport: A systematic review and meta-analysis. Br. J. Sports Med. 2015, 49, 865–870. [Google Scholar] [CrossRef]
  4. Staes, F.F.; Vereecken, S.; Timmerman, W.P.; Tooth, C.; Gard, S.; Houtmeyers, K.C.; Jaspers, A. Challenges and Opportunities for Injury Reduction and Performance Development in Elite Youth Team Sport Schools: A Practice-Based Opinion. Int. J. Sports Phys. Ther. 2025, 20, 1547–1556. [Google Scholar] [CrossRef]
  5. Larsen, T.; Van Hoye, A.; Tjomsland, H.E.; Holsen, I.; Wold, B.; Heuzé, J.-P.; Samdal, O.; Sarrazin, P. Creating a supportive environment among youth football players: A qualitative study of French and Norwegian youth grassroots football coaches. Health Educ. 2015, 115, 570–586. [Google Scholar] [CrossRef]
  6. Bergeron, M.F.; Mountjoy, M.; Armstrong, N.; Chia, M.; Côté, J.; Emery, C.A.; Faigenbaum, A.; Hall, G.; Kriemler, S.; Léglise, M.; et al. International Olympic Committee consensus statement on youth athletic development. Br. J. Sports Med. 2015, 49, 843–851. [Google Scholar] [CrossRef] [PubMed]
  7. Jayanthi, N.; Schley, S.; Cumming, S.P.; Myer, G.D.; Saffel, H.; Hartwig, T.; Gabbett, T.J. Developmental Training Model for the Sport Specialized Youth Athlete: A Dynamic Strategy for Individualizing Load-Response During Maturation. Sports Health 2022, 14, 142–153. [Google Scholar] [CrossRef] [PubMed]
  8. Teixeira, J.E.; Forte, P.; Ferraz, R.; Leal, M.; Ribeiro, J.; Silva, A.J.; Barbosa, T.M.; Monteiro, A.M. The Association between External Training Load, Perceived Exertion and Total Quality Recovery in Sub-Elite Youth Football. Open Sports Sci. J. 2022, 15, e1875399X2207220. [Google Scholar] [CrossRef]
  9. Bjørndal, C.T.; Hausken-Sutter, S.; Møller, M.; Myklebust, G.; Grindem, H. Exploring the interplay of interpersonal and contextual dynamics in youth sports injuries: A comprehensive narrative review. BMJ Open Sport Exerc. Med. 2024, 10, e001964. [Google Scholar] [CrossRef]
  10. Cular, D.; Babic, M.; Katovic, D.; Beslija, T.; Kezic, A. How to Compare Relative Age Effect in Different Sports? A New Methodological Approach—Example of Youth Olympic Games. Sports 2024, 12, 215. [Google Scholar] [CrossRef]
  11. Jiang, Z.; Hao, Y.; Jin, N.; Li, Y. A Systematic Review of the Relationship between Workload and Injury Risk of Professional Male Soccer Players. Int. J. Environ. Res. Public Health 2022, 19, 13237. [Google Scholar] [CrossRef] [PubMed]
  12. Ribeiro, N.; Martinho, D.V.; Pereira, J.R.; Rebelo, A.; Monasterio, X.; Gonzalo-Skok, O.; Valente-dos-Santos, J.; Tavares, F. Injury Risk in Elite Young Male Soccer Players: A Review on the Impact of Growth, Maturation, and Workload. J. Strength Cond. Res. 2024, 38, 1834–1848. [Google Scholar] [CrossRef] [PubMed]
  13. Long, J.; Brown, D.; Farrell, J.; Gonzalez, M.; Cheever, K.M. Relationship between Workload, Psychological State and Sleep in Female Soccer Athletes. Int. J. Sports Med. 2024, 45, 829–836. [Google Scholar] [CrossRef] [PubMed]
  14. Parry, G.N.; Williams, S.; McKay, C.D.; Johnson, D.J.; Bergeron, M.F.; Cumming, S.P. Associations between growth, maturation and injury in youth athletes engaged in elite pathways: A scoping review. Br. J. Sports Med. 2024, 58, 1001–1010. [Google Scholar] [CrossRef]
  15. Ferreira, A.B.d.M.; Ribeiro, B.L.L.; Batista, E.d.S.; Dantas, M.P.; Mortatti, A.L. The Influence of Different Training Load Magnitudes on Sleep Pattern, Perceived Recovery, and Stress Tolerance in Young Soccer Players. J. Strength Cond. Res. 2023, 37, 351–357. [Google Scholar] [CrossRef]
  16. Allan, R.; Akin, B.; Sinclair, J.; Hurst, H.; Alexander, J.; Malone, J.J.; Naylor, A.; Mawhinney, C.; Gregson, W.; Ihsan, M. Athlete, coach and practitioner knowledge and perceptions of post-exercise cold-water immersion for recovery: A qualitative and quantitative exploration. Sport Sci. Health 2022, 18, 699–713. [Google Scholar] [CrossRef]
  17. Cluitmans, M.; Lluch, É.; Morales, H.; Heijman, J.; Volders, P. Personalized Computational Framework to Study Arrhythmia Mechanisms on Top of ECG Image-Detected Substrate. Em 2018. In Proceedings of the Computing in Cardiology 2018, Maastricht, The Netherlands, 23–26 September 2018; Available online: http://www.cinc.org/archives/2018/pdf/CinC2018-325.pdf (accessed on 8 November 2025).
  18. Mason, L.; Connolly, J.; Devenney, L.E.; Lacey, K.; O’Donovan, J.; Doherty, R. Sleep, Nutrition, and Injury Risk in Adolescent Athletes: A Narrative Review. Nutrients 2023, 15, 5101. [Google Scholar] [CrossRef]
  19. Puce, L.; Ceylan, H.İ.; Żmijewski, P.; Ruggieri, P.; Biz, C.; Sciarretta, G.; Rossin, A.; Giorgio, A.D.; Trompetto, C.; Bragazzi, N.L.; et al. Socceromics: The Integration of Omics Technologies in Soccer to Enhance Performance and Health. A Comprehensive, Critical Review of the Literature [Internet]. Preprints. 2024. Available online: https://www.preprints.org/manuscript/202410.1705 (accessed on 8 November 2025).
  20. Edwards, C.D. Mental health considerations for athlete removal from play and return to play planning. Sports Psychiatry 2023, 3, 125–139. [Google Scholar] [CrossRef]
  21. Ren, X.; Boisbluche, S.; Philippe, K.; Demy, M.; Äyrämö, S.; Rautiainen, I.; Ding, S.; Prioux, J. Machine Learning in Rugby Union: Predicting and Identifying Key Performance Indicators for Professional Rugby Union Players in Match Play Based Workload. Eur. J. Sport Sci. 2025, 25, e70042. [Google Scholar] [CrossRef]
  22. Solaro, N.; Pagani, M.; Spataro, A.; Lucini, D. Assessing the cardiac autonomic response to bicycle exercise in Olympic athletes with different loads of endurance training: New insights from statistical indicators based on multilevel exploratory factor analysis. Front. Physiol. 2023, 14, 1245310. [Google Scholar] [CrossRef]
  23. Ahrend, M.-D.; Schneeweiss, P.; Martus, P.; Niess, A.M.; Krauss, I. Predictive ability of a comprehensive incremental test in mountain bike marathon. BMJ Open Sport Exerc. Med. 2018, 4, e000293. [Google Scholar] [CrossRef]
  24. Cheng, L.; Phillips, K.; He, X. Retention Intention of Chinese Urban Preschool Teachers Predicted by Workload and Work Value. Sage Open 2025, 15, 21582440251358301. [Google Scholar] [CrossRef]
  25. McFadden, B.A.; Walker, A.J.; Arent, M.A.; Bozzini, B.N.; Sanders, D.J.; Cintineo, H.P.; Bello, M.L.; Arent, S.M. Biomarkers Correlate with Body Composition and Performance Changes Throughout the Season in Women’s Division I Collegiate Soccer Players. Front. Sports Act. Living 2020, 2, 74. [Google Scholar] [CrossRef]
  26. Vacher, P.; Martinent, G.; Mourot, L.; Nicolas, M. Elite swimmers’ internal markers trajectories in ecological training conditions. Scand. J. Med. Sci. Sports 2018, 28, 1866–1877. [Google Scholar] [CrossRef]
  27. Miranda-Mendoza, J.; Hernández-Cruz, G.; Reynoso-Sánchez, L.F.; González-Fimbres, R.A.; Cejas-Hernández, B.A. Control of recovery using the Total Quality Recovery (TQR) scale during four accumulation microcycles and its relationship to physiological factors. Retos 2023, 50, 1155–1162. [Google Scholar] [CrossRef]
  28. Coope, O.C.; Reales Salguero, A.; Spurr, T.; Páez Calvente, A.; Domenech Farre, A.; Jordán Fisas, E.; Lloyd, B.; Gooderick, J.; Abad Sangrà, M.; Roman-Viñas, B. Effects of Root Extract of Ashwagandha (Withania somnifera) on Perception of Recovery and Muscle Strength in Female Athletes. Eur. J. Sport Sci. 2025, 25, e12265. [Google Scholar] [CrossRef] [PubMed]
  29. Hauer, R.; Tessitore, A.; Knaus, R.; Tschan, H. Lacrosse Athletes Load and Recovery Monitoring: Comparison between Objective and Subjective Methods. Int. J. Environ. Res. Public Health 2020, 17, 3329. [Google Scholar] [CrossRef] [PubMed]
  30. Havenetidis, K. Exercise Performance and Recovery of Muslim Endurance Athletes during Ramadan Fasting. Int. J. Sports Sci. Coach. 2015, 10, 51–68. [Google Scholar] [CrossRef]
  31. Chaves, S.F.N.; Cerqueira, M.S.; Tucher, G.; Miloski, B.; Coelho, D.B.; Borba, D.d.A.; Pimenta, P.M.Q.; Ferreira-Júnior, J.B. Efeito de uma competição escolar de futebol com jogos em dias consecutivos no estado de recuperação de jogadores sub-19. Rev. Bras. Cineantropometria Desempenho Hum. 2018, 20, 402–411. [Google Scholar] [CrossRef]
  32. Mascherini, G.; Petri, C.; Stefani, L.; Toncelli, L.; Bini, V.; Calà, P.; Galanti, G. The Impact of the Weight Status on Cardiovascular Parameters Related to Physical Effort in Young Athletes. Sustainability 2020, 12, 3964. [Google Scholar] [CrossRef]
  33. Liang, Z.; Liang, C. Design and implementation of load intensity monitoring platform supported by big data technology in stage training for women’s sitting volleyball. Sci. Rep. 2023, 13, 22382. [Google Scholar] [CrossRef] [PubMed]
  34. Greenberg, E.; Greenberg, E.; Lawrence, J.T.; Ganley, T. Understanding Youth Athlete Motivation, Training, and Activity Progression During and After the COVID-19 Sports Interruption. Int. J. Sports Phys. Ther. 2022, 17, 1396–1403. [Google Scholar] [CrossRef]
  35. Doherty, R.; Madigan, S.M.; Nevill, A.; Warrington, G.; Ellis, J.G. The Sleep and Recovery Practices of Athletes. Nutrients 2021, 13, 1330. [Google Scholar] [CrossRef]
  36. Latief, G.R.G.; Bahri, S.; Syafiarini, R.; Hasan, M.F. The Effect of Watermelon Juice Supplementation on Perceived Recovery and Anaerobic Power Recovery in Young Karate Athletes. J. Pendidik. Jasm. Dan Olahraga 2022, 7, 17–22. [Google Scholar] [CrossRef]
  37. Ayala, E.E.; Bromback, M.E.A.; Kaufman, J.; Nelson, L.J. Mental Health and Developmental Needs of Youth Athletes: A Mixed Methods Study. J. Prev. Health Promot. 2024, 5, 93–120. [Google Scholar] [CrossRef]
  38. Gwyther, K.; Pilkington, V.; Bailey, A.P.; Mountjoy, M.; Bergeron, M.F.; Rice, S.M.; Purcell, R. Mental health and well-being of elite youth athletes: A scoping review. Br. J. Sports Med. 2024, 58, 1011–1019. [Google Scholar] [CrossRef]
  39. Suppiah, H.T.; Swinbourne, R.; Wee, J.; He, Q.; Pion, J.; Driller, M.W.; Gastin, P.B.; Carey, D.L. Predicting Youth Athlete Sleep Quality and the Development of a Translational Tool to Inform Practitioner Decision Making. Sports Health 2022, 14, 77–83. [Google Scholar] [CrossRef]
  40. Feiss, R.S.; Lutz, M.; Moody, J.R.; Pangelinan, M.M. A systematic review of coach and parent knowledge of concussion. J. Concussion 2020, 4, 2059700219900053. [Google Scholar] [CrossRef]
  41. Murdaugh, D.L.; Ono, K.E.; Morris, S.O.; Burns, T.G. Effects of Developmental Age on Symptom Reporting and Neurocognitive Performance in Youth After Sports-Related Concussion Compared to Control Athletes. J. Child Neurol. 2018, 33, 474–481. [Google Scholar] [CrossRef]
  42. Woelfel, S.L.; Haan, D.D.; Jensen, C.B.; Martens, K.T.; Rosen, A.B.; Wilkins, S.J.; Knarr, B.A. A Comprehensive Biomechanical and Clinical Analysis of a Youth Ambidextrous Baseball Pitcher: A Case Report. Int. J. Sports Phys. Ther. 2025, 20, 1060–1073. [Google Scholar] [CrossRef] [PubMed]
  43. Spetz, L.; Rogestedt, J.; Nilsson, R.; Mattsson, C.M.; Larsen, F.J. Validating Subjective Ratings with Wearable Data for a Nuanced Understanding of Load-Recovery Status in Elite Endurance Athletes [Internet]. Research Square. 2025. Available online: https://www.researchsquare.com/article/rs-6527207/v1 (accessed on 8 November 2025).
  44. Rommers, N.; Rössler, R.; Verhagen, E.; Vandecasteele, F.; Verstockt, S.; Vaeyens, R.; Lenoir, M.; D’Hondt, E.; Witvrouw, E. A Machine Learning Approach to Assess Injury Risk in Elite Youth Football Players. Med. Sci. Sports Exerc. 2020, 52, 1745–1751. [Google Scholar] [CrossRef]
  45. Hamlin, M.J.; Wilkes, D.; Elliot, C.; Lizamore, C.A.; Kathiravel, Y. Monitoring Training Loads and Perceived Stress in Young Elite University Athletes. Front. Physiol. 2019, 10, 34. [Google Scholar] [CrossRef]
  46. Malina, R.M.; Kozieł, S.; Králík, M.; Chrzanowska, M.; Suder, A. Prediction of maturity offset and age at peak height velocity in a longitudinal series of boys and girls. Am. J. Hum. Biol. 2020, 33, e23551. [Google Scholar] [CrossRef]
  47. Monasterio, X.; Gil, S.M.; Bidaurrazaga-Letona, I.; Lekue, J.A.; Santisteban, J.M.; Diaz-Beitia, G.; Lee, D.-J.; Zumeta-Olaskoaga, L.; Martin-Garetxana, I.; Bikandi, E.; et al. The burden of injuries according to maturity status and timing: A two-decade study with 110 growth curves in an elite football academy. Eur. J. Sport Sci. 2021, 23, 267–277. [Google Scholar] [CrossRef]
  48. Saw, A.E.; Main, L.C.; Gastin, P.B. Monitoring the athlete training response: Subjective self-reported measures trump commonly used objective measures: A systematic review. Br. J. Sports Med. 2016, 50, 281–291. [Google Scholar] [CrossRef]
  49. Anderegg, J.M.; Brefin, S.L.; Nigg, C.R.; Koschnick, D.; Paul, C.; Ketelhut, S. Load and recovery monitoring in Swiss top-level youth soccer players: Exploring the associations of a new web application-based score with recognised load measures. Curr. Issues Sport Sci. CISS 2024, 9, 020. [Google Scholar] [CrossRef]
  50. Arede, J.; Cumming, S.P.; Johnson, D.M.; Leite, N. The effects of maturity matched and un-matched opposition on physical performance and spatial exploration behavior during youth basketball matches. PLoS ONE 2021, 16, e0249739. [Google Scholar] [CrossRef] [PubMed]
  51. Heidari, J.; Beckmann, J.; Bertollo, M.; Brink, M.; Kallus, K.W.; Robazza, C.; Kellmann, M. Multidimensional Monitoring of Recovery Status and Implications for Performance. Int. J. Sports Physiol. Perform. 2019, 14, 2–8. [Google Scholar] [CrossRef] [PubMed]
  52. Kellmann, M.; Bertollo, M.; Bosquet, L.; Brink, M.; Coutts, A.J.; Duffield, R.; Erlacher, D.; Halson, S.L.; Hecksteden, A.; Heidari, J.; et al. Recovery and Performance in Sport: Consensus Statement. Int. J. Sports Physiol. Perform. 2017, 13, 240–245. [Google Scholar] [CrossRef] [PubMed]
  53. Spörri, J.; Stöggl, T.; Aminian, K. Editorial: Health and Performance Assessment in Winter Sports. Front. Sports Act. Living 2021, 3, 628574. [Google Scholar] [CrossRef] [PubMed]
  54. Jennings, D.; Cormack, S.; Coutts, A.J.; Boyd, L.J.; Aughey, R.J. Variability of GPS Units for Measuring Distance in Team Sport Movements. Int. J. Sports Physiol. Perform. 2010, 5, 565–569. [Google Scholar] [CrossRef]
  55. Beato, M.; Coratella, G.; Stiff, A.; Iacono, A.D. The Validity and Between-Unit Variability of GNSS Units (STATSports Apex 10 and 18 Hz) for Measuring Distance and Peak Speed in Team Sports. Front. Physiol. 2018, 9, 1288. [Google Scholar] [CrossRef]
  56. Nikolaidis, P.T.; Clemente, F.M.; van der Linden, C.M.I.; Rosemann, T.; Knechtle, B. Validity and Reliability of 10-Hz Global Positioning System to Assess In-line Movement and Change of Direction. Front. Physiol. 2018, 9, 228. [Google Scholar] [CrossRef]
  57. Whitehead, S.; Till, K.; Weaving, D.; Jones, B. The Use of Microtechnology to Quantify the Peak Match Demands of the Football Codes: A Systematic Review. Sports Med. 2018, 48, 2549–2575. [Google Scholar] [CrossRef] [PubMed]
  58. Borg, G. Borg’s Perceived Exertion and Pain Scales; Human Kinetics: Champaign, IL, USA, 1998; Volume viii, 104p. [Google Scholar]
  59. Foster, C.; Florhaug, J.A.; Franklin, J.; Gottschall, L.; Hrovatin, L.A.; Parker, S.; Doleshal, P.; Dodge, C. A new approach to monitoring exercise training. J. Strength Cond. Res. 2001, 15, 109–115. [Google Scholar] [PubMed]
  60. Kelly, D.M.; Strudwick, A.J.; Atkinson, G.; Drust, B.; Gregson, W. The within-participant correlation between perception of effort and heart rate-based estimations of training load in elite soccer players. J. Sports Sci. 2016, 34, 1328–1332. [Google Scholar] [CrossRef]
  61. Teixeira, J.E.; Forte, P.; Ferraz, R.; Leal, M.; Ribeiro, J.; Silva, A.J.; Barbosa, T.M.; Monteiro, A.M. Quantifying Sub-Elite Youth Football Weekly Training Load and Recovery Variation. Appl. Sci. 2021, 11, 4871. [Google Scholar] [CrossRef]
  62. Teixeira, J.E.; Encarnação, S.; Branquinho, L.; Ferraz, R.; Portella, D.; Monteiro, D.; Morgans, R.; Barbosa, T.; Monteiro, A.; Forte, P. Classification of recovery states in U15, U17, and U19 sub-elite football players: A machine learning approach. Front. Psychol. 2024, 15, 447968. [Google Scholar] [CrossRef]
  63. Nobari, H.; Eken, Ö.; Kamiş, O.; Oliveira, R.; González, P.P.; Aquino, R. Relationships between training load, peak height velocity, muscle soreness and fatigue status in elite-level young soccer players: A competition season study. BMC Pediatr. 2023, 23, 55. [Google Scholar] [CrossRef]
  64. Schreiber, S.; Faude, O.; Gärtner, B.; Meyer, T.; Egger, F. Risk of SARS-CoV-2 transmission from on-field player contacts in amateur, youth and professional football (soccer). Br. J. Sports Med. 2022, 56, 158–164. [Google Scholar] [CrossRef]
  65. Coutinho, P.; Ramos, A.; Afonso, J.; Bessa, C.; Ribeiro, J.; Davids, K.; Fonseca, A.M.; Mesquita, I. To Sample or to Specialise? Sport Participation Patterns of Youth Team Sport Male Players. Children 2023, 10, 729. [Google Scholar] [CrossRef]
  66. da Costa, I.T.; Garganta, J.; Greco, P.J.; Mesquita, I.; Maia, J. Sistema de avaliação táctica no Futebol (FUT-SAT): Desenvolvimento e validação preliminar. Motricidade 2011, 7, 69–84. [Google Scholar] [CrossRef]
  67. Matos, R.; Moreira, C.; Alves, E.; Teixeira, J.E.; Rodrigues, F.; Monteiro, D.; Antunes, R.; Forte, P. Tactical Knowledge by Decision Making and Motor Efficiency of Young Football Players in Different Playing Positions during a Three-a-Side Small-Sided Game. Behav. Sci. 2023, 13, 310. [Google Scholar] [CrossRef]
  68. Low, B.; Coutinho, D.; Gonçalves, B.; Rein, R.; Memmert, D.; Sampaio, J. A Systematic Review of Collective Tactical Behaviours in Football Using Positional Data. Sports Med. 2020, 50, 343–385. [Google Scholar] [CrossRef]
  69. Sánchez-López, R.; Echeazarra, I.; Castellano, J. Systematic review of declarative tactical knowledge evaluation tools based on game-play scenarios in soccer. Qual. Quant. 2022, 56, 2157–2176. [Google Scholar] [CrossRef]
  70. Praça, G.M.; Soares, V.V.; Matias, C.J.A.d.S.; da Costa, I.T.; Greco, P.J. Relationship between tactical and technical performance in youth soccer players. Rev. Bras. Cineantropometria Desempenho Hum. 2015, 17, 136–144. [Google Scholar] [CrossRef]
  71. Coutinho, D.; Kelly, A.L.; Santos, S.; Figueiredo, P.; Pizarro, D.; Travassos, B. Exploring the Effects of Tasks with Different Decision-Making Levels on Ball Control, Passing Performance, and External Load in Youth Football. Children 2023, 10, 220. [Google Scholar] [CrossRef] [PubMed]
  72. Junior, J.A.S.; Aquino, R.; Machado, J.C. Tactical performance in soccer: The importance of choosing and properly applying tactical assessment tools. Braz. J. Mot. Behav. 2020, 14, 171–179. [Google Scholar] [CrossRef]
  73. Quina, J.d.N.; Camões, M.; Graça, A. Desenvolvimento e Validação de um Instrumento de Avaliação da Competência Tática em Futebol. 3° Congr Int Jogos Desport [Internet]. 2011. Available online: https://bibliotecadigital.ipb.pt/handle/10198/6996 (accessed on 2 May 2024).
  74. Huang, Y.; Li, C.; Bai, Z.; Wang, Y.; Ye, X.; Gui, Y.; Lu, Q. The impact of sport-specific physical fitness change patterns on lower limb non-contact injury risk in youth female basketball players: A pilot study based on field testing and machine learning. Front. Physiol. 2023, 14, 1182755. [Google Scholar] [CrossRef] [PubMed]
  75. Frydrychová, Z.; Bartošová, K.; Hutečková, S. The role of parent-initiated motivational climate in the development of overtraining syndrome in adolescent elite athletes: A review study. Stud. Sport. 2017, 11, 269–284. [Google Scholar] [CrossRef]
  76. Pol, R.; Hristovski, R.; Medina, D.; Balague, N. From microscopic to macroscopic sports injuries. Applying the complex dynamic systems approach to sports medicine: A narrative review. Br. J. Sports Med. 2019, 53, 1214–1220. [Google Scholar] [CrossRef]
  77. Stiffler-Joachim, M.R.; Wille, C.M.; Kliethermes, S.A.; Johnston, W.; Heiderscheit, B.C. Foot Angle and Loading Rate during Running Demonstrate a Nonlinear Relationship. Med. Sci. Sports Exerc. 2019, 51, 2067–2072. [Google Scholar] [CrossRef] [PubMed]
  78. Wang, S.; Chen, W.; Zhou, X.; Chang, S.-Y.; Ji, M. Addressing Skewness in Iterative ML Jobs with Parameter Partition. In Proceedings of the IEEE INFOCOM 2019—IEEE Conference on Computer Communications, Paris, France, 29 April–2 May 2019; IEEE Press: Paris, France, 2019; pp. 1261–1269. [Google Scholar]
  79. Horst, F.; Janssen, D.; Beckmann, H.; Schöllhorn, W.I. Can Individual Movement Characteristics Across Different Throwing Disciplines Be Identified in High-Performance Decathletes? Front. Psychol. 2020, 11, 2262. [Google Scholar] [CrossRef]
  80. Ren, P. Theory of physical education ecosystem based on SFIC model. Appl. Math. Nonlinear Sci. 2023, 8, 2971–2980. [Google Scholar] [CrossRef]
  81. Farias, C.; Mesquita, I.; Hastie, P.A. Student game-play performance in invasion games following three consecutive hybrid Sport Education seasons. Eur. Phys. Educ. Rev. 2019, 25, 691–712. [Google Scholar] [CrossRef]
  82. Ivosevic, L.; Stöckel, T. Anticipation in soccer: Skilled players benefit from early pattern recognition in corner kick situations. Front. Psychol. 2025, 16, 1631208. [Google Scholar] [CrossRef]
  83. Powless, M.D.; Steinfeldt, J.A.; Fisher, S.E.; McFadden, P.; Kennedy, K.W.; Bellini, S. Utilizing Video-Based Trainings to Improve Decision Making in High School Quarterbacks. Sports 2020, 8, 18. [Google Scholar] [CrossRef]
  84. Silvino, V.; Da Silva Sousa, L.; Ferreira, C.; Santos, L.; Apaza, H.; Soares Almeida, S.; Santos, M. The Use of Machine Learning in Sports Performance: A Systematic Review. Transl. J. Am. Coll. Sports Med. 2025, 10, 1–10. [Google Scholar] [CrossRef]
  85. Yang, W.; Wang, J.; Zhao, Z.; Cui, Y. Accuracy of an Ultra-Wideband-Based Tracking System for Time–Motion Analysis in Tennis. Sensors 2025, 25, 1031. [Google Scholar] [CrossRef] [PubMed]
  86. Yeh, C.-C.; Peng, H.-T.; Lin, W.-B. Achievement Prediction and Performance Assessment System for Nations in the Asian Games. Appl. Sci. 2024, 14, 789. [Google Scholar] [CrossRef]
  87. Duncan, M.J.; Eyre, E.L.J.; Clarke, N.; Hamid, A.; Jing, Y. Importance of fundamental movement skills to predict technical skills in youth grassroots soccer: A machine learning approach. Int. J. Sports Sci. Coach. 2024, 19, 1042–1049. [Google Scholar] [CrossRef]
  88. Lim, B.; Song, W. Exploring CrossFit performance prediction and analysis via extensive data and machine learning. J. Sports Med. Phys. Fit. 2024, 64, 640–649. [Google Scholar] [CrossRef]
  89. Weirich, C.; Kim, J.W.; Yoon, Y.; Jeong, S. Advancing NFL win prediction: From Pythagorean formulas to machine learning algorithms. Front. Sports Act. Living 2025, 7, 1638446. [Google Scholar] [CrossRef]
  90. Zhan, T.; Fang, L.; Xu, Y. Prediction of thermal boundary resistance by the machine learning method. Sci. Rep. 2017, 7, 7109. [Google Scholar] [CrossRef]
  91. Fu, X.; Zhang, B.; Wang, L.; Wei, Y.; Leng, Y.; Dang, J. Stability prediction for soil-rock mixture slopes based on a novel ensemble learning model. Front. Earth Sci. 2023, 10, 1102802. [Google Scholar] [CrossRef]
  92. Huang, M.-L.; Li, Y.-Z. Use of Machine Learning and Deep Learning to Predict the Outcomes of Major League Baseball Matches. Appl. Sci. 2021, 11, 4499. [Google Scholar] [CrossRef]
  93. Kim, J.W.; Magnusen, M.; Jeong, S. March Madness prediction: Different machine learning approaches with non-box score statistics. Manag. Decis. Econ. 2023, 44, 2223–2236. [Google Scholar] [CrossRef]
  94. Prince, M.A.; Conner, B.T.; Davis, S.R.; Swaim, R.C.; Stanley, L.R. Risk and protective factors of current opioid use among youth living on or near American Indian reservations: An application of machine learning. Transl. Issues Psychol. Sci. 2021, 7, 130–140. [Google Scholar] [CrossRef]
  95. Lee, Y.-H.; Chang, J.; Lee, J.-E.; Jung, Y.-S.; Lee, D.; Lee, H.-S. Essential elements of physical fitness analysis in male adolescent athletes using machine learning. PLoS ONE 2024, 19, e0298870. [Google Scholar] [CrossRef]
  96. Rakesh, D.; Sadikova, E.; McLaughlin, K.A. Beyond the income-achievement gap: The role of individual, family, and environmental factors in cognitive resilience among low-income youth. JCPP Adv. 2024, 5, e12297. [Google Scholar] [CrossRef] [PubMed]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Article Metrics

Citations

Article Access Statistics

Article metric data becomes available approximately 24 hours after publication online.