1. Introduction
Vision and associated visual skills are fundamentally important in most sports [
1,
2]. In gymnastics, visual abilities significantly contribute to athletes’ capacity to execute highly complex skills on various apparatuses, each presenting distinct visual demands [
3]. Key visual skills utilized in gymnastics include static and dynamic visual acuity, gaze control (fixations, saccades, and smooth pursuit), depth perception, peripheral vision, accommodation (focus flexibility), reaction time, and hand–eye coordination [
1].
Gymnasts must precisely control their gaze, fixating on specific locations while performing intricate movements. Rhythmic gymnastics, involving the manipulation of apparatuses like ribbons, hoops, and balls, places extreme demands on visual control for anticipating, tracking, and executing rapid, precise actions. Success hinges not only on physical prowess but also on the ability to accurately stabilize and direct one’s gaze. Gaze behavior can vary based on task constraints and the performer’s level of expertise [
3].
Gaze control involves two main functions: stabilizing the gaze to maintain focus using reflexes triggered by inputs from the vestibular system, visual cues, or neck movements; and orienting the gaze towards objects of interest using a combination of quick eye movements (saccades) and smooth tracking movements (smooth pursuit). During this process, the visual and oculomotor systems collaborate to ensure that the object of interest remains centered on the retina [
4].
Focus flexibility, or accommodation, refers to the skill that enables athletes to swiftly shift their focus from one point to another in space without excess effort. Difficulties in this area can hinder the ability to track incoming or outgoing objects quickly and accurately [
5].
Reaction time refers to the duration between sensing a stimulus and initiating the appropriate response. Specifically, visual reaction time measures the time it takes to perceive and react to visual stimuli, which may also involve auditory cues. Response time is defined as the total time necessary to process visual information and complete the motor response sequence [
1].
In our research, we employed the Devices for an Integral Visual Examination (DIVE) System, a comprehensive tool integrating eye-tracking technology with various utilities to assess visual function across different domains. The DIVE system features a high-resolution touchscreen display for presenting visual stimuli and facilitating patient interaction. Enhanced by an advanced eye tracker, the DIVE captures patient responses to these stimuli [
4].
Furthermore, while the present study focuses on the application of machine learning models, it is rooted in biomedical optics using the DIVE system—a platform based on light-based eye-tracking and precision visual stimulus presentation. This optical infrastructure is central to capturing high-resolution oculomotor data and aligns with the technological scope of photonics-related research.
In this research study, we investigate the predictive capabilities of three distinct algorithms utilized to forecast visual skills among gymnasts. The algorithms employed are the k-nearest neighbors (KNN), decision tree (DT), and support vector machine (SVM) using the hold-out method.
KNN is an intuitive supervised learning algorithm that employs a proximity-based method for pattern recognition [
4].
DT is a supervised learning algorithm used for classification and prediction tasks [
6].
SVM is a kernel-based machine learning algorithm that can categorize and input data into specific classes or categories [
4].
In the broader field of optical systems, artificial intelligence—particularly artificial neural networks (ANNs)—have been increasingly employed to model complex photonic phenomena. Hamedi and Jahromi [
7] used ANN architectures to analyze the performance of all-optical logic gates, demonstrating the capability of neural models to simulate non-linear optical interactions. Similarly, Hamedi et al. [
8] applied AI techniques to the modeling of nanoplasmonic biosensors, highlighting the potential of data-driven methods to predict optical system responses with high precision. In another study, Jahromi and Hamedi [
9] developed an ANN-based model to estimate the electronic and optical properties of nanocomposites, achieving excellent correlation between experimental and predicted values. These applications reinforce the versatility of ANN approaches in optical domains and suggest promising avenues for their integration into complex vision-based sports performance modeling.
The hold-out method is a technique used in machine learning and statistics to evaluate the performance of a predictive model. It involves dividing the dataset into two parts: one used to train the model (training set, typically 70% of the data) and the other reserved for testing (test set, typically 30%). This approach helps in assessing the model’s ability to generalize to unseen data.
Our objective is to discern the efficacy of these algorithms in forecasting visual abilities based on a battery of visual tests administered to 383 gymnasts aged between 4 and 27 years. To elucidate the visual skills contributing to gymnastics and enable greater predictive precision, we employed a variety of algorithms in this research. Through the utilization of these algorithms, we aim to determine their effectiveness in predicting visual skills crucial for performance in rhythmic gymnastics.
2. Materials and Methods
Permission was obtained from the Sports Department of the Madrid City Council to conduct visual assessments of rhythmic gymnasts. The study was conducted in accordance with the principles of the Declaration of Helsinki and was approved by the Institutional Review Board of Hospital Clínico San Carlos, Madrid, Spain (protocol no. 21/766-E; approval date: 20 December 2021). Participation was voluntary, and written informed consent was obtained from all participants or their legal guardians in the case of minors. All tests were administered by trained clinicians under standardized conditions.
Assessments were conducted in various sports centers across Madrid where rhythmic gymnastics is regularly practiced by affiliated clubs. In each center, two dedicated testing areas were set up: a darkened room for DIVE assessments and an adjacent space with tables and chairs for clinical procedures. The clinical tests included near convergence point, cover test, monocular accommodative facility, visual reaction time, and hand–eye coordination. The order of test administration was randomized, and the total duration of testing per participant was approximately 15 min.
Figure 1 provides a schematic overview of the visual evaluation process carried out with rhythmic gymnasts, including the sequence of procedures, the settings used for data collection, and the specific tests performed.
2.1. Equipment and Visual Assessment Protocol
The DIVE system, equipped with a 12-inch screen providing a visual angle of 22.11° horizontally and 14.81° vertically, was used to conduct the evaluations. Its eye-tracking technology operates with a maximum temporal resolution of 120 Hz, offering high precision in tracking eye movements, a key feature for assessing rapid and subtle visual responses in gymnasts.
During the visual assessment sessions, gymnasts sat in front of the DIVE system, which features a high-resolution 12-inch screen and integrated eye-tracking technology (
Figure 2). Testing was conducted in a dimly lit environment to ensure optimal tracking accuracy and participant comfort. The system recorded eye movements and visual responses during a series of standardized tasks, providing objective data across multiple visual domains.
The selection of the DIVE system (Device for an Integral Visual Examination) for visual assessment in this study is supported by its prior validation in clinical and multicenter settings. Pérez Roche et al. [
10] utilized DIVE to evaluate visual acuity and contrast sensitivity in a sample of 2208 children aged between 6 months and 14 years, including both full-term and preterm births, across five countries. The findings indicated that both visual functions improved with age, particularly during the first five years of life. This study provided normative reference values and endorsed DIVE’s utility as an objective and effective tool for measuring basic visual functions in pediatric populations.
Furthermore, Pueyo et al. [
11] employed DIVE to characterize the development of oculomotor control in 802 healthy children aged between 5 months and 15 years, observing significant improvements in fixation stability and saccadic movements with age, especially during the first two years of their lives.
Lastly, Altemir et al. [
12] assessed fixational behavior during long and short fixation tasks in 259 participants aged between 5 months and 77 years using DIVE. They found that gaze stability improved with age up to 30 years and then progressively declined from the fifth decade of life onwards.
The DIVE system facilitated various assessment protocols, each selected for its relevance to the visual demands of rhythmic gymnastics. These protocols include:
Long Saccades DIVE: It assesses the gymnasts’ ability to perform rapid and extended eye movements, which are crucial for tracking moving apparatus.
Short Saccades DIVE: It measures the precision of shorter eye movements, which are necessary for detailed tasks such as hand–eye coordination with apparatus.
Eye Tracker Fixation Test DIVE: It assesses the stability of the gymnasts’ visual fixation, which is essential for maintaining focus during routines.
Color Perception DIVE: Detects possible anomalies in color perception that could affect interaction with colored apparatus.
Visual Acuity and Single Binocular Field (Av y FSC DIVE): Essential for spatial awareness and precision in positioning relative to the apparatus.
In addition to the DIVE protocols, two complementary clinical tests were conducted:
Reaction time and hand–eye coordination were assessed using the Reaction Lights system to simulate dynamic visual–motor demands.
Monocular accommodative facility was measured with ±2.00 D flippers to evaluate the gymnasts’ focusing facility.
2.2. Variable Categorization and Class Balance
For the machine learning models, each visual variable was converted into a categorical output. In particular, REAF (right eye accommodative facility) and LEAF (left eye accommodative facility) were labeled as “normal” or “reduced” based on clinical optometric reference values adapted to pediatric populations. Participants who achieved ≥6 cycles per minute were classified as “normal”, and those below this threshold were classified as “reduced”.
Additionally, to mitigate potential class imbalance, a stratified 5-fold cross-validation approach was implemented during model training and validation. This ensured that the distribution of class labels was preserved across all data folds, maintaining representativeness and improving model robustness.
2.3. Model Training and Preprocessing
All models were developed and evaluated using the Scikit-learn and XGBoost libraries in Python 3.11.4. The dataset was divided into 70% for training and 30% for testing. This single hold-out split was used to evaluate the final model performance on unseen data.
Although no cross-validation was applied for model evaluation, a stratified 5-fold cross-validation framework was used during hyperparameter tuning to optimize each algorithm’s configuration.
A univariate feature selection method based on ANOVA F-values was used to retain the most relevant features. For each algorithm, hyperparameter tuning was performed using grid search within the cross-validation framework.
Although the hyperparameter search was not exhaustive, we conducted empirical exploration for each algorithm to identify configurations that maximized predictive performance. For KNN, several values of k (e.g., 3, 5, 7) were tested. For SVM, we experimented with different kernels (linear and RBF) and regularization parameters (C).
The best-performing configuration for each model was selected based on macro F1-score on the validation folds. Tree-based models (decision tree, random forest, and XGBoost) did not require feature scaling, while standardization (z-score normalization) was applied to SVM and KNN to ensure optimal performance.
The macro F1-score provides an unweighted average of F1-scores across all classes and is particularly useful when class distributions are imbalanced. It is defined as
where
C is the number of classes. For our binary classification tasks (e.g., “normal” vs. “reduced”), the macro F1-score corresponds to the average of F1-scores for each class, treating both classes equally regardless of size. In future applications involving multi-class or ordinal targets, this metric can similarly assess overall performance by computing per-class F1 values and averaging them.
3. Results
A total of 383 rhythmic gymnasts aged 4–26 years participated in this study. The dataset conformed to the assumption of normality, allowing the application of parametric statistical methods. Descriptive statistics, including means, standard deviations, minimum and maximum values, were calculated to characterize the global variables of the sample. To analyze differences and patterns in the variables of interest, the sample was divided into age groups. Statistical comparisons were made to identify significant trends and variations between these groups, providing insight into the developmental trajectory of visual variables within rhythmic gymnastics.
These global values were presented in
Table 1, which provides an overview of the entire cohort’s performance across all assessed variables. A closer examination of the table reveals considerable variability in several visual function parameters within the sample of rhythmic gymnasts. Notably, accommodative facility (REAF and LEAF) and eye–hand coordination (EHC) display wide ranges and relatively high standard deviations, indicating heterogeneity in neurosensory performance across participants. Visual reaction time (VRT) also shows substantial dispersion (mean: 1066 ms; SD: 241 ms), likely reflecting developmental differences across the broad age spectrum. Notably, the near convergence point (NCP) ranged from 0 to 16 cm, suggesting that while many gymnasts demonstrate effective convergence, a subset exhibits marked limitations. The relatively symmetrical means and variability in fixation stability metrics (FLTBREFS/FLTBLEFS and FSTBREFS/FSTBLEFS) indicate balanced binocular control between eyes.
To analyze differences and patterns in the variables of interest, the sample was divided into nine age groups: 6–6.9, 7–7.9, 8–8.9, 9–10.9, 11–12.9, 13–13.9, 13–14.9, 15–18, and 19–27 years. Statistical comparisons were made to identify significant trends and variations between these groups, providing insight into the developmental trajectory of visual variables within rhythmic gymnastics.
Table 2 presents the age-stratified means and standard deviations for each assessed variable, highlighting how specific visual performance metrics evolve with age. Overall, an age-related improvement is evident in key parameters such as accommodative facility, visual reaction time, and eye–hand coordination, consistent with the progressive maturation of oculomotor and neurosensory pathways. Conversely, convergence point, and smooth pursuit performance show greater variability, potentially reflecting more complex or non-linear developmental profiles influenced by training history or individual visual demands.
3.1. Differences Between Groups
Figure 3,
Figure 4 and
Figure 5 provide a graphical representation of the age-related differences observed in three key visual variables: near convergence point (NCP), reaction time, and hand–eye coordination, respectively. The data reveal a general trend of improvement with increasing age, particularly noticeable between the younger and older participant groups.
As shown in
Figure 3, NCP values tend to decrease with age, although this trend is not strictly linear. Notably, the 19–27 age group showed a slight increase, which may be due to greater individual variability or differing visual demands at older ages.
Figure 4 shows a gradual reduction in reaction time, reflecting faster visual–motor responses as age increases. Similarly,
Figure 5 illustrates improvements in hand–eye coordination, with older groups achieving higher scores. These visual patterns support the statistical analyses and suggest a developmental maturation of visual and sensorimotor functions relevant to rhythmic gymnastics performance.
A series of significant differences are evident across age groups in relation to near convergence point, reaction time, and hand–eye coordination, as depicted in the preceding three figures.
To further investigate whether age alone could explain the variance observed in visual performance, we conducted a correlational analysis between age and key visual parameters. As shown in
Figure 6, the correlation between age and near convergence point (NCP) was weak (R
2 = 0.075), with a shallow regression slope. This suggests that the relationship between age and NCP is modest and non-linear. Additionally, machine learning models trained without age as a feature retained robust predictive performance, indicating that the visual variables provide relevant information beyond chronological age.
3.2. Machine Learning Models Performance
In addition to the descriptive group comparisons, the predictive capabilities of three supervised machine learning models—decision tree, support vector machine (SVM), and k-nearest neighbors (KNN)—were evaluated. These models were applied to classify performance on specific visual functions, such as fixation stability and accommodative facility, using eye-tracking and clinical data. The results for each model and task are presented below.
This study aims to evaluate the performance of three machine learning models, namely decision tree, support vector machine (SVM), and k-nearest neighbors (KNN), in predicting fixation stability in short tasks. The results of this study will provide insights into the most suitable model for this task and highlight the strengths and weaknesses of each model.
The class distributions for the binary classification of accommodative facility (normal vs. reduced) were balanced by a median split. To further illustrate the model’s predictive ability,
Figure 7 presents the ROC curve for the decision tree model in predicting accommodative facility. The area under the curve (AUC = 0.98) demonstrates excellent discriminative performance.
Figure 8 shows the structure of the final decision tree model, illustrating the most relevant features used for classification. Notably, fixation stability and accommodative facility emerged as dominant predictors, consistent with their established clinical relevance in visual performance assessment.
Results for the right eye:
Results for the left eye:
In general, the decision tree consistently demonstrated excellent performance in both eyes, with high accuracy and a macro F1 score close to 1. The SVM and KNN showed lower performance, exhibiting a higher rate of misclassifications. Notably, the SVM struggled to correctly classify positive cases in both eyes, whereas the KNN showed a higher number of false positives and negatives in the left eye.
In summary, the decision tree is the most suitable model for predicting fixation stability in short tasks in both eyes, followed by the KNN and SVM. It is important to acknowledge the limitations of each model and consider appropriate adjustments to enhance their predictive performance.
Combined results for both eyes:
The combined results show that the decision tree is still the best-performing model, with an average accuracy of 97.20% and an average macro F1 score of 0.973. The KNN and SVM have lower performance, with average accuracy and macro F1 scores of 70.00% and 74.77%, respectively.
Short-term fixation denotes the ability to maintain visual attention on an object for a brief period, typically spanning seconds to minutes. This generally demands less sustained concentration and attention compared to long-term fixation.
The study found that the decision tree model also exhibits optimal predictive performance for short-term fixation, albeit with a slightly lower precision compared to long-term fixation. This indicates that the model can accurately predict an individual’s ability to sustain visual attention on an object for a brief period.
In summary, three machine learning models (support vector machine (SVM), k-nearest neighbors (KNN), and decision tree) were tested for two classification tasks: accommodative facility of the right eye (REAF) and accommodative facility of the left eye (LEAF), for both eyes.
In general, the models had difficulties in correctly classifying the positive class in both classification tasks.
The SVM had high accuracy in some tests, but its low F1 macro revealed a significant imbalance in the classification of the positive class.
The KNN had a better balance between classes in some tests, but its accuracy was lower compared to the SVM.
The decision tree performed worst in some tests, with both low accuracy and macro F1 score, indicating substantial misclassification in both classes.
For the accommodative facility of the right eye (REAF), the SVM had the highest accuracy (79.28%), but its F1 macro was low (0.4422) due to its inability to correctly classify the positive class. The KNN had a better F1 macro (0.5547) compared to the SVM, but its accuracy was lower (70.27%). The DT exhibited the lowest performance, both in accuracy (65.77%) and F1 macro (0.5229). Moreover, random forest and XGBoost had an accuracy of 56.76% and 61.26%, respectively, and a macro F1 0.4127 and 0.5037.
For the accommodative facility of the left eye (LEAF), the KNN seemed to be the most balanced model, while the SVM suffered from a severe imbalance and the DT had a low overall performance.
4. Discussion
The present study explored the application of supervised machine learning algorithms to predict key visual functions in rhythmic gymnasts, focusing specifically on fixation stability and accommodative facility. Among the three models evaluated—decision tree (DT), support vector machine (SVM), and k-nearest neighbors (KNN), the DT algorithm exhibited the highest predictive performance, with an accuracy of 92.79% and a macro F1-score of 0.9276. These findings highlight the potential of decision trees as a robust and interpretable approach for modeling complex, non-linear relationships between visual variables and functional outcomes in high-performance sports.
The superior performance of the DT model may be attributed to its inherent ability to manage multidimensional data and capture subtle interactions between input features.
Although age was correlated with some visual variables in descriptive analyses, our findings suggest that the predictive performance of the model was not driven primarily by chronological age. As shown in the correlation analysis between age and near convergence point (
Figure 6), the association was weak (R
2 = 0.075), and models trained without age as an input retained high accuracy. This supports the notion that machine learning algorithms captured meaningful visual performance patterns that extend beyond age-related maturation.
In dynamic disciplines like rhythmic gymnastics, visual skills such as fixation and accommodation are essential for responding to rapidly changing stimuli with precision and stability. Our results support the notion that visual–motor abilities can be effectively predicted using AI-based models, offering practical implications for athlete monitoring and individualized training design.
Visual skills such as fixation stability and accommodative facility, both of which are essential for athletes to maintain visual focus and adjust rapidly to changing visual stimuli, were predicted with high accuracy. This is consistent with the findings of previous studies, which emphasize the importance of visual acuity, saccades, and reaction time in sports performance [
13,
14].
Our results strongly suggest that the decision tree (DT) algorithm is the most robust, consistent, and clinically interpretable choice for classification problems in this context. Its exceptionally high accuracy and macro F1-score across most datasets make it a highly reliable and effective tool when the goal is to maximize both predictive performance and clarity. While other algorithms, such as the SVM and KNN, may offer advantages in specific scenarios, the DT model consistently outperformed them. This reinforces its value as the most suitable, practical, and accessible solution for modeling complex visual performance patterns in rhythmic gymnasts.
The final decision tree model identified fixation stability and accommodative facility as the most influential features in classifying visual performance categories. This aligns with clinical expectations, as both variables are closely linked to visual efficiency and oculomotor control, which are fundamental in sports like rhythmic gymnastics. The prominence of these features reinforces the interpretability of the model and its potential applicability in practical settings.
The binary categorization into “normal” and “reduced” was a methodological choice to enhance interpretability and ensure class balance using the median as cutoff. While this simplification may reduce granularity, it enabled robust and clinically meaningful predictions in this exploratory phase. Additional decision tree structures and ROC curves for the remaining visual variables are available in the
Supplementary Materials (
Figure S1–S10). Future studies will explore multi-class and continuous models for greater precision.
Additionally, a comparative analysis of the algorithms used (decision tree, support vector machine, and k-nearest neighbors) has been incorporated, focusing on their technical characteristics. The performance of the SVM model may have been limited by its reliance on linear class separability, especially for variables such as accommodative facility. The KNN algorithm, on the other hand, showed sensitivity to the choice of the k parameter and the number of samples within local neighborhoods, which may affect its stability in heterogeneous datasets. Although the decision tree model performed well overall, its simple structure may carry a higher risk of overfitting in small or highly variable datasets. This comparison underscores the importance of selecting models not only based on overall performance but also on their suitability for the type of variable and data structure.
Moreover, the three algorithms differ significantly in their bias–variance trade-offs. The decision tree tends to have low bias but high variance, making it prone to overfitting, especially when no pruning is applied. SVMs, depending on the kernel, typically have a more balanced bias–variance profile and are robust to overfitting, but can be sensitive to the selection of hyperparameters like the regularization term and kernel type. KNN is characterized by high variance and low bias, especially with small values of k, and is highly sensitive to outliers and noise in the data. These characteristics influence model stability and generalizability, particularly in datasets with heterogeneous distributions or noisy measurements.
While the decision tree model exhibited strong performance across most visual variables, its predictive accuracy was notably lower for accommodative facility (REAF/LEAF). This discrepancy may be attributed to several factors. First, a potential class imbalance—where most participants demonstrated normal accommodative function—may have hindered the model’s ability to learn minority class patterns. Second, accommodative facility tests, conducted with ±2.00 D flippers, are subject to variability due to examiner influence and participant cooperation, which can introduce noise into the labels. Finally, the relatively simple structure of the decision tree may lead to overfitting when learning from noisy or unbalanced data, especially without extensive regularization or pruning.
Additionally, the lower performance of the KNN model does not appear to be related to class imbalance, as outcome variables were binarized using the sample median to ensure balanced class distribution. Rather, KNN’s limitations may stem from its sensitivity to high-dimensional input spaces and the absence of dimensionality reduction or feature engineering strategies. Future work may address this by applying principal component analysis (PCA) or automated feature selection to improve performance.
Furthermore, we assessed feature distribution and multicollinearity to ensure the reliability of the input variables. Principal component analysis (PCA) was tested as a dimensionality reduction technique but showed minimal improvement in model performance. Considering the relatively small number of predictors (14 variables) and the importance of interpretability in clinical–sport settings, we decided to retain the original feature set. Multicollinearity was evaluated using variance inflation factors (VIF), and all values were below two, indicating no significant redundancy among variables.
Although artificial neural networks (ANNs) are increasingly used in biomedical and sports-related predictive modeling, we deliberately excluded them from the current study due to the heightened risk of overfitting associated with our relatively small dataset (n = 383). Nevertheless, prior studies have demonstrated that ANN architectures can yield reliable predictions even with similar sample sizes. For example, Hamedi et al. and Jahromi et al. applied ANN models to predict optical and nanophotonic properties in experimental contexts with limited data availability [
7,
8,
9]. These precedents support the potential future application of ANN techniques in sports vision research, particularly when larger, more heterogeneous, or multimodal datasets become available.
The findings of our study align with the growing body of literature supporting the integration of artificial intelligence (AI) and machine learning (ML) techniques in sports science. Reis et al. [
15] emphasizes the utility of decision tree (DT) algorithms and other supervised learning methods for injury risk prediction and performance optimization, especially in disciplines that require dynamic and multidimensional analysis. Our work adds to this evidence by showing that DT algorithms can also be highly effective in predicting key visual variables in rhythmic gymnasts, highlighting the potential of AI models to support tailored interventions and performance monitoring in youth sports.
A relevant comparison can be made with the study by Liu et al. [
16], who applied various machine learning algorithms—including decision trees, KNN, and SVM—to predict physical activity behavior among university students, based on psychological constructs such as sports learning interest and autonomy support. Although their study focused on behavioral and motivational variables rather than visual or physiological abilities, both investigations share the common goal of using supervised learning models to forecast performance-related traits in sports populations. In their findings, logistic regression achieved the highest overall accuracy (72.88%), while decision trees and SVM yielded moderate results (F1 scores of 0.6672 and 0.6845, respectively). In contrast, our study found decision trees to be the most effective model for predicting visual function, particularly in tasks involving fixation and accommodative facility. These differences may be attributed to the nature of the target variables—subjective behavioral intentions versus objective visual performance—as well as the structure of the datasets. Nonetheless, both studies highlight the utility of machine learning as a powerful tool for modeling complex relationships in sports-related domains.
Additional support for the use of machine learning in predicting physical performance variables comes from Zhang et al. [
17], who applied optimized algorithms such as decision trees and SVM to gain recognition and prediction. Their study demonstrated high precision in modeling human posture changes, with a root mean square error (RMSE) as low as 0.018 on flat terrain. Although focused on movement patterns rather than visual variables, their findings align with our results in highlighting the strength of decision trees in capturing complex, non-linear relationships in human performance prediction.
Machine learning (ML) refers to the development of systems capable of learning from experience and adapting autonomously to generate predictive analytics, without requiring explicit instructions [
13].
Machine learning has been applied in various areas of sports—for example, for sports monitoring data [
14], for activity recognition [
14], for making performance predictions [
4,
14,
18,
19,
20], and to investigate whether sports skills, physical performance, or general cognitive functions differ between players of different competition levels [
6].
Several studies have used machine learning algorithms to predict performance in sports contexts. For example, using KNN in the running discipline of marathon [
21] or to make injury predictions [
13,
19].
According to the authors, no previous study has been found that examines the visual skills of rhythmic gymnasts using machine learning to make predictions.
In this context, the objective of our study was to predict the visual variables utilized in gymnasts using three distinct algorithms: k-nearest neighbors (KNN), decision tree, and support vector machine (SVM). The visual skills assessed in gymnasts included visual acuity, saccades, smooth pursuits, fixations, reaction time, contrast sensitivity, accommodative facility, and color vision. Among these, machine learning algorithms were applied to predict two key functions: fixation stability and accommodative facility.
The optometric assessments conducted on athletes add value to the study, as they evaluate aspects crucial for sports performance. Predicting specific optometric values further enriches the scope of the study.
Regarding the predictive modeling, notable percentages exceeding 85% were observed for all variables. indicating high reliability even with only 60% of the data.
In the context of visual tests conducted on rhythmic gymnasts, the k-nearest neighbors (KNN) model has been effectively trained and performs well on a representative test set, suggesting the model has learned useful patterns and can generalize to similar situations with new rhythmic gymnasts. However, additional regular evaluations are recommended to ensure relevance and accuracy in evolving problem conditions.
From a practical perspective, the predictive models developed in this study offer valuable tools for the early detection of visual performance deficits in rhythmic gymnasts. By integrating eye-tracking assessments and algorithmic classification, coaches and clinicians could identify athletes with suboptimal fixation stability or accommodative facility—both critical for spatial orientation and rapid motor response during performance. This approach enables personalized training interventions aimed at strengthening specific visual skills, optimizing sensorimotor coordination, and potentially reducing injury risk.
Moreover, the integration of predictive models like the one presented in this study could be highly valuable for applied contexts beyond assessment. In training programs, early identification of gymnasts with reduced fixation stability or accommodative facility would allow coaches to tailor visual and sensorimotor exercises aimed at improving those specific skills. Similarly, systematic screening with AI-based tools may support talent identification by detecting athletes with exceptional visual abilities early in their development. Finally, incorporating such models into injury prevention protocols could help identify visual deficits associated with increased risk of misjudging distances, apparatus timing, or coordination under pressure, all of which are relevant to rhythmic gymnastics performance and safety.
Longitudinal implementation of these models could support visual monitoring throughout the athletic development cycle, offering objective data to inform selection processes, guide rehabilitation strategies, and adjust visual–cognitive load during training sessions.