The Role of Intelligent Data Analysis in Selected Endurance Sports: A Systematic Literature Review

Rajšp, Alen; Rek, Patrik; Kokol, Peter; Fister, Iztok

doi:10.3390/app151810158

Open AccessSystematic Review

The Role of Intelligent Data Analysis in Selected Endurance Sports: A Systematic Literature Review

by

Alen Rajšp

^*

,

Patrik Rek

,

Peter Kokol

and

Iztok Fister, Jr.

Faculty of Electrical Engineering and Computer Science, University of Maribor, SI-2000 Maribor, Slovenia

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2025, 15(18), 10158; https://doi.org/10.3390/app151810158

Submission received: 4 March 2025 / Revised: 8 July 2025 / Accepted: 30 August 2025 / Published: 17 September 2025

Download

Browse Figures

Versions Notes

Abstract

In endurance sports, athletes and coaches shift increasingly from intuition-based decision-making to data-driven approaches powered by modern technology and analytics. Since 2018, the field has experienced significant advances, influencing endurance sports disciplines. This systematic literature review identified 75 peer-reviewed studies on intelligent data analysis in endurance sports training. Each study was categorized by its intelligent method (e.g., machine learning, deep learning, computational intelligence), the types of sensors and wearables used, and the specific training application and approach. Our synthesis reveals that machine learning and deep learning are among the most used approaches, with running and cycling identified as the most extensively studied sports. Physiological and environmental data, such as heart rate, biomechanical signals, and GPS, are often used to aid in generating personalized training plans, predicting injuries, and increasing athletes’ long-term performance. Despite these advancements, challenges remain, related to data quality and the small participant sample sizes.

Keywords:

smart sports training; endurance sports; intelligent data analysis; machine learning; artificial intelligence; computational intelligence; systematic literature review

1. Introduction

The range of intelligent data analysis applications has increased steadily in recent years as more domains have been impacted. Intelligent data analysis uses advanced statistical, machine learning, and pattern recognition techniques to probe deeper into the data structure and extract meaningful, non-obvious insights beyond simple summarization [1]. Various sports are not outliers, and the approach has also been utilized heavily in sports where competition results are a combination of different factors that may not be apparent by non-assisted observation. In sports, these approaches have been used in fan engagement [2], officiating competitions [3], sports betting [4], and sports training [5].

Moreover, artificial intelligence (AI) approaches are already being used in practice for selecting youth prospects in professional soccer leagues (e.g., the Premier League, La Liga, Bundesliga, etc.) [6]. The market of intelligent data analytics in sports is expected to grow from USD 0.9 billion in 2023 to USD 15.6 billion in 2033 [7] with some estimates [8] ranging up to USD 20.5 billion. Another one of the next steps in the evolution of sports could also be digital sports coaches, which may make sports more accessible by offering a digital service that may allow athlete users to pursue their wellness and fitness goals in a more efficient manner [9]. The refereeing of sports has also changed by it being supported more and more by information technology; this has happened in soccer with Video Assistant Referees [10] and in the 2024 Paris Olympics by improving decision-making in various events, from biomechanics analysis in swimming to real-time performance tracking in athletics [11].

The consequence of methods for intelligent data analysis being used to evaluate athletes objectively is that these methods are also being incorporated into their training routines to exploit every opportunity for improvement fully.

These trends have been observed in practice, and 109 cases of Smart Sports Training (SST) were identified between 2006 and 2020. SST approaches represent various forms of athletic training that utilize wearables, sensors, IoT devices, and intelligent data analysis methods and tools to enhance training efficiency and outcomes. SST aims to improve performance or reduce effort, all while maintaining or exceeding the current performance levels [5].

Sports cover a diverse variety of activities with different demands on athletes. These demands can be mental and/or physical. In sports, this can range from short bursts of high intensity to prolonged, sustained efforts. This review relates to endurance, defined as the ability to keep engaged in something difficult, unpleasant, or painful for a long time [12]. Given that endurance sports often require a balance of physical and mental endurance, making them a unique challenge for athletes, it is crucial to develop a deep understanding of how intelligent data analysis can address these challenges.

Several similar systematic literature reviews (SLRs) already exist, addressing either sports in general [5,13,14] or team sports [15]. They highlight the transformative potential of intelligent data analysis in sports. They do not address our topic directly or cover endurance sports sufficiently, which was the focus of the proposed review.

The authors in Rajšp et al. [5] proposed the term Smart Sports Training and covered 108 science papers. The drawback is that the SLR was conducted in 2020 and is over four years old. The approach also does not identify the devices used in training or the benefits of incorporating modern approaches in individual cases or endurance sports, which were not investigated specifically.

Furthermore, Bonidia et al. [13] presented an analysis of data mining techniques and algorithms in sports from 2010 to 2018. They identified 31 relevant studies, highlighting the interest in applying computational intelligence to improve sports performance, predict outcomes, and optimize training. Compared to our proposed study, the SLR did not identify the devices used in training and covered only three studies from the category of endurance sports training, which deserves an investigation of its own.

The authors in Krstić et al. [14] presented a systematic literature review on applying artificial intelligence, machine learning, and deep learning in improving sports performance. The areas of improvement of this SLR were that only one scientific database was queried and that only eleven studies were related to endurance sports, which is the focus of our review.

Despite the three systematic literature reviews providing valuable insights into the topic, they fell short of addressing the specifics of the endurance sports field. For this reason a more extensive study was proposed and conducted, with the goal of addressing the existing gaps in knowledge and provide an overview of intelligent data analysis in endurance sports training.

Through a systematic literature review, this paper identifies and presents the latest advancements in the domain of endurance sports in the domain of Smart Sports Training as a systematic literature review.

The objectives of this review were determined to investigate (1) how intelligent data analysis methods are used in endurance sports training, (2) which endurance sports are the most supported by such applications of Smart Sports Training, and (3) what types of performance improvements and training goals are targeted, reported, or investigated through the application of intelligent data analysis methods in endurance sports. Based on the study objectives, the following research questions (RQ) were formed:

RQ 1:

Which intelligent data-analysis methods are used in endurance sports, what performance parameters and outcomes are they focused on, and in which disciplines are these methods implemented most frequently?

RQ 1.1:: Which intelligent data analysis approaches were used in endurance sports training?
RQ 1.2:: What was the focus of studies utilizing intelligent data analysis approaches?
RQ 1.3:: Which endurance sports disciplines are supported most frequently by the implementation of intelligent data analysis methods?

RQ 2:

What are the most common IoT (Internet of Things) devices, wearables, and sensors combined in endurance sports training combined with intelligent data analysis approaches, and how do they contribute to the training process and data collection and analysis?

The main unique contributions of this review paper are as follows:

Comprehensive Taxonomy of Approaches: A taxonomy for intelligent data analysis methods in endurance sports, based on categorization in machine learning, deep learning, computational intelligence, and other data-driven techniques.
Holistic Overview of Sensors and Devices: A catalog of the standard IoT devices, wearables, and sensors adopted in endurance sports.
Focused Endurance Sports Insights: Unlike previous papers of the broad sports reviews, our study is focused on endurance sports.

The remainder of this paper is structured as follows. Section 2 presents endurance sports and their training. Section 3 presents the systematic literature review methodology followed. Section 4 presents the literature review results. Section 5 discusses the findings, and the paper is concluded in Section 6, where key insights and future research directions are presented.

2. Endurance Sports Training

Endurance sports contain continuous isotonic contractions of large skeletal muscle groups [16]. The performance of athletes competing in endurance sports is determined mainly by their maximal aerobic power, lactate threshold, and economy. Three types of training are usually performed in endurance sports training: (1) extended-duration sessions at moderate intensity, (2) medium-duration sessions at high intensity, (3) short-duration sessions at very high intensity [17]. While all endurance sports share the common challenge of sustaining high aerobic performance, each discipline exhibits unique profiles in terms of muscle usage, physiological demands, and the psychological and social factors that contribute to success.

Endurance sports encompass a diverse array of activities that challenge the human body and mind in unique ways, which, among others, include (long-distance) running, cycling, cross-country skiing, (long-distance) speed skating, and triathlon [16,18].

These sports each require different compositions of proficiencies and abilities, e.g., triathlon combines swimming, cycling, and running to test the overall aerobic capacity and muscular endurance [19,20]; rowing and long-distance swimming emphasize upper-body endurance and technical precision [21]; ultra-marathons test the athletes by demanding specific adaptations, not only in physical performance but also in pacing and mental resilience [22].

The nature of endurance sports demands that both physiological and technical factors converge to push human limits, which is their essence.

3. Research Methodology

We followed the systematic literature review protocol described in detail below to answer the research questions introduced in Section 1. This review was conducted per the Systematic Literature Review Guidelines in Software Engineering outlined in [23], for reporting clarity, we broadly aligned with PRISMA [24], adopting selected elements rather than the complete checklist. The study protocol [25] has been registered on Open Society Foundations.

The primary database literature search was performed on the following four scientific literature databases: IEEE Xplore [26], ScienceDirect [27], Scopus [28], Web of Science [29]. Although no explicit publication date restrictions were applied during the search, all the included studies meeting the eligibility criteria were published between 2007 and 2024.

The search string was determined as shown in Table 1 below. It was determined to be used in the title, abstract, and keywords search space to limit the number of unrelated results.

3.1. Inclusion Criteria

The following inclusion criteria were selected and followed for the selection of research papers:

IC 1:

The research materials were peer-reviewed journal articles or conference papers.

IC 2:

The research was published in the English language.

IC 3:

The research addressed at least one of the endurance sports.

(a): Only traditional endurance sports were selected, notably cycling, (long-distance) skiing, running, biathlon, and triathlon.

IC 4:

The research addressed sports training.

IC 5:

The research used at least one intelligent method (e.g., machine learning, artificial intelligence, computational intelligence, big data, data mining, deep learning).

3.2. Exclusion Criteria

The exclusion criteria for rejecting the research material were determined as follows:

EC 1:: Full text was not available.
EC 2:: The research was retracted.
EC 3:: Not relevant to the set of research questions.
EC 4:: The study addressed the non-endurance variant of the sport or discipline, e.g., a running study about sprinting.

3.3. Limitations

Due to differences in how the databases process search queries, several database-specific limitations had to be acknowledged and imposed for the ScienceDirect and IEEE Xplore library.

3.3.1. ScienceDirect

Due to a limit of 8 boolean operators per field and no wildcard support, the search query was split between title/abstract/keywords and full text, replacing wildcard terms (e.g., “athlete*” with “athlete”). The modified search string used in this library is shown below in Table 2.

3.3.2. IEEE Xplore

No English filter is available, so non-English results must be screened manually. Wildcards were used, and the search was limited to abstracts because there is no support for a combined search of title, keywords abstract.

3.3.3. Other Databases (Scopus, Web of Science)

No limitations were imposed.

3.4. Data Extraction

From the included literature, an Extraction Table was devised to aid in answering the research questions. The attributes shown in Table 3 were identified for each study.

4. Results

The literature search was performed on 22 October 2024. Before duplicate removal and screening, an overall 1305 results were identified, with most of the results (78.5% of the total) originating from Scopus (580) and Web of Science (445), as shown in Table 4.

The investigation of duplicates was based on the Digital Object Identifiers (DOIs), authors, and titles of individual studies. Table 5 shows the identified duplicates between the different databases.

It is interesting to note that duplicated results are sometimes even identified inside the same database (e.g., Scopus 2 and Web of Science 2) because these databases aggregate the results across multiple smaller databases. Most of the duplicates were connected to cross references with the Scopus (328 duplicates) and Web of Science (316 duplicates) databases, which was to be expected since the majority of studies were identified from them. The number of unique papers per database, which were not duplicated in other databases, was 18 in IEEE Xplore, 193 in ScienceDirect, 306 in Scopus, and 162 in Web of Science.

After duplicate removal, 943 papers were identified. The screening was performed in a three-step procedure: (a) the retracted papers were identified and eliminated, (b) the papers were screened based on abstract and title, and (c) the papers were screened based on their full-text contents. The papers accepted in the last step (c) were entered into the Data Extraction Table.

Retractions due to papers of questionable origins and practices have increased steadily, reaching over 10,000 papers in 2023 [30]. We used Retraction Watch [31] published by CrossRef to compare our results with the retracted papers. In our review, 33 papers were identified as retracted and eliminated from the scope of our study, decreasing the total number of papers to 910.

These papers (910) were then screened by abstract and title, and a total of 726 papers were eliminated, which brought the number of papers to 184. Each study was screened by the first two authors of this paper for relevancy; if a consensus was not achieved, the study was screened additionally by one of the remaining authors. After this, these papers were analyzed thoroughly, and the full text was analyzed. A total of 104 papers were eliminated because they were irrelevant after close inspection, 5 papers with full texts were unavailable, and 75 papers were selected for inclusion in the systematic literature review. The PRISMA flow diagram [32] in the Figure 1 below summarizes the whole selection process and shows the number of studies in each selection process step.

The first paper identified was from the year 2008, and the field has become increasingly relevant and studied through the years, as seen in the Figure 2, with the highest increase in published studies happening between 2017 (two studies) and 2018 (eight studies). The field has stayed relevant after this, and between 6 and 12 studies were published each year.

The papers were also analyzed by where they were published to identify if any journals, book series, or conferences emerged as the best fit for the inspected theme. It was found that the majority of papers (42–56%) were published in unique publications. Only some publications had more than one paper related to the inspected field, as seen in Table 6, most notably the journals MDPI Sensors (eight papers), IEEE Access (five papers), Springer Journal of Sports Sciences (four papers). This displays the high multidisciplinarity of the domain, which impacts multiple fields.

Numerous approaches were utilized in the case of intelligent endurance sports training, and a diverse set of methods was identified. The variety of approaches, ranging from traditional machine learning and statistical methods to deep learning techniques, demonstrates the complexity and richness of the field.

4.1. Taxonomy of Intelligent Methods

Motivated by a taxonomy presented in [5], a novel taxonomy was developed to represent the intelligent approaches used in endurance sports training better. The developed taxonomy used and applied existing taxonomies to the approaches identified in endurance sports training. The main categories of computational intelligence, data mining, machine learning, deep learning, and other methods were adopted from [33]. The computational intelligence methods were divided and grouped based on Engelbrecht’s taxonomy presented in [34]. The machine learning subcategories were categorized according to [35,36]. The taxonomy of techniques is shown in Figure 3 below. It should be noted that this is not an exhaustive taxonomy of all the intelligent methods; the categories are shown only for methods that were actually identified to be used in the studies.

A total of 63 different techniques were identified. The techniques identified are categorized and presented below by their primary category.

4.1.1. Computational Intelligence

Computational Intelligence is an extensive and diverse collection of nature-inspired computational techniques and methods used to model and solve complex problems in which the conventional approaches based on strict and well-defined techniques are either not viable or not efficient [37]. Computational intelligence-based approaches were used 13 times, ranging from evolutionary algorithms to swarm intelligence algorithms and fuzzy systems, as shown below.

Computational intelligence

Evolutionary algorithms
- Differential Evolution (DE) [38] in [39,40];
- Evolutionary algorithm [41] in [42];
- Multi-gradient evolutionary computation [43] in [44];
- Genetic algorithms [45] in [33].
Swarm intelligence algorithms
- Particle Swarm Optimization (PSO) [46] in [47,48,49];
- Improved Grey Wolf Optimization [50] in [44];
- Modified Bat Algorithm in [51];
- Bat Algorithm [52] in [53].
Fuzzy systems
- Adaptive neuro-fuzzy inference system [54] in [55];
- Fuzzy logic [56] in [57].

4.1.2. Data Mining

Data Mining is the process of identifying and extracting information from large datasets to identify previously unknown and potentially useful patterns and relationships [58]. Data mining techniques were applied three times in the identified literature, as seen below.

Data mining

Association Rule Mining (ARM) [59] in [40];
Numerical Association Rule Mining [59] in [40];
Data mining (general) [60] in [61].

4.1.3. Machine Learning

Machine Learning is concerned with developing algorithms that enable computers to learn from data and improve their performance on a given task over time without being programmed explicitly [62]; methods from supervised learning (58), unsupervised learning (6), and reinforcement learning (1) were identified in our case.

Machine learning

Supervised Learning
(a)
Regression
Linear regression [63] in [64,65,66,67];
Polynomial regression [68] in [69];
Logistic regression [70] in [71,72,73];
LASSO regression [74] in [71,75];
Support vector regression [76] in [77];
Bivariate regression [78] in [79];
Multivariate regression [80] in [79,81];
Regression trees [82] in [83];
Regression models [84] in [85,86].
(b)
Classification
Decision trees [87] in [69,88,89];
Random forest [90] in [71,73,91,92,93];
KNN (k-nearest neighbors) [94] in [67,71,75,93,95,96,97,98];
Naive Bayes [99] in [71,100];
Relevance vector machines [101] in [102];
Support vector machine (SVM) [103] in [57,71,75,88,104,105,106];
Bayesian networks [107] in [102,108];
Boosted trees [109] in [72].
(c)
Ensemble Learning
Bagging [110] in [88,111];
Gradient boosting [112] in [66,113];
XGBoost [114] in [75,81,95,111,115,116];
CatBoost [117] in [73,115].
Unsupervised Learning
- Clustering
  –
  K-means clustering [118] in [69,85,100]
- Dimensionality Reduction
  –
  Singular Value Decomposition (SVD) [119] in [67,120]
- Local Matrix Completion (LMC) [121] in [120]
Reinforcement Learning
- Reinforcement learning [122] in [123]

4.1.4. Deep Learning

Deep learning is a subdomain of machine learning that utilizes multilayered artificial neural networks to simulate the complex decision-making of the human brain [124,125]. It is a subset of machine learning but was included in the taxonomy as a separate unit since it is characterized by a unique architecture that enables the model to learn multiple levels of abstraction, capturing both simple and complex data features. The most frequently used specific methods were Long Short-Term Memory (LSTM) networks (9), and Convolutional Neural Networks (CNNs) (7).

Deep learning

Convolutional Neural Network (CNN) [126] in [106,127,128,129,130,131,132];
Long Short-Term Memory (LSTM) Networks [133] in [67,88,127,129,131,134,135,136,137];
Recurrent Neural Networks (RNNs) [138] in [88,139];
Feed-Forward Artificial Neural Networks [140] in [47,141];
Temporal Convolutional Networks (TCNs) [142] in [143];
Deep Recurrent Q-Learning Network (DRQN) [144] in [137];
Region-Based Convolutional Neural Network (R-CNN) [145] in [146];
Deep Convolutional Neural Networks [147] in [106,148];
Back-Propagation Neural Network [149] in [150];
Adaptive Neural Network [151] in [152];
Bidirectional LSTMs [153] in [131];
Artificial Neural Networks (General) [154] in [55,64,71,72,75,77,97,155,156,157];
Hybrid Long Short-Term Memory + Convolutional Neural Network in [132];
Cascaded Pyramid Network [158] in [146].

4.1.5. Other Methods

In the other methods section, methods were listed deemed not to fit a specific field or were the only representatives of their field.

Other methods

Markov Decision Process [159] in [160];
You Only Look Once (YOLO—real-time object detection) [161] in [106];
Change-Point Segmentation Algorithm in [91];
Large Language Model (Chat GPT v3) [162] in [163];
Custom Supervised Learning Method in [164];
Custom Correlation-Based Algorithm in [165];
Customized Predictive Algorithm in [166];
Collaborative Filtering [167] in [95];
Case-Based Reasoning (CBR) [168] in [98];
Tabu Search [169] in [108];
Subgroup Discovery [170] in [85].

The most popular general category was machine learning (used 65 times), followed by deep learning (40), computational intelligence (13), and data mining (3), with other methods being used 11 times. When investigating individual approaches, the most popular approaches, which occurred six or more times, were Artificial Neural Networks (10), Long-Short Term Memory (9), support vector machines (8), k-nearest neighbors (8), Convolutional Neural Networks (7), and XGBoost (6).

4.2. Sports

A total of nine endurance sports were identified, where intelligent approaches were used, as shown in Table 7. The most studied sports were running (35) and cycling (21). A single study could address multiple sports.

4.3. Study Focuses

Each study had a specific goal, which was fulfilled by incorporating an intelligent method in the process. We have identified seven general topics that were the focuses of the inspected studies, with them being the following:

Fatigue and injury management aims to predict when an athlete is at risk of injuries and adjust their training loads or techniques accordingly.
Pacing/effort strategies and (training) path optimization are related to choosing the optimum effort the athlete should expend.
Performance prediction and evaluation are related to evaluating the training performance and predicting the athlete’s future results.
Physiological metrics and biomechanics focused on quantifying the internal and external factors that impact performance.
Technique analysis and classification mostly related to analyzing the athletes movement.
Training planning and adaptation address the design and continuous adjustment of training plans.
Other includes studies whose goals do not fit into identified categories.

The diverse research topics identified in the literature demonstrate the broad applicability of intelligent data analysis in endurance sports training. Together, these research papers illustrate how intelligent data analysis enhances our understanding of endurance sports and offers practical tools for optimizing athletic performance and safety. The categorization of individual papers into previously presented categories is shown in Table 8.

Computational intelligence techniques were used primarily in training planning and adaptation, pacing/effort strategies and path optimization, as well as performance prediction and evaluation, reflecting their strength in optimizing training strategies and modeling performance outcomes. Data mining approaches were applied in performance prediction and evaluation, where they played a critical role in extracting insights from large datasets of previous training or competition results. Deep learning methods were most used in technique analysis and classification. Deep learning was also used frequently in performance prediction and evaluation and physiological metrics and biomechanics, showcasing the increasing reliance on neural networks for processing complex physiological and biomechanical data. The distribution of different techniques demonstrates the applicability of intelligent techniques in endurance sports training research and is shown in Table 9.

Studies were also examined and categorized according to the number of participants for the applied approaches, as shown in Table 10 below. The not applicable or 0 refers to studies that were not tested on individuals or the number of individuals was not described. When combining these results, we can see that 37 (~49%) studies were applied on 10 or fewer individuals, or the number of individuals was not stated clearly.

The use of smart devices, sensors, and wearables has become an essential component in modern endurance sports training. These devices enable precise data collection, real-time feedback, and advanced performance analysis, providing athletes and coaches with valuable insights that help optimize training, prevent injuries, and improve overall efficiency. The data gathered using these devices allow for individualized training programs, ensuring athletes can push their natural limits while minimizing injury risks.

The research identified six categories of devices used in endurance sports training, each capturing specific types of data.

Wearable Devices and On-Body Sensors: Smartwatches, wristbands, and rings track metrics like heart rate, sleep patterns, step count, and movement efficiency. These are essential for monitoring daily training loads, energy expenditure, and recovery states [173].
Inertial and Motion Sensors: Devices such as accelerometers, gyroscopes, and IMUs (inertial measurement units) provide crucial information about movement patterns, stride mechanics, cadence, and stability, which are key factors in optimizing running, cycling, and skiing techniques [174].
Physiological and Biometric Sensors: Heart rate monitors, pulse oximeters, lactate analyzers, and electromyography (sEMG) sensors measure internal physiological responses to training. These devices help assess cardiovascular efficiency, muscular activation, and metabolic thresholds, aiding in performance prediction and fatigue management [175].
Performance and Exercise Equipment: Power meters, ergometers, cadence sensors, and cycling computers measure real-time output, ensuring that athletes train at the correct intensity levels [176].
Location and Environmental Sensors: GPS devices track distance, speed, elevation, and route optimization. Environmental sensors, such as barometers and weather API integrations, provide real-time information on external conditions affecting performance [177].
Imaging and Motion Capture Systems: Video cameras, motion capture systems, and force plates analyze biomechanics, posture, and gait efficiency, helping refine technique and minimize injury risk [178].

By leveraging data from these devices, endurance athletes can fine-tune their training plans based on objective metrics rather than subjective perception alone. Real-time feedback allows for immediate adjustments in intensity, form, or pacing. Furthermore, by monitoring biometrics and external conditions continuously, athletes can prevent overtraining and reduce injury risks. With machine learning and artificial intelligence applications, these data can be analyzed further to detect patterns, optimize training loads, and even predict future performance outcomes [179]. The categorization of research papers by the sensors and devices used is shown in Table 11. The most popular category of devices was Physiological and Biometric Sensors, which were used in 33 papers, followed by Inertial and Motion Sensors (20), and Location and Environmental Sensors (17). The least used were Wearable Devices and On-Body Sensors, which were used nine times. These devices are general and include several sensors, which were described more thoroughly in other categories. This may lead to a lower number in the first category.

The device used the most times was the heart rate monitor, with 28 occurrences, followed by GPS, with 16 occurrences.

The use of the three most popular devices in each sport is presented in Table 12. Race walking and Ironman are not specified because no sensors were used in the studies due to the approaches being based on the competition results. As shown in the Table, heart rate monitor is in the top three devices, used in four out of seven sports, since it is one of the most accurate sensors for obtaining measurable data in endurance sports. The only sports where a heart rate monitor is not used are cross-country skiing, speed skating, and biathlon, which use more specific sensors. Additionally, less research has focused on these sports, leading to fewer cases of sensors being used.

5. Discussion

This section presents the findings of our systematic literature review on the role of intelligent data analysis in endurance sports training. The review examines the trends resulting from the analysis of 75 primary sources identified in our systematic literature review. This discussion aims to contextualize the results within intelligent data analysis in the endurance sports training domain and assess the strengths and limitations of current practices critically. The discussion is organized according to the research questions outlined in the methodology.

RQ 1.: Which intelligent data-analysis methods are used in endurance sports, what performance parameters and outcomes are they focused on, and in which disciplines are they implemented most frequently?

The purpose of our first research question aimed to map the current research space of intelligent data analysis in endurance sports training. To construct this overview, we investigated three core components: the specific analytical methods being used (RQ 1.1), the primary goals and applications of these methods (RQ 1.2), and the endurance disciplines receiving the most research attention (RQ 1.3). The following subsections discuss the findings for each of these components in detail.

RQ 1.1: Which intelligent data analysis approaches were used in endurance sports training?

A wide range of data analysis methods were used in endurance sports training, but machine learning (ML) methods were by far the most common, used in ~49% papers. Deep learning (DL) approaches followed, accounting for roughly ~41%. This trend is a consequence of the fact that the main goals in sports analytics often focus on prediction and classification (e.g., [65,95,113,115,120]). The most popular machine learning techniques included SVM, KNN, linear regression, random forests, and XGBoost. These methods are well-established in machine learning and have been in use for over 20 years, except for XGBoost, which is newer.

Deep learning is used increasingly to analyze complex, high-dimensional data. For example, Long Short-Term Memory (LSTM) networks are particularly suited for the time-series nature of training data. Studies have shown their effectiveness in estimating physiological states during running [134] and cycling [129]. Convolutional Neural Networks (CNNs) excel at predicting from visual and sensor data. They are being used for technique analysis, such as identifying cross-country skiing techniques [127,131].

In contrast, computational intelligence (CI) methods were used less frequently, appearing 12 times, but they played a crucial niche role in optimization. These methods were applied mainly to complex planning problems, where a single correct answer might not exist. This includes generating personalized training routes through use of genetic algorithms [33], or adapting plans with Differential Evolution [39]. This indicates a precise task-specific application of methods in the existing literature.

Overall, 63 different techniques were used, which shows that numerous challenges the researchers are trying to tackle require specific niche approaches in many cases.

RQ 1.2: What was the focus of studies utilizing intelligent data analysis approaches?

The application of intelligent data analysis in endurance sports covers diverse categories of training themes. The most prominent focus areas were performance prediction and evaluation, technique analysis and classification, and training planning and adaptation, underscoring a primary demand for data-driven tools to forecast athletic potential and prescribe optimized training regimes.

For performance prediction, researchers relied heavily on traditional machine learning models to analyze historical data and predict future outcomes [65,97,115]. In contrast, while machine learning still dominated the field in training planning and adaptaion, computational intelligence methods were often involved, whose optimization capabilities are well-suited for creating personalized and adaptive plans [48,51,72].

The next tier of research interests included technique analysis and classification (14 studies), followed by fatigue and injury management (10 studies), and pacing and effort strategies (9 studies). Technique analysis, crucial for improving efficiency and preventing strain, was dominated overwhelmingly by deep learning methods. The use of models like CNNs and LSTMs is a natural fit for interpreting complex, high-dimensional data from IMUs and video feeds to classify movements or identify flaws in form [127,146,156]. The focus on fatigue and injury management addresses the critical need to maintain athlete health and longevity. Here, studies typically employed ML classifiers to predict injury risk based on training load, biomechanical markers, and physiological responses [66,92,111]. Similarly, developing optimal pacing and effort strategies, another optimization-oriented task benefited from both ML and CI approaches to model race dynamics and suggest ideal energy expenditure [33,57].

Finally, a smaller but foundational group of studies focused on modeling physiological metrics and biomechanics (seven studies). This research aims to translate raw sensor data into meaningful biological insights, such as estimating metabolic thresholds or biomechanical forces [134,139]. Deep learning was particularly prevalent in this area because it captures intricate patterns in physiological time-series data.

Overall, the distribution of studies‘ focuses reveals a methodologically complex landscape. There is a clear alignment between the nature of the problem and the choice of intelligent method: established ML models are the choice of many for predictive tasks, CI algorithms are deployed for complex optimization challenges, and DL excels at perception and pattern recognition from raw sensor data. This task-specific application of technology demonstrates a maturing field focused on solving tangible problems in endurance sports.

RQ 1.3: Which endurance sports disciplines are most frequently supported by the implementation of intelligent data analysis methods?

Among the endurance sports studied, running was the most popular sport to be studied, with 35 of the studies that were analyzed studying it. This is because running is accessible and ubiquitous worldwide. It is simple to obtain data using commercially available wearables, and it is of interest at both the recreational and elite levels. The running studies used mostly smart data analysis to monitor training loads, predict injuries, and optimize pacing strategies. For instance, Rothschild et al. [75] and Martinez-Gramage et al. [92] utilized wearable-based data and machine learning to predict performance outcomes and determine biomechanical inefficiencies.

Cycling ranked as the second most reported sport, being featured in 21 studies. Cyclists benefit from high-grade datasets generated by power meters, cadence sensors, and GPS-equipped cycling computers. These data sources allow for many applications, from workload monitoring to power output modeling and predictive performance forecasting.

Fister et al. [42] present an evolutionary-algorithm approach that generates cycling training sessions on a topology graph using a TSS-inspired objective, while Sagi et al. [73] develop a classification-based recommendation framework for race team selection, where CatBoost achieves the best performance.

Rowing, the focus of seven studies, is technically inclined, with intelligent methods often being employed to study movement skills and stroke techniques. Zhang et al. [146] employed convolutional neural networks and video-based analysis to extract biomechanical parameters, and Wang et al. [150] employed AI to construct customized training (nutrition) programs from rowing ergometer data.

Cross-country skiing (seven cases), demonstrated a strong prevalence of inertial measurement units (IMUs) to determine types of skiing and detect movement inefficiencies (e.g., Jang et al. [127] and Seeberg et al. [165] are studies that show the ability of sensor fusion and neural networks in sports with complex movement patterns).

Triathlon, a data-rich endurance sport by nature, was researched in only two studies. The lack of studies may reflect, either its lower relative popularity compared to individual disciplines or increasing complexity in combining data from multiple sport disciplines at the same time.

Other sports studied less often included biathlon (two studies), speed skating (two), and Ironman competitions (one). These sports might entail specialized equipment or be challenging to standardize data on, possibly limiting their appearance in intelligent data analysis research. Nevertheless, the existing studies (e.g., Maier et al. [72] biathlon and Krumm et al. [64] speed skating), yielded interesting results with the incorporation of shooting accuracy indicators and biomechanical analysis.

Endurance sports like running and cycling benefit from wide commercial sensor support and large user bases, while specialized endurance sports trail behind, suggesting where future research could be directed.

RQ 2.: What are the most common IoT (Internet of Things) devices, wearables, and sensors combined in endurance sports training combined with intelligent data analysis approaches, and how do they contribute to the training process and data collection and analysis?

The deployment of IoT sensors and wearable sensors is at the heart of applying smart data analysis for endurance training. Out of 75 reviewed studies, heart rate monitors were the most frequently applied devices (23 studies), given that they provide priceless information regarding the internal load, training intensity, and cardiovascular adaptation. Their frequent use in consumer markets and clinical settings makes them highly congruent, with data-driven performance monitoring and modeling.

GPS sensors were the next most common (14 studies), employed mainly in outdoor sports such as running, cycling, and triathlon. GPS tracking allows monitoring of distance, altitude, speed, and route planning, which are used in machine learning models to identify pacing habits or overtraining, such as demonstrated by Lovdal et al. [111] and Berndsen et al. [116].

Inertial measurement units (IMUs) were applied extensively in biomechanical examinations, especially in cross-country skiing, running, and rowing. IMUs, which combine accelerometers, gyroscopes, and magnetometers, provide abundant movement data for classifying techniques and detecting inefficiency. For example, Jang et al. [127] applied CNN and LSTM models based on IMU data to classify skiing styles accurately.

Smartwatches, wristbands, and rings were used in four studies, which were multi-sensor wearables that can collect a range of data, including heart rate, number of steps, sleep, and stress level. Oura Ring and Garmin watches contain cloud integration that facilitates large-scale data integration for predictive modeling, as illustrated in the study by Rothschild et al. [75].

Power meters and cycling computers, though popular in the context of cycling research, were used in eight studies. These devices allow for the direct measurement during the exercise of external workload (e.g., watts), cadence, and torque. Sagi et al. [73] and Fister et al. [42] used power output measurements for determining performance trends and prediction of fatigue through ensemble machine learning methods.

Technology hardware, such as force plates, video cameras, and movement capture equipment, was applied primarily to sports requiring accurate technical assessment. For instance, Krumm et al. [64] used ground reaction forces in speed skating, while Zhang et al. [146] applied R-CNNs to extract posture information from rowing videos.

Less often, studies utilized lactate analyzers, pulse oximeters, and surface EMG sensors to quantify internal physiological responses, muscle activation, and aerobic fitness. Such sensors are particularly beneficial in clinical or laboratory-based measurements in building more precise training stress and adaptation models.

More broadly, the results indicate a trend toward multimodal sensor fusion, whereby multiple data streams—physiological, biomechanical, and environmental—are integrated into smart systems. This enables holistic training analysis, personalized feedback, and the creation of adaptive training systems. As the price of devices declines and interoperability rises, the role of IoT and wearables in endurance sports can be anticipated to grow, supported by smart data analysis architectures.

Limitations to Validity

Despite conducting this systematic literature review carefully, several limitations still have to be acknowledged. We acknowledge and recognize the following limitations of our study findings: time frame limitations, the quality and validity of the primary studies, and scope and selection bias.

Time frame limitations: The review contains literature up to October 2024. Because of the development of the field, newer methods and studies have been published since our search concluded.
Quality and validity of primary studies: The validity of our synthesis is fundamentally dependent on the quality of the included studies. A major challenge we identified is the prevalence of research based on small, participant samples (as shown in Table 10, almost 50% of studies had 10 or fewer participants). This limits the generalizability of findings and highlights a need for larger, more diverse validation studies.
Scope and selection bias: Our review protocol focused exclusively on peer-reviewed, English-language articles from four major scientific databases and limited coverage from additional sources identified from relevant SLRs. This decision introduces potential publication bias (as studies with null or negative results are less likely to be published) and language bias. This means that relevant work published in languages other than English, as well as any findings in the gray literature, has not been identified.

6. Conclusions

This paper reviewed the role of intelligent data analysis methods in endurance sports training and how they can enhance it. Endurance sports cover many highly competitive activities where sustained performance over extended durations is vital. Given the complexity of factors involved in athletic performance, intelligent data analysis has emerged as a powerful tool for refining the training approaches.

The field has risen in popularity from 2018 onwards and has received constant attention ever since. Overall, 75 papers were identified.

The approaches have been used in all areas of endurance sports training, with the most popular addressed areas of performance prediction and evaluation (19), technique analysis and classification (14), and training planning and adaptation (13); these areas demonstrate a broad need among athletes and coaches for accurate forecasting of training outcomes and individualized, data-driven program design. Additional study areas included pacing strategies and training path optimization (nine), reflecting the growing interest in real-time, adaptive coaching solutions.

The most often studied sports were running (35) and cycling (21), which could be attributed to their widespread popularity.

Of the algorithms identified, neural networks (general ANN, LSTM, CNN), support vector machines, k-nearest neighbors, and XGBoost were applied the most frequently. Machine learning was shown to be the most popular approach when looking at results in general. Our review shows that the use of devices in approaches is dominated by heart rate monitors and GPS devices, which provide crucial data on internal physiological load and external performance. While these are standard, a growing number of studies are leveraging more advanced sensors like IMUs for biomechanical analysis and multimodal wearables for more comprehensive athlete monitoring. Despite the number of studies reviewed, this review is subject to potential publication bias, as English-language and peer-reviewed sources were included predominantly. Additionally, many identified studies involved small participant samples, limiting the generalizability of the results.

As costs decline and devices become more user-friendly, these tools could benefit elite competitors and everyday sports enthusiasts, changing how endurance training is conducted. In conclusion, intelligent data analysis provides substantial interventions for enhancing endurance sports training and raising the bar of athletic achievement. Given the competitiveness of endurance sports, it will continue to do so in the coming years.

The future directions of research on the domain could include application of methods used in running and cycling (currently the most studied) to underrepresented less popular endurance sports (e.g., biathlon, cross-country skiing, triathlon), developing artificial sports trainers (as proposed in [135] for cycling) for all types of endurance sports, integrating multimodal sensor data, developing robust frameworks that integrate multiple data sources and adapt based on available data, and verification of approaches tested on a small number of individuals.

Author Contributions

Conceptualization, A.R. and I.F.J.; methodology, A.R., I.F.J., P.K., and P.R.; formal analysis, A.R. and P.R.; investigation, A.R. and P.R.; writing—original draft preparation, A.R. and P.R.; writing—review and editing, A.R., I.F.J., P.R., and P.K.; visualization, A.R.; supervision, I.F.J. and P.K.; project administration, I.F.J. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Slovenian Research Agency, Research Core Funding No. P2-0057.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

AI	Artificial Intelligence
ANFIS	Adaptive Neuro-Fuzzy Inference System
ANN	Artificial Neural Network
ARM	Association Rule Mining
CatBoost	Categorical Boosting
CI	Computational Intelligence
CNN	Convolutional Neural Network
CBR	Case-Based Reasoning
CPET	Cardiopulmonary Exercise Testing
DL	Deep Learning
DRQN	Deep Recurrent Q-Learning Network
GPS	Global Positioning System
IMU	Inertial Measurement Unit
IoT	Internet of Things
KNN	k-Nearest Neighbors
LASSO	Least Absolute Shrinkage and Selection Operator
LMC	Local Matrix Completion
LSTM	Long Short-Term Memory
ML	Machine Learning
PRISMA	Preferred Reporting Items for Systematic reviews and Meta-Analyses
PSO	Particle Swarm Optimization
R-CNN	Region-Based Convolutional Neural Network
RNN	Recurrent Neural Network
RVM	Relevance Vector Machine
sEMG	Surface Electromyography
SLR	Systematic Literature Review
SST	Smart Sports Training
SVM	Support Vector Machine
TCN	Temporal Convolutional Network
YOLO	You Only Look Once
XGBoost	eXtreme Gradient Boosting

References

Berthold, M.R.; Hand, D.J. Intelligent Data Analysis; Chapter an Introduction; Springer: Berlin/Heidelberg, Germany, 2003; pp. 1–15. [Google Scholar]
El-Maghrabi, Y.; Sharif, M. Game Changers or Game Predictors? Big Data Analytics in Sports for Performance Enhancement and Fan Engagement. J. Contemp. Healthc. Anal. 2022, 6, 19–39. [Google Scholar]
Mallen, C. Artificial Intelligence and Implications for Sport Officiating. In Emerging Technologies in Sport: Implications for Sport Management; Mallen, C., Ed.; Routledge: Abingdon, Oxon; New York, NY, USA, 2019; pp. 87–104. [Google Scholar] [CrossRef]
Hubáček, O.; Šourek, G.; Železný, F. Exploiting sports-betting market using machine learning. Int. J. Forecast. 2019, 35, 783–796. [Google Scholar] [CrossRef]
Rajšp, A.; Fister, I., Jr. A systematic literature review of intelligent data analysis methods for smart sport training. Appl. Sci. 2020, 10, 3013. [Google Scholar] [CrossRef]
Booth, R. Football Coaches Could Soon Be Calling on AI to Scout the Next Superstar. 2025. Available online: https://www.theguardian.com/technology/2025/jan/04/football-coaches-could-soon-be-calling-on-ai-to-scout-the-next-superstar (accessed on 17 February 2025).
Market U.S. Global AI in Sports Analytics Market. 2024. Available online: https://market.us/report/ai-in-sports-analytics-market/ (accessed on 17 February 2025).
Insider, S. Sports Analytics Market Size Worth US$ 20.48 Billion By 2032 with a CAGR of 22.51%, Boosted by Surged Usage of Wearable Technology | Research by SNS Insider. 2024. Available online: https://www.globenewswire.com/news-release/2024/07/12/2912556/0/en/Sports-Analytics-Market-Size-Worth-US-20-48-Billion-By-2032-With-a-CAGR-of-22-51-Boosted-by-Surged-Usage-of-Wearable-Technology-Research-by-SNS-Insider.html (accessed on 17 February 2025).
Jud, M.; Thalmann, S. AI in digital sports coaching—A systematic review. Manag. Sport Leis. 2025, 1–17. [Google Scholar] [CrossRef]
FIFA. Video Assistant Referee. 2024. Available online: https://inside.fifa.com/innovation/standards/video-assistant-referee (accessed on 17 February 2025).
Times, T.N.Y. 2024 Paris Olympics: How AI and Omega Are Shaping the Future of Sports. 2024. Available online: https://www.nytimes.com/athletic/5646415/2024/07/25/2024-paris-olympics-ai-omega/ (accessed on 17 February 2025).
Cambridge University. Cambridge Online Dictionary: Endurance. 2024. Available online: https://dictionary.cambridge.org/dictionary/english/endurance (accessed on 30 September 2024).
Bonidia, R.P.; Rodrigues, L.A.L.; Avila-Santos, A.P.; Sanches, D.S.; Brancher, J.D. Computational Intelligence in Sports: A Systematic Literature Review. Adv.-Hum.-Comput. Interact. 2018, 2018, 1–21. [Google Scholar] [CrossRef]
Krstić, D.; Vučković, T.; Dakić, D.; Ristić, S.; Stefanović, D. The Application and Impact of Artificial Intelligence on Sports Performance Improvement: A Systematic Literature Review. In Proceedings of the 2023 4th International Conference on Communications, Information, Electronic and Energy Systems (CIEES), Plovdiv, Bulgaria, 23–25 November 2023; pp. 1–8. [Google Scholar] [CrossRef]
Beal, R.; Norman, T.J.; Ramchurn, S.D. Artificial intelligence for team sports: A survey. Knowl. Eng. Rev. 2019, 34, e28. [Google Scholar] [CrossRef]
Morici, G.; Gruttad’Auria, C.I.; Baiamonte, P.; Mazzuca, E.; Castrogiovanni, A.; Bonsignore, M.R. Endurance training: Is it bad for you? Breathe 2016, 12, 140–147. [Google Scholar] [CrossRef]
Pate, R.R.; Branch, J.D. Training for endurance sport. Med. Sci. Sport. Exerc. 1992, 24, S365. [Google Scholar] [CrossRef]
Tønnessen, E.; Sandbakk, Ø.; Sandbakk, S.B.; Seiler, S.; Haugen, T. Training Session Models in Endurance Sports: A Norwegian Perspective on Best Practice Recommendations. Sport. Med. 2024, 54, 2935–2953. [Google Scholar] [CrossRef]
Aoyagi, A.; Ishikura, K.; Nabekura, Y. Exercise Intensity during Olympic-Distance Triathlon in Well-Trained Age-Group Athletes: An Observational Study. Sports 2021, 9, 18. [Google Scholar] [CrossRef]
OʼToole, M.L.; Douglas, P.S. Applied Physiology of Triathlon. Sport. Med. 1995, 19, 251–267. [Google Scholar] [CrossRef]
Ruth, W. Upper Body Training for Rowing. 2019. Available online: https://rowingstronger.com/2019/05/13/upper-body-training-for-rowing/ (accessed on 17 February 2025).
Bossi, A.; Matta, G.; Millet, G.; Lima, P.; Pertence, L.; Lima, J.; Hopker, J. Pacing Strategy During 24-Hour Ultramarathon-Distance Running. Int. J. Sport. Physiol. Perform. 2016, 12, 1–25. [Google Scholar] [CrossRef] [PubMed]
Kitchenham, B.; Charters, S. Guidelines for performing systematic literature reviews in software engineering version 2.3. Engineering 2007, 45, 1051. [Google Scholar]
Page, M.J.; McKenzie, J.E.; Bossuyt, P.M.; Boutron, I.; Hoffmann, T.C.; Mulrow, C.D.; Shamseer, L.; Tetzlaff, J.M.; Akl, E.A.; Brennan, S.E.; et al. The PRISMA 2020 statement: An updated guideline for reporting systematic reviews. BMJ 2021, 372, n71. [Google Scholar] [CrossRef] [PubMed]
Rajšp, A.; Fister, I., Jr.; Rek, P.; Kokol, P. The Role of Intelligent Data Analysis in Selected Endurance Sports: A Systematic Literature Review. OSF Regist. 2025. [Google Scholar] [CrossRef]
IEEE. IEEE Xplore. 2024. Available online: https://ieeexplore.ieee.org (accessed on 22 October 2024).
Elsevier. ScienceDirect. 2024. Available online: https://www.sciencedirect.com (accessed on 22 October 2024).
Elsevier. Scopus. 2024. Available online: https://www.scopus.com (accessed on 22 October 2024).
Clarivate. Web of Science. 2024. Available online: https://www.webofscience.com (accessed on 22 October 2024).
Van Noorden, R. More than 10,000 research papers were retracted in 2023—A new record. Nature 2023, 624, 479–481. [Google Scholar] [CrossRef]
Rosa-Clark. Retraction Watch—Crossref—Crossref.org. 2025. Available online: https://www.crossref.org/documentation/retrieve-metadata/retraction-watch/ (accessed on 13 June 2025).
Haddaway, N.R.; Page, M.J.; Pritchard, C.C.; McGuinness, L.A. PRISMA2020: An R package and Shiny app for producing PRISMA 2020-compliant flow diagrams, with interactivity for optimised digital transparency and Open Synthesis. Campbell Syst. Rev. 2022, 18, e1230. [Google Scholar] [CrossRef]
Rajsp, A.; Fister, I. A modified evolutionary algorithm for generating the cycling training routes. IEEE Access 2022, 10, 109743–109759. [Google Scholar] [CrossRef]
Engelbrecht, A.P. Computational Intelligence: An Introduction; Wiley: Hoboken, NJ, USA, 2007. [Google Scholar] [CrossRef]
Zhou, Z.H. Machine Learning; Springer: Singapore, 2021. [Google Scholar] [CrossRef]
Hossain, E. Machine Learning Crash Course for Engineers; Springer International Publishing: Berlin/Heidelberg, Germany, 2024. [Google Scholar] [CrossRef]
Kacprzyk, J.; Pedrycz, W. Introduction. In Springer Handbook of Computational Intelligence; Springer: Berlin/Heidelberg, Germany, 2015; pp. 1–4. [Google Scholar] [CrossRef][Green Version]
Feoktistov, V. Differential Evolution. In Differential Evolution: In Search of Solutions; Springer: Boston, MA, USA, 2006; pp. 1–24. [Google Scholar] [CrossRef]
Fister, I.; Fister, D.; Deb, S.; Mlakar, U.; Brest, J.; Fister, I., Jr. Post hoc analysis of sport performance with differential evolution. Neural Comput. Appl. 2020, 32, 10799–10808. [Google Scholar] [CrossRef]
Fister, I., Jr.; Fister, D.; Iglesias, A.; Galvez, A.; Osaba, E.; Del Ser, J.; Fister, I. Visualization of numerical association rules by hill slopes. In Lecture Notes in Computer Science; Lecture notes in computer science; Springer International Publishing: Cham, Switzerland, 2020; pp. 101–111. [Google Scholar]
Eiben, A.E.; Smith, J.E. What Is an Evolutionary Algorithm? In Introduction to Evolutionary Computing; Springer: Berlin/Heidelberg, Germany, 2015; pp. 25–48. [Google Scholar] [CrossRef]
Fister, I.; Fister, D.; Fister, I. Topology-based generation of sport training sessions. J. Ambient Intell. Humaniz. Comput. 2021, 12, 667–678. [Google Scholar] [CrossRef]
Goh, C.K.; Ong, Y.S.; Tan, K.C.; Teoh, E.J. An investigation on evolutionary gradient search for multi-objective optimization. In Proceedings of the 2008 IEEE Congress on Evolutionary Computation (IEEE World Congress on Computational Intelligence), Hong Kong, China, 1–6 June 2008; IEEE: Piscataway, NJ, USA, 2008; pp. 3741–3746. [Google Scholar] [CrossRef]
Deepak, V.; Anguraj, D.K.; Mantha, S.S. An efficient recommendation system for athletic performance optimization by enriched grey wolf optimization. Pers. Ubiquitous Comput. 2023, 27, 1015–1026. [Google Scholar] [CrossRef]
Lambora, A.; Gupta, K.; Chopra, K. Genetic Algorithm—A Literature Review. In Proceedings of the 2019 International Conference on Machine Learning, Big Data, Cloud and Parallel Computing (COMITCon), Faridabad, India, 14–16 February 2019; pp. 380–384. [Google Scholar] [CrossRef]
Kennedy, J.; Eberhart, R. Particle swarm optimization. In Proceedings of the ICNN’95—International Conference on Neural Networks, Perth, Australia, 27 November–1 December 1995; IEEE: Piscataway, NJ, USA, 1995; Volume 4, ICNN-95. pp. 1942–1948. [Google Scholar] [CrossRef]
Pogson, M.; Verheul, J.; Robinson, M.A.; Vanrenterghem, J.; Lisboa, P. A neural network method to predict task- and step-specific ground reaction force magnitudes from trunk accelerations during running activities. Med. Eng. Phys. 2020, 78, 82–89. [Google Scholar] [CrossRef]
Fister, I., Jr.; Iglesias, A.; Osaba, E.; Mlakar, U.; Brest, J.; Fister, I. Adaptation of sport training plans by swarm intelligence. In Recent Advances in Soft Computing; Advances in intelligent systems and computing; Springer International Publishing: Cham, Switzerland, 2019; pp. 56–67. [Google Scholar]
Fister, I.; Brest, J.; Iglesias, A.; Fister, I., Jr. Framework for planning the training sessions in triathlon. In Proceedings of the Genetic and Evolutionary Computation Conference Companion, Kyoto, Japan, 15–19 July 2018. [Google Scholar]
Nadimi-Shahraki, M.H.; Taghian, S.; Mirjalili, S. An improved grey wolf optimizer for solving engineering problems. Expert Syst. Appl. 2021, 166, 113917. [Google Scholar] [CrossRef]
Fister, I.; Rauter, S.; Yang, X.S.; Ljubivc, K.; Fister, I., Jr. Planning the sports training sessions with the bat algorithm. Neurocomputing 2015, 149, 993–1002. [Google Scholar] [CrossRef]
Yang, X.S. Bat algorithm for multi-objective optimisation. Int. J. Bio-Inspired Comput. 2011, 3, 267. [Google Scholar] [CrossRef]
Fister, I., Jr.; Fister, D.; Iglesias, A.; Galvez, A.; Rauter, S.; Fister, I. Population-based metaheuristics for planning interval training sessions in mountain biking. In Lecture Notes in Computer Science; Springer International Publishing: Cham, Switzerland, 2019; pp. 70–79. [Google Scholar]
Jang, J.S. ANFIS: Adaptive-network-based fuzzy inference system. IEEE Trans. Syst. Man Cybern. 1993, 23, 665–685. [Google Scholar] [CrossRef]
Sudin, S.; Md Shakaff, A.Y.; Zakaria, A.; Salleh, A.F.; Kamarudin, L.M.; Azmi, N.; Ahmad Saad, F.S. Real-time track cycling performance prediction using ANFIS system. Int. J. Perform. Anal. Sport 2018, 18, 806–822. [Google Scholar] [CrossRef]
Zadeh, L. Fuzzy logic. Computer 1988, 21, 83–93. [Google Scholar] [CrossRef]
Dziomdziora, A.; Taibi, D. Running pace adjustment and training distance fitting with fuzzy logic and machine learning. In Proceedings of the 2022 21st International Symposium on Communications and Information Technologies (ISCIT), Xi’an, China, 27–30 September 2022; IEEE: Piscataway, NJ, USA, 2022. [Google Scholar]
Fayyad, U.M.; Piatetsky-Shapiro, G.; Smyth, P. From Data Mining to Knowledge Discovery in Databases. AI Mag. 1996, 17, 37–54. [Google Scholar]
Piatetsky-Shapiro, G. Discovery, Analysis, and Presentation of Strong Rules. In Knowledge Discovery in Databases; MIT Press: Cambridge, MA, USA, 1991. [Google Scholar]
Hand, D.J.; Adams, N.M. Data Mining, In Wiley StatsRef: Statistics Reference Online; John Wiley & Sons, Ltd.: Chichester, UK, 2014; pp. 1–7. [Google Scholar] [CrossRef]
Malkinson, T.J. Male and Female Age-Division VO2 Max: Marathon and Ironman® Triathlon Performance. J. Exerc. Physiol. Online 2022, 25, 3. [Google Scholar]
Mahesh, B. Machine Learning Algorithms—A Review. Int. J. Sci. Res. (IJSR) 2020, 9, 381–386. [Google Scholar] [CrossRef]
Su, X.; Yan, X.; Tsai, C. Linear regression. WIREs Comput. Stat. 2012, 4, 275–294. [Google Scholar] [CrossRef]
Krumm, D.; Kuske, N.; Neubert, M.; Buder, J.; Hamker, F.; Odenwald, S. Determining push-off forces in speed skating imitation drills. Sports Eng. 2021, 24, 25. [Google Scholar] [CrossRef]
Epanchintseva, A.; Bakaev, M. Predicting performance and functional reserves of athletes based on their pulse indicators in different trainings. In Springer Geography; Springer Nature: Cham, Switzerland, 2024; pp. 237–245. [Google Scholar]
Derie, R.; Robberechts, P.; Van den Berghe, P.; Gerlo, J.; De Clercq, D.; Segers, V.; Davis, J. Tibial acceleration-based prediction of maximal vertical loading rate during overground running: A machine learning approach. Front. Bioeng. Biotechnol. 2020, 8, 33. [Google Scholar] [CrossRef] [PubMed]
Song, B.; Paolieri, M.; Stewart, H.E.; Golubchik, L.; McNitt-Gray, J.L.; Misra, V.; Shah, D. Estimating ground reaction forces from inertial sensors. IEEE Trans. Biomed. Eng. 2024, 72, 595–608. [Google Scholar] [CrossRef] [PubMed]
Bradley, R.A.; Srivastava, S.S. Correlation in Polynomial Regression. Am. Stat. 1979, 33, 11–14. [Google Scholar] [CrossRef]
Kholkine, L.; Latré, S.; Verdonck, T.; de Leeuw, A.W. Age of peak performance in professional road cycling. J. Sports Sci. 2023, 41, 298–306. [Google Scholar] [CrossRef]
Berkson, J. Application of the Logistic Function to Bio-Assay. J. Am. Stat. Assoc. 1944, 39, 357–365. [Google Scholar] [CrossRef]
Liu, M.; Chen, Y.; Guo, Z.; Zhou, K.; Zhou, L.; Liu, H.; Bao, D.; Zhou, J. Construction of women’s all-around speed skating event performance prediction model and competition strategy analysis based on machine learning algorithms. Front. Psychol. 2022, 13, 915108. [Google Scholar] [CrossRef]
Maier, T.; Meister, D.; Trösch, S.; Wehrlin, J.P. Predicting biathlon shooting performance using machine learning. J. Sports Sci. 2018, 36, 2333–2339. [Google Scholar] [CrossRef]
Sagi, M.; Saldanha, P.; Shani, G.; Moskovitch, R. Pro-cycling team cyclist assignment for an upcoming race. PLoS ONE 2024, 19, e0297270. [Google Scholar] [CrossRef] [PubMed]
Ranstam, J.; Cook, J.A. LASSO regression. Br. J. Surg. 2018, 105, 1348. [Google Scholar] [CrossRef]
Rothschild, J.A.; Stewart, T.; Kilding, A.E.; Plews, D.J. Predicting daily recovery during long-term endurance training using machine learning analysis. Eur. J. Appl. Physiol. 2024, 124, 3279–3290. [Google Scholar] [CrossRef] [PubMed]
Smola, A.J.; Schölkopf, B. A tutorial on support vector regression. Stat. Comput. 2004, 14, 199–222. [Google Scholar] [CrossRef]
Füller, M.; Meenakshi Sundaram, A.; Ludwig, M.; Asteroth, A.; Prassler, E. Modeling and predicting the human heart rate during running exercise. In Communications in Computer and Information Science; Communications in computer and information science; Springer International Publishing: Cham, Switzerland, 2015; pp. 106–125. [Google Scholar]
Weisburd, D.; Britt, C.; Wilson, D.B.; Wooditch, A. An Introduction to Bivariate Regression. In Basic Statistics in Criminology and Criminal Justice; Springer International Publishing: Berlin/Heidelberg, Germany, 2020; pp. 531–580. [Google Scholar] [CrossRef]
O’Loughlin, E.; Nikolaidis, P.T.; Rosemann, T.; Knechtle, B. Different predictor variables for women and men in ultra-marathon running-the Wellington Urban Ultramarathon 2018. Int. J. Environ. Res. Public Health 2019, 16, 1844. [Google Scholar] [CrossRef]
Liang, K.Y.; Zeger, S.L.; Qaqish, B. Multivariate Regression Analyses for Categorical Data. J. R. Stat. Soc. Ser. B Stat. Methodol. 1992, 54, 3–24. [Google Scholar] [CrossRef]
Wiecha, S.; Kasiak, P.S.; Cieśliński, I.; Maciejczyk, M.; Mamcarz, A.; Śliż, D. Modeling physiological predictors of running velocity for endurance athletes. J. Clin. Med. 2022, 11, 6688. [Google Scholar] [CrossRef]
Breiman, L.; Friedman, J.H.; Olshen, R.A.; Stone, C.J. Classification and Regression Trees; Routledge: London, UK, 2017. [Google Scholar] [CrossRef]
Brodani, J.; Toth, M.; Spisiak, M.; Siska, L. Training indicators as predictors of the sport performance of the race walker Matej Tóth in YTC 2013/2014 to YTC 2015/2016. Phys. Act. Rev. 2018, 6, 161–170. [Google Scholar] [CrossRef]
Braak, C.J.F.t.; Looman, C.W.N.; Jongman, R.H.G.; Braak, C.J.F.T.; Tongeren, O.F.R.v. Regression. In Data Analysis in Community and Landscape Ecology; Cambridge University Press: Cambridge, UK, 1995; pp. 29–77. [Google Scholar]
de Leeuw, A.W.; Meerhoff, L.A.; Knobbe, A. Effects of pacing properties on performance in long-distance running. Big Data 2018, 6, 248–261. [Google Scholar] [CrossRef]
de Leeuw, A.W.; Heijboer, M.; Verdonck, T.; Knobbe, A.; Latré, S. Exploiting sensor data in professional road cycling: Personalized data-driven approach for frequent fitness monitoring. Data Min. Knowl. Discov. 2023, 37, 1125–1153. [Google Scholar] [CrossRef]
de Ville, B. Decision trees. WIREs Comput. Stat. 2013, 5, 448–455. [Google Scholar] [CrossRef]
Padmanandam, K.; Akhila, T.; Divya Sri, A.; Sunidhi, K.; Amulya, B. Athletic runner injury prediction system. In Proceedings of the 2024 International Conference on Data Science and Network Security (ICDSNS), Tiptur, India, 26–27 July 2024; IEEE: Piscataway, NJ, USA, 2024; pp. 1–7. [Google Scholar]
Priymak, S.; Krutsevich, T.; Pangelova, N.; Trachuk, S.; Kravchenko, T.; Stepanenko, V.; Ruban, V. Modeling of functional support of sports activities of biathletes of different qualifications. J. Hum. Sport Exerc. 2020, 16, 136–146. [Google Scholar] [CrossRef]
Breiman, L. Random Forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
Khan, T.; Lundgren, L.E.; Järpe, E.; Olsson, M.C.; Viberg, P. A novel method for classification of running fatigue using change-point segmentation. Sensors 2019, 19, 4729. [Google Scholar] [CrossRef]
Martínez-Gramage, J.; Albiach, J.P.; Moltó, I.N.; Amer-Cuenca, J.J.; Huesa Moreno, V.; Segura-Ortí, E. A random forest machine learning framework to reduce running injuries in young triathletes. Sensors 2020, 20, 6388. [Google Scholar] [CrossRef]
Baldassarri, S.; García de Quirós, J.; Beltrán, J.R.; Álvarez, P. Wearables and machine learning for improving runners’ motivation from an affective perspective. Sensors 2023, 23, 1608. [Google Scholar] [CrossRef]
Kramer, O. K-Nearest Neighbors. In Dimensionality Reduction with Unsupervised Nearest Neighbors; Springer: Berlin/Heidelberg, Germany, 2013; pp. 13–23. [Google Scholar] [CrossRef]
Berndsen, J.; Smyth, B.; Lawlor, A. A collaborative filtering approach to successfully completing the marathon. In Proceedings of the 2020 19th IEEE International Conference on Machine Learning and Applications (ICMLA), Miami, FL, USA, 14–17 December 2020; IEEE: Piscataway, NJ, USA, 2020. [Google Scholar]
Lopez-Matenci, P.; Alonso, J.V.; Gonzalez-Castano, F.J.; Sieiro, J.L.; Alcaraz, J.J. Ambient intelligence assistant for running sports based on k-NN classifiers. In Proceedings of the 3rd International Conference on Human System Interaction, Rzeszow, Poland, 13–15 May 2010; IEEE: Piscataway, NJ, USA, 2010. [Google Scholar]
Lerebourg, L.; Saboul, D.; Clémençon, M.; Coquart, J.B. Prediction of Marathon Performance using Artificial Intelligence. Int. J. Sports Med. 2023, 44, 352–360. [Google Scholar] [CrossRef]
Smyth, B.; Cunningham, P. Running with cases: A CBR approach to running your best marathon. In Case-Based Reasoning Research and Development; Lecture notes in computer science; Springer International Publishing: Cham, Switzerland, 2017; pp. 360–374. [Google Scholar]
Lowd, D.; Domingos, P. Naive Bayes models for probability estimation. In Proceedings of the 22nd International Conference on Machine learning—ICML ’05, Bonn, Germany, 7–11 August 2005; ACM Press: New York, NY, USA, 2005; pp. 529–536. [Google Scholar] [CrossRef]
Ofoghi, B.; Zeleznikow, J.; Dwyer, D.; Macmahon, C. Modelling and analysing track cycling Omnium performances using statistical and machine learning techniques. J. Sports Sci. 2013, 31, 954–962. [Google Scholar] [CrossRef][Green Version]
Tipping, M.E. The Relevance Vector Machine. Adv. Neural Inf. Process. Syst. 1999, 12, 652–658. [Google Scholar][Green Version]
Cenedese, A.; Susto, G.A.; Terzi, M. A parsimonious approach for activity recognition with wearable devices: An application to cross-country skiing. In Proceedings of the 2016 European Control Conference (ECC), Aalborg, Denmark, 29 June–1 July 2016; IEEE: Piscataway, NJ, USA, 2016. [Google Scholar][Green Version]
Hearst, M.; Dumais, S.; Osuna, E.; Platt, J.; Scholkopf, B. Support vector machines. IEEE Intell. Syst. Their Appl. 1998, 13, 18–28. [Google Scholar] [CrossRef]
Liu, Q.; Chen, H.; Thirupathi, A.; Yang, M.; Baker, J.S.; Gu, Y. A pilot study of plantar mechanics distributions and fatigue profiles after running on a treadmill: Using a support vector machine algorithm. J. Healthc. Eng. 2023, 2023, 7461729. [Google Scholar] [CrossRef]
Hsu, P.Y.; Hsu, Y.C.; Liu, H.L.; Fong Kao, W.; Lin, K.Y. An acute kidney injury prediction model for 24-hour ultramarathon runners. J. Hum. Kinet. 2022, 84, 103–111. [Google Scholar] [CrossRef] [PubMed]
Qi, J.; Li, D.; He, J.; Wang, Y. Optically non-contact cross-country skiing action recognition based on key-point collaborative estimation and motion feature extraction. Sensors 2023, 23, 3639. [Google Scholar] [CrossRef] [PubMed]
Lam, W.; Bacchus, F. Learning Bayesian belief networks: An approach based on the MDL principle. Comput. Intell. 1994, 10, 269–293. [Google Scholar] [CrossRef]
Ofoghi, B.; Zeleznikow, J.; Macmahon, C. A hybrid probabilistic and combinatorial optimization approach to analyzing rowing performance measures. In Proceedings of the PACIS 2011—15th Pacific Asia Conference on Information Systems: Quality Research in Pacific, Brisbane, Australia, 7–11 July 2011. [Google Scholar]
Coadou, Y. Boosted Decision Trees and Applications. EPJ Web Conf. 2013, 55, 02004. [Google Scholar] [CrossRef]
Breiman, L. Bagging predictors. Mach. Learn. 1996, 24, 123–140. [Google Scholar] [CrossRef]
Lövdal, S.S.; Den Hartigh, R.J.R.; Azzopardi, G. Injury prediction in competitive runners with machine learning. Int. J. Sports Physiol. Perform. 2021, 16, 1522–1531. [Google Scholar] [CrossRef]
Bentéjac, C.; Csörgő, A.; Martínez-Muñoz, G. A comparative analysis of gradient boosting algorithms. Artif. Intell. Rev. 2020, 54, 1937–1967. [Google Scholar] [CrossRef]
Cau, F.M.; Mancosu, M.S.; Mulas, F.; Pilloni, P.; Spano, L.D. An intelligent interface for supporting coaches in providing running feedback. In Proceedings of the 13th Biannual Conference of the Italian SIGCHI Chapter: Designing the Next Interaction, Padua, Italy, 23–25 September 2019. [Google Scholar]
Chen, T.; Guestrin, C. XGBoost: A Scalable Tree Boosting System. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. KDD ’16, San Francisco, CA, USA, 13–17 August 2016; ACM: New York, NY, USA, 2016; pp. 785–794. [Google Scholar] [CrossRef]
Karetnikov, A.; Nuijten, W.; Hassani, M. Data-driven Support of Coaches in Professional Cycling using Race Performance Prediction. In Proceedings of the International Conference on Sport Sciences Research and Technology Support, icSPORTS—Proceedings, Porto, Portugal, 21–22 November 2021; Volume 2021-October, pp. 43–53. [Google Scholar]
Berndsen, J.; Smyth, B.; Lawlor, A. Mining marathon training data to generate useful user profiles. In Communications in Computer and Information Science; Springer International Publishing: Cham, Switzerland, 2020; pp. 113–125. [Google Scholar]
Prokhorenkova, L.; Gusev, G.; Vorobev, A.; Dorogush, A.V.; Gulin, A. CatBoost: Unbiased boosting with categorical features. arXiv 2017, arXiv:1706.09516. [Google Scholar] [CrossRef]
Hartigan, J.A.; Wong, M.A. Algorithm AS 136: A K-Means Clustering Algorithm. Appl. Stat. 1979, 28, 100. [Google Scholar] [CrossRef]
Klema, V.; Laub, A. The singular value decomposition: Its computation and some applications. IEEE Trans. Autom. Control 1980, 25, 164–176. [Google Scholar] [CrossRef]
Blythe, D.A.J.; Király, F.J. Prediction and quantification of individual athletic performance of runners. PLoS ONE 2016, 11, e0157257. [Google Scholar] [CrossRef]
Cravo, G. Matrix Completion Problems. Linear Algebra Its Appl. 2009, 430, 2511–2540. [Google Scholar] [CrossRef][Green Version]
Kaelbling, L.P.; Littman, M.L.; Moore, A.W. Reinforcement Learning: A Survey. J. Artif. Intell. Res. 1996, 4, 237–285. [Google Scholar] [CrossRef]
Silacci, A.; Taiar, R.; Caon, M. Towards an AI-based tailored training planning for road cyclists: A case study. Appl. Sci. 2020, 11, 313. [Google Scholar] [CrossRef]
LeCun, Y.; Bengio, Y.; Hinton, G. Deep Learning. Nature 2015, 521, 436–444. [Google Scholar] [CrossRef] [PubMed]
Holdsworth, J. What Is Deep Learning? 2024. Available online: https://www.ibm.com/think/topics/deep-learning (accessed on 18 February 2025).
O’Shea, K.; Nash, R. An Introduction to Convolutional Neural Networks. arXiv 2015, arXiv:1511.08458. [Google Scholar] [CrossRef]
Jang, J.; Ankit, A.; Kim, J.; Jang, Y.J.; Kim, H.Y.; Kim, J.H.; Xiong, S. A unified deep-learning model for classifying the cross-country skiing techniques using wearable gyroscope sensors. Sensors 2018, 18, 3819. [Google Scholar] [CrossRef]
Chow, D.H.K.; Tremblay, L.; Lam, C.Y.; Yeung, A.W.Y.; Cheng, W.H.W.; Tse, P.T.W. Comparison between accelerometer and gyroscope in predicting level-ground running kinematics by treadmill running kinematics using a single wearable sensor. Sensors 2021, 21, 4633. [Google Scholar] [CrossRef]
Han, C.; Liu, P. Effect of deep learning algorithm incorporating attention module optimization on assisted training for youth running sports. IEEE Access 2024, 12, 113960–113971. [Google Scholar] [CrossRef]
Seo, C.; Sabanai, M.; Goto, Y.; Tagami, K.; Ogata, H.; Kanosue, K.; Ohya, J. Extracting and interpreting unknown factors with classifier for foot strike types in running. In Proceedings of the 2020 25th International Conference on Pattern Recognition (ICPR), Milan, Italy, 10–15 January 2021; IEEE: Piscataway, NJ, USA, 2021. [Google Scholar]
Johansson, M.; Korneliusson, M.; Lawrence, N.L. Identifying cross country skiing techniques using power meters in ski poles. In Communications in Computer and Information Science; Springer International Publishing: Cham, Switzerland, 2019; pp. 52–57. [Google Scholar]
Chang, P.; Wang, C.; Chen, Y.; Wang, G.; Lu, A. Identification of runner fatigue stages based on inertial sensors and deep learning. Front. Bioeng. Biotechnol. 2023, 11, 1302911. [Google Scholar] [CrossRef]
Graves, A. Long Short-Term Memory. In Supervised Sequence Labelling with Recurrent Neural Networks; Springer: Berlin/Heidelberg, Germany, 2012; pp. 37–45. [Google Scholar] [CrossRef]
Uddin, M.Z.; Seeberg, T.M.; Kocbach, J.; Liverud, A.E.; Gonzalez, V.; Sandbakk, Ø.; Meyer, F. Estimation of mechanical power output employing deep learning on inertial measurement data in roller ski skating. Sensors 2021, 21, 6500. [Google Scholar] [CrossRef] [PubMed]
Fister, I., Jr.; Salcedo-Sanz, S.; Iglesias, A.; Fister, D.; Gálvez, A.; Fister, I. New perspectives in the development of the artificial sport trainer. Appl. Sci. 2021, 11, 11452. [Google Scholar] [CrossRef]
Shao, Y.; Li, R.D.; Luo, Y.J.; Zhu, M. Research on running data analysis method based on attention-LSTM. In Proceedings of the 2021 International Conference on Intelligent Transportation, Big Data & Smart City (ICITBS), Xi’an, China, 27–28 March 2021; IEEE: Piscataway, NJ, USA, 2021. [Google Scholar]
Silacci, A.; Khaled, O.A.; Mugellini, E.; Caon, M. Designing an e-coach to tailor training plans for road cyclists. In Human Systems Engineering and Design II; Advances in intelligent systems and computing; Springer International Publishing: Cham, Switzerland, 2020; pp. 671–677. [Google Scholar]
Kanagachidambaresan, G.R.; Ruwali, A.; Banerjee, D.; Prakash, K.B. Recurrent Neural Network. In Programming with TensorFlow; Springer International Publishing: Berlin/Heidelberg, Germany, 2021; pp. 53–61. [Google Scholar] [CrossRef]
Etxegarai, U.; Portillo, E.; Irazusta, J.; Arriandiaga, A.; Cabanes, I. Estimation of lactate threshold with machine learning techniques in recreational runners. Appl. Soft Comput. 2018, 63, 181–196. [Google Scholar] [CrossRef]
Bebis, G.; Georgiopoulos, M. Feed-forward neural networks. IEEE Potentials 1994, 13, 27–31. [Google Scholar] [CrossRef]
BenSiSaid, K.; Ababou, N.; Ababou, A.; Roth, D.; von Mammen, S. Tracking rower motion without on-body sensors using an instrumented machine and an artificial neural network. Proc. Inst. Mech. Eng. P. J. Sport. Eng. Technol. 2022, 236, 238–252. [Google Scholar] [CrossRef]
Lea, C.; Vidal, R.; Reiter, A.; Hager, G.D. Temporal Convolutional Networks: A Unified Approach to Action Segmentation. In European Conference on Computer Vision; Springer International Publishing: Cham, Switzerland, 2016. [Google Scholar] [CrossRef]
Hedge, E.T.; Amelard, R.; Hughson, R.L. Prediction of oxygen uptake kinetics during heavy-intensity cycling exercise by machine learning analysis. J. Appl. Physiol. 2023, 134, 1530–1536. [Google Scholar] [CrossRef]
Hausknecht, M.J.; Stone, P. Deep Recurrent Q-Learning for Partially Observable MDPs. arXiv 2015, arXiv:1507.06527. [Google Scholar]
Girshick, R.; Donahue, J.; Darrell, T.; Malik, J. Region-Based Convolutional Networks for Accurate Object Detection and Segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 2016, 38, 142–158. [Google Scholar] [CrossRef]
Zhang, Y.; Zhou, Y.; Wang, P.; He, L.; Zhang, Z.; Ren, J. Intelligent Pose Recognition and Evaluation System for Rowing Sports. In Proceedings of the 2024 5th International Conference on Electronic Communication and Artificial Intelligence (ICECAI), Guangzhou, China, 31 May–2 June 2024; IEEE: Piscataway, NJ, USA, 2024; pp. 620–625. [Google Scholar]
Hinton, G.E.; Osindero, S.; Teh, Y.W. A Fast Learning Algorithm for Deep Belief Nets. Neural Comput. 2006, 18, 1527–1554. [Google Scholar] [CrossRef]
Johnson, W.R.; Mian, A.; Robinson, M.A.; Verheul, J.; Lloyd, D.G.; Alderson, J.A. Multidimensional ground reaction forces and moments from wearable sensor accelerations via deep learning. IEEE Trans. Biomed. Eng. 2021, 68, 289–297. [Google Scholar] [CrossRef] [PubMed]
Hecht-Nielsen, R. Theory of the Backpropagation Neural Network. In Neural Networks for Perception; Elsevier: Amsterdam, The Netherlands, 1992; pp. 65–93. [Google Scholar] [CrossRef]
Wang, X.; Li, Z.; Wu, H. Personalized recommendation method of “carbohydrate-protein” supplement based on machine learning and enumeration method. IEEE Access 2023, 11, 100573–100586. [Google Scholar] [CrossRef]
Armstrong, W.W.; Dwelly, A.; Dong Liang, J.; Lin, D.; Reynolds, S. Learning and Generalization in Adaptive Logic Networks. In Artificial Neural Networks; Elsevier: Amsterdam, The Netherlands, 1991; pp. 1173–1176. [Google Scholar] [CrossRef]
Nguyen, T.N.; Su, S.; Celler, B.; Nguyen, H. Advanced portable remote monitoring system for the regulation of treadmill running exercises. Artif. Intell. Med. 2014, 61, 119–126. [Google Scholar] [CrossRef] [PubMed]
da Silva, D.G.; Meneses, A.A.d.M. Comparing Long Short-Term Memory (LSTM) and bidirectional LSTM deep neural networks for power consumption prediction. Energy Rep. 2023, 10, 3315–3334. [Google Scholar] [CrossRef]
Rosenblatt, F. The perceptron: A probabilistic model for information storage and organization in the brain. Psychol. Rev. 1958, 65, 386–408. [Google Scholar] [CrossRef]
Li, N.; Hu, W.; Ma, Y.; Xiang, H. Machine learning prediction of pulmonary oxygen uptake from muscle oxygen in cycling. J. Sports Sci. 2024, 42, 1299–1307. [Google Scholar] [CrossRef]
Rindal, O.; Seeberg, T.; Tjønnås, J.; Haugnes, P.; Sandbakk, Ø. Automatic classification of sub-techniques in classical cross-country skiing using a machine learning algorithm on micro-sensor data. Sensors 2017, 18, 75. [Google Scholar] [CrossRef]
Roczniok, R.; Rygula, I.; Kwasniewska, A. The use of Kohonen’s neural networks in the recruitment process for sport swimming. J. Hum. Kinet. 2007, 17, 75–88. [Google Scholar]
Chen, Y.; Wang, Z.; Peng, Y.; Zhang, Z.; Yu, G.; Sun, J. Cascaded Pyramid Network for Multi-Person Pose Estimation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA, 18–23 June 2018. [Google Scholar]
Bellman, R. A Markovian Decision Process. J. Math. Mech. 1957, 6, 679–684. [Google Scholar] [CrossRef]
Vales-Alonso, J.; López-Matencio, P.; Alcaraz, J.J.; Sieiro-Lomba, J.L.; Costa-Montenegro, E.; González-Castaño, F.J. A dynamic programming approach for ambient intelligence platforms in running sports based on Markov decision processes. In Advances in Intelligent and Soft Computing; Springer: Berlin/Heidelberg, Germany, 2012; pp. 165–181. [Google Scholar]
Redmon, J.; Divvala, S.; Girshick, R.; Farhadi, A. You Only Look Once: Unified, Real-Time Object Detection. arXiv 2015, arXiv:1506.02640. [Google Scholar] [CrossRef]
Radford, A.; Wu, J.; Child, R.; Luan, D.; Amodei, D.; Sutskever, I. Language models are unsupervised multitask learners. OpenAI Blog 2019, 1, 9. [Google Scholar]
Düking, P.; Sperlich, B.; Voigt, L.; Van Hooren, B.; Zanini, M.; Zinner, C. ChatGPT generated training plans for runners are not rated optimal by coaching experts, but increase in quality with additional input information. J. Sports Sci. Med. 2024, 23, 56–72. [Google Scholar] [CrossRef] [PubMed]
Fothergill, S.; Harle, R.; Holden, S. Modeling the Model Athlete: Automatic Coaching of Rowing Technique. In Proceedings of the Structural, Syntactic, and Statistical Pattern Recognition, Orlando, FL, USA, 4–6 December 2008; DaVitoria Lobo, N., Kasparis, T., Roli, F., Kwok, J., Georgiopoulos, M., Anagnostopoulos, G., Loog, M., Eds.; Int Assoc Pattern Recognit, Tech Committee, Lecture Notes in Computer Science. Volume 5342, pp. 372–381. [Google Scholar]
Seeberg, T.M.; Tjønnås, J.; Rindal, O.M.H.; Haugnes, P.; Dalgard, S.; Sandbakk, Ø. A multi-sensor system for automatic analysis of classical cross-country skiing techniques. Sports Eng. 2017, 20, 313–327. [Google Scholar] [CrossRef]
Lukač, L.; Fister, I., Jr.; Fister, I. Digital twin in sport: From an idea to realization. Appl. Sci. 2022, 12, 12741. [Google Scholar] [CrossRef]
Su, X.; Khoshgoftaar, T.M. A Survey of Collaborative Filtering Techniques. Adv. Artif. Intell. 2009, 2009, 421425. [Google Scholar] [CrossRef]
Kolodner, J.L. An introduction to case-based reasoning. Artif. Intell. Rev. 1992, 6, 3–34. [Google Scholar] [CrossRef]
Gendreau, M.; Potvin, J.Y. Tabu Search. In Search Methodologies: Introductory Tutorials in Optimization and Decision Support Techniques; Burke, E.K., Kendall, G., Eds.; Springer: Boston, MA, USA, 2005; pp. 165–186. [Google Scholar] [CrossRef]
Atzmueller, M. Subgroup discovery. WIREs Data Min. Knowl. Discov. 2015, 5, 35–49. [Google Scholar] [CrossRef]
Charvatova, H.; Prochazka, A.; Vysata, O.; Suarez-Araujo, C.P.; Smith, J.H. Evaluation of accelerometric and cycling Cadence data for motion monitoring. IEEE Access 2021, 9, 129256–129263. [Google Scholar] [CrossRef]
Chang, R.; Yang, Z.; Ning, J. Inaccurate action detection algorithm for rowing machine exercise based on attention-CNN. IEEE Access 2024, 12, 114961–114973. [Google Scholar] [CrossRef]
Li, R.; Kling, S.; Salata, M.; Cupp, S.; Sheehan, J.; Voos, J. Wearable Performance Devices in Sports Medicine. Sport. Health Multidiscip. Approach 2015, 8, 74–78. [Google Scholar] [CrossRef]
Lapinski, M.; Brum Medeiros, C.; Moxley Scarborough, D.; Berkson, E.; Gill, T.J.; Kepple, T.; Paradiso, J.A. A Wide-Range, Wireless Wearable Inertial Motion Sensing System for Capturing Fast Athletic Biomechanics in Overhead Pitching. Sensors 2019, 19, 3637. [Google Scholar] [CrossRef] [PubMed]
Aguilar-Torán, J.; Rabost-Garcia, G.; Toinga-Villafuerte, S.; Álvarez Carulla, A.; Colmena-Rubil, V.; Fajardo-Garcia, A.; Cardona-Bonet, A.; Casals-Terré, J.; Muñoz-Pascual, X.; Miribel-Català, P.; et al. Novel Sweat-Based Wearable Device for Advanced Monitoring of Athletic Physiological Biometrics. Sensors 2023, 23, 9473. [Google Scholar] [CrossRef] [PubMed]
Rodger, S.M.; Plews, D.J.; McQuillan, J.; Driller, M.W. Evaluation of the Cyclus cycle ergometer and the Stages power meter for measurement of power output in cycling. J. Sci. Cycl. 2016, 5, 16–22. [Google Scholar]
Manivannan, A.; Chin, W.C.B.; Barrat, A.; Bouffanais, R. On the challenges and potential of using barometric sensors to track human activity. Sensors 2020, 20, 6786. [Google Scholar] [CrossRef]
Suo, X.; Tang, W.; Li, Z. Motion capture technology in sports scenarios: A survey. Sensors 2024, 24, 2947. [Google Scholar] [CrossRef]
Mateus, N.; Abade, E.; Coutinho, D.; Gómez, M.Á.; Peñas, C.L.; Sampaio, J. Empowering the Sports Scientist with Artificial Intelligence in Training, Performance, and Health Management. Sensors 2024, 25, 139. [Google Scholar] [CrossRef]

Figure 1. Systematic literature review selection process.

Figure 2. Year of publication of included studies. (*) The year 2024 is shown in a different color because some studies may still be published between the date the SLR was performed and the end of the year.

Figure 3. Taxonomy of intelligent methods used in endurance sports training.

Table 1. Search string for literature review.

(triathlon OR biathlon OR rowing OR marathon OR cycling OR running OR “cross-country skiing” OR “speed skating” OR endurance) AND (sport* OR athlete*) AND training AND (“machine learning” OR “big data” OR “data mining” OR “deep learning” OR intellige* OR algorithm*)

Table 2. Search queries for Science Direct.

Field	Search Query
Title, Abstract, Keywords	(“machine learning” OR “big data” OR “data mining” OR “deep learning” OR intelligence OR algorithm) AND (sport OR athlete) AND (“training”)
Full text	(triathlon OR biathlon OR rowing OR marathon OR cycling OR running OR “cross-country skiing” OR “speed skating” OR endurance)

Table 3. Data Extraction Table attributes.

#	Attribute	Description
A1	Title	The title of the study.
A2	Authors	The authors who conducted and published the study.
A3	Year of Publication	The year the study was published.
A4	Intelligent Methods	The methods of intelligent data analysis applied in the study, such as machine learning or artificial intelligence.
A5	Endurance Sports Discipline	The specific endurance sports discipline that was the focus of the study, such as running or cycling.
A6	Devices	The wearables, IoT devices, or sensors used to collect data for intelligent sports training.
A7	Published in	The journal’s name or conference where the study was published.
A8	Results	The key findings or outcomes of the study.
A9	Sample Size	The number of participants involved in the study and their relevant demographics, such as age, gender, or fitness level.
A10	Future Research	The recommendations made by the authors for future research based on the study’s findings.

Table 4. Literature search results.

Database	Results
Science Direct	193
IEEE Xplore	87
Scopus	580
Web of Science	445
Total	1305

Table 5. Duplicate entry counts across databases.

Database	IEEE Xplore	Science Direct	Scopus	Web of Science
IEEE Xplore	-	-	55	43
Science Direct	-	-	-	-
Scopus	55	-	2	271
Web of Science	43	-	271	2

Table 6. Publication distribution.

Type	Publication	Count
Journal	MDPI Sensors	8
Journal	IEEE Access	5
Journal	Springer Journal of Sports Sciences	4
Journal	MDPI Applied Sciences	3
Journal	Frontiers in Bioengineering and Biotechnology	2
Journal	PLoS ONE	2
Journal	IEEE Transactions on Biomedical Engineering	2
Journal	Springer Sports Engineering	2
Book series	Springer Lecture Notes in Computer Science	3
Book series	Springer Advances in Intelligent Systems and Computing	2
Other	Other	42

Table 7. Studies categorized by sport (* Ironman is a type of triathlon but was classified separately due to the extreme endurance challenge it presents).

Sport	Studies	Count
Biathlon	[72,89]	2
Cross-country skiing	[102,106,127,131,134,156,165]	7
Cycling	[33,40,42,48,51,53,55,69,73,75,86,100,115,123,129,135,137,143,155,166,171]	21
Ironman *	[61]	1
Race walking/speed walking	[83]	1
Rowing	[75,108,141,146,150,164,172]	7
Running	[39,44,47,48,57,61,65,66,67,75,77,79,85,88,91,92,93,95,97,98,104,105,111,113,116,120,128,129,130,132,136,139,148,152,163]	35
Speed skating	[64,71]	2
Triathlon	[49,75]	2

Table 8. Categorization of research papers by their topic of application.

Category	Papers	Count
Fatigue and injury management	[66,88,91,92,104,105,111,132,148,172]	10
Pacing/effort strategies and (training) path optimization	[33,39,57,85,95,98,108,116,160]	9
Performance prediction and evaluation	[40,44,55,61,65,71,79,83,89,95,97,98,113,115,116,120,136,143,155]	19
Physiological metrics and biomechanics	[47,67,77,86,134,139,152]	7
Technique analysis and classification	[64,102,106,127,128,129,130,131,141,146,156,164,165,171]	14
Training planning and adaptation	[42,48,49,51,53,69,72,75,81,96,123,137,163]	13
Other	[73,93,100,150,166]	5

Table 9. Distribution of methods across study focuses.

Category	Comp. Intell.	Data Mining	Deep Learning	Machine Learn.	Other
Fatigue and injury management	-	-	3	7	1
Pacing/effort strategies and (training) path optimization	3	-	-	5	5
Performance prediction and evaluation	3	2	6	12	2
Physiological metrics and biomechanics	1	-	6	3	-
Technique analysis and classification	-	-	10	3	3
Training planning and adaptation	5	-	3	6	1
Other	-	-	1	3	1

Table 10. Studies by participant numbers (with N/A or 0 merged into one category).

Participants	Studies	Count
Not applicable or 0	[33,49,71,72,85,108,135,146,160,171]	10
1–10	[39,40,42,48,51,53,57,64,77,83,86,96,102,106,115,123,127,128,129,131,137,148,152,156,163,164,166]	27
11–50	[47,55,65,67,75,89,91,92,93,104,105,113,130,132,134,136,141,143,155,165]	20
51–100	[66,79,88,111,172]	5
101–1000	[73,97,100,139,150]	5
1001–5000	[69,81]	2
5001–10,000	[95]	1
10,001–50,000	[61,116]	2
>50,000	[44,98,120]	3

Table 11. Distribution of devices used across studies.

Category	Device	Papers	Count
Wearable Devices and On-Body Sensors			6
	Smart Wristband	[75,93,150]	3
	Smart Ring	[75]	1
	Smart Watch	[75,171]	2
	Other Wearables	[49,143]	2
Inertial and Motion Sensors			17
	Accelerometer	[47,64,66,129,130,148,165,171]	8
	Gyroscope	[129,165]	2
	IMU	[67,92,102,106,127,128,132,134,156,165]	10
	Magnetometer	[165]	1
Physiological and Biometric Sensors			26
	Pulse Oximeter	[89,95,96]	3
	Heart Rate Monitor	[39,42,51,53,55,57,65,73,77,86,89,91,92,104,111,135,136,139,152,155,165,166,171]	23
	Surface Electromyography Sensor (sEMG)	[91,92]	2
	Lactate Analyzer	[81,139]	2
	Near-Infrared Spectroscopy Sensor	[155]	1
	Body Temperature Sensor	[55,165]	2
	CPET	[81]	1
	Bioimpedance Body Composition Analyzer	[81]	1
Performance and Exercise Equipment			15
	Ergometer	[89,143,150,155,164]	5
	Power Meter	[42,73,86,131,135,141]	6
	Cadence Sensor	[73,136]	2
	Timing lights	[104]	1
	Cycling Computer	[55,115]	2
Location and Environmental Sensors			15
	GPS	[39,42,51,53,57,77,86,92,111,116,135,136,165,166]	14
	Displacement Sensor	[141]	1
	Barometer	[165]	1
Imaging and Motion Capture Systems			12
	Motion Capture System	[141,148]	2
	Force Plates	[47,64,104,148]	4
	Shooting Target System	[72]	1
	Video Camera	[92,106,130,146,164,172]	6

Table 12. Top three devices for each sport (with counts).

	Most Popular Devices
Sport	1st	2nd	3rd
Running	Heart Rate Monitor (11) [39,57,65,77,91,92,104,111,136,139,152]	GPS (7) [39,57,77,92,111,116,136]	Accelerometer (5) [47,66,129,130,148]
Cycling	Heart Rate Monitor (10) [42,51,53,55,73,86,135,155,166,171]	GPS (6) [42,51,53,86,135,166]	Power Meter (4) [42,73,86,135]
Rowing	Ergometer (3) [89,150,164]	Video Camera (3) [146,164,172]	Heart Rate Monitor (2) [75,89]
Cross-Country Skiing	IMU (6) [102,106,127,134,156,165]	Accelerometer (1) [165]	Gyroscope (1) [165]
Triathlon	Smart Watch (1) [75]	Smart Ring (1) [75]	–
Speed Skating	Accelerometer (1) [64]	Force Plates (1) [64]	–
Biathlon	Shooting Target Sys. (1) [72]	Ergometer (1) [89]	Pulse Oximeter (1) [89]

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Rajšp, A.; Rek, P.; Kokol, P.; Fister, I., Jr. The Role of Intelligent Data Analysis in Selected Endurance Sports: A Systematic Literature Review. Appl. Sci. 2025, 15, 10158. https://doi.org/10.3390/app151810158

AMA Style

Rajšp A, Rek P, Kokol P, Fister I Jr. The Role of Intelligent Data Analysis in Selected Endurance Sports: A Systematic Literature Review. Applied Sciences. 2025; 15(18):10158. https://doi.org/10.3390/app151810158

Chicago/Turabian Style

Rajšp, Alen, Patrik Rek, Peter Kokol, and Iztok Fister, Jr. 2025. "The Role of Intelligent Data Analysis in Selected Endurance Sports: A Systematic Literature Review" Applied Sciences 15, no. 18: 10158. https://doi.org/10.3390/app151810158

APA Style

Rajšp, A., Rek, P., Kokol, P., & Fister, I., Jr. (2025). The Role of Intelligent Data Analysis in Selected Endurance Sports: A Systematic Literature Review. Applied Sciences, 15(18), 10158. https://doi.org/10.3390/app151810158

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

The Role of Intelligent Data Analysis in Selected Endurance Sports: A Systematic Literature Review

Abstract

1. Introduction

2. Endurance Sports Training

3. Research Methodology

3.1. Inclusion Criteria

3.2. Exclusion Criteria

3.3. Limitations

3.3.1. ScienceDirect

3.3.2. IEEE Xplore

3.3.3. Other Databases (Scopus, Web of Science)

3.4. Data Extraction

4. Results

4.1. Taxonomy of Intelligent Methods

4.1.1. Computational Intelligence

4.1.2. Data Mining

4.1.3. Machine Learning

4.1.4. Deep Learning

4.1.5. Other Methods

4.2. Sports

4.3. Study Focuses

5. Discussion

Limitations to Validity

6. Conclusions

Author Contributions

Funding

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI