Real-Time Performance Prediction in Long-Distance Trail Running: A Practical Model Based on Terrain Difficulty and Pacing Variability

Gutiérrez, Héctor; Piedrafita, Eduardo; Bascuas, Pablo Jesús; Arbonés, Irela; Berzosa, César; Bataller-Cervero, Ana Vanessa

doi:10.3390/sports13110385

Open AccessArticle

Real-Time Performance Prediction in Long-Distance Trail Running: A Practical Model Based on Terrain Difficulty and Pacing Variability

by

Héctor Gutiérrez

¹

,

Eduardo Piedrafita

^1,*

,

Pablo Jesús Bascuas

¹

,

Irela Arbonés

²

,

César Berzosa

¹

and

Ana Vanessa Bataller-Cervero

¹

Facultad de Ciencias de la Salud, Universidad San Jorge, Autovía A-23 Zaragoza-Huesca, km 299, Villanueva de Gállego, 50830 Zaragoza, Spain

²

Instituto Nacional de Educación Física de Cataluña (INEFC), Universidad de Lleida, Partida la Caparrella s/n, 25192 Lérida, Spain

^*

Author to whom correspondence should be addressed.

Sports 2025, 13(11), 385; https://doi.org/10.3390/sports13110385

Submission received: 15 September 2025 / Revised: 14 October 2025 / Accepted: 20 October 2025 / Published: 4 November 2025

(This article belongs to the Special Issue Endurance Sports Performance: Training, Recovery and Injury-Prevention Strategies)

Download

Browse Figures

Versions Notes

Abstract

Trail running is a demanding endurance sport where performance prediction models often rely on laboratory testing or pre-race data, limiting their practical application. This study presents a real-time predictive model for marathon and ultra-trail races, based on variables recorded during the race, including uphill/downhill pace-times, terrain difficulty coefficients, and partial rankings. A total of 947 runners from the ‘Trail Valle de Tena’ event (Spain) were analyzed to develop equations that estimate total race time using only the first third of the race. The model incorporates weighted time (WT_n), pacing variability (WTV_n,n+2), and checkpoint percentile rank (CPR_n), showing strong predictive power (adjusted R² > 0.95) across sexes and race modalities. These variables reflect the runner’s ability to both overcome elevation and maintain consistent pacing, offering insights into fatigue management and performance optimization. The model enables coaches and athletes to monitor race progression, adjust strategies in real time, and potentially reduce injury risk through better control of effort intensity. Unlike laboratory-based models, this approach is fully applicable in field conditions and does not require prior testing. Further validation in similar endurance events is recommended to confirm its utility as a practical tool for training and competition planning.

Keywords:

trail running; endurance sports; pacing strategy; performance monitoring; fatigue management

1. Introduction

In trail running, multiple factors influence performance; however, there is still no clear consensus on which are the most decisive [1,2]. Some classical models have proposed different variables as possible factors that could be related to the final performance in a single test or competition in this sports discipline. These include: physiological variables, such as maximal oxygen consumption (VO₂max), ventilatory threshold (VT), running economy or muscle strength; sex; training load, indicated by average distance or average speed; and terrain characteristics, such as elevation gain, altitude or technical difficulty [3,4,5,6,7]. Nevertheless, these factors have not yet yielded models with the sufficient predictive power to completely and reliably predict performance in trail running races. Alternatively, some studies have suggested adapting predictive models from other contexts by conducting laboratory tests or collecting pre-competition data, although these approaches are often complex and impractical in real-world settings [8,9]. Currently, validated models have not been identified in scientific literature that can accurately predict trail runner performance.

To date, performance prediction models in long-distance races have primarily aimed to estimate total race time (TT) through equations applied to endurance continuous races across different sports disciplines. Keogh et al. identified 114 equations designed to explain the relationship between road-marathon TT and a range of independent variables [8]. Of these, 61 equations used variables collected outside the laboratory (such as anthropometric measurements, previous tests results, and training history), while the remaining 53 included variables measured directly in the laboratory, including VO₂max and skinfolds thickness. 50 of the variables collected were independent, being the most frequently used: time in the previous race (35.0% of equations), average weekly training distance (24.6%), age (17.5%), VO₂max (16.7%), number of previously completed marathon races (13.2%), average training race pace (13.2%), longest training run distance (11.4%), and total distance covered in the previous 8–9 weeks (10.5%). The predictive power of these models was assessed using the coefficient of determination (R² = [0.10, 0.99]) and the standard error of estimate (SEE = [0.27, 27.40 min]). According to the authors, the main limitation of these models was not considering variables such as elevation gain, sex, or expected weather conditions on the race day to predict running performance. In the context of trail running, external variables (such as terrain variability or race-specific characteristics) appear to play a more critical role in determining performance outcomes, and may be essential for accurately predicting TT. However, a key limitation of these studies is their reliance on laboratory-based data, which may restrict their applicability in real-world trail running contexts.

In recent years, various models have been proposed to predict performance in trail running. Ehrström et al. developed a model to explain performance in a 27 km trail race, including variables such as VO₂max, the percentage of VO₂max corresponding to the VT, fatigue index, and running economy at both 0% and 10% slopes (R² = 0.98) [3]. In line with these results, Scheer et al. conducted three laboratory tests to generate a predictive model in which the most remarkable variable to explain performance was running speed at VO₂max, obtained through an inclined incremental running test (R² = 0.68) [9]. Furthermore, this model’s explanatory power increased substantially when the TT from the same test in the previous year was included (R² = 0.99). However, conducting laboratory tests to implement these predictive models is often challenging due to the limited availability of appropriate facilities. Moreover, these models often depend on data collected prior to the race, which limits their capacity to provide real-time insights during competition. Models that rely solely on data collected during the race may overcome the limitations associated with requiring pre-race or laboratory-based variables.

For this reason, alternative approaches to predict trail running performance without relying on laboratory tests have been developed. Fogliato et al. proposed a model with 26 variables and their interactions to estimate partial time at each kilometer point, TT, and the probability of withdrawal during the race [7]. These variables included runner’s sociodemographic data (e.g., age, sex, nationality) and information on previous races participation (e.g., number of races, difficulty level, withdrawals). Therefore, this model requires intermediate times during the race to predict runner’s TT, which offers a greater accuracy than models considering intermediate times as the only predictive variable. However, it still requires data collection on the same test day to make the prediction. A key limitation of this model is its dependence on a specific runners-database of tests conducted under standardized quality conditions. Consequently, athletes without prior participations in these tests or with insufficient race history cannot be effectively analyzed.

To sum up, creating a predictive model for trail running performance is a complex task, and a multivariate combination of variables is required. Therefore, to develop a model that is more applicable in real-world sports contexts (one that avoids the need for laboratory tests or pre-race data collection from the runner) would be more practical and efficient. In this study, publicly available data from two types of races within a trail running event were used as a reference to predict TT in trail races with similar characteristics. just considering the runners pace-partial times and their rankings at specific checkpoints. Thus, the aims of this study were: (1) to weight the runners pace-partial times in a specific race sector based on its relative difficulty compared to the entire race; (2) to examine the relationship between partial times and TT; (3) to analyze differences in these weighted times throughout the race and across groups defined by race modality, sex, and relative ranking in the sector; (4) to create a predictive model based on the analyzed variables and check its predictive capacity for TT in these races.

2. Materials and Methods

The ‘Trail Valle de Tena’ is an annual trail running event held in the Tena Valley (Panticosa, Huesca, Spain), featuring multiple race modalities. For the present study, a total of 947 official race times were analyzed. These times correspond to trail runners’ performances recorded in two race modalities (4K and 8K), who completed their respective races across three consecutive editions (2017–2019), and do not necessarily represent 947 unique participants, as some runners may competed in multiple editions and/or modalities. This period corresponds to the last three consecutive years in which the event was held without interruption prior to the COVID-19 pandemic. The 2020 edition was cancelled due to public health restrictions, and the immediately subsequent editions experienced certain changes as a consequence of this situation (e.g., race organization, track alteration, runners’ profiles and participation, or environmental conditions). Therefore, the 2017–2019 period was chosen to ensure consistency in race format and conditions, allowing for a more reliable longitudinal analysis.

The 4K race consisted of a total track distance (d) of 42 km, an accumulated positive slope (S⁺) of 3500 m, and an International Trail Running Association (ITRA) points of 77. In contrast, the 8K race covered 78 km with a S⁺ of 6900 m and an ITRA points of 147. The raw data were obtained directly from the event’s official website (https://trailvalledetena.com/ [last accessed on 29 March 2025]), where the information is publicly available. All data were anonymized prior to analysis by removing participants’ names and using only bib number as unique identifiers for statistical processing. As the data does not contain any sensitive information, approval from an ethics committee was not required for the conduct of this research.

Although the use of open-access data offers practical advantages, it may also present certain limitations depending on the type of study conducted (for example, the lack of demographic details about participants or the possibility of repeated participation across different race editions).

2.1. Total and Relative Race Difficulty

The variables total difficulty and relative difficulty were analyzed for both race modalities. The ITRA assigns points to each race to quantify the physical effort or difficulty involved in completing a specific trail running track. This is based on ITRA difficulty factor (IDF⁺), which incorporates both the total distance and the accumulated uphill-elevation gain (or positive slope), as indicated in Equation (1) [7].

{I D F}^{+} = d + \frac{S^{+}}{100}

(1)

Equation (1). IDF⁺: ITRA difficulty factor for accumulated uphill-elevation gain; d: total track distance (km); S⁺: accumulated positive slope (m).

As shown, ITRA currently evaluates race difficulty based on S⁺. Nevertheless, given that trail running tracks are typically non-circular, this study proposes also incorporating the downhill-elevation loss or accumulated negative slope (S⁻) into the assessment. Including both ascent and descent would provide a more comprehensive measure of overall race difficulty. To this end, a dual-difficulty coefficient is proposed, reflecting the IDF⁺ structure but also considering S⁻, as presented in Equation (2).

{I D F}^{-} = d + \frac{S^{-}}{100}

(2)

Equation (2). IDF⁻: ITRA difficulty factor for accumulated downhill-elevation loss; d: total track distance (km); S⁻: accumulated negative slope (m).

Considering both difficulty coefficients and using the previously described event database, the characteristics of each race analyzed are as follows:

4K: d = 42 km, S⁺ = 3500 m, S⁻ = 4000 m, IDF⁺ = 77, IDF⁻ = 82.
8K: d = 78 km, S⁺ = 6900 m, S⁻ = 6950 m, IDF⁺ = 147, IDF⁻ = 148.

Following this proposed model, the ITRA relative difficulty coefficient (IRDC_n) would be defined as the difficulty of a specific section normalized by the total difficulty of the race. In a trail race, a section is understood as the partial segment of the total track between two specific checkpoints. For a section with a net positive elevation gain (i.e., more ascent than descent meters), Equation (3) would be applied. Conversely, for a section with a net negative elevation (i.e., more descent than ascent meters), Equation (4) would be applied.

I {R D C}_{n} = \frac{{I D F}^{+ n}}{{I D F}^{+}}

(3)

Equation (3). IRDC_n: ITRA relative difficulty coefficient; IDF⁺ⁿ: ITRA difficulty factor calculated for a certain “n” section with a net positive slope; IDF⁺: ITRA difficulty factor for accumulated positive slope.

I {R D C}_{n} = \frac{{I D F}^{- n}}{{I D F}^{-}}

(4)

Equation (4). IRDC_n: ITRA relative difficulty coefficient; IDF⁻ⁿ: ITRA difficulty factor calculated for a certain “n” section with a net negative slope; IDF⁻: ITRA difficulty factor for accumulated negative slope.

Table 1 presents the overlapping sections of the 4K and 8K race modalities considered in this study. To distinguish them in the analysis, the sections were identified as follows: section 1 (s₁), from the start line (considered as checkpoint 0 [cp₀]) to Garmo Negro, the first shared checkpoint (cp₁); section 2 (s₂), from cp₁ to Bachimaña, the second shared checkpoint (cp₂); section 3 (s₃), from cp₂ to Tebarray, the third shared checkpoint (cp₃); section 4 (s₄), from cp₃ to Respomuso, the fourth shared checkpoint (cp₄); and section 5 (s₅), from cp₄ to Musales, the fifth shared checkpoint (cp₅).

For each section, the table reports the corresponding difficulty factors (IDF⁺, IDF⁻) and relative difficulty coefficients (IRDC_n), allowing for a comparative analysis of effort distribution across shared segments.

2.2. Weighted Time, Weighted Time-Variability and Relative Ranking

Due to the variable intensity inherent in trail running races, final performance may depend on two factors: the ability to traverse sections at the highest possible speed, and the capacity to maintain this intensity throughout the entire track.

In this study, the weighted time for a section n (WT_n) is introduced as an indicator of a runner’s ability to complete a race partial-section at maximal speed. WT_n for a given section is defined as the time spent in completing that section (T_n) divided by its corresponding IRDC_n value, as shown in Equation (5). T_n is recorded when the runner passes the second checkpoint delimiting that section. In this proposal, hour (h) was chosen as the time unit for expressing both T_n and WT_n variables.

{W T}_{n} = \frac{T_{n}}{I {R D C}_{n}}

(5)

Equation (5). WT_n: weighted time for a “n” section (h); T_n: time spent in completing a “n” section (h); IRDC_n: ITRA relative difficulty coefficient for a “n” section.

If effort intensity were constant and the race just consisted of uphill (or downhill) segments, WT_n would correspond to the total race time (TT). However, in trail running, the effort intensity is variable throughout the track, making it necessary to account for potential fluctuations in speed. To conduct this, the weighted time-variability (WTV_n,n+2) is defined (Equation (6)), where WT_n+2 represents the weighted time of a given ascending section (or analogously, a descending section), and WT_n corresponds to the weighted time of the previous section of the same type (ascent or descent). As with the other previously mentioned time-related variables, WT_n+2 is also expressed in hours.

It is worth noting that the subscript notations n and n + 2 do not imply that the sections are adjacent in the track sequence, but rather that they are consecutive in terms of terrain type. For example, in this five-section track where sections 1, 3 and 5 were net uphill, and sections 2 and 4 were net downhill, WTV_n,n+2 reflects the variability between two successive uphill (or downhill) segments, skipping over the intervening section of opposite slope (e.g., WTV_1,3 represents variability between the first and third uphill sections, skipping the downhill section in between). This formulation allows for the assessment of speed loss or gain across comparable terrain types as the race progresses.

{W T V}_{n, n + 2} = \frac{{W T}_{n + 2}}{{W T}_{n}} - 1

(6)

Equation (6). WTV_n,n+2: weighted time-variability, calculated as the relative change in weighted time between two consecutive sections of the same slope type (h); WT_n: weighted time of the earlier section (h); WT_n+2: weighted time of the later section (h).

Given the potential association between some of these variables and runners’ performance levels [10], the checkpoint percentile rank (CPR_n) is also defined. This variable indicates the percentile position of each runner based on their time in the common section of the race (4K/8K), regardless of race modality, when passing the checkpoint delimiting the end of the section.

2.3. Data Analysis

WT_n, WTV_n,n+2 and CPR_n were calculated under three different conditions (Table 2).

The mean and standard deviation (SD) were calculated for TT, WT_n, and WTV_n,n+2 based on three dichotomous grouping variables: race (4K, n = 764; 8K, n = 183), sex (female, n = 76; male, n = 871), and CPR_n quartile [11,12] (Q1, n = 237, Q2–Q4, n = 710).

Most variables did not follow a normal distribution according to the Kolmogorov–Smirnov test; therefore, all statistical analysis were conducted using the bootstrap method with 1000 simple random resamples [13].

Pearson’s correlation coefficient (r) was used to assess linear relationships between TT, T_n, and WT_n.

Paired-sample t-tests were conducted to analyze intra-group differences across race conditions (net positive slope: WT₁–WT₃, WT₃–WT₅; net negative slope: WT₂–WT₄). Independent samples t-tests were used to examine inter-group differences (race modality, sex, and rank_n quartile) for WT_n and WTV_n,n+2. Cohen’s d was calculated to determine effect sizes for these comparisons [14].

Multiple linear regressions were performed using the “enter” method. TT was considered as the dependent variable, while WT_n, WTV_n,n+2 and CPR_n were included as independent variables for both race conditions. Entry and removal criteria were set at p > 0.05 and p > 0.10, respectively. Linearity and independence of residuals were assessed using the Durbin–Watson test, with values between 1 and 3 considered acceptable. Homoscedasticity was evaluated using a plot of standardized residuals vs. standardized predicted values.

Additionally, the Bland–Altman method was employed to assess systematic bias and random error in the prediction models, as well as to determine the limits of agreement (±1.96 SD) [15]. Residual normality was tested using the Shapiro–Wilk test. Multicollinearity was assessed via the variance inflation factor (VIF), with values above 10 indicating excessive multicollinearity. Cases with a Cook’s distance greater than 1 were identified as influential, and residuals exceeding ±3 SD were considered outliers. Both types of cases were excluded from the final analysis.

All statistical tests were conducted with a significance level of p < 0.05. Post hoc statistical power (1-β) was calculated to determine the adequacy of the sample size based on the observed effect sizes (r and d).

The statistical analyses were performed using SPSS software, version 30 (Chicago, IL, USA). Graphical representations of the figures were created using Microsoft Excel, as it provided clearer and more precise visual outputs.

3. Results

The means (±SD) TT for all runners according to the race modality were 9.98 ± 1.70 h (in 4K) and 18.35 ± 3.24 h (in 8K). Table 3 presents the total time (TT) for the race in both modalities (4K and 8K), disaggregated by sex, along with the corresponding number of participants.

TT and WT_n showed stronger linear correlations (WT₁: r = 0.962; WT₂: r = 0.974; WT₃: r = 0.976; WT₄: r = 0.973; WT₅: r = 0.944) than TT and T_n (T₁: r = 0.821; T₂: r = 0.682; T₃: r = 0.713; T₄: r = 0.616; T₅: r = 0.476). All correlations were statistically significant (p < 0.001) with statistical power (1-β) > 0.80. These strong correlations suggest that WT_n is a more robust predictor of total race time than T_n, likely because WT_n accounts for the relative difficulty of each terrain segment, integrating both distance and elevation gain into the performance assessment.

WT_n and WTV_n,n+2 for each terrain type segment (ascent #1, descent and ascent #2) are presented in Table 4. In the first (ascent #1) and second (descent) segments, WT_n showed statistically significant (p < 0.05), high-powered (1-β > 0.80) changes with large effect sizes (d < −0.80) across all grouping variables (entire sample, race modality, sex, and quartile) in intra-group comparisons. This indicates that the estimated TT, adjusted for the time spent in each segment and its relative difficulty, increased between an ascent (or descent) and the subsequent segment with a similar net slope. However, this pattern was not observed in the third terrain type segment (ascent #2), where effect sizes were medium (0.50 < d < 0.80 for the 8K and female groups), small (0.20 < d < 0.50 for the entire sample, male, and Q2–4 groups), or trivial (0.00 < d < 0.20 for the 4K and Q1 groups). Moreover, the observed effect in this segment indicated a decrease in WT_n (d > 0.00) suggesting a reduction in weighted time between sections 3 and 5 (opposite to the trend observed in the first two sections).

Regarding intra-group differences, all WT_n values showed statistically significant and high-powered differences for the race and quartile grouping variables with medium effect sizes (WT₁, WT₃, WT₅ in the quartile comparison) and large effect sizes in the remaining comparisons. However, no significant differences were found in WT_n based on sex (−0.20 < d < 0.20; p > 0.05; 1-β < 0.80).

For WTV_n,n+2, statistically significant and high-powered differences were observed across race groups, with a medium effect size in ascent #1 (d = 0.54), small in descent (d = 0.45), and large in ascent #2 (d = 0.89). The direction of the effect indicated greater WTV_n,n+2 in the 4K race, suggesting higher pace variability in this group and more consistent pacing in the 8K race. For the quartile grouping variable, WTV_n,n+2 differences were small in ascent #1 (d = −0.33), medium in descent (d = 0.64), and trivial in ascent #2 (d = 0.14). These results suggest that runners in Q1 exhibited lower variability in weighted time during the first ascent and greater variability during the descent compared to the rest of the participants. No significant differences in WTV_n,n+2 were found based on sex (−0.20 < d < 0.20; p > 0.05; 1-β < 0.80).

Table 5 presents the three multiple linear regression models for each terrain type segment. The adjusted R² values indicate that at least 96% of the variance in TT is explained by the proposed models (R² ascent #1 = 0.967; R² descent = 0.959; R² ascent #2 = 0.961). The corresponding scatter plots for these models are shown in Figure 1. All models were statistically significant (p < 0.001), and all variables included in the models had a p < 0.001, supporting their inclusion.

The Durbin–Watson test values fell within the acceptable range (1 < D-W < 3), confirming the assumptions of residual linearity and independence. In the partial standardized residual vs. standardized predicted value plots (Figure 2), residuals showed linear trends with respect to individual predictors: strong linearity for WT_n, weak for WTV_n,n+2, and very weak for CPR_n in the ascent segments (s₁ and s₃). However, Bland–Altman plots (Figure 2) showed residuals randomly distributed around the mean of TT and predicted time. Some values fell outside the ±1.96 SD limits, and residuals were not normally distributed, with outliers observed in Q1 and Q4. Despite this, no sport-related justification was found to exclude these cases, so they were retained in the analysis.

All variables had VIF values below 10, indicating no multicollinearity concerns.

4. Discussion

The primary aim of this study was to examine the relationship between TT (total race time) and WT_n (weighted time per section). The results demonstrated that WT_n exhibited stronger correlations with TT than raw section time (T_n), suggesting that WT_n may serve as a more accurate predictor of final performance. This enhanced predictive capacity is likely due to the incorporation of relative difficulty (accounting for both distance and elevation gain) into the WT_n metric, thereby providing a more noticeable representation of the runner’s effort across different terrain segments (uphill/downhill).

This approach aligns with previous models that normalize section speed relative to average race speed to evaluate pacing strategies during competition [16,17]. Similarly to those models, WT_n reflects relative intensity by integrating the time spent in a section with its proportional contribution to the race’s total difficulty.

In support of the first aim, WT_n values for each analyzed section showed a very strong linear relationship (r > 0.950) with the final race time, underscoring its utility as a performance indicator.

The second aim focused on analyzing WT_n differences across race sections and between participant variable groups. WT_n increased between sections during ascent #1 and descent, but remained stable or decreased during ascent #2. These variations suggest that WT_n is sensitive to terrain-specific demands and may reflect the influence of technical factors (surface type [18], slope [19,20]), environmental conditions (temperature and humidity [16,21,22]), and accumulated fatigue [23]. The reduced magnitude of WT_n differences in later segments may indicate a stabilization of pace as the race progresses, consistent with prior findings of greater speed loss in early race stages [17]. These results are consistent with previous research indicating that longer race distances tend to promote more stable pacing strategies, as athletes adopt energy-conserving approaches to manage fatigue over extended durations. Studies have shown that pacing variability decreases as race length increases, reflecting a shift toward more regulated effort distribution in ultra-endurance events [24,25,26]. However, the absence of a significant correlation between overall performance and descriptors of pacing in trail ultramarathons with hilly terrain suggests that pacing behavior in these events may differ from other types of running competitions, highlighting the influence of environmental and course-specific factors on pacing dynamics [17].

WT_n also varied significantly across race modality and performance quartiles, with faster times observed in the 4K group and among top-performing runners (Q1). Interestingly, no significant differences in WT_n were found between sexes, suggesting that male and female runners exhibit similar pacing behavior in this type of trail competition. This finding is consistent with previous research indicating reduced sex-based performance differences in longer-duration events [27,28,29]. Furthermore, no significant sex-based differences were observed in WTV_n,n+2, the metric representing variability in weighted time between consecutive segments of similar slope.

Regarding WTV_n,n+2, greater pace variability was observed in the 4K group, while the 8K group maintained more consistent pacing. This may reflect the higher relative intensity and shorter duration of the 4K race. Additionally, Q1 runners exhibited lower WTV_n,n+2 during the first ascent and higher variability during the descent, possibly due to superior strength and technical skills that enable efficient uphill pacing and faster downhill running [30,31]. These findings support the inclusion of WTV_n,n+2 in predictive models of performance, as it captures meaningful differences in pacing strategies across performance levels.

The third aim involved developing multiple linear regression models incorporating WT_n, WTV_n,n+2, and CPR_n (checkpoint percentile rank) to predict TT. The models for each terrain segment (ascent #1, descent, ascent #2) demonstrated high predictive power (adjusted R² > 0.95), comparable to previous models in endurance running literature [3,8,9]. All three predictors were statistically significant (p < 0.001), with WT_n contributing the most explanatory power (β = 0.952–1.031), followed by WTV_n,n+2 (β = 0.079–0.152) and CPR_n (β = −0.033–0.109). Residual analysis revealed non-normality and increasing variance with race distance and runner position, as illustrated in Figure 1 and Figure 2.

From an applied perspective, these findings suggest that: (1) the ability to efficiently overcome elevation and distance (WT_n) is the strongest predictor of performance; (2) maintaining consistent pacing (low WTV_n,n+2) contributes to performance, although to a lesser extent; and (3) WTV_n,n+2 offers predictive value across sexes and terrain types.

Compared to laboratory-based models [3,9], the proposed model offers practical advantages by relying solely on in-race data, eliminating the need for costly and less accessible testing facilities. However, it requires data from the same race edition, limiting its use for pre-race predictions. Future studies should validate this approach in races with varying characteristics (e.g., elevation, distance, technical difficulty). Despite potential differences in model coefficients, we hypothesize that WT_n, WTV_n,n+2, and CPR_n (calculated after completing approximately 30% of the race, including initial uphill and downhill sections) will consistently demonstrate high predictive capacity.

Therefore, considering the regression model for ascent #1, Equations (7) and (8), have been derived. Both equations can be used to estimate final race time at the Tebarray checkpoint (cp₃) in the 4K and 8K races of the ‘Trail Valle de Tena’ event. These equations incorporate section times (T₁ and T₂) and CPR₁, enabling real-time performance forecasting and strategic decision-making.

{T T}_{4 K} = (0.914 \times \frac{T_{1}}{0.27}) + (4.993 \times [\frac{\frac{T_{3}}{0.16}}{\frac{T_{1}}{0.27}} - 1]) + (1.468 \times \frac{{C P R}_{1}}{n}) + 0.939

(7)

Equation (7). TT_4K: predicted total race time (h) for the 4K race modality; T₁: time spent in section 1 (h); T₃: time spent in section 3 (h); CPR₁: checkpoint percentile rank at cp₁ (ending checkpoint at section 1); n: total number of participants. This equation estimates TT at checkpoint cp₃ using in-race data from the first ascent and descent segments.

{T T}_{8 K} = (0.914 \times \frac{T_{1}}{0.14}) + (4.993 \times [\frac{\frac{T_{3}}{0.08}}{\frac{T_{1}}{0.14}} - 1]) + (1.468 \times \frac{{C P R}_{1}}{n}) + 0.939

(8)

Equation (8). TT_8K: predicted total race time (h) for the 8K race modality; T₁: time spent in section 1 (h); T₃: time spent in section 3 (h); CPR₁: checkpoint percentile rank at cp₁ (ending checkpoint at section 1); n: total number of participants. This equation estimates TT at checkpoint cp₃ using in-race data from the first ascent and descent segments.

4.1. Practical Applications

To enhance the practical utility of this model, we propose several considerations for trail running coaches: first, they should recognize that final race time is largely determined by the runner’s ability to generate and sustain high uphill speed; second, training programs should therefore target both uphill performance and fatigue resistance; third, race plans should account for progressive speed loss during ascents, even among elite runners. Finally, field tests replicating race demands (such as vertical kilometer efforts and repeated ascents) may serve as valuable tools for monitoring performance and informing training interventions.

Moreover, the variables proposed in this study (WT_n, WTV_n,n+2, and CPR_n) offer practical value as accessible, non-laboratory-based predictors. Their use enables athletes and coaches to monitor race progression and adjust pacing strategies in real time, without the need for prior testing or specialized equipment.

4.2. Limitations and Future Research Directions

While the predictive model developed in this study demonstrates strong accuracy and practical applicability within the context of the ‘Trail Valle de Tena’ event, certain considerations should be taken into account when interpreting its generalizability. The model is based on data from a single competition (2017–2019 editions), which provided a consistent and well-documented framework for analysis. Although this enhances internal validity, future studies should explore its applicability to other races with different terrain profiles, distances, and organizational formats.

Additionally, the female subgroup in the sample was relatively small, particularly in the 8K modality. While the model showed consistent predictive performance across sexes, further validation with larger female cohorts would strengthen its robustness in sex-based comparisons.

The Bland–Altman analysis revealed a high level of agreement between predicted and actual race times, although a small number of outliers were observed and residuals were not normally distributed. These findings do not undermine the model’s predictive capacity but highlight areas for refinement in future iterations.

Finally, the model relies on in-race data (e.g., section times and checkpoint rankings), which limits its use for pre-race predictions. However, this design choice aligns with the study’s objective: to provide a real-time, field-based tool for monitoring pacing and forecasting performance during competition.

Future research should aim to replicate and adapt this approach in other endurance events, testing its performance across diverse terrain profiles, race distances, environmental conditions, and athlete subgroups (e.g., elite vs. recreational). Such efforts will help confirm the model’s versatility and position this study as a foundation for further work in trail running performance prediction.

5. Conclusions

In conclusion, WT_n, WTV_n,n+2, and CPR_n demonstrated strong predictive capacity for TT in trail running, specifically in the marathon and ultra-trail race modalities, due to their similar characteristics to the 4K and 8K races analyzed in the present study. The proposed model enables accurate performance estimation using only a portion of the race, without requiring laboratory testing, and appears valid for both male and female athletes.

Author Contributions

Conceptualization, H.G.; methodology, H.G. and A.V.B.-C.; formal analysis, A.V.B.-C. and P.J.B.; investigation, E.P. and C.B.; resources, C.B.; data curation, I.A.; writing—original draft preparation, E.P. and H.G.; writing—review and editing, P.J.B. and A.V.B.-C.; validation, P.J.B. and I.A.; supervision, C.B.; project administration, E.P.; funding acquisition, C.B. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the Departamento de Ciencia, Universidad y Sociedad del Conocimiento, from the Gobierno de Aragón (Government of Aragón, Spain), Research Group ‘ValorA’ (grant S08_23R).

Institutional Review Board Statement

Not applicable. The raw data used in this study consisted of official race records, which are publicly available online. As the data are openly accessible and do not involve any personal or sensitive information, approval from an ethics committee was not deemed necessary for conducting this research.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data processed and analyzed for the development of this work were obtained from the official website of the event: https://trailvalledetena.com/ (last accessed on 29 March 2025). The original contributions presented in the study are included in the article; further inquiries can be directed to the corresponding author.

Acknowledgments

The authors would like to thank Tempo Finito, the organizing entity of the event, for the recording and provision of the data, especially David Latorre and Gloria Bataller for their support.

Conflicts of Interest

The authors declare that they have no conflicts of interest. The funders had no role in the design of the study; in the collection, analysis, or interpretation of the data; in the writing of the manuscript; or in the decision to publish the results.

Abbreviations

The following abbreviations are used in this manuscript:

VO₂max	Maximal oxygen consumption
VT	Ventilatory threshold
TT	Total race time
SEE	Standard error of estimate
SD	Standard deviation
d	Total track distance
ITRA	International Trail Running Association
S⁺	Accumulated positive slope
IDF⁺	ITRA difficulty factor for accumulated uphill-elevation gain
S^-	Accumulated negative slope
IDF^-	ITRA difficulty factor for accumulated downhill-elevation loss
IRDC_n	ITRA relative difficulty coefficient
IDF⁺ⁿ	ITRA difficulty factor calculated for a certain “n” section with a net positive slope
IDF⁻ⁿ	ITRA difficulty factor calculated for a certain “n” section with a net negative slope
s_n	Race section
T_n	Time spent in each section
cp_n	Race checkpoint
4K	4K race modality
8K	8K race modality
Q1	Runners’ time ranked in quartile 1
Q2–4	Runners’ time ranked in quartiles 2, 3 and 4
WT_n	Weighted time in each section
WTV_n,n+2	Weighted time-variability between two consecutive sections of the same slope type
VIF	Variance inflation factor
CPR_n	Checkpoint percentile rank
ActTTs_n	Actual total race time in each section
PredTTs_n	Total race time predicted by the model in each section

References

Pastor, F.S.; Besson, T.; Varesco, G.; Parent, A.; Fanget, M.; Koral, J.; Foschia, C.; Rupp, T.; Rimaud, D.; Féasson, L.; et al. Performance Determinants in Trail-Running Races of Different Distances. Int. J. Sports Physiol. Perform. 2022, 17, 844–851. [Google Scholar] [CrossRef]
de Waal, S.J.; Gomez-Ezeiza, J.; Venter, R.E.; Lamberts, R.P. Physiological Indicators of Trail Running Performance: A Systematic Review. Int. J. Sports Physiol. Perform. 2021, 16, 325–332. [Google Scholar] [CrossRef]
Ehrström, S.; Tartaruga, M.P.; Easthope, C.S.; Brisswalter, J.; Morin, J.-B.; Vercruyssen, F. Short Trail Running Race: Beyond the Classic Model for Endurance Running Performance. Med. Sci. Sports Exerc. 2018, 50, 580–588. [Google Scholar] [CrossRef]
Easthope, C.S.; Nosaka, K.; Caillaud, C.; Vercruyssen, F.; Louis, J.; Brisswalter, J. Reproducibility of performance and fatigue in trail running. J. Sci. Med. Sport 2014, 17, 207–211. [Google Scholar] [CrossRef] [PubMed]
Björklund, G.; Swarén, M.; Born, D.P.; Stöggl, T. Biomechanical Adaptations and Performance Indicators in Short Trail Running. Front. Physiol. 2019, 10, 506. [Google Scholar] [CrossRef] [PubMed]
Coates, A.M.; Berard, J.A.; King, T.J.; Burr, J.F. Physiological Determinants of Ultramarathon Trail-Running Performance. Int. J. Sports Physiol. Perform. 2021, 16, 1454–1461. [Google Scholar] [CrossRef] [PubMed]
Fogliato, R.; Oliveira, N.L.; Yurko, R. TRAP: A predictive framework for the Assessment of Performance in Trail Running. J. Quant. Anal. Sports 2021, 17, 129–143. [Google Scholar] [CrossRef]
Keogh, A.; Smyth, B.; Caulfield, B.; Lawlor, A.; Berndsen, J.; Doherty, C. Prediction Equations for Marathon Performance: A Systematic Review. Int. J. Sports Physiol. Perform. 2019, 14, 1159–1169. [Google Scholar] [CrossRef]
Scheer, V.; Janssen, T.I.; Vieluf, S.; Heitkamp, H.-C. Predicting Trail-Running Performance with Laboratory Exercise Tests and Field-Based Results. Int. J. Sport Physiol. Perform. 2019, 14, 130–133. [Google Scholar] [CrossRef]
Casado, A.; Hanley, B.; Jiménez-Reyes, P.; Renfree, A. Pacing profiles and tactical behaviors of elite runners. J. Sport Health Sci. 2021, 10, 537–549. [Google Scholar] [CrossRef]
Renfree, A.; Crivoi do Carmo, E.; Martin, L. The influence of performance level, age and gender on pacing strategy during a 100-km ultramarathon. Eur. J. Sport Sci. 2016, 16, 409–415. [Google Scholar] [CrossRef] [PubMed]
Corbí-Santamaría, P.; Herrero-Molleda, A.; García-López, J.; Boullosa, D.; García-Tormo, V. Variable Pacing Is Associated with Performance during the OCC^®® Ultra-Trail du Mont-Blanc^®® (2017–2021). Int. J. Environ. Res. Public Health 2023, 20, 3297. [Google Scholar] [CrossRef]
Kostanek, J.; Karolczak, K.; Kuliczkowski, W.; Watala, C. Bootstrap Method as a Tool for Analyzing Data with Atypical Distributions Deviating from Parametric Assumptions: Critique and Effectiveness Evaluation. Data 2024, 9, 95. [Google Scholar] [CrossRef]
Cohen, J. Statistical Power Analysis for the Behavioral Sciences, 2nd ed.; Lawrence Erlbaum Associates: Hillsdale, NJ, USA, 1988. [Google Scholar]
Bland, J.M.; Altman, D.G. Statistical methods for assessing agreement between two methods of clinical measurement. Lancet 1986, 1, 307–310. [Google Scholar] [CrossRef]
Hoffman, M.D. Pacing by Winners of a 161-km Mountain Ultramarathon. Int. J. Sport Physiol. Perform. 2014, 9, 1054–1056. [Google Scholar] [CrossRef]
Kerhervé, H.A.; Cole-Hunter, T.; Wiegand, A.N.; Solomon, C. Pacing during an ultramarathon running event in hilly terrain. PeerJ 2016, 4, e2591. [Google Scholar] [CrossRef]
Larsen, R.J.; Jackson, W.H.; Schmitt, D. Mechanisms for regulating step length while running towards and over an obstacle. Hum. Mov. Sci. 2016, 49, 186–195. [Google Scholar] [CrossRef]
Lemire, M.; Remetter, R.; Hureau, T.J.; Kouassi, B.Y.L.; Lonsdorfer, E.; Geny, B.; Isner-Horobeti, M.E.; Favret, F.; Dufour, S.P. High-intensity downhill running exacerbates heart rate and muscular fatigue in trail runners. J. Sports Sci. 2021, 39, 815–825. [Google Scholar] [CrossRef]
Giandolini, M.; Vernillo, G.; Samozino, P.; Horvais, N.; Edwards, W.B.; Morin, J.B.; Millet, G.Y. Fatigue associated with prolonged graded running. Eur. J. Appl. Physiol. 2016, 116, 1859–1873. [Google Scholar] [CrossRef] [PubMed]
Parise, C.A.; Hoffman, M.D. Influence of Temperature and Performance Level on Pacing a 161 km Trail Ultramarathon. Int. J. Sport Physiol. Perform. 2011, 6, 243–251. [Google Scholar] [CrossRef] [PubMed]
Bouscaren, N.; Faricier, R.; Millet, G.Y.; Racinais, S. Heat Acclimatization, Cooling Strategies, and Hydration during an Ultra-Trail in Warm and Humid Conditions. Nutrients 2021, 13, 1085. [Google Scholar] [CrossRef]
Garbisu-Hualde, A.; Santos-Concejero, J. What are the Limiting Factors During an Ultra—Marathon? A Systematic Review of the Scientific Literature. J. Hum. Kinet. 2020, 72, 129–139. [Google Scholar] [CrossRef] [PubMed]
Stjepanovic, M.; Knechtle, B.; Weiss, K.; Nikolaidis, P.T.; Cuk, I.; Thuany, M.; Sousa, C.V. Changes in pacing variation with increasing race duration in ultra-triathlon races. Sci. Rep. 2023, 13, 3692. [Google Scholar] [CrossRef]
Skorski, S.; Abbiss, C.R. The Manipulation of Pace within Endurance Sport. Front. Physiol. 2017, 8, 102. [Google Scholar] [CrossRef]
Venhorst, A.; Micklewright, D.P.; Noakes, T.D. The Psychophysiological Regulation of Pacing Behaviour and Performance Fatigability During Long-Distance Running with Locomotor Muscle Fatigue and Exercise-Induced Muscle Damage in Highly Trained Runners. Sports Med. Open. 2018, 4, 29. [Google Scholar] [CrossRef]
Hubble, C.; Zhao, J. Gender differences in marathon pacing and performance prediction. J. Sport Anal. 2016, 2, 19–36. [Google Scholar] [CrossRef]
Temesi, J.; Arnal, P.J.; Rupp, T.; Féasson, L.; Cartier, R.; Gergelé, L.; Verges, S.; Martin, V.; Millet, G.Y. Are Females More Resistant to Extreme Neuromuscular Fatigue? Med. Sci. Sports Exerc. 2015, 47, 1372–1382. [Google Scholar] [CrossRef]
Hoffman, M.D. Ultramarathon Trail Running Comparison of Performance-Matched Men and Women. Med. Sci. Sports Exerc. 2008, 40, 1681–1686. [Google Scholar] [CrossRef]
Lemire, M.; Hureau, T.J.; Favret, F.; Geny, B.; Kouassi, B.Y.L.; Boukhari, M.; Lonsdorfer, E.; Remetter, R.; Dufour, S.P. Physiological factors determining downhill vs uphill running endurance performance. J. Sci. Med. Sport 2021, 24, 85–91. [Google Scholar] [CrossRef] [PubMed]
Sheehan, R.C.; Gottschall, J.S. Preferred step frequency during downhill running may be determined by muscle activity. J. Electromyogr. Kinesiol. 2013, 23, 826–830. [Google Scholar] [CrossRef] [PubMed]

Figure 1. Scatter plots and Bland–Altman plots for the three multiple linear regression models corresponding to each terrain type segment: ascent #1 (section 1 [s₁]); descent (section 2 [s₂]); ascent #2 (section 3 [s₃]). The scatter plots (left column) show the relationship between actual total race time and predicted total race time, with adjusted R² values indicating strong model fit. The Bland–Altman plots (right column) display the agreement between predicted and actual times, with mean differences and limits of agreement (±1.96 SD). Model colors: light grey (4K); dark grey (8K). s_n: race section analyzed; ActTTs_n: actual total race time in each section; PredTTs_n: total race time predicted by the model in each section; R²: adjusted coefficient of determination; SD: standard deviation; *: multiplication sign for limits of agreement determination.

Figure 2. Partial standardized residuals (ordinate) plotted against standardized predicted values (abscissa) for three race terrain-segments (ascent #1 or section 1 [s₁]; descent or section 2 [s₂]; ascent #2 or section 3 [s₃]) and key predictors. Each row corresponds to a different model, and each column represents a specific predictor: WT_n (weighted time in each section), WTV_n,n+2 (weighted time-variability, calculated as the relative change in weighted time between two consecutive sections of the same terrain type), and CPR_n (checkpoint percentile rank: percentile position when passing at the ending checkpoint of a section s_n). The plots illustrate varying degrees of linearity between residuals and predictors across models and terrain segments.

Table 1. Values of covered distances, positive/negative slopes and total/relative race-difficulty at the five sections overlapped in the 4K and 8K races of the ‘Trail Valle de Tena’ during the 2017–2019 editions.

Section	aD (km)		sD (km)	sS⁺ (m)	sS⁻ (m)	nS	IDF⁺ⁿ	IDF⁻ⁿ	IRDC_n
Section	4K	8K	sD (km)	sS⁺ (m)	sS⁻ (m)	nS	IDF⁺ⁿ	IDF⁻ⁿ	4K	8K
s₁ (cp₀-cp₁)	5.5	31.0	5.5	1430	−10	+	20.5		0.27	0.14
s₂ (cp₁-cp₂)	11.0	36.5	5.5	205	−1150	−		17.0	0.21	0.11
s₃ (cp₂-cp₃)	16.5	42.0	5.5	650	−80	+	12.0		0.16	0.08
s₄ (cp₃-cp₄)	20.5	46.0	4.0	75	−680	−		11.0	0.13	0.07
s₅ (cp₄-cp₅)	24.5	50.0	4.0	470	−75	+	9.0		0.12	0.06

s_n: sections 1–5; cp_n-cp_n+1: consecutive checkpoints that define each section; aD: accumulated distance from the start to the last checkpoint of each section (km); sD: section-partial distance (km); sS⁺: section uphill-positive slope (m); sS⁻: section downhill-negative slope (m); nS: net slope (sign indicates whether the section has more uphill meters [+] or downhill meters [−]); IDF⁺ⁿ: ITRA difficulty factor calculated for a section with a net positive slope; IDF⁻ⁿ: ITRA difficulty factor calculated for a section with a net negative slope; IRDC_n: ITRA relative difficulty coefficient for each section.

Table 2. Weighted time, weighted time-variability and checkpoint percentile rank across assessed terrain types (ascent/descent).

Terrain Type (Elevation Profile)	WTn		WTV_n,n+2	CPR_n
Terrain Type (Elevation Profile)	4K	8K	WTV_n,n+2	CPR_n
Ascent #1 (s1, s3)	${W T}_{1} = \frac{T_{1}}{0.27}$	${W T}_{1} = \frac{T_{1}}{0.14}$	${W T V}_{1,3} = \frac{W T_{3}}{{W T}_{1}} - 1$	CPR₁ (percentile position at cp₁)
Ascent #1 (s1, s3)	${W T}_{3} = \frac{T_{3}}{0.16}$	${W T}_{3} = \frac{T_{3}}{0.08}$	${W T V}_{1,3} = \frac{W T_{3}}{{W T}_{1}} - 1$	CPR₁ (percentile position at cp₁)
Descent (s2, s4)	$W T_{2} = \frac{T_{2}}{0.21}$	$W T_{2} = \frac{T_{2}}{0.11}$	${W T V}_{2,4} = \frac{{W T}_{4}}{{W T}_{2}} - 1$	CPR₂ (percentile position at cp₂)
Descent (s2, s4)	$W T_{4} = \frac{T_{4}}{0.13}$	$W T_{4} = \frac{T_{4}}{0.07}$	${W T V}_{2,4} = \frac{{W T}_{4}}{{W T}_{2}} - 1$	CPR₂ (percentile position at cp₂)
Ascent #2 (s3, s5)	${W T}_{3} = \frac{T_{3}}{0.16}$	${W T}_{3} = \frac{T_{3}}{0.08}$	${W T V}_{3,5} = \frac{{W T}_{5}}{W T_{3}} - 1$	CPR₃ (percentile position at cp₃)
Ascent #2 (s3, s5)	$W T_{5} = \frac{T_{5}}{0.12}$	$W T_{5} = \frac{T_{5}}{0.06}$	${W T V}_{3,5} = \frac{{W T}_{5}}{W T_{3}} - 1$	CPR₃ (percentile position at cp₃)

s_n: race section; T_n: time spent in each section; WT_n: weighted time in each section; WTV_n,n+2: weighted time-variability, calculated as the relative change in weighted time between two consecutive sections of the same slope type; CPR_n: percentile position of each runner when passing at the checkpoint (cp_n) delimiting the end of section s_n.

Table 3. Total time in hours expressed in mean ± SD for both modalities (4k and 8K) by sex of the runners.

Sex	4K	8K
Males (n = 871)	9.94 ± 1.71 h (n = 698)	18.30 ± 3.26 h (n = 173)
Females (n = 76)	10.42 ± 1.58 h (n = 66)	19.08 ± 2.85 h (n = 10)
Overall sample (n = 947)	9.98 ± 1.70 h (n = 764)	18.35 ± 3.24 h (n = 183)

Table 4. Intra-group differences in WT_n and inter-groups differences in WT_n and WTV_n,n+2 across the three race situations depending on terrain segment: ascent #1, descent, and ascent #2.

			Intra-Group			Mean ± SD	Inter-Group			Mean ± SD	Inter-Group			Mean ± SD	Inter-Group
Segment	Variable	Group	d	p	1-β	Mean ± SD	d	p	1-β	Mean ± SD	d	p	1-β	Mean ± SD	d	p	1-β
Ascent #1						WT₁ (h)				WT₃ (h)				WTV_1,3 (h)
	Race	4K	−2.32 *	<0.001	>0.99	7.30 ± 1.02	−6.29 *	<0.001	>0.99	9.70 ± 1.77	−4.96 *	<0.001	>0.99	0.33 ± 0.12	0.54 *	<0.001	>0.99
	Race	8K	−2.18 *	<0.001	>0.99	16.85 ± 2.75	−6.29 *	<0.001	>0.99	21.23 ± 3.85	−4.96 *	<0.001	>0.99	0.26 ± 0.11	0.54 *	<0.001	>0.99
	Sex	Males	−1.83 *	<0.001	>0.99	9.16 ± 4.13	0.05	0.578	0.11	11.96 ± 5.19	0.07	0.472	0.14	0.31 ± 0.12	0.11	0.262	0.23
	Sex	Females	−2.33 *	<0.001	>0.99	8.94 ± 3.33	0.05	0.578	0.11	11.58 ± 4.16	0.07	0.472	0.14	0.30 ± 0.09	0.11	0.262	0.23
	Quartile	Q1	−2.15 *	<0.001	>0.99	7.40 ± 2.99	−0.58 *	<0.001	>0.99	9.28 ± 3.65	−0.66 *	<0.001	>0.99	0.28 ± 0.10	−0.33 *	<0.001	>0.99
	Quartile	Q2–4	−1.93 *	<0.001	>0.99	9.73 ± 4.21	−0.58 *	<0.001	>0.99	12.81 ± 5.23	−0.66 *	<0.001	>0.99	0.32 ± 0.13	−0.33 *	<0.001	>0.99
	Total		−1.86 *	<0.001	>0.99	9.14 ± 4.10				11.93 ± 5.11				0.31 ± 0.12
Descent						WT₂ (h)				WT₄ (h)				WTV_2,4 (h)
	Race	4K	−2.33 *	<0.001	>0.99	5.57 ± 1.22	−3.83 *	<0.001	>0.99	6.42 ± 1.27	−4.06 *	<0.001	>0.99	0.16 ± 0.12	0.45 *	<0.001	>0.99
	Race	8K	−0.89 *	<0.001	>0.99	11.60 ± 2.58	−3.83 *	<0.001	>0.99	12.71 ± 2.38	−4.06 *	<0.001	>0.99	0.11 ± 0.11	0.45 *	<0.001	>0.99
	Sex	Males	−1.10 *	<0.001	>0.99	6.73 ± 2.89	−0.04	0.730	0.09	7.63 ± 2.95	−0.04	0.733	0.09	0.15 ± 0.12	−0.13	0.110	0.29
	Sex	Females	−1.66 *	<0.001	>0.99	6.83 ± 2.47	−0.04	0.730	0.09	7.74 ± 2.69	−0.04	0.733	0.09	0.14 ± 0.08	−0.13	0.110	0.29
	Quartile	Q1	−1.74 *	<0.001	>0.99	4.96 ± 1.72	−0.92 *	<0.001	>0.99	5.91 ± 2.01	−0.84 *	<0.001	>0.99	0.21 ± 0.11	0.64 *	<0.001	>0.99
	Quartile	Q2–4	−1.00 *	<0.001	>0.99	7.42 ± 2.91	−0.92 *	<0.001	>0.99	8.21 ± 2.96	−0.84 *	<0.001	>0.99	0.13 ± 0.12	0.64 *	<0.001	>0.99
	Total		−1.13 *	<0.001	>0.99	6.74 ± 2.86				7.73 ± 2.93				0.15 ± 0.12
Ascent #2						WT₃ (h)				WT₅ (h)				WTV_3,5 (h)
	Race	4K	−0.02	0.649	0.09	9.70 ± 1.77	−4.96 *	<0.001	>0.99	9.71 ± 1.91	−3.73 *	<0.001	>0.99	0.00 ± 0.10	0.89 *	<0.001	>0.99
	Race	8K	0.67 *	<0.001	>0.99	21.23 ± 3.85	−4.96 *	<0.001	>0.99	19.05 ± 4.15	−3.73 *	<0.001	>0.99	−0.10 ± 0.16	0.89 *	<0.001	>0.99
	Sex	Males	0.20 *	<0.001	>0.99	11.96 ± 5.19	0.07	0.472	0.14	11.57 ± 4.51	0.14	0.190	0.32	−0.13 ± 0.12	0.28	0.002	0.76
	Sex	Females	0.53 *	<0.001	>0.99	11.58 ± 4.16	0.07	0.472	0.14	10.95 ± 3.68	0.14	0.190	0.32	−0.05 ± 0.08	0.28	0.002	0.76
	Quartile	Q1	0.05	0.494	0.12	9.28 ± 3.65	−0.71 *	<0.001	>0.99	8.81 ± 3.03	−0.72 *	<0.001	>0.99	0.00 ± 0.15	0.14	0.121	0.59
	Quartile	Q2–4	0.28 *	<0.001	>0.99	12.81 ± 5.23	−0.71 *	<0.001	>0.99	12.43 ± 4.49	−0.72 *	<0.001	>0.99	−0.02 ± 0.11	0.14	0.121	0.59
	Total		0.21 *	<0.001	>0.99	11.93 ± 5.11				11.52 ± 4.45				−0.02 ± 0.12

Segment: segment type depending on terrain slope; 4K: 4K race modality; 8K: 8K race modality; Q1: runners’ time ranked in quartile 1; Q2–4: runners’ time ranked in quartiles 2, 3 and 4; Total: overall sample within each segment; d: Cohen’s d effect size; Intra-group: significance level (p) and statistical power (1-β) for mean differences between related samples (t test); Inter-group: significance level (p) and statistical power (1-β) for mean differences between independent samples (t test); WT_n: weighted time in each section; WTV_n,n+2: weighted time-variability between two consecutive sections of the same slope type; * effect size with p < 0.05 and 1-β > 0.80.

Table 5. Multiple linear regression models for three race situations depending on terrain segment: ascent #1, descent, ascent #2.

Terrain Type Segment (Coinciding Race Section)	r	R²	adR²	SEE	p	Durbin Watson			B	B-SE	Beta	p	B		VIF
Terrain Type Segment (Coinciding Race Section)	r	R²	adR²	SEE	p	LL95%	UL95%		B	B-SE	Beta	p	LL95%	UL95%	VIF
Ascent #1 (s₁)	0.984	0.967	0.967	0.71	<0.001	1.104	1.444	constant	0.939	0.095		<0.001	0.754	1.135
								WT₁	0.914	0.009	0.952	<0.001	0.896	0.931	1.176
								WTV_1,3	4.993	0.227	0.156	<0.001	4.548	5.444	1.079
								CPR₁	1.468	0.086	0.109	<0.001	1.295	1.633	1.175
Descent (s₂)	0.979	0.959	0.959	0.80	<0.001	1.005	1.316	constant	1.796	0.087		<0.001	1.613	1.613
								WT₂	1.411	0.015	1.031	<0.001	1.389	1.381	1.519
								WTV_2,4	3.321	0.249	0.102	<0.001	2.856	2.849	1.215
								CPR₂	−0.465	0.118	−0.033	<0.001	−0.686	−0.693	1.477
Ascent #2 (s₃)	0.980	0.961	0.961	0.77	<0.001	1.021	1.373	constant	2.295	0.088		<0.001	2.159	2.431
								WT₃	0.746	0.009	0.976	<0.001	0.735	0.757	1.347
								WTV_3,5	2.555	0.211	0.079	<0.001	2.122	2.989	1.138
								CPR₃	0.888	0.090	0.066	<0.001	0.703	1.073	1.202

s₁: race-section 1; s₂: race-section 2; s₃: race-section 3; r: correlation coefficient; R²: determination coefficient; adR²: adjusted determination coefficient; SEE: standard error of estimation; p: significance level; LL95%: lower limit for 95% confidence interval; UL95%: upper limit for 95% confidence interval; B: multiple linear regression coefficients for each variable; B-SE: B’standard error; Beta: standardized coefficients; VIF: variance inflation factor; WT_n: weighted time in each section; WTV_n,n+2: weighted time-variability between two consecutive sections of the same slope type; CPR_n: checkpoint percentile rank.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Gutiérrez, H.; Piedrafita, E.; Bascuas, P.J.; Arbonés, I.; Berzosa, C.; Bataller-Cervero, A.V. Real-Time Performance Prediction in Long-Distance Trail Running: A Practical Model Based on Terrain Difficulty and Pacing Variability. Sports 2025, 13, 385. https://doi.org/10.3390/sports13110385

AMA Style

Gutiérrez H, Piedrafita E, Bascuas PJ, Arbonés I, Berzosa C, Bataller-Cervero AV. Real-Time Performance Prediction in Long-Distance Trail Running: A Practical Model Based on Terrain Difficulty and Pacing Variability. Sports. 2025; 13(11):385. https://doi.org/10.3390/sports13110385

Chicago/Turabian Style

Gutiérrez, Héctor, Eduardo Piedrafita, Pablo Jesús Bascuas, Irela Arbonés, César Berzosa, and Ana Vanessa Bataller-Cervero. 2025. "Real-Time Performance Prediction in Long-Distance Trail Running: A Practical Model Based on Terrain Difficulty and Pacing Variability" Sports 13, no. 11: 385. https://doi.org/10.3390/sports13110385

APA Style

Gutiérrez, H., Piedrafita, E., Bascuas, P. J., Arbonés, I., Berzosa, C., & Bataller-Cervero, A. V. (2025). Real-Time Performance Prediction in Long-Distance Trail Running: A Practical Model Based on Terrain Difficulty and Pacing Variability. Sports, 13(11), 385. https://doi.org/10.3390/sports13110385

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Real-Time Performance Prediction in Long-Distance Trail Running: A Practical Model Based on Terrain Difficulty and Pacing Variability

Abstract

1. Introduction

2. Materials and Methods

2.1. Total and Relative Race Difficulty

2.2. Weighted Time, Weighted Time-Variability and Relative Ranking

2.3. Data Analysis

3. Results

4. Discussion

4.1. Practical Applications

4.2. Limitations and Future Research Directions

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI