1. Introduction
The Tokyo Olympic Games, originally scheduled for 2020, were postponed by a year to 2021 due to the novel coronavirus (COVID-19) worldwide. With the postponement of major international events, the traditional training cycle of athletes was destroyed. A too-long training time makes their training load heavier, which may make it tough to reach the best condition in the competition. Once COVID-19 is brought under control, the high density of the competition schedule will further exacerbate athletes’ fatigue. Therefore, the pressure of systematic preparation for the Tokyo 2021 Olympic Games of athletes is quite severe.
In competitions, the pattern of physical load adopted by elite athletes is often the best choice made by evaluating their own abilities as well as the environment of the actual arena [
1,
2]. In the competitions where time is the standard of results, the load distribution in each stage is shown as the pacing strategy that athletes take [
3]. Accordingly, considering vital factors inside the competition as much as possible, which is conducive to promoting the accuracy of the forecast model that helps make the match strategy, is essential for athletes to win the initiative, especially for those from nonhost countries.
In previous research, the meteorology of the arena was often ignored [
4]. However, in outdoor sports events, such as rowing, competition results can be affected by altitude, temperature, humidity and wind direction easily. This mean that not thinking over meteorological information will limit the athletes’ ability to make full preparation to take the best strategy in competitions. As some variables will occur in the Tokyo 2021 Olympic Games due to the spread of COVID-19, on this account, considering the meteorological factors into the choices of tactics is significant for China’s rowing teams.
In the field of competitive sports, physical and technical responses are dependent on athletes’ strategies directly [
5], which gains more and more concern in research. According to the differences in methodology, the current research on pacing strategy can be divided into two categories: the post-test pacing analysis, which refers to inversely inferring the pacing strategy based on the athlete’s performance data; and the a priori pacing analysis, which refers to the evaluation after the athlete takes multiple pacing strategies in the given laboratory conditions. This article belongs to the former. In the post-test pacing analysis, the preliminary research focused on the analysis of the pacing characteristics of top athletes [
6,
7], analysis of the group pacing characteristics of athletes [
8,
9,
10,
11], comparative analysis of pacing characteristics of domestic and foreign athletes [
12,
13,
14,
15,
16,
17], analysis of competition strategy and race pacing [
18,
19,
20], analysis of technical movement and race pacing [
21,
22,
23]. Above is mainly based on analyzing pacing data merely, including analysis for different sports levels [
24,
25,
26], and different race rounds and boat types [
24,
27,
28].
In analyzing the physical load of rowing, physiological and biomechanical data are often used to explain the differences in sports performance [
29,
30]. In rowing, we can often see that the athletes’ most effective distribution of effort shows a high–low–high pattern [
25], and the corresponding pacing strategy is parabolic [
25,
26,
27,
28], that is, in a 2 km race (divided into 4 stages per 500 m), the speed increases rapidly to the peak in the first stage, then decreases and remains flat in the second and the third stage, and returns to a high level in the fourth stage. This is because athletes need to allocate their physical fitness reasonably during the competition, avoiding premature exertion leading to insufficient physical fitness in the later stages and drastically slowing down [
31], so it shows the higher effort of the athlete in the first and last stage [
32]. At this time, the impact of the difference between physiology and biomechanics on sports performance reflects the fatigue effect and fatigue-load management [
33], and the choice of pacing strategies in rowing is also the decision-making process of team events.
External environmental factors, which also exert a particular influence on match load, have received attention recently in studies of pacing strategy. For example, the speeds of skiers on the uphill, downhill and flat ground are different [
1], and the effort distribution in these terrains affects the skier’s overall performance [
34,
35,
36]. In the analyses of the pacing strategy of two marathon runners, who were the best two in the world in 2013, Angus considered the influence mechanism of wind speed during the competition and the slope of the competition field on their performance [
2]. However, containing a small sample of the two athletes, that research only did correlation analysis based on least-squares constraint. Therefore, massive data and systematic research on meteorology’s influence on sports are our urgent scholarly pursuit.
At the Tokyo 2021 Olympic Games, China won the qualifications of six boat types: W1x, W2-, W2x, M2x, W4x and M4x. Moreover, at the 2019 World Championships, the Chinese rowing team also won three gold medals, two of which are the W4x and M2x. Therefore, at this time China’s rowing team will be a strong contender for gold medals. Therefore, any factors that affect China’s rowing results in the Tokyo Olympics Games must be fully considered. Thus, it is urgent to supplement the necessary meteorological information for the selection of the pacing strategy of the Tokyo Olympic Games for Chinese athletes and to obtain the pacing strategy recommendations under different weather conditions.
Accordingly, this study selects hour-by-hour meteorological data of Tokyo from March 1979 to March 2020, and competition data for 6 boat types (W1x, W2-, W2x, M2x, W4x and M4x) in the 2010–2019 Olympic and World Championship finals to: (1) retrieve Tokyo historical weather data, including altitude, temperature, humidity and wind direction data, to predict the weather conditions during the Tokyo Olympics rowing competition; (2) analyze the characteristics of the pacing strategy of athletes at a different specific temperature, humidity and wind direction; (3) provide pacing strategy suggestions according to the characteristics of Chinese athletes.
Our marginal contribution may include: First, based on abundant historical data, this study provides an algorithmic basis for the weather forecast of outdoor competition events, fully mining the value of meteorological data. Second, this study makes up for the absence of meteorology in the analysis of influencing factors of pacing strategy, improving the data dimension of pace forecast, which promotes the accuracy of the forecast model. Third, we propose the best pacing strategy references for China’s rowing teams in the Tokyo 2021 Olympic Games, which will be conducive to optimize the pattern of physical load in the training and match.
3. Results
Based on this, we developed a recommendation system for the rowing pacing strategy. Just input the race date and time, race location, track number and boat type, and that will provide the weather forecast results of the selected moment (including the highest temperature, the lowest temperature, the probability of precipitation, wind speed and direction), as well as the recommended pacing strategy (speed and paddle frequency) for each segment of Q1–Q4. Following the visual way of presenting the methodology in Garnica-Caparrós and Memmert (2021) [
37], the methodology figure workflow is presented in
Figure 3. After more than 10 epoch training, our model can be applied to the actual weather forecast.
3.1. The Influence of Different Meteorological Features on the Pacing Strategy
We adopted linear fitting for modeling. In terms of meteorological features, we selected altitude, temperature, air pressure, humidity and wind speed for analysis. The fitted dependent variables are in six dimensions, which include the average speed difference and the paddle frequency difference of Q1–Q2, Q2–Q3, and Q3–Q4. The model formula is shown in Equation (3):
In order to ensure the rationality in the subsequent analysis, we only selected the top three players in all finals and semifinals as research targets, excluding athletes at the other levels. In sports competitions, we pay more attention to the results of the top three and the bottom three players. The players in other levels usually choose the follow strategy. Therefore, it is enough to cover the distribution we wanted to analyze, and the probability is approximately correct. For more details, please see the Data Availability Statement.
In the subsequent analysis, we regarded p-value 0.95, 0.99, and 0.999 as the first, second and third levels of significance, which are represented by *, **, and *** respectively. Correlation categories included the positive one and the negative one, which are represented by +/−, respectively, and the correlation coefficient is the fitting parameter value . Specifically, the analysis results are as follows.
3.1.1. When the Dependent Variable Is Average Speed Difference of Q1–Q2
The coefficient of goodness of fit (the proportion of the amount of information in the linear fitting to the total amount of information)
. The specific performance of each feature is shown in
Table 1.
In
Table 1, the value of P > |
t| corresponds to 1−
p-value, [0.025 and 0.975] representing the confidence intervals of the coefficients within this confidence range. The following tables are the same as this.
3.1.2. When the Dependent Variable Is Average Speed Difference of Q2–Q3
The coefficient of goodness of fit
. The specific performance of each feature is shown in
Table 2.
3.1.3. When the Dependent Variable Is Average Speed Difference of Q3–Q4
The coefficient of goodness of fit
. The specific performance of each feature is shown in
Table 3.
3.1.4. When the Dependent Variable Is Average Paddle Frequency Difference of Q1–Q2
The coefficient of goodness of fit
. The specific performance of each feature is shown in
Table 4.
3.1.5. When the Dependent Variable Is Average Paddle Frequency Difference of Q2–Q3
The coefficient of goodness of fit
. The specific performance of each feature is shown in
Table 5.
3.1.6. When the Dependent Variable Is Average Paddle Frequency Difference of Q3–Q4
The coefficient of goodness of fit
. The specific performance of each feature is shown in
Table 6.
3.2. The Influence of Track Numbers on Pacing Strategy
Here we also selected the semifinals and finals for analysis. Specifically, we used the t-test method of the pairwise algorithm to analyze differences in the pacing strategies of players in different tracks. In the table below, the upper part of the table is the difference significance rating of the paddle frequency data. The +/− at (track i, track j) indicates that the average paddle frequency of the players on track is significantly greater than / less than track j. The following stars indicate the significance level, which is consistent with the last part. The lower half of the table is the difference significance rating of the average speed data, and the meaning of the symbols is consistent with the upper part. The basis for dividing the upper and lower parts is that the effective amount of information can be represented by half a table. For example, − (**) at (3,6) indicates that the paddle frequency of the players on the third track is significantly less than the sixth track. We can directly use + (**) to represent the significant information of the paddle frequency data at (6,3). In this way, we could directly merge the two tables into one table.
3.2.1. Comparison of Q1–Q2
The comparison of the influence of track numbers on pacing strategy between Q1 and Q2 is shown in
Table 7.
3.2.2. Comparison of Q2–Q3
The comparison of the influence of track numbers on pacing strategy between Q2 and Q3 is shown in
Table 8.
3.2.3. Comparison of Q3–Q4
The comparison of the influence of track numbers on pacing strategy between Q3 and Q4 is shown in
Table 9.
3.3. The Influence of Boat Types on Pacing Strategy
Here, we selected race boats that appear more frequently in the data (including the race data of the top three in the semifinals/finals) to conduct a significant analysis of the difference in pacing strategies between every two pairs (similarly, we only considered the pacing strategies of the top three teams in the semifinals and finals). The selected boat types count as 22. The method adopted was a t-test, and the difference of pacing strategies between different boat types was obtained as shown in the following tables.
Notes: Due to the name of the boat type being too long, we used the numbers 1–22 in the table to mark them, and the numbers 1–22 correspond to Lightweight Men’s Double Sculls, Men’s Double Sculls, Lightweight Men’s Four, Men’s Single Sculls, Women’s Pair, Men’s Four, Men’s Quadruple Sculls, Men’s Eight, Men’s Pair, Women’s Single Sculls, Women’s Quadruple Sculls, Lightweight Women’s Double Sculls, Women’s Eight, Women’s Double Sculls, Lightweight Men’s Pair, Lightweight Women’s Single Sculls, Women’s Four, Lightweight Men’s Single Sculls, Lightweight Men’s Quadruple Sculls, Men’s Coxed Pair, PR1 Men’s Single Sculls, PR3 Mixed Coxed Four.
3.3.1. Comparison of Q1–Q2
The comparison of the influence of boat types on pacing strategy between Q1 and Q2 is shown in
Table 10.
In
Table 10, blue means negative while green means positive. The intensity of the color is proportional to the correlation coefficient. That is, the more significant the correlation is, the darker the color is.
Table 11 and
Table 12 are presented in the same way.
3.3.2. Comparison of Q2–Q3
The comparison of the influence of boat types on pacing strategy between Q2 and Q3 is shown in
Table 11.
3.3.3. Comparison of Q3–Q4
The comparison of the influence of boat types on pacing strategy between Q3 and Q4 is shown in
Table 12.
3.4. The Influence of Player Levels on Pacing Strategy
Regarding the level of players, we divided them into four groups: the top three players in the semifinal/final (F/SF123), the last three players in the semifinal/final (F/SF456), the top three players in other competitions (PT123), and the last three players in other competitions (PT456). We conducted a comparative analysis for them. The method we adopted was the same as that of boat types. Correspondingly, we could get the differences in pacing strategies between players of different levels, which are shown in the following tables.
3.4.1. Comparison of Q1–Q2
The comparison of the influence of player levels on pacing strategy between Q1 and Q2 is shown in
Table 13.
3.4.2. Comparison of Q2–Q3
The comparison of the influence of player levels on pacing strategy between Q2 and Q3 is shown in
Table 14.
3.4.3. Comparison of Q3–Q4
The comparison of the influence of player levels on pacing strategy between Q3 and Q4 is shown in
Table 15.
3.5. Model Prediction Results
After 10 epochs of training, we established the final network model of prediction (on the given training data set, the average MSE of the sample can be as low as about 0.005). For example, we input the situation that needs to be predicted into the model: in the Men’s Four group competition, the altitude is 100 m, the temperature is 20 degrees Celsius, the air pressure is 1000 kpa, the humidity is 60%, the wind speed is 3 m/s, and track number is 3. The pacing strategy that the system returned was [5.721578 5.5232887 5.4479446 5.551791 42.585205 38.44623 38.342327 39.731567].
The returned pacing strategy had a vector of length 8. From front to back were the average speeds of Q1, Q2, Q3, and Q4, and the average paddle frequencies of Q1, Q2, Q3, and Q4. It can be seen that the average speed and average paddle frequency output by our model showed a high–low–high pattern under this particular meteorological condition, which basically agrees with the parabolic pacing strategy proposed by previous scholars [
25,
26,
27,
28,
38] and reverifies the rationality of this kind of load distribution mode. Besides that, we further provided a practical reference for rowing athletes to make accurate pacing adjustment according to meteorology changes before racing.
4. Conclusions and Recommendations
Athletes often adjust their patterns of load based on the actual arena environment. This study aimed to get more features from data of different dimensions to simulate the competition situations, improving the forecast accuracy of pacing strategy. For long-term outdoor events such as rowing, meteorological factors are very influential for physical load, without which it is difficult to get good forecast results for the athlete. Therefore, we explored the pacing strategies adopted in the world’s top competitions in recent years and incorporated meteorology forecast results into them, so as to provide advice for China’s rowing teams about load management in the coming games. The following are our recommendations in five areas.
Strengthen the targeted training of pacing strategy during the sprint stage of Olympic preparations. In the Tokyo Olympic Games Final, on the basis of ensuring an absolutely high average speed of China’s 6 rowing teams, refer to the pacing strategy of top international rowing events in the past 10 years, and get proper training for that in the final preparation.
Achieve the optimal combination of paddle frequency, amplitude, efficiency and physiological indicators. Under the joint constraints of paddle frequency, amplitude, efficiency and physiological indicators, increase to 1–1.5 paddles/minute in each stage of the competition as much as possible. Accordingly, our rowers are required to extend the training of technique, strength, and speed daily. For instance, practice starting sail for different distances on the water, as well as catching the water for the first step after the oar enters the water. In addition, in the practice of returning the paddle on the water and on the dynamometer, attention that the key of the paddle frequency is grasping under the water and the sequence of returning the paddle is fast.
Establish a segment timing database with a 2.5% segment ratio. It is recommended to establish a segment timing database based on the 2.5% segmentation standard, reducing the segment ratio from 25% to 2.5%. Keep the tracking system in a continuous upgrade to collect more refined game data results, which helps analyze different groups’ differences in smaller segments (such as start and sprint).
Choose a training field with a climate similar to Tokyo to prepare for the Olympics. Affected by COVID-19, the 2020 Tokyo Olympic Games was postponed. Therefore, the probability of precipitation, the maximum and the minimum temperature would be expected to change greatly, even in the same period of 2021 and 2020. As a consequence, firstly, the rowing team of China should choose a proper training field, which is closer to the climatic conditions during the Tokyo competition in the final preparation. It is worth noting that the variance of the highest temperature and the lowest temperature reduced by 70.31% and 35.80% from 23 July to 8 August 2021, compared with the original game time. Secondly, it is necessary to emphasize preparing the preventive pacing strategies for large fluctuations in the probability of precipitation, the variance of which will increase by 13.39% from 23 July to 8 August 2021, compared to the original game time.
Improve the scientific level of research on physical load in China comprehensively. First, take the influence of multidimensional factors on athletes’ performance into account and shift the research of training load from an ideal environment to a more actual scene. Second, in addition to regression analysis and variance analysis, it is recommended to introduce interdisciplinary research methods such as least-squares constraints, KS normal distribution test, optimization model, and machine learning. Third, it is suggested to combine wearable devices, computer vision, and the Internet of Things (IoT) in data collection, by which more fine-grained data and more types of data (such as physiological data, biomechanical data, field environment data, etc.) can be obtained. Fourth, physiological and psychological data such as heart rate, oxygen uptake, and RPE can be introduced into the internal load analysis.