Predicting Fatigue in Long Duration Mountain Events with a Single Sensor and Deep Learning Model

Russell, Brian; McDaid, Andrew; Toscano, William; Hume, Patria

doi:10.3390/s21165442

Open AccessArticle

Predicting Fatigue in Long Duration Mountain Events with a Single Sensor and Deep Learning Model

¹

Sports Performance Research Institute, Auckland University of Technology, Auckland 0632, New Zealand

²

National Aeronautics and Space Administration, Ames Research Center, Moffett Field, CA 94043, USA

³

Department of Mechanical Engineering, University of Auckland, Auckland 1142, New Zealand

^*

Author to whom correspondence should be addressed.

Sensors 2021, 21(16), 5442; https://doi.org/10.3390/s21165442

Submission received: 14 May 2021 / Revised: 31 July 2021 / Accepted: 7 August 2021 / Published: 12 August 2021

(This article belongs to the Special Issue Sensors for Human Physical Behaviour Monitoring)

Download

Browse Figures

Versions Notes

Abstract

:

Aim: To determine whether an AI model and single sensor measuring acceleration and ECG could model cognitive and physical fatigue for a self-paced trail run. Methods: A field-based protocol of continuous fatigue repeated hourly induced physical (~45 min) and cognitive (~10 min) fatigue on one healthy participant. The physical load was a 3.8 km, 200 m vertical gain, trail run, with acceleration and electrocardiogram (ECG) data collected using a single sensor. Cognitive load was a Multi Attribute Test Battery (MATB) and separate assessment battery included the Finger Tap Test (FTT), Stroop, Trail Making A and B, Spatial Memory, Paced Visual Serial Addition Test (PVSAT), and a vertical jump. A fatigue prediction model was implemented using a Convolutional Neural Network (CNN). Results: When the fatigue test battery results were compared for sensitivity to the protocol load, FTT right hand (R² 0.71) and Jump Height (R² 0.78) were the most sensitive while the other tests were less sensitive (R² values Stroop 0.49, Trail Making A 0.29, Trail Making B 0.05, PVSAT 0.03, spatial memory 0.003). The best prediction results were achieved with a rolling average of 200 predictions (102.4 s), during set activity types, mean absolute error for ‘walk up’ (MAE200 12.5%), and range of absolute error for ‘run down’ (RAE200 16.7%). Conclusions: We were able to measure cognitive and physical fatigue using a single wearable sensor during a practical field protocol, including contextual factors in conjunction with a neural network model. This research has practical application to fatigue research in the field.

Keywords:

fatigue; cognitive; physical; executive decision-making; psychophysiology; artificial intelligence; deep learning; multi-day missions

1. Introduction

1.1. Why We Need to Measure Physical and Cognitive Fatigue in the Field

Measures of physical and cognitive fatigue are needed in the field to improve performance and help improve safe participation in outdoor environments.

Physiological and cognitive fatigue in field environments directly affects performance as a person modulates decisions based on contextual input to maintain resources [1]. Various fields where operational safety is related to fatigue have been investigated, including pilots [2,3], motor vehicle drivers [4,5,6,7,8,9], firefighters [10,11], and shift workers [12]. Physical fatigue relates to reduced force, endurance, level of effort, strength, speed, and coordination [13]. Levels of performance may be modulated by physical load, sleep, nutrition, and psychological factors based on mission duration, pain, levels of perceived exertion [14,15,16,17], intensity, and time on task [18]. Hill [19] won the Noble prize for his work on skeletal muscle and maximum oxygen uptake.

The interaction of central fatigue and motivating factors have been modelled in various forms: Borg’s [20] Rating of Perceived Exertion (RPE); Millet’s [21] Flush model for pacing strategies in ultra-marathons; Noakes’s [17] central fatigue model; and Venhorst’s [22] bio-psycho-social model.

Cognitive fatigue can be viewed as a combination of goal, adaption, and reward trade-offs, including the energetic requirements to achieve a goal [23,24]. Performance psychology [25,26] describes performance as recalling one’s knowledge, skills, and abilities during an event. Cognitive and physical fatigue have a complex interaction of over-lapping redundant systems [27].

1.2. How We Can Measure Physical and Cognitive Fatigue in the Lab and the Field

Mental and physical fatigue have been researched in the lab using different sensing modalities including computer interaction [28], accelerometery, electroencephalogram (EEG), electrooculography (EOG) [29], electromyography (EMG), and electrocardiograph (ECG) [16,30,31,32], however, these techniques are not always practical in a field setting.

Assessment of performance and fatigue has been studied [3] with multiple sensors and neural networks. However, they have not been validated in the field with noise sources such as terrain, slope, and obstacles. Enoka [33] noted that lab-based experiments such as maximum voluntary contractions (MVC) result in task dependency that do not translate into field performance. The reduction of separate effects does not equate to overall performance. The only way to determine performance reductions from fatigue is to measure the response to loads in the field.

Field applications require the number of sensors to be minimized while performing challenging multiday events and to not distract the operator from their mission tasks or add to logistical loads when deploying technology into an operational environment. Where multiple sensors would aid accuracy and redundancy, they may lead to lack of deployment of the entire system, hence a minimum viable solution to maximize use by operators is desirable. A review of sensors used for measuring occupational fatigue [34] showed that the most effective sensors were heart rate and accelerometry. Smartphones with multi-channel inertial sensors and deep learning models have been used for human activity recognition [35,36] in controlled environments for complex activity types. A review of physical and cognitive fatigue has shown a relationship of heart rate and accelerometry with muscle activity, proprioception, and changes in gait [37,38,39]. Gait has been shown to change physical performance with increased mental fatigue [9,16,40], goals [41], and reduced executive function [42]. Terrain has been shown to influence gait and accelerometry readings [43].

Traditional machine learning with feature extraction has been used in applications such human activity recognition [43,44], however this approach assumes the features of interest are known and calculatable. Deep learning uses models which automatically determine feature morphology and significance in the data which may not be observable with traditional statistics and data analysis. Deep learning has been used for areas such as wakefulness detection with accelerometry and ECG [45] and fatigue estimation by Gordienko et al. [46] showed positive results with a repetitive exercises in the gym. Recurrent neural network (RNN) and long short-term memory (LSTM) are often cited as the preferred models for time series data [47]. Convolutional neural networks (CNN) have also been used for time series data [43,48] and do not suffer from the stability issues of RNNs while enabling parallel processing which is not possible with RNN type models. CNN models have shown good performance on physiological time series data for emotion classification, [49] and mental fatigue [50] using EOG, which is not generally practical in field operations with high levels of activity. Accelerometry has been shown to be affected by cognitive fatigue [51].

The aim of this study was to:

-: Determine whether cognitive and physical fatigue could be accurately predicted by an AI model using data from a single sensor capable of being worn in an endurance activity for multiple days, measuring acceleration and ECG in an outdoor environment with voluntary activity.
-: Additionally propose a protocol for data collection in an unsupervised remote environment with no manual labelling by the participant
-: Determine if environmental parameters would affect accuracy, including; random activity, self-pacing, terrain surface (concrete, gravel, dirt, mud grass), and slope (flat, up and down slopes)

2. Materials and Methods

2.1. Ethics

The researcher’s university ethics committee (AUTEC 18/412) approved all procedures in the study and the participant gave written informed consent prior to participating in the study.

2.2. Protocol—Physical and Cognitive Load and Performance Assessments

A protocol was developed that included self-paced running in an unstructured mountain environment and standard performance assessments with no distractions in a laboratory for comparison.

The protocol was developed using physical and cognitive loads in excess of a participants’ critical power [52] to induce fatigue. A one-hour period of fixed load was repeated until the participant voluntarily ceased the protocol. No restart was allowed. Physical load was provided by a trail run (3.8 km, 200 m vertical gain), and cognitive load was provided by 10 min Multi Attribute Test Battery (MATB) [53] (Figure 1).

A goal was set as 100 km distance, 5200 m (17,000 feet) total climb, and 26 h’ time in order to address motivation [14] and psychological perception of pain [54]. The course was prescribed to cover various slope angles and terrain types (concrete, gravel, dirt, grass, boulders) and obstacles (trees, river, gate, fence) and to not require active navigation for safety under fatigue and reduced decision-making capacity [55]. Speed was rewarded by earlier completion of the hourly protocol, resulting in a larger rest period per hour.

For clinical comparison, a battery of performance assessments were completed on an iPad Pro (Apple, Cupertino, CA, USA) using a custom application, implementing tests built with an Apple Research Kit [56]. The battery of assessments was chosen because they have previously shown sensitivity to the protocol loads and fatigue-related diseases, Table 1. These included assessments used for fibro myalgia [57], Parkinson’s [58], and physical [16,59] and cognitive fatigue [60]. Assessments used included Stroop, Finger Tap Test, FTT, Trail Making A, Trail Making B, paced serial addition test, PVSAT, memory, and jump height.

2.3. Data Preparation

The participant wore a chest-mounted BioHarness (Medtronic, MN, USA) [68,69] for acceleration data (100 Hz, vertical x-axis, sagittal z-axis, lateral y-axis) and electrocardiogram (ECG) (250 Hz) and a Garmin Forerunner GPS (Garmin, KS, USA) wrist watch (1 Hz, horizontal accuracy 6 m) to assist with labelling and location.

The trail was divided into twenty-three sections separated by waypoints defined by a change in terrain surface, slope, or obstacle. Terrain descriptors were validated against video (GoPro Hero 4, Garmin, KS, USA). Slope was determined from a mean of GPS altitude measurements at each waypoint. Waypoint location was determined from Google maps to an accuracy of 10 cm. Time at a waypoint was determined when the subject was closest. Walk and Run activity labels were defined by cadence from vertical axis accelerometery zero crossings (100 < Walk < 150 < Run steps per minute) as described in Russel et al. for human activity recognition [43]. Identification of crossing obstacles was based on geographic location and manual observation of the acceleration waveforms Figure 2. Time resolution for labelling was one second.

2.4. Convolutional Neural Network

Figure 2 shows the multi-channel 1-D Convolutional Neural Network (CNN) that was selected to allow learning on separate channels and cross correlation into a single regression output value. The training label was FTT up-sampled to 250 Hz. Data were split by activity type and segmented by input window length. The initial model width for all hidden layers was set at 256, which was approximately one second of data. The model implemented the Adam optimizer and mean absolute error (MAE) as the error term during training. Randomized train test split ratio was 0.33.

Hyper parameter tuning, included window size for each activity type, was performed (64, 128, 256, 512). The lowest MAE activity was selected for further model optimization of hidden layer widths. Optimization was performed separately for three datasets: acceleration; ECG; and combined acceleration and ECG. The final model for comparison was selected for lowest MAE. Performance was assessed using the mean absolute difference (MAE₂₀₀), and range of absolute difference (RAE₂₀₀), between the label values and the average of 200 predictions. RAE was of interest as it indicated the largest error possible when the trained model was used to predict a fatigue value in the future.

2.5. Statistics

Linear regression (Pearson correlation R²) was performed on each performance test to assess sensitivity of the protocol. The performance test results were normalized across the protocol and linearly interpolated to give a long-term linear fit (LTLF). The same tests with highest R² were up-sampled to 250 Hz using inter-test interpolation (ITI), as ITI includes short term fatigue and recovery. LTLF is more representative of long-term fatigue but is only possible with a research protocol designed with a constant load over time. ITI is needed for random field predictions where no assumptions can be made about overall loads.

Time series data were normalized using feature scaling via Equation (1) in preparation for training the CNN. ECG data were base line corrected. All accelerometer axis (x, y, z) and ECG data were transformed into an array (D, W, F) with D rows, W window width, and F number of features.

x_{new} = \frac{x - x_{\min}}{x_{\max} - x_{\min}}

(1)

3. Results

The participant voluntarily ceased the protocol at 11 h (2200 m vertical climb, 41.8 km) due to perceived exhaustion.

Figure 3 shows the representative input to the CNN of the gait waveforms of vertical acceleration on tarseal and dirt at different fatigue levels. Each plot is 50 steps triggered at zero g and plotted with the median waveform in a thick black line. Inter-step variation in acceleration and morphology can be observed between surfaces (a) tarseal and (b) dirt. The changes in waveform shape between surfaces was likely due to surface hardness and variations in surface texture uniformity. Across the protocol, variation was likely due to fatigue reducing peak forces and subsequent gait adaption, as seen on the plots at point (c).

A subset of performance tests (FTT, Jump test, Stroop, PVSAT) completed in the protocol are shown in Figure 4. Jump height and FTT-right-hand were most sensitive to the fatigue protocol.

Figure 4a shows FTT-right-hand and the slower non dominant left hand with separate linear regression lines. Inter-test variation was observed between physical and cognitive tests, with an overall trend having a negative slope showing performance was decreasing over time. Correlation results for all tests are shown in Table 2. Jump height shown in Figure 4b was performed after each physical load period and showed high correlation (R² 0.78) with the protocol. Stroop shown in Figure 4c had two outliers and showed moderate correlation (R² 0.5) with the outliers removed. PVSAT shown in Figure 4d was not correlated with the protocol load. Trail making A (R² 0.29) and spatial memory (R² 0.28) were somewhat correlated to post cognitive load. Trail making B (R² 0.22) was somewhat correlated to post-physical load.

Figure 5 shows the variation of gait for four periods in the protocol illustrating the variation to the accelerometer waveforms for both fatigue levels and terrain.

A training result is shown for a single activity ‘run down’ in Figure 6 for data window 128, epoch 100, individual predictions (light grey), and rolling average of 200 predictions (black). The label for FTT (red) inter-test linear interpolation with discontinuities between time periods due to concatenation.

A total of 108 machine learning experiments were performed to test which input data width and activity type gave the best MAE. Initially, a fixed CNN topology was used (Epoch 50, Batch 256, layer 1 filter 256, layer 2 filter 256, dense layer 128, overlap = 0). Three data group results were compared for: acceleration, ECG, and combined acceleration with ECG. These three conditions were tested for each activity type (‘run’, ’walk’, etc.) over four data window widths (64, 128, 256 and 512). The results for these experiments are shown in Figure 7 by activity, where circle diameter is data window width. Minimum MAE was at ’walk up’ (window width 256, MAE 0.105, samples 1,534,500, windows 5994) and ‘sit’ (window width 256, MAE 0.116, samples 2,662,750, windows 10,401). However, sit was not included as it took place in the lab for cognitive testing. Samples were more numerous for ‘run down’ (window width 256, MAE 0.181, samples 1,843,749, windows 7202) and still gave a larger minimum MAE. This indicates that total sample count is not the main influence on MAE, however the activity with considerably lower samples did show larger MAE values, ‘walk down’ (stride 512, MAE 0.309, samples 20,000, windows 78).

Further experiments were performed for acceleration and ECG with ‘walk up’ to optimize the CNN model hyperparameters, various widths of the first two convolutional layers, and the dense layer. The lowest MAE was found to be the following model: Conv1D 128, Conv1D 128, max_pooling, flatten, dense 128, dense 1.

Table 3 shows the total samples per activity and results for MAE and RAE with the training labels using two methods, linear fit, and inter-test interpolation, window width 128, epoch 100, batch size 256, and a rolling window average of 200 predictions. There was no result for activity of ‘walk-down’ as the total samples divided by the window width of 128 was 156, which was less than the rolling average of 200 predictions. Activity ‘Walk Up’ gave the lowest MAE for both linear interpolation and inter-test interpolation of label data. Activity ‘Run Down’ gave the lowest range of errors, indicating it may be a better activity for field prediction.

4. Discussion

A protocol for cognitive and physical fatigue was performed in the field, with voluntary activity selection and voluntary pacing over various terrain slopes and surfaces. Jump height and FTT-dominant-hand were most sensitive to the protocol. FTT-non-dominant-hand and Stroop were moderately sensitive. FTT was the most sensitive and biomechanically non-specific, as the legs were exposed to physical load and the arms–hand–fingers were tested for neuromuscular performance. It is likely Stroop would be more sensitive if the protocol included sleep deprivation. Spatial Memory was mildly correlated to the cognitive load.

The experiment showed that a field protocol of cognitive and physical load in excess of a critical power will cause failure and modulate standard objective measures of cognitive and physical performance. Mental and physical fatigue led to earlier-than-anticipated termination of the protocol, which aligned with previous studies [16,40].

The use of a machine learning model was required due to the complex gait waveform morphology variations throughout the protocol. The results for acceleration, ECG, and combined acceleration and ECG are shown in Figure 7 across various stride lengths from 64 to 512 samples. While the activity ‘sit’ had low MAE showing how a controlled environment could give good results, our work aimed to determine if it was possible in an uncontrolled field-based environment. Activity ‘walk up’ had low MAE for both inter-test interpolation and long-term linear fit. ‘Run down’ had the lowest RAE. It is recommended that RAE is used, as this represents the results you would get when using the model in the future for inference.

This experiment showed how a single sensor could be used in conjunction with a CNN model to give accurate results of cognitive and physical fatigue equivalent to gold standard objective tests; FTT and Vertical Jump Test. Best results were obtained when model training was specific to activities such as ‘run down’ and ‘walk up’. MAE and RAE performed well for a rolling window of 200 continuous predictions of 102 s. This intuitively makes sense that any one step in a persons’ gait may be influenced by objects, surface, and other distractions, and it is best to use multiple steps of a persons’ gait to determine a fatigue result. Winter [70] showed that the cadence in steps per minute on a uniform surface varied from 84.7 ± 10.4 for slow to 121.6 ± 5.3 for fast.

The input window size of the CNN model has an optimum size. Too small does not allow a full gait or ECG waveform to be analyzed, and too large significantly reduces the number of training samples.

Tests that had the highest sensitivity to the protocol, and indicated a central fatigue component, were the jump test (high physical load on the legs) and the FTT (utilized hand digits which were not significantly utilized during running). Cognitive tests were less sensitive to the protocol, indicating there may have been a mismatch between cognitive and physical loads.

The effectiveness of the protocol was encouraging as it provided proof of concept for translational research to be undertaken in outdoor environments. Future work could examine how team workload and tactical decision-making can be adjusted for cognitive and physical fatigue in real time with no additional data entry for soldiers on multiday missions. Recovery during training missions could be assessed without researchers being present. Adventure sports people could gain insight into their cognitive and physical fatigue, enabling informed training plans. Work rest cycles could be adjusted, and critical tactical and navigation decisions can be chosen based on periods of highest cognitive performance.

This feasibility study researched approaches of protocol design, error sources, calibration techniques, data collection, validation, labelling, and data processing. Given the lessons learnt, data gathering and processing needs to be more automated to reduce the high processing load that occurred for the one participant in this study. Further work is needed to test inter-subject variability to the protocol, test–retest accuracy of the prediction model, longer duration, and additional fatigue modulators including sleep, pain, discomfort, and nutrition.

Limitations

Limitations in validating the experimental objective include a linear protocol and the limited amount of comparison tests, however, this is a natural limitation in the field of cognitive assessments in the field. A long-term linear fit was appropriate for this protocol as the repetitive load could be assumed constant over the longer-term time frame. A random field assessment with no defined load protocol would require training using inter-test interpolation to allow for stochastic loads and recovery cycles. A constant long-term load was required to fit a machine learning model. Future work could compare the results in a long-term non periodic protocol.

The limitations of this test were the duration and the use of a single participant to initially prove the feasibility of the protocol and approach. Further research is required around increasing the duration of the protocol, possibly by reducing the hourly physical load. Additional studies over longer periods are required to generate cognitive fatigue that includes sleep deprivation. The test battery should include assessments immediately after large vertical assents to gather insight into short-term recovery. The addition of cognitive loads and assessment significantly affected the rate of perceived exertion. Future protocols should halve the physical load to lengthen the time to failure. Additionally, this method requires more participants to compare inter-person sensitivity and variability.

5. Conclusions

This paper showed that a single wearable sensor could be used in conjunction with a neural network model to determine cognitive and physical fatigue without performance tests being required during an operation in an outside unstructured environment. This research has the potential to increase safety and operational performance in high-risk environments by indicating the possibility of replacing traditional performance tests with a single wearable device. This work is novel, to the knowledge of the authors, in developing a field-based protocol for human performance with no direct supervision and modulation from ground surface, slope, fatigue, and task motivation. Future research is required for more participants and will require further automation of data labelling to process field data with self-pacing activities.

Author Contributions

Conceptualization and methodology, B.R., A.M., W.T. and P.H.; software, B.R. and A.M.; validation, B.R., W.T. and P.H.; formal analysis, B.R., A.M., W.T. and P.H.; investigation, B.R.; resources, B.R. and P.H.; data curation, B.R.; writing—B.R.; writing—review and editing, A.M., W.T. and P.H.; visualization, B.R.; supervision, A.M. and P.H.; project administration, B.R. and P.H.; funding acquisition, P.H. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

The study was conducted according to the guidelines of the Declaration of Helsinki, and approved by the Institutional Review Board (or Ethics Committee) of Auckland University of Technology (protocol code AUTEC 18/412.

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Conflicts of Interest

The authors declare no conflict of interest.

References

Phillips, R.O. A review of definitions of fatigue—And a step towards a whole definition. Transp. Res. Part F Traffic Psychol. Behav. 2015, 29, 48–56. [Google Scholar] [CrossRef]
Caldwell, J.A.; Caldwell, J.L.; Brown, D.L.; Smith, J.K. The Effects of 37 Hours of Continuous Wakefulness On the Physiological Arousal, Cognitive Performance, Self-Reported Mood, and Simulator Flight Performance of F-117A Pilots. Mil. Psychol. 2004, 16, 163–181. [Google Scholar] [CrossRef]
Thomas, L.C.; Gast, C.; Grube, R.; Craig, K. Fatigue Detection in Commercial Flight Operations: Results Using Physiological Measures. Procedia Manuf. 2015, 3, 2357–2364. [Google Scholar] [CrossRef] [Green Version]
Vural, E.; Çetin, M.; Erçil, A. Machine Learning Systems for Detecting Driver Drowsiness. In Digital Signal Processing for In-Vehicle Systems and Safety; Springer: Boston, MA, USA, 2007; pp. 16937–16953. [Google Scholar]
Desai, A.V.; Haque, M.A. Vigilance monitoring for operator safety: A simulation study on highway driving. J. Saf. Res. 2006, 37, 139–147. [Google Scholar] [CrossRef] [PubMed]
Correa, A.G.; Orosco, L.; Laciar, E. Automatic detection of drowsiness in EEG records based on multimodal analysis. Med. Eng. Phys. 2014, 36, 244–249. [Google Scholar] [CrossRef] [PubMed]
Von Jan, T.; Karnahl, T.; Seifert, K.; Hilgenstock, J.; Zobel, R. Don’t sleep and drive—VW’s fatigue detection technology. In Proceedings of the 19th International Conference on Enhanced Safety of Vehicles, Washington, DC, USA, 6 June 2005; pp. 1–12. [Google Scholar]
Chen, L.L.; Zhao, Y.; Zhang, J.; Zou, J.Z. Automatic detection of alertness/drowsiness from physiological signals using wavelet-based nonlinear features and machine learning. Expert Syst. Appl. 2015, 42, 7344–7355. [Google Scholar] [CrossRef]
Duncan, M.J.; Fowler, N.; George, O.; Joyce, S.; Hankey, J. Mental Fatigue Negatively Influences Manual Dexterity and Anticipation Timing but not Repeated High-intensity Exercise Performance in Trained Adults. Res. Sports Med. 2015, 23, 1–13. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Park, K.; Rosengren, K.S.; Horn, G.P.; Smith, D.L.; Hsiao-Wecksler, E.T. Assessing gait changes in firefighters due to fatigue and protective clothing. Saf. Sci. 2011, 49, 719–726. [Google Scholar] [CrossRef]
Smith, B.P.; Browne, M.; Armstrong, T.A.; Ferguson, S.A. The accuracy of subjective measures for assessing fatigue related decrements in multi-stressor environments. Saf. Sci. 2016, 86, 238–244. [Google Scholar] [CrossRef]
Dawson, D.; Reid, K. Fatigue alcohol and performance impairment. Nature 1997, 388, 235–237. [Google Scholar] [CrossRef] [PubMed]
Davis, M.P.; Walsh, D. Mechanisms of Fatigue. J. Support Oncol. 2010, 8, 164–174. [Google Scholar] [CrossRef]
Noakes, T.D. Fatigue is a brain-derived emotion that regulates the exercise behavior to ensure the protection of whole body homeostasis. Front. Physiol. 2012, 82. [Google Scholar] [CrossRef] [Green Version]
Hampson, D.B.; Gibson, A.S.; Lambert, M.I.; Noakes, T.D. The Influence of Sensory Cues on the Perception of Exertion During Exercise and Central Regulation of Exercise Performance. Sports Med. 2001, 31, 935–952. [Google Scholar] [CrossRef]
Siirtola, P.; Laurinen, P.; Haapalainen, E.; Röning, J.; Kinnunen, H. Clustering-based activity classification with a wrist-worn accelerometer using basic features. In Proceedings of the 2009 IEEE Symp. Comput. Intell. Data Mining, CIDM 2009-Proc., Nashville, TN, USA, 2 April 2009; Volume 44, pp. 95–100. [Google Scholar] [CrossRef]
Noakes, T.D. Physiological models to understand exercise fatigue and the adaptations that predict or enhance athletic performance. Med. Sci. Sports 2000, 123–145. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Van Cutsem, J.; Marcora, S.; De Pauw, K.; Bailey, S.; Meeusen, R.; Roelands, B. The Effects of Mental Fatigue on Physical Performance: A Systematic Review. Sports Med. 2017, 47, 1569–1588. [Google Scholar] [CrossRef] [Green Version]
Hill, A.V.; Long, C.N.H.; Lupton, H. Muscular Exercise Lactic Acid and the Supply and Utilisation of Oxygen. Proc. R. Soc. 1924, 97, 155–176. [Google Scholar] [CrossRef]
Borg, G.; Borg, E. To determine the magnitude of pain with Borg. In Proceedings of the Fechner Day 2014—30th Annual Meeting International Society for Psychophysics, Lund, Sweden, 18 August 2014; Volume 45, p. 16. [Google Scholar]
Millet, G.Y.; Tomazin, K.; Verges, S.; Vincent, C.; Bonnefoy, R.; Boisson, R.C.; Gergelé, L.; Féasson, L.; Martin, V. Neuromuscular consequences of an extreme mountain ultra-marathon. PLoS ONE 2011, 6, e17059. [Google Scholar] [CrossRef] [Green Version]
Venhorst, A.; Micklewright, D.; Noakes, T.D. Perceived Fatigability: Utility of a Three-Dimensional Dynamical Systems Framework to Better Understand the Psychophysiological Regulation of Goal-Directed Exercise Behaviour. Sports Med. 2018, 2479–2495. [Google Scholar] [CrossRef] [PubMed]
Boksem, M.A.; Tops, M. Mental fatigue: Costs and benefits. Brain Res. Rev. 2008, 59, 125–139. [Google Scholar] [CrossRef] [Green Version]
Möckel, T.; Beste, C.; Wascher, E. The Effects of Time on Task in Response Selection—An ERP Study of Mental Fatigue. Nat. Publ. Gr. 2015, 5, 10113. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Aoyagi, M.W.; Portenga, S.T. The Role of Positive Ethics and Virtues in the Context of Sport and Performance Psychology Service Delivery. Prof. Psychol. Res. Pract. 2010, 41, 253–259. [Google Scholar] [CrossRef]
Portenga, S.T.; Aoyagi, M.W.; Cohen, A.B. Helping to build a profession: A working definition of sport and performance psychology. J. Sport Psychol. Action 2017, 8, 47–59. [Google Scholar] [CrossRef]
Lambert, E.V.; Gibson, A.S.C.; Noakes, T.D. Complex systems model of fatigue: Integrative homoeostatic control of peripheral physiological systems during exercise in humans. Br. J. Sports Med. 2004, 39, 52–62. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Pimenta, A.; Carneiro, D.; Neves, J.; Novais, P. A neural network to classify fatigue from human-computer interaction. Neurocomputing 2016, 172, 413–426. [Google Scholar] [CrossRef]
Abdulin, E. User Fatigue Detection via Eye Movement Behavior. In Proceedings of 33rd Annual ACM Conference Extended Abstracts on Human Factors in Computing Systems, Seoul, Korea, 18 April 2015. [Google Scholar]
Gonzalez, K.; Sasangohar, F.; Mehta, R.; Lawley, M.; Erraguntla, M. Measuring Fatigue through Heart Rate Variability and Activity Recognition: A Scoping Literature Review of Machine Learning Techniques. In Proceedings of the Human Factors and Ergonomics Society Annual Meeting, Rome, Italy, 28–30 September 2017; p. 1748. [Google Scholar]
Patel, A.N.; Howard, M.D.; Roach, S.M.; Jones, A.P.; Bryant, N.B.; Robinson, C.S.H.; Clark, V.P.; Pilly, P.K. Mental State Assessment and Validation Using Personalized Physiological Biometrics. Front. Hum. Neurosci. 2018, 12, 221. [Google Scholar] [CrossRef]
Azim, T.; Jaffar, M.A.; Mirza, A.M. Fully automated real time fatigue detection of drivers through Fuzzy Expert Systems. Appl. Soft Comput. J. 2014, 18, 25–38. [Google Scholar] [CrossRef]
Enoka, R.; Duchateau, J. Translating fatigue to human performance. Meical Sci. Sports Exerc. 2016, 48, 2223–2238. [Google Scholar] [CrossRef] [Green Version]
Zhu, Y.; Jankay, R.R.; Pieratt, L.C.; Mehta, R.K. Wearable sensors and their metrics for measuring comprehensive occupational fatigue: A scoping review. In Proceedings of the Human Factors and Ergonomics Society Annual Meeting, Austin, TX, USA, 9–13 October 2017; pp. 1041–1045. [Google Scholar] [CrossRef]
Qi, W.; Su, H.; Aliverti, A. A Smartphone-Based Adaptive Recognition and Real-Time Monitoring System for Human Activities. IEEE Trans. Hum. Mach. Syst. 2020, 50, 414–423. [Google Scholar] [CrossRef]
Qi, W.; Su, H.; Yang, C.; Ferrigno, G.; De Momi, E.; Aliverti, A. A fast and robust deep convolutional neural networks for complex human activity recognition using smartphone. Sensors 2019, 19, 3731. (In Switzerland) [Google Scholar] [CrossRef] [Green Version]
Granacher, U.; Wolf, I.; Wehrle, A.; Bridenbaugh, S.; Kressig, R.W. Effects of muscle fatigue on gait characteristics under single and dual-task conditions in young and older adults. J. Neuroeng. Rehabil. 2010, 7, 56. (In Switzerland) [Google Scholar] [CrossRef] [PubMed] [Green Version]
Fuller, J.T.; Bellenger, C.R.; Thewlis, D.; Arnold, J.; Thomson, R.L.; Tsiros, M.D.; Robertson, E.Y.; Buckley, J.D. Tracking performance changes with running-stride variability when athletes are functionally overreached. Int. J. Sports Physiol. Perform. 2017, 12, 357–363. [Google Scholar] [CrossRef] [PubMed]
Heredia-Jimenez, J.; Latorre-Roman, P.; Santos-Campos, M.; Orantes-Gonzalez, E.; Soto-Hermoso, V.M. Spatio-temporal gait disorder and gait fatigue index in a six-minute walk test in women with fibromyalgia. Clin. Biomech. 2016, 33, 1–6. [Google Scholar] [CrossRef]
Marcora, S.M.; Staiano, W.; Manning, V. Mental fatigue impairs physical performance in humans. J. Appl. Physiol. Publ. 2009, 106, 857–864. [Google Scholar] [CrossRef] [PubMed]
Roelands, B.; De Koning, J.; Foster, C.; Hettinga, F.; Meeusen, R. Neurophysiological determinants of theoretical concepts and mechanisms involved in pacing. Sports Med. 2013, 43, 301–311. [Google Scholar] [CrossRef] [PubMed]
Borghini, G.; Astolfi, L.; Vecchiato, G.; Mattia, D.; Babiloni, F. Measuring neurophysiological signals in aircraft pilots and car drivers for the assessment of mental workload, fatigue and drowsiness. Neurosci. Biobehav. Rev. 2014, 44, 58–75. [Google Scholar] [CrossRef]
Russell, B.; McDaid, A.; Toscano, W.; Hume, P. Moving the Lab into the Mountains: A Pilot Study of Human Activity Recognition in Unstructured Environments. Sensors 2021, 21, 654. [Google Scholar] [CrossRef] [PubMed]
Wang, G.; Li, Q.; Wang, L.; Wang, W.; Wu, M.; Liu, T. Impact of sliding window length in indoor human motion modes and pose pattern recognition based on smartphone sensors. Sensors 2018, 18, 1965. (In Switzerland) [Google Scholar] [CrossRef] [PubMed] [Green Version]
Yoon, H.; Hwan, S.; Ho, S.; Choi, J.; Jin, Y. Wakefulness evaluation during sleep for healthy subjects and OSA patients using a patch-type device. Comput. Methods Progr. Biomed. 2018, 155, 127–138. [Google Scholar] [CrossRef]
Gordienko, Y.; Stirenko, S.; Kochura, Y.; Alienin, O.; Novotarskiy, M.; Gordienko, N. Deep Learning for Fatigue Estimation on the Basis of Multimodal Human-Machine Interactions. arXiv 2017, arXiv:1801.06048. [Google Scholar]
van der Westhuizen, J.; Lasenby, J. A Review of Machine Learning Applied to Time Series; Technical Report; Cambridge University Engineering Department: Cambridge, UK, 2016; CUED/F-INFENG/TR.702:0951-9211. [Google Scholar]
Ignatov, A. Real-time human activity recognition from accelerometer data using Convolutional Neural Networks. Appl. Soft Comput. J. 2018, 62, 915–922. [Google Scholar] [CrossRef]
Tripathi, S.; Acharya, S.; Sharma, R.D.; Mittal, S.; Bhattacharya, S. Using Deep and Convolutional Neural Networks for Accurate Emotion Classification on DEAP Dataset. In Proceedings of the Twenty-Ninth IAAI Conference, San Francisco, CA, USA, 6 February 2017; pp. 4746–4752. [Google Scholar]
Laurent, F.; Valderrama, M.; Besserve, M.; Guillard, M.; Lachaux, J.P.; Martinerie, J.; Florence, G. Multimodal information improves the rapid detection of mental fatigue. Biomed. Signal Process. Control 2013, 8, 400–408. [Google Scholar] [CrossRef]
Grobe, S.; Kakar, R.S.; Smith, M.L.; Mehta, R.; Baghurst, T.; Boolani, A. Impact of cognitive fatigue on gait and sway among older adults: A literature review. Prev. Med. Rep. 2017, 6, 88–93. [Google Scholar] [CrossRef]
Vanhatalo, A.; Jones, A.M.; Burnley, M. Application of Critical Power in Sport What Is the Critical Power Concept. Int. J. Sports Physiol. Perform. 2011, 6, 128–136. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Miyake, S.; Yamada, S.; Shoji, T.; Takae, Y.; Kuge, N.; Yamamura, T. Physiological responses to workload change. A test/retest examination. Appl. Ergon. 2009, 40, 987–996. [Google Scholar] [CrossRef]
Gibson, A.S.C.; Goedecke, J.H.; Harley, Y.X.; Myers, L.J.; Lambert, M.I.; Noakes, T.D.; Lambert, E.V. Metabolic setpoint control mechanisms in different physiological systems at rest and during exercise. J. Theor. Biol. 2005, 236, 60–72. [Google Scholar] [CrossRef] [PubMed]
Wickens, C.D.; Keller, J.W.; Shaw, C. Human Factors in High-Altitude Mountaineering. J. Hum. Perform. Extrem. Environ. 2015, 12, 5–8. [Google Scholar] [CrossRef]
Apple Research Kit. Available online: https://developer.apple.com/researchkit/ (accessed on 12 June 2018).
Cherry, B.J.; Zettel-Watson, L.; Chang, J.C.; Shimizu, R.; Rutledge, D.N.; Jones, C.J. Positive associations between physical and cognitive performance measures in fibromyalgia. Arch. Phys. Med. Rehabil. 2012, 93, 62–71. [Google Scholar] [CrossRef] [PubMed]
Lee, C.Y.; Kang, S.J.; Hong, S.K.; Ma, H.; Lee, U.; Kim, Y.J. A validation study of a smartphone-based finger tapping application for quantitative assessment of bradykinesia in Parkinson’s disease. PLoS ONE 2016, 11, e0158852. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Leyla, A.; Kiziltan, E. Polyphasic Temporal Behavior of Finger-Tapping Performance. J. Mot. Behav. 2016, 48, 72–78. [Google Scholar]
Pageaux, B.; Marcora, S.M.; Rozand, V.; Lepers, R. Mental fatigue induced by prolonged self-regulation does not exacerbate central fatigue during subsequent whole-body endurance exercise. Front. Hum. Neurosci. 2015, 9, 67. [Google Scholar] [CrossRef] [Green Version]
Egner, T.; Hirsch, J. The neural correlates and functional integration of cognitive control in a Stroop task. Neuroimage 2005, 24, 539–547. [Google Scholar] [CrossRef]
Iancheva, D.; Trenova, A.G.; Terziyski, K.; Kandilarova, S.; Mantarova, S. Translational validity of PASAT and the effect of fatigue and mood in patients with relapsing remitting MS: A functional MRI study. J. Eval. Clin. Pract. 2018, 24, 832–838. [Google Scholar] [CrossRef] [PubMed]
Gonzales, J.U.; James, C.R.; Yang, H.S.; Jensen, D.; Atkins, L.; Thompson, B.J.; Al-Khalil, K.; O’Boyle, M. Different cognitive functions discriminate gait performance in younger and older women: A pilot study. Gait Posture 2016, 50, 89–95. [Google Scholar] [CrossRef] [PubMed]
Corsi, P.M. Memory and the Medial Temporal Region of the Brain. Ph.D. Thesis, Department of Psychology McGill University, Montreal, QC, Canada, 1972. [Google Scholar]
Brunetti, R.; Del Gatto, C.; Delogu, F. eCorsi: Implementation and testing of the Corsi block-tapping task for digital tablets. Front. Psychol. 2014, 5, 1–8. [Google Scholar] [CrossRef] [PubMed]
Watkins, C.M.; Barillas, S.R.; Wong, M.A.; Archer, D.C.; Dobbs, I.J.; Lockie, R.G.; Coburn, J.W.; Tran, T.T.; Brown, L.E. Determination of vertical jump as a measure of neuromuscular readiness and fatigue. J. Strength Cond. Res. 2017, 31, 3305–3310. [Google Scholar] [CrossRef]
Kovářová, L.; Pánek, D.; Kovář, K.; Hlinčík, Z. Relationship between subjectively perceived exertion and objective loading in trained athletes and non-athletes. J. Phys. Educ. Sport 2015, 15, 186–193. [Google Scholar] [CrossRef]
Johnstone, J.A.; Ford, P.A.; Hughes, G.; Watson, T.; Garrett, A.T. Bioharness^TM multivariable monitoring device. Part I: Validity. J. Sports Sci. Med. 2012, 11, 400–408. [Google Scholar]
Johnstone, J.A.; Ford, P.A.; Hughes, G.; Watson, T.; Garrett, A.T. Bioharness^TM multivariable monitoring device. Part II: Reliability. J. Sports Sci. Med. 2012, 11, 409–417. [Google Scholar] [PubMed]
Winter, D.A. Kinematic and kinetic patterns in human gait: Variability and compensating effects. Hum. Mov. Sci. 1984, 3, 51–76. [Google Scholar] [CrossRef]

Figure 1. Fatigue Protocol.

Figure 2. Time series example of the one-off activity “climb gate” verses repetitive data “run” and “walk”.

Figure 3. Structure of CNN.

Figure 4. Performance Tests (a) FTT, (b) Jump Test, (c) Stroop, (d) PVSAT.

Figure 5. Acceleration for activity “Run Down” over protocol time (0, 3, 8, and 10 h) for surfaces (a) tarseal, (b) dirt, and (c) feature of interest over fatigue.

Figure 6. Training Results (BLACK) for CNN with activity ‘run down’ with training label (RED) and individual predictions (GREY).

Figure 7. Training Loss, MAE, for (a) Accelerometer, (b) ECG and (c) combined ECG and accelerometer, epoch = 100, circle radius set by data window width (64, 128, 256, 512).

Table 1. Assessment Battery.

Assessment	Bio-Psycho-Central Performance	Reference
Finger Tap Test	Neuro muscular fatigue	[59]
Stroop	Cognitive flexibility and selective attention	[61]
PVSAT	processing speed, attention, working memory	[62]
Trail Making A and B	Motor and executive impairment	[63]
Corsi Block test	Spatial memory, Working memory	[64,65]
Vertical Jump	Neuromuscular fatigue	[66]
Rating of Perceived Exertion	Perceived level of exertion	[16,20,67]

Table 2. Performance test sensitivity, R2, to the protocol load.

Test	All Tests	Post Physical Load	Post Cognitive Load
Jump	0.78	-	-
Finger Tap Test
Dominant Hand	0.72	0.76	0.67
Non Dominant Hand	0.54	0.51	0.60
Stroop (with outliers)	0.04	0.003	0.36
Stroop (no outliers)	0.49	0.37	0.36
PVSAT	0.03	0.11	0.02
Trail Making A	0.19	0.04	0.29
Trail Making B	0.001	0.22	0.05
Spatial Memory	0.00	0.00	0.30

Table 3. Results for linear fit and inter-test interpolated labels.

Activity	Data (250 Hz)	Linear Fit MAE₂₀₀	Linear Fit RAE₂₀₀	Inter-Test Interpolation MAE₂₀₀	Inter-Test Interpolation RAE₂₀₀
Run Up	1,019,002	0.145	0.225	0.134	0.240
Run	732,501	0.151	0.238	0.156	0.232
Run Down	1,843,749	0.130	0.289	0.133	0.167
Walk Up	1,534,500	0.136	0.303	0.125	0.411
Walk	299,997	0.238	0.683	0.235	0.726
Walk Down	20,000	0.219	-	0.239	-
Open Gate	56,750	0.195	0.338	0.199	0.316
Climb Gate	65,249	0.327	0.422	0.313	0.389

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Russell, B.; McDaid, A.; Toscano, W.; Hume, P. Predicting Fatigue in Long Duration Mountain Events with a Single Sensor and Deep Learning Model. Sensors 2021, 21, 5442. https://doi.org/10.3390/s21165442

AMA Style

Russell B, McDaid A, Toscano W, Hume P. Predicting Fatigue in Long Duration Mountain Events with a Single Sensor and Deep Learning Model. Sensors. 2021; 21(16):5442. https://doi.org/10.3390/s21165442

Chicago/Turabian Style

Russell, Brian, Andrew McDaid, William Toscano, and Patria Hume. 2021. "Predicting Fatigue in Long Duration Mountain Events with a Single Sensor and Deep Learning Model" Sensors 21, no. 16: 5442. https://doi.org/10.3390/s21165442

APA Style

Russell, B., McDaid, A., Toscano, W., & Hume, P. (2021). Predicting Fatigue in Long Duration Mountain Events with a Single Sensor and Deep Learning Model. Sensors, 21(16), 5442. https://doi.org/10.3390/s21165442

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Predicting Fatigue in Long Duration Mountain Events with a Single Sensor and Deep Learning Model

Abstract

1. Introduction

1.1. Why We Need to Measure Physical and Cognitive Fatigue in the Field

1.2. How We Can Measure Physical and Cognitive Fatigue in the Lab and the Field

2. Materials and Methods

2.1. Ethics

2.2. Protocol—Physical and Cognitive Load and Performance Assessments

2.3. Data Preparation

2.4. Convolutional Neural Network

2.5. Statistics

3. Results

4. Discussion

Limitations

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI