Next Article in Journal
MCRBM–CNN: A Hybrid Deep Learning Framework for Robust SSVEP Classification
Previous Article in Journal
SAMViTrack: A Search-Region Adaptive Mamba-ViT Tracker for Real-Time UAV Tracking
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Machine-Learning-Based Fatigue Trend Analysis on IMU Wearable Sensor Data from Construction Site Workers

1
VTT Technical Research Centre of Finland Ltd., Kaitoväylä 1, P.O. Box 1100, 90571 Oulu, Finland
2
Department of Advanced Robotics (ADVR), Istituto Italiano di Tecnologia, via Morego, 30, 16163 Genoa, Italy
3
Department of Informatics, Bioengineering, Robotics and Systems Engineering (DIBRIS), Università degli Studi di Genova (UniGe), 16145 Genova, Italy
4
Biomimetics and Intelligent Systems Group, Faculty of Information Technology and Electrical Engineering, University of Oulu, Pentti Kaiteran katu 1, P.O. Box 8000, 90014 Oulu, Finland
*
Author to whom correspondence should be addressed.
Sensors 2025, 25(24), 7455; https://doi.org/10.3390/s25247455
Submission received: 29 September 2025 / Revised: 21 November 2025 / Accepted: 2 December 2025 / Published: 8 December 2025
(This article belongs to the Section Wearables)

Abstract

Physical fatigue is a major cause of work-related accidents and musculoskeletal injuries in the construction industry, and additional means are needed for their identification and management to prevent long-term consequences. Based on recent scientific literature, fatigue can be detected with wearable inertial measurement units (IMUs). However, IMUs for detecting fatigue have been so far tested mainly in the laboratory; therefore, a research gap exists in application of IMU sensors for detecting fatigue in real-life work settings. The aim of this paper is to bring the fatigue trend detection with IMUs closer to real-life context by using wearable IMU sensor data from an actual construction site measuring actual workers with simulated work tasks. The paper also presents advancements in fatigue trend detection with frequency domain investigations to gain access to more detailed fatigue relevant features. Machine-learning methods are used to predict fatigue trends based on IMU data, resulting in fatigue trend detection accuracy that advances the state of the art. More knowledge is also unearthed about relevant sensor locations and features.

1. Introduction

In the US construction industry, 33% of all work-related musculoskeletal injuries are attributed to overexertion [1]. In addition, physical fatigue is found to be a major cause of work-related accidents in the building construction industry [2,3]. Therefore, detecting and tracking fatigue is important, so that timely interventions (e.g., breaks) can be introduced [4]. To support physical ergonomics and overall safety at work, personalized fatigue assessments would be beneficial.
Fatigue is usually detected by measuring heart rate (HR) and heart rate variability (HRV), skin temperature, muscle jerk (IMU) and muscle response (electromyography, EMG) metrics [5]. However, HR, HRV, skin temperature and EMG all require a sensor with skin contact, which can be considered obtrusive. Fatigue adversely affects movement control and quality, therefore, demonstrating increases in the jerk values (with IMUs) [6]. In [7], features associated with jerk and acceleration at the wrists and hips were better predictors of physical fatigue than features associated with heart rate [7]. Heart rate data (HRV) might still be applicable to evaluate general fatigue; however, jerk metrics might be more suitable to assess localized fatigue (e.g., lower back, lower limbs) [5,7,8]. Then again, heart rate might not be directly proportional to the individuals’ fatigue state, but rather an indicator of the intensity of the current task [9]. Jerk metrics, on the other hand, can be sensitive to sensor position and the current task (task- and person-specific cutoff values) [5].
Based on the scientific literature presented in the previous paragraph, fatigue can be detected with wearable sensor IMUs. IMUs also present other benefits: IMU sensors are cheap, they can be easily integrated into clothing, and they are relatively unobtrusive. Another clear advantage of IMUs is that the data that they produce is less privacy-intrusive than those captured by, e.g., cameras or health-related sensors, such as heart rate monitors. However, IMUs for detecting fatigue have so far been mainly tested in the laboratory [5]; therefore, a research gap exists in the application of IMU sensors for detecting fatigue in real-life settings, where the volatile environment presents challenges. This research will aim to cover this gap and contribute to the field of wearables and personal wellbeing.
The inspiration for the research presented in this paper is based on previous work [10], where the aim was to detect hazardous situations and accidents by applying human-activity recognition methods on wearable sensor data. A dataset [11] was collected with inertial motion unit (IMU) sensors from real construction work conditions and was analyzed using machine-learning models to recognize physical activities of the workers. The project revealed challenges in both reliable data collection (sensor placements, battery consumption, sensor synchronization, etc.), and the challenge of tracking data from workers, which would reveal essential parts of their wellbeing and safety. However, initial assessments of the sensor data in the frequency domain to track overall fatigue also raised the interest in the possibility of a more detailed setup for tracking fatigue itself.
The research presented in this paper aims to answer the question: How can the fatigue trends of workers in real work environments be accurately detected with non-intrusive IMUs? The methods include algorithms such as pre-processing, filtering, and other frequency domain manipulations to gain access to fatigue relevant features. For the actual data analysis, machine-learning processes and methods were used to detect and classify fatigue and activities. One hypothesis for a potential metric for fatigue build-up over time could be the change related to a starting “normal” baseline. Although fatigue is presumably person-dependent, a general model approach was investigated to find a generalized fatigue model for multiple people. The efficacy of the fatigue models used was benchmarked against existing results from state-of-the-art literature. All these algorithms were used with workers’ data in a real-life construction scenario. During the test, the workers wore the Smart Suit developed at the Istituto Italiano di Tecnologia [12].
The work presented in this paper has been funded by BEEYONDERS EU project. BEEYONDERS’s problem statement is highly relevant regarding the aims of this paper too. Demographic trends in Europe are posing a challenge, e.g., to the construction sector, which is facing labor shortage, and an aging existing workforce. This can be addressed through the integrated adoption of breakthrough technologies like smart wearables and artificial intelligence, which can help construction tasks to be completed with more efficacy, while at the same time, making the sector more attractive to young people, and improving the safety, productivity, quality, and environmental impact of construction projects. The project aims to demonstrate the safety of wearables in a construction environment in cooperation with workers and increase the wellbeing of the construction workforce concerned and develop solutions for monitoring the wellbeing of the workforce and prevention of accidents [13].
In this paper, we present novel contributions: the use of frequency plane (PSD) peaks as features in fatigue detection methodology; these peaks represent the major frequencies of the human movement and illustrate shift in the intensity of the movement. Our proposed PSD features are strongly present in the most important features listing, hinting at their high usability in detecting fatigue trends compared to jerk values and standard machine-learning features. Our approach also achieved promising MAE rates, improving the state of the art regarding the accuracy of IMU-based detection of construction site fatigue trends. We also emphasize that another strength of this research is that the Technology Readiness Level (TRL) of the dataset and approach used in this paper is relatively high, TRL 6, since it is from the actual construction site with actual construction workers.

2. Dataset Description and Machine-Learning Methodology

This chapter aims to describe method development for fatigue trend analysis with wearables for construction workers. The ideal scenario would be to use a minimum amount of non-intrusive IMU sensors to enable realistic usage at work sites; however, during initial method development, all possible sensors and features were used in assessing the fatigue trend detection accuracy.

2.1. Smart Suit and Data Acquisition

The fatigue trend detection uses data coming from the Smart Suit. The Smart Suit has been developed at Istituto Italiano di Tecnologia (IIT) to analyze and monitor worker movements and muscle activations in operational scenarios. Several sensors, e.g., IMU and EMG, have been integrated within a high-visibility Personal Protective Equipment (PPE) in a seamless way, avoiding bulky external devices on the worker. Smart Suit provides a wearable motion capture technology with improved usability for the final user. A patented mechanism ensures the sensor’s anchoring and stability during usage.
The Smart Suit integrates, under its outmost layer, ten magnetometer-free IMUs, making this technology particularly suitable for industrial applications [14]. More details are reported in [12]. Small portable batteries power the sensors and the Raspberry Pi (RPI), the single-board computer. The RPI equips an external data storage device with a capacity of 16 GB that allows continuous data logging of more than two hundred variables at 200 Hz for about 4 h.
The Smart Suit software architecture consists of a distributed framework of nodes on multiple levels. The Core node is responsible for loading, initializing, and managing low- and high-level nodes. The Core node also ensures the synchronization of all the other nodes, such as the human activity recognition [15] and the ergonomic risk assessment [16]. The Streamer node can be used as a bridge between the Smart Suit data and variables, and external IoT devices among the high-level modules.
The data used in this research is measured from six positions of the Smart Suit IMU sensors: left leg, right leg, left arm, right arm, sternum, and pelvis. Arm data are important considering activities that are performed mainly by hand or with the upper body; in these cases, the detection of fatigue could be challenging without the arm data. The Smart Suit was used by IIT to collect data from a field study at a construction site located in Helsinki, Finland. The field study was performed in a dedicated real-life production environment with 5 individual workers (age 30 ± 17 years, height 176 ± 6 cm, and weight 80 ± 4 kg) performing work activities on five different days. The experiment was approved by the Ethical Committee of Liguria (protocol reference number: CER Liguria 001/2019) and complies with the Helsinki Declaration. All the subjects signed an informed consent form prior to participating, after receiving a full explanation of the experimental procedure.
One of the activities that the Smart Suit was tested on workers was hauling. This activity consisted of lifting a heavy object, carrying it a few meters and lowering it, and repeating this cycle. The task was performed between two loading stations placed approximately 10 to 12 m apart from each other. The workers carried out three consecutive tasks, first carrying bricks (total weight 10 kg) 4–5 times, then carrying a cement bag (25 kg), and after this, executing a free walk (no load). This sequence of movements was repeated for the whole duration of the test, and the sequence aimed at keeping the strenuousness of the lifting in a safe range. The duration of every activity was individual to the worker, and the test lasted between 30 and 60 min, depending on the test subject’s pace. The test was performed on two separate days; altogether, two repetitions were recorded for each worker and each task, apart from worker 4, who did only one repetition on day 4, and worker 3, who did not perform on day 2. This resulted in 8 datasets for hauling activities. Table 1 illustrates the overall view for the hauling task datasets.
Along with the Smart Suit data, self-reported effort assessments (force exerted during an activity) were collected from the workers in the beginning, in the middle and at the end of the work tasks. The effort assessment was on a scale of five values, from 1 to 5. Table 2 details the effort levels.

2.2. Pre-Processing

Only the accelerometer data coming from the Smart Suit IMUs went through pre-processing. There was some slight variance in the data frequency and occasional NaN values in the timestamps, but the approximate average frequency of the data was 45 Hz. The accelerometer time series data was pre-processed by checking the average frequency of each sensor, and calculating the total acceleration from the x, y, and z axes of the accelerometer data. Possible null values were filtered out. A low-pass Butterworth filter with a cutoff frequency of 18 Hz was added for the total acceleration data. The accelerometer data was windowed with sliding window technique, with 12 min-long windows sliding 1 min at a time.
The effort values (force exerted during activity) were pre-processed so that the values attained for each worker during a task (from the beginning, from the middle and from the end), are linearly interpolated so that they span the whole task time, providing a target value for each of the time windows generated from the data for the task. These interpolated effort values operate as target values (a baseline) that are used in guiding the machine-learning training and testing.

2.3. Features

The feature calculation phase utilized power spectral density-based features that were already used in the previous project [11]. The power spectral density (PSD) was calculated using Welch’s method [17], using fast Fourier transform to reveal the frequency plane details of the movement data from the accelerometers. From the PSD, the predominant frequencies were analyzed for each time window, and especially focused on the predominant frequency peaks, and their change over time. These peaks are illustrated in Figure 1. The frequency values of the 8 highest peaks of each PSD time window were used as features. In addition, a power weighed sum of these frequency values was calculated and used as a feature.
The power weighed sum of the frequency values is calculated as follows:
x = k = 1 8 f k   p k ,
where x is the power weighed sum, f is the frequency of each frequency peak, and p is the power of each frequency peak.
Other features that were calculated can be divided into standard features and jerk features. Standard features are common features used in machine learning, including mean, standard deviation, median, 75th percentile, and 25th percentile. These features are calculated directly for the windowed total acceleration values. Jerk features are otherwise the same, except the total acceleration windowsare first used to calculate the jerk value, i.e., derivative/gradient of the aforementioned time window. These jerk time windows are then used to calculate the machine-learning features of mean, standard deviation, median, 75th percentile, and 25th percentile. Calculating all these features for all the 6 different sensors resulted in a total of 114 different features being used in the feature selection phase. After feature calculations, the features were normalized by dividing each feature value with the average of the first two values of each feature vector. The normalization is needed to create features that are comparable between different test subjects and to be able to create a generalized fatigue model from many sources. Different data standardization approaches were tried, such as setting the dataset mean to zero and standard deviation to 1 (standard scaling), and by scaling each feature to a given value range (from 0 to 1), but the normalization with the equation below yielded the best results.
f n o r m a l i z e d = f x   /   ( f 1 +   f 2 )
For splitting the data into train and test splits, the leave-one-out method was used, where each of the datasets is left as a test case in their turn and the other datasets are used for model training., e.g., in the case of the hauling task, there are 8 different permutations of the train/test split.
For feature selection, univariate feature selection was used, in which univariate statistics are used to determine the best features. More specifically, variance F-value was analyzed for the feature samples [18]. This is more commonly used in classification tasks but also works well for regression. The feature selection results in the five best features chosen for each train/test split for model training. A random forest regressor was used in model training with the following parameters: the number of trees in the forest is 80; for the function to measure the quality of a split, the mean squared error was used with Friedman’s improvement score, and randomness of the bootstrapping of the samples was used when building trees were set to an integer value of 0 [19]. The whole algorithmic process is explained in Section 2.2 and Section 2.3 and is illustrated in a flowchart in Figure 2.

3. Fatigue Trend Analysis Results

To evaluate the accuracy of the trained fatigue models, the mean absolute error (MAE) was used, which is one of the most appropriate ones to assess how far a predicted fatigue trend curve is from the actual one. MAEs were calculated for each of the train/test splits (for all the datasets), and an overall value was obtained by calculating the mean value of all the calculated MAEs. These values are presented in Table 3. The overall average MAE for all workers is 0.3670.
Our study also provides results regarding the most important sensors and features while testing the fatigue model. Each train/test split dataset with its five best features produced feature importance results. The importance of a feature is computed as the (normalized) total reduction of the criterion brought by that feature, which is also known as the “Gini importance” [19]. Sum scores for the best features of all eight datasets were calculated, and they are depicted in Table 4.
From the information in Table 4, we can see that the pelvis and sternum areas produce the most meaningful features regarding fatigue accumulation. This also became evident when additional model training was performed using only the features from pelvis and sternum sensors, resulting in an average MAE of 0.419, and further on, when only using features from the pelvis sensor, the average MAE was 0.47, resulting in a relatively small reduction in accuracy (0.3670 when using all sensors) even though data from majority of the sensors was not used. Also, interestingly, our proposed new PSD features are strongly present in the most important features, hinting their high usability in detecting fatigue trends compared to jerk values and standard machine-learning features. With additional model training without using the PSD related features, the average MAE worsens to 0.533. Figure 3 depicts all the effort trends predicted by the fatigue model (using all possible features), and the linearly interpolated actual expected values. The fatigue trend detection is well in line with the expected effort values (effort self-assessment collected from workers).
Regarding the model used, random forest regression was chosen after trials with other regression models. In trials, the same model parameters were used for other models as for random forest regression (where applicable); otherwise, default parameters were used. Regression itself was chosen as fatigue over time is, by nature, a continuous and cumulative trend depending on many input variables. Model trial results are depicted in Table 5.

4. Discussion

The research presented aims to answer the question: How can the fatigue trends of workers in real work environments be accurately detected with non-intrusive IMUs? The data from the construction site provided a solid foundation for the machine-learning work on IMU-based fatigue trend detection. Starting with the pre-processing, we chose the initial value-based normalization based on experiences presented in [20] and modified it to better represent the initial non-fatigue state. We tried also other data standardization approaches, but the normalization yielded best results. We also used Butterworth low-pass filtering as in [20] and found that a higher cutoff value of 18 Hz (often used around 10 Hz) provided more detailed features, especially considering our use of PSD features that rely on the frequency plane data. The data was windowed with the sliding window technique, with 12 min-long windows sliding 1 min at a time. Through experimentation, this was found to be the most accurate time-windowing approach compared to shorter time windows, or non-overlapping time-windowing methods.
The literature suggested that supervised machine-learning methods such as random forest [20,21] and regression models [7] are good starting points. Our experiments proved that random forest regression was the best model for fatigue trend detection purposes. Other models were tried, but with more inaccurate results, as depicted in Table 5.
The usage of jerk features is a common phenomenon in the fatigue detection literature like in [7,21], and our novel contribution is the use of frequency plane (PSD) peaks that represent the major frequencies of the movement. PSDs have been previously used in fatigue estimation as in [22,23], but to our knowledge, the PSD peaks have not been utilized in fatigue estimation before. Our proposed PSD peak-based features are strongly present in our most important feature listing, hinting at their high usability in detecting fatigue trends compared to jerk values and standard machine-learning features. Pelvis and sternum areas produced the most meaningful features regarding fatigue accumulation.
Cross-validation is a common method in machine learning, and in the IMU-based fatigue trend detection literature. Compared to work in [7], the strength of our fatigue model method is that all the datasets can be used in testing in cross-validation, not just choosing a couple of the datasets as the testing datasets. All the effort trends predicted by the fatigue model were well in line with the expected effort values and the overall average accuracy (MAE) for all workers is 0.3670. The authors in [7] achieved a MAE value of 2.16 for their fatigue trends. To be able to fully compare to this, we need to calculate the percentage of the MAE in the context of the used effort scale. The authors of [7] used a scale from 6 to 20 with 15 steps; thus, their percentage error was 2.16/15 = 0.144, resulting in a 14.4% MAE in the context of the used scale. Our effort scale was from 1 to 5 with 5 steps, and our percentage error was 0.3670/5 = 0.0734, resulting in a 7.34% MAE in the context of the scale used, thus markedly improving the state of the art regarding the accuracy of IMU-based detection of construction site fatigue trends.
Another fatigue trend detection example from the related field of manufacturing is where the authors [24] used a wearable system, including electrocardiogram (ECG), heart rate (HR), skin temperature and motion signals from the accelerometers and gyroscopes. Test subjects carried out simulated manufacturing tasks with a weighted vest to exaggerate the induced fatigue. The best fatigue trend MAE the authors achieved was 2.01 on a fatigue scale from 0 to 7 with 8 steps; thus, their percentage error was 2.01/8 = 0.25125, resulting in 25.13% [24].
When considering other fatigue trend detection methodologies, an approach [25] was presented by authors by using an electroencephalography (EEG) sensor to monitor brain activity and a photoplethysmography (PPG) sensor to monitor cardiovascular parameters. The authors introduced laboratory-induced fatigue with experimental procedures including cycling and jumping rope. Their ensemble model reached fatigue trend detection with average mean squared error (MSE) values of 0.0007 [25].
For comparison purposes, we calculated MSEs for our datasets between effort predictions by our model and the expected effort values, resulting in our approach having an average MSE of 0.25. This demonstrates that there is a considerable gap in accuracy between gold-standard sensors used in a laboratory by the authors of [25], and more practical sensors used in actual construction sites.
On the other hand, the authors in [26] presented an approach, where the research monitored fourteen construction workers’ fatigue in an on-site work on the repair and maintenance of a utility tunnel and gathered physiological measures and fatigue data simultaneously. Multiple sensors, including electrodermal activity (EDA), skin temperature (ST), and photoplethysmography (PPG), which can monitor heart rate (HR) and heart rate variability (HRV) were used (as opposed to our IMU-only approach). Correlation analysis between ratings of fatigue (ROF) and each physiological measure was performed, resulting in a best correlation coefficient value of 0.33 for EDA [26].
For comparison, we calculated correlation coefficients for our datasets between effort predictions by our model and the expected effort values, resulting in an average correlation coefficient of 0.38. Our approach was deemed to have slightly better accuracy results, but our work tasks were simulated, while the authors in [26] used more intrusive sensors, but tracked actual real-life work tasks. This demonstrates the special challenge of tracking real-world construction site fatigue, since the work tasks at the construction site are multifaceted and vary in pace. Some tasks might induce major fatigue (e.g., heavy hauling), while others do not necessarily manifest in physical fatigue (e.g., electric wiring, door lock installations, etc.) This challenge of gathering fatigue data from real-life environments also manifests itself in our approach, since fatigue levels 4 and 5 were not reported by the workers.
Besides presenting a novel approach that improves the state of the art regarding construction context, we would also emphasize that the strength of this research is that the TRL of the dataset and approach used in this paper is relatively high, around TRL 6, since it is from the actual construction site and from actual construction workers. For comparison, the authors in [7,20,21,25] use laboratory data, which is usually around TRL 4. The exception is [26], where the authors reached TRL 7.
Interesting and most certainly challenging future research would be on the utilization of IMUs and fatigue models in actual everyday construction site work in long-term tests, spanning several weeks at minimum. Organizing this kind of data gathering is highly challenging but would also provide valuable information regarding the most informative sensor locations and features in combination with the activities being performed. Such a dataset would also enable the exploration of personalized fatigue models, and their performance against a general fatigue model. Initially, we intended to use personalized fatigue models to gain more articulate fatigue modeling, but the results were not encouraging, partly due to the low number of different datasets from the same people. A more general fatigue model, where mixed datasets from different people are used for training (besides the test dataset in question during cross-validation) yielded the most encouraging results. In this more general approach, most people had data in training and test phases, besides workers 3 and 4, who had only one dataset and thus had no data in training during their testing phase. Interestingly, the MAEs of workers 3 and 4 were among the lowest, besides having no data in the training phase. Another interesting future work aspect would be the comparison of the hauling task to another kind of work to see possible differences in IMU-based fatigue trend detection accuracies. Although this research focuses on the construction industry, other industries such as steelworks, forestry and healthcare also face the same issues of physically demanding work that could benefit from the fatigue analysis.

5. Conclusions

Physical fatigue is found to be a major cause of work-related accidents and injuries in the construction industry, and additional means are needed for their identification and management. Fatigue can be detected with wearable IMUs, but a research gap exists in application of IMU sensors for detecting fatigue in real-life work settings. This research aims to bring the fatigue trend detection with IMUs closer to the real-life context by using a dataset with a TRL of 6, since it is from an actual construction site with actual construction workers.
We presented a novel contribution: the use of frequency plane (PSD) peaks as features in fatigue detection methodology. These peaks represent the major frequencies of the human movement and illustrate shift in the intensity of the movement. Our proposed PSD features are strongly present in the most important features listing, hinting at their high usability in detecting fatigue trends compared to jerk values and standard machine-learning features. Our approach also achieved promising MAE rates, markedly improving the state of the art regarding the accuracy of IMU-based construction site fatigue trend detection. More knowledge is also unearthed about the most relevant sensor locations and their machine-learning features.

Author Contributions

Conceptualization: J.S.K., S.-M.M. and C.D.N.; funding acquisition, S.-M.M. and C.D.N.; project administration, J.S.K. and C.D.N.; supervision, P.S., A.K. and D.G.C.; investigation, resources and data curation, J.A., S.L. and C.D.N.; formal analysis, J.S.K.; methodology, J.S.K. and P.S.; software, J.S.K., J.A. and S.L.; visualization, J.S.K.; writing—original draft preparation, J.S.K.; writing—review and editing, S.L., C.D.N., S.-M.M., J.A., A.K. and P.S. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the BEEYONDERS project funded by the European Union’s Horizon Europe research and innovation program under grant agreement N° 101058548.

Institutional Review Board Statement

The Istituto Italiano di Technologia’s experimental protocol was approved by the Ethical Committee of Liguria (protocol reference number: CER Liguria 001/2019) and complies with the Helsinki Declaration.

Informed Consent Statement

All the subjects signed a consent form prior to participating after receiving a full explanation of the experimental procedure.

Data Availability Statement

The datasets generated during the current study are available from IIT: christian.dinatali@iit.it on reasonable request.

Acknowledgments

The data gathering was supported by BEEYONDERS partner Fira Rakennus Oy by providing the field study infrastructure, environment, and study assistance at the site.

Conflicts of Interest

Authors Janne S. Keränen, Satu-Marja Mäkelä and Atte Kinnula were employed by the company VTT Technical Research Centre of Finland Ltd. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

  1. U.S. Bureau of Labor Statistics. Nonfatal Occupational Injuries and Illnesses Requiring Days Away from Work 2015. 2016. Available online: https://www.bls.gov/news.release/pdf/osh2.pdf (accessed on 1 December 2025).
  2. Wong, F.K.; Chan, A.P.; Yam, M.C.; Wong, E.Y.; Tse, K.T.; Yip, K.K.; Cheung, E. Findings from a research study of construction safety in Hong Kong. J. Eng. Des. Technol. 2009, 7, 130–142. [Google Scholar] [CrossRef]
  3. Adane, M.M.; Gelaye, K.A.; Beyera, G.K.; Sharma, H.R.; Yalew, W.W. Occupational Injuries Among Building Construction Workers in Gondar City, Ethiopia. Occup. Med. Health Aff. 2013, 1, 5. [Google Scholar] [CrossRef]
  4. Umer, W.; Li, H.; Szeto, G.P.Y.; Wong, A.Y.L. Identification of Biomechanical Risk Factors for the Development of Lower-Back Disorders during Manual Rebar Tying. J. Constr. Eng. Manag. 2017, 143, 04016080. [Google Scholar] [CrossRef]
  5. Anwer, S.; Li, H.; Antwi-Afari, M.F.; Umer, W.; Wong, A.Y.L. Evaluation of physiological metrics as real-time measurement of physical fatigue in construction workers: State-of-the-art review. J. Constr. Eng. Manag. 2021, 147, 03121001. [Google Scholar] [CrossRef]
  6. Zhang, L.; Diraneyya, M.M.; Ryu, J.; Haas, C.T.; Abdel-Rahman, E.M. Jerk as an indicator of physical exertion and fatigue. Autom. Constr. 2019, 104, 120–128. [Google Scholar] [CrossRef]
  7. Maman, Z.S.; Yazdi, M.A.A.; Cavuoto, L.A.; Megahed, F.M. A data-driven approach to modeling physical fatigue in the workplace using wearable sensors. Appl. Ergon. 2017, 65, 515–529. [Google Scholar] [CrossRef] [PubMed]
  8. Zhang, J.; Lockhart, T.E.; Soangra, R. Classifying lower extremity muscle fatigue during walking using machine learning and inertial sensors. Ann. Biomed. Eng. 2014, 42, 600–612. [Google Scholar] [CrossRef] [PubMed]
  9. da Cunha, F.A.; Farinatti, P.T.; Midgley, A.W. Methodological and practical application issues in exercise prescription using the heart rate reserve and oxygen uptake reserve methods. J. Sci. Med. Sport 2011, 14, 46–57. [Google Scholar] [CrossRef] [PubMed]
  10. Häikiö, J.; Kallio, J.; Mäkelä, S.-M.; Keränen, J.S. IoT-based safety monitoring from the perspective of construction site workers. Int. J. Occup. Environ. Saf. 2020, 4, 1–14. [Google Scholar] [CrossRef]
  11. Mäkelä, S.-M.; Lämsä, A.; Keränen, J.S.; Liikka, J.; Ronkainen, J.; Peltola, J.; Häikiö, J.; Järvinen, S.; López, M.B. Introducing VTT-ConIot: A Realistic Dataset for Activity Recognition of Construction Workers Using IMU Devices. Sustainability 2022, 14, 220. [Google Scholar] [CrossRef]
  12. Leggieri, S.; Sencandan, P.; Fanti, V.; Ahmad, J.; Caldwell, D.G.; Natali, C.D. The Smart Suit: A novel tool platform for learning and assessment at work. In Proceedings of the 5th IFSA Winter Conference on Automation, Robotics & Communications for Industry 4.0/5.0 (ARCI’ 2025), Granada, Spain, 19–21 February 2025. [Google Scholar]
  13. BEEYONDERS Project Plan & Problem Statement. Available online: https://beeyonders.eu/project/ (accessed on 1 December 2025).
  14. Ligorio, G.; Bergamini, E.; Truppa, L.; Guaitolini, M.; Raggi, M.; Mannini, A.; Sabatini, A.M.; Vannozzi, G.; Garofalo, P.A. Wearable Magnetometer-Free Motion Capture System: Innovative Solutions for Real-World Applications. IEEE Sens. J. 2020, 20, 8844–8857. [Google Scholar] [CrossRef]
  15. Sochopoulos, A.; Poliero, T.; Ahmad, J.; Caldwell, D.G.; Di Natali, C. Human activity recognition algorithms for manual material handling activities. Sci. Rep. 2025, 15, 10954. [Google Scholar] [CrossRef]
  16. Leggieri, S.; Vasco, F.; Caldwell, D.G.; Di Natali, C. Online ergonomic evaluation in realistic manual material handling task: Proof of concept. Bioengineering 2023, 11, 14. [Google Scholar] [CrossRef] [PubMed]
  17. Welch, P. The use of the fast Fourier Transform for the estimation of power spectra: A method based on time averaging over short, modified periodograms. IEEE Trans. Audio Electroacoust. 1967, 15, 70–73. [Google Scholar] [CrossRef]
  18. Sckit-Learn: Univariate Feature Selection. Available online: https://scikit-learn.org/stable/modules/feature_selection.html#univariate-feature-selection (accessed on 1 December 2025).
  19. Sckit-Learn: Random Forest Regressor. Available online: https://scikit-learn.org/stable/modules/generated/sklearn.ensemble.RandomForestRegressor.html (accessed on 1 December 2025).
  20. Pinto-Bernal, M.J.; Cifuentes, C.A.; Perdomo, O.; Rincón-Roncancio, M.; Múnera, M.A. Data-Driven Approach to Physical Fatigue Management Using Wearable Sensors to Classify Four Diagnostic Fatigue States. Sensors 2021, 21, 6401. [Google Scholar] [CrossRef]
  21. Maman, Z.S.; Chen, Y.-J.; Baghdadi, A.; Lombardo, S.; Cavuoto, L.A.; Megahed, F.M. A data analytic framework for physical fatigue management using wearable sensors. Expert Syst. Appl. 2020, 155, 113405. [Google Scholar] [CrossRef]
  22. Biró, A.; Cuesta-Vargas, A.I.; Szilágyi, L. AI-Assisted Fatigue and Stamina Control for Performance Sports on IMU-Generated Multivariate Times Series Datasets. Sensors 2024, 24, 132. [Google Scholar] [CrossRef]
  23. Moyen-Sylvestre, B.; Goubault, É.; Begon, M.; Côté, J.N.; Bouffard, J.; Dal Maso, F. Power Spectrum of Acceleration and Angular Velocity Signals as Indicators of Muscle Fatigue during Upper Limb Low-Load Repetitive Tasks. Sensors 2022, 22, 8008. [Google Scholar] [CrossRef]
  24. Mohapatra, P.; Aravind, V.; Bisram, M.; Lee, Y.-J.; Jeong, H.; Jinkins, K.; Gardner, R.; Streamer, J.; Bowers, B.; Cavuoto, L.; et al. Wearable network for multilevel physical fatigue prediction in manufacturing workers. Proc. Natl. Acad. Sci. USA 2024, 3, pgae421. [Google Scholar] [CrossRef] [PubMed]
  25. Kim, B.; Preethaa, K.R.S.; Song, S.; Lukacs, R.R.; An, J.; Chen, Z.; An, E.; Kim, S. Internet of things and ensemble learning-based mental and physical fatigue monitoring for smart construction sites. J. Big Data 2024, 11, 115. [Google Scholar] [CrossRef]
  26. Umer, W.; Yu, Y.; Afari, M.F.A.; Anwer, S.; Jamal, A. Towards automated physical fatigue monitoring and prediction among construction workers using physiological signals: An on-site study. Saf. Sci. 2023, 166, 106242. [Google Scholar] [CrossRef]
Figure 1. Frequency peaks of power spectral density of the one time-window of real worker IMU data.
Figure 1. Frequency peaks of power spectral density of the one time-window of real worker IMU data.
Sensors 25 07455 g001
Figure 2. Flowchart depicting the algorithmic process.
Figure 2. Flowchart depicting the algorithmic process.
Sensors 25 07455 g002
Figure 3. Resulting effort predictions and expected values for all hauling datasets.
Figure 3. Resulting effort predictions and expected values for all hauling datasets.
Sensors 25 07455 g003
Table 1. Overall view on the hauling task datasets.
Table 1. Overall view on the hauling task datasets.
DayWorkerDataset Duration
1129.2 min
1215.3 min
2216.6 min
3132.2 min
3326.1 min
4430.2 min
4520.4 min
5533.0 min
Table 2. Used effort values and their explanations.
Table 2. Used effort values and their explanations.
Force Exerted During ActivityExplanation
1Low
2Normal
3Moderate
4High
5Very high
Table 3. Accuracies for all the datasets.
Table 3. Accuracies for all the datasets.
DayWorkerData Split MAE
110.6082
120.3915
220.7070
310.2441
330.2724
440.2264
450.2432
550.2428
Average MAE for all workers0.3670
Table 4. The most important sensors and features.
Table 4. The most important sensors and features.
Sensor Location and FeatureSum Feature Importance
Pelvis median2.452521
Sternum PSD power weighed sum1.900824
Pelvis 8. PSD peak1.335845
Right upper arm median0.786098
Right upper arm 4. PSD peak0.501262
Pelvis jerk standard deviation0.354390
Right thigh jerk mean0.154660
Right upper arm 75 percentile0.149535
Right upper arm 2. PSD peak0.138543
Right thigh 7. PSD peak0.136377
Left upper arm 4. PSD peak0.083994
Left thigh 2. PSD peak0.005953
Table 5. Accuracies for the different tried regression models.
Table 5. Accuracies for the different tried regression models.
Model TriedAverage MAE for all Train/Test Splits
Random forest regression0.366951
Gradient-boosting regression0.414846
Decision tree regression0.432119
Lasso regression0.471001
Support vector regression0.478087
Ridge regression0.561196
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Keränen, J.S.; Ahmad, J.; Leggieri, S.; Mäkelä, S.-M.; Caldwell, D.G.; Di Natali, C.; Kinnula, A.; Siirtola, P. Machine-Learning-Based Fatigue Trend Analysis on IMU Wearable Sensor Data from Construction Site Workers. Sensors 2025, 25, 7455. https://doi.org/10.3390/s25247455

AMA Style

Keränen JS, Ahmad J, Leggieri S, Mäkelä S-M, Caldwell DG, Di Natali C, Kinnula A, Siirtola P. Machine-Learning-Based Fatigue Trend Analysis on IMU Wearable Sensor Data from Construction Site Workers. Sensors. 2025; 25(24):7455. https://doi.org/10.3390/s25247455

Chicago/Turabian Style

Keränen, Janne S., Jamil Ahmad, Sergio Leggieri, Satu-Marja Mäkelä, Darwin G. Caldwell, Christian Di Natali, Atte Kinnula, and Pekka Siirtola. 2025. "Machine-Learning-Based Fatigue Trend Analysis on IMU Wearable Sensor Data from Construction Site Workers" Sensors 25, no. 24: 7455. https://doi.org/10.3390/s25247455

APA Style

Keränen, J. S., Ahmad, J., Leggieri, S., Mäkelä, S.-M., Caldwell, D. G., Di Natali, C., Kinnula, A., & Siirtola, P. (2025). Machine-Learning-Based Fatigue Trend Analysis on IMU Wearable Sensor Data from Construction Site Workers. Sensors, 25(24), 7455. https://doi.org/10.3390/s25247455

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop