Intention Prediction for Active Upper-Limb Exoskeletons in Industrial Applications: A Systematic Literature Review

Hochreiter, Dominik; Schmermbeck, Katharina; Vazquez-Pufleau, Miguel; Ferscha, Alois

doi:10.3390/s25175225

Open AccessSystematic Review

Intention Prediction for Active Upper-Limb Exoskeletons in Industrial Applications: A Systematic Literature Review

by

Dominik Hochreiter

^1,*,†

,

Katharina Schmermbeck

^2,*,†

,

Miguel Vazquez-Pufleau

¹

and

Alois Ferscha

¹

Institute of Pervasive Computing, Johannes Kepler University, 4040 Linz, Austria

²

Chair of Production Technology, Institute of Mechatronics, University of Innsbruck, 6020 Innsbruck, Austria

^*

Authors to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Sensors 2025, 25(17), 5225; https://doi.org/10.3390/s25175225

Submission received: 18 July 2025 / Revised: 14 August 2025 / Accepted: 14 August 2025 / Published: 22 August 2025

(This article belongs to the Section Sensors and Robotics)

Download

Browse Figures

Versions Notes

Abstract

Intention prediction is essential for enabling intuitive and adaptive control in upper-limb exoskeletons, especially in dynamic industrial environments. However, the suitability of different cues, sensors, and computational models for real-world industrial applications remains unclear. This systematic review, conducted according to PRISMA guidelines, analyzes 29 studies published between 2007 and 2024 that investigate intention prediction in active exoskeletons. Most studies rely on motion capture (14) and electromyography (14) to estimate joint torque or trajectories, predicting from 450 ms before to 660 ms after motion onset. Approaches include model-based and model-free regression, as well as classification methods, but vary significantly in complexity, sensor setups, and evaluation procedures. Only a subset evaluates usability or support effectiveness, often under laboratory conditions with small, non-representative participant groups. Based on these insights, we outline recommendations for robust and adaptable intention prediction tailored to industrial task requirements. We propose four generalized support modes to guide sensor selection and control strategies in practical applications. Future research should leverage wearable sensors, integrate cognitive and contextual cues, and adopt transfer learning, federated learning, or LLM-based feedback mechanisms. Additionally, studies should prioritize real-world validation, diverse participant samples, and comprehensive evaluation metrics to support scalable, acceptable deployment of exoskeletons in industrial settings.

Keywords:

intention prediction; active industrial exoskeletons; human–machine interaction; human–robot interaction; sensor modalities for exoskeletons; adaptive control in industrial applications

1. Introduction

Exoskeletons are employed in industrial work settings to relieve specific body parts from physical stress by facilitating, adding, enabling, enhancing, or stabilizing movements [1]. Their primary objective is to lower the risk of work-related musculoskeletal disorders (WMSDs) [2,3], which are especially common in the upper limbs [4]. The effectiveness of an industrial exoskeleton depends on its ability to meet the dynamic requirements of the user, the task, and the environment. The support characteristic of an exoskeleton is described by the available support curves, which, for example, define the relationship between joint angle and the corresponding support force or torque (see Figure 1) [5].

While passive exoskeletons provide fixed support characteristics that cannot be changed during use, the actuation of active exoskeletons allows real-time adjustments. Systems that adapt their support automatically without user input are referred to as adaptive exoskeletons [6]. Developing such systems for upper-limb industrial use is challenging due to the need to accommodate frequent task and posture changes over long work periods [7,8]. Integrating intention prediction offers a promising approach to meet these challenges by enabling timely and appropriate support adjustments based on user behavior, somatic cues, or contextual information. This can reduce the user’s cognitive load and eliminate the need for disruptive manual tuning. In human–robot collaboration, intention prediction has already been applied to improve safety and task coordination [9,10,11]. Similarly, in rehabilitation exoskeletons, detecting residual motor function to guide support is critical for functionality [12,13]. However, despite this progress, intention prediction has received limited attention in industrial contexts, where exoskeletons remain largely passive and non-adaptive, often limiting their usability and effectiveness.

This review aims to provide the first structured overview of intention prediction methods relevant to active industrial exoskeletons and examine the transferability of rehabilitation-based approaches to industrial applications. Following the PRISMA guidelines for systematic reviews [14], we address two central research questions: How is intention prediction implemented? We address this by analyzing sensors modalities, applied methods, and models to obtain control outputs as well as prediction times. Why is intention prediction implemented? We address this by considering the desired support characteristics of systems and evaluation methodologies. The concise compilation identifies common strategies with their strengths and weaknesses and offers a foundation for guiding future research on intent-aware industrial exoskeletons.

2. Background

2.1. Active Upper-Limb Industrial Exoskeletons

Exoskeletons are wearable systems that provide support to specific body parts to add, facilitate, enable, enhance, or stabilize movements [1]. For industrial applications, they are employed to support human operators during physically demanding tasks, such as heavy material handling or overhead work. Although exoskeletons exist for various body parts, devices targeting the shoulders, back, and elbows are most prevalent in industrial contexts, as these areas are commonly affected by work-related musculoskeletal disorders (WMSDs) [4]. Empirical studies have shown that systems are able to reduce acute physical stress and strain in targeted body areas [2,3], thereby lowering risk factors associated with WMSDs. However, usability and acceptance remain limited due to a range of factors. In addition to issues related to comfort, social acceptance, and general usability [15,16], many industrial exoskeletons are developed for narrowly defined support contexts, tailored to a specific user, task, and environment [2,17]. While passive and active exoskeletons can be mechanically adapted to individual anthropometry prior to use, active systems incorporate actuators (e.g., electric or pneumatic) and control units that enable runtime adjustment of support forces or torques. These adjustments can be triggered manually or automatically via sensor data, using model-based or data-driven approaches [5]. Adaptive exoskeletons, i.e., those with automatic support adjustment, can enhance user experience and support effectiveness by minimizing unexpected or hindering support behavior during operation.

Recent research explores a range of approaches to detect changing support requirements in exoskeletons, leveraging environmental cues, kinematic or physiological signals from the human operator, or multimodal combinations [18]. The intended system behavior is inferred using models ranging from simple linear functions to complex biomechanical models, with control outputs generated by downstream system-level controllers. A comprehensive strategy for achieving adaptive support is to leverage users’ intentions, i.e., users’ future actions. While intention prediction is essential for the functionality of rehabilitation exoskeletons and has been intensively studied in recent years [13,19,20], its application in industrial exoskeletons remains in its infancy. In particular, it remains unclear which approaches from other exoskeleton application fields can be applied to real-world industrial contexts as they introduce additional requirements: short possible sensor setup times, higher movement velocities that demand low control latency times, the dynamics of human of movements from fine manipulation to high force compensation, and constant interaction with the external environment [7,8]. Moreover, as stated by Gantenbein et al. [18], today’s research on the effects of exoskeletons lacks standardized procedures, including clearly defined metrics and protocols. This lack of uniformity as well as publicly available datasets complicates the comparison and benchmarking of different approaches, impeding the systematic advancement of the field.

2.2. Intention Prediction

To enable proactive control of an exoskeleton, a computational model must effectively learn or predict the user’s intentions, dynamically adjusting support characteristics to meet evolving demands from the user, task, and environment. The distinction between intention recognition and prediction can be difficult due to their overlapping concepts and methods [21]. While intention recognition focuses on understanding current goals, prediction anticipates future intentions. In our structural analysis, we distinguish between biological and non-biological cues. This distinction is a key factor in our findings. Muscle activity, for example, is a biological cue that can be measured before movement. These cues act as input patterns that can be used to predict intentions. A variety of biological (somatic) cues can be observed, such as heart rate [22,23], skin conductance and muscle activity [24,25], gaze [26], photoplethysmography, electrodermal activity, brain oxygenation [27], and energy expenditure. Predicting emotional states through facial expressions, vocal cues, or physiological responses provides valuable insights [28,29], and cognitive context involves users’ mental state [30]. Non-biological cues consist of behavioral, social [31,32,33,34], contextual [35,36,37], and habitual patterns [38]. They include visible, intentional, and observable actions or motions, such as gestures [39,40], eye movements [26,41], or series of whole-body movements [42]. Analyzing movements can offer predictive cues about intentions and future actions [43,44].

Cue signals for kinematic data, for example, are monitored and captured using sensor technologies such as inertial measurement units (IMUs) or multiple inertial measurement units (M-IMUs). Applied forces can be measured using pressure sensors integrated into gloves or interfaces. Vision-based sensors, such as cameras, depth cameras, light detection and ranging (LiDAR) [45], or marker-based motion capture systems (MoCap), enable gesture or motion recognition and spatial mapping. Surface electromyography (sEMG) measures muscle activity to infer intended movement [46] or fatigue, while electroencephalography (EEG) records brain activity to detect cognitive processes. Galvanic skin response (GSR) sensors detect changes in skin conductance associated with emotional arousal. In vivo imaging techniques, like functional near-infrared spectroscopy (fNIRS) [27] and functional magnetic resonance imaging (fMRI), visualize brain activity to understand neural correlates of intention, while CMOS and low-cost semiconductor lasers detect vascular structures [18,47].

Computational and algorithmic methods use captured signals to infer intent. These signals are typically pre-processed and filtered before being fed into an algorithm to produce a discrete or continuous output. A distinction can be made between model-free, i.e., data-driven, and model-based approaches [11], the modalities used [18], and the prediction horizon. Implementations range from classical machine learning (SVM, decision trees, random forests) to deep neural network models [48]. For temporal inference of intentions or tasks, sequence models such as time series classification (TSC) can provide predictions, including recurrent neural network (RNN) models that capture the dynamics of sequences. RNN model architectures such as long short-term memory (LSTM), extended (xLSTM) [49], gated recurrent units (GRUs) and transformers use self-attention mechanisms to weigh the importance of different patterns in sequential applications.

3. Research Objectives and Methodology

Given these challenges and the fragmented state of current research, it is necessary to systematically evaluate existing intention prediction approaches across application domains to assess their relevance and applicability to industrial exoskeletons. To address this gap, the present review investigates intention prediction models used in upper-limb exoskeletons spanning rehabilitation, assistive, and industrial contexts, with the goal of identifying approaches suitable for real-world deployment in industrial upper-limb exoskeletons. The following section elaborates on the research questions we posed, the literature, and the PRISMA methodology [14] applied, including search strings, eligibility criteria, and analysis dimensions. This systematic review has not been registered.

3.1. Objectives and Dimensions

Two central research questions were formulated to explore the relevance and applicability of intention prediction research. First, we investigated how intention prediction is implemented, focusing on the types of sensors and computational methods applied, and the temporal characteristics of prediction. Secondly, we explored why researchers use intention prediction for exoskeletons, thereby analyzing target users, evaluation methods, and ways the system adapts based on the predicted intent. To address these research questions systematically during analysis, we defined seven analysis dimensions and formulated guiding sub-questions for each (e.g., A1: “What intention cues are used?”). These sub-questions were not additional research questions but served as a guideline during the review to ensure consistent extraction of relevant information. When a paper addressed or answered a sub-question, it was assigned to the corresponding analysis dimension, and relevant passages were highlighted for later synthesis. Table 1 provides an overview of the main research questions, their guiding sub-questions, and the associated analysis dimensions used to categorize and analyze the selected papers.

3.2. Methodology

This systematic literature review was conducted following the guidelines of the PRISMA methodology [14]. As a widely accepted method for systematic and reproducible reviews, we aimed to reduce the risk of missing data and incorporating biases into our analysis. We therefore organized our review along the recent PRIMSA guidelines [14]. You can find the data of the complete search process in the Supplementary Materials.

3.2.1. Eligibility Criteria

The inclusion and exclusion criteria, shown in Figure 2, were chosen to ensure that the records’ content fit our research objectives. Prior to the assessment with eligibility criteria, records were excluded if they were not written in English or were workshop articles, from non-scientific literature, or duplicates.

The objective of this review was to identify sensors, models, and evaluation approaches for proactive control in exoskeletons that are transferable to industrial applications. Studies were excluded if their findings could not be directly applied to such control. This included work on intention prediction not designed for exoskeleton control (EC1), studies limited to sensor development (EC7) or algorithm benchmarking (EC2) without application context, and those presenting only preliminary results (EC6). Papers that did not use human poses as input (EC3) were excluded, as industrial exoskeleton support depends on user posture as this defines the load on the joint. Likewise, studies were excluded if they did not propose how their intention detection could be used for control or lacked timing information necessary to evaluate predictive feasibility (EC4). Applicability to industrial contexts (EC5) was a key criterion that was discussed among the three first authors. In cases of disagreement, studies were retained to avoid missing potentially relevant insights. We excluded reports presenting systems developed solely for rehabilitation purposes, where algorithms were designed for users with little or no movement capability. Moreover, studies presenting lower-limb exoskeletons were critically discussed, as these devices in industry are rarely used and typically limited to sitting or stair-carrying tasks. Exceptions were made when computational models or evaluation techniques appeared transferable, as in [50,51]. Finally, papers were excluded if their scientific quality was insufficient, for example, due to unclear argumentation, lack of methodological detail, or non-logical results.

3.2.2. Databases and Search Strategy

All used databases and a general search query are listed in Figure 2 with the number of found records. If the options were available, records were filtered for English language, a time span from 2004 to the day of the search (3 February 2024) was implemented, and a search excluding words in references was selected. The search string, which was adapted to each database, was developed and discussed.

3.2.3. Selection Process

The screening of titles, abstracts, and full texts was conducted by the first three authors. During the title and abstract screening, each author individually decided if the record met the criteria and unclear records were discussed. The full text screening was performed by two authors independently; exclusion criteria were recorded and subsequently discussed. This resulted in 29 relevant records published from 2007 to 2024.

3.2.4. Data Collection and Synthesis

The data extraction was organized by analyzing the final paper selection, considering seven comparison dimensions, which were assigned to a responsible author. Table 1 gives an overview on the dimensions and their relation to the research questions, while Table 2 lists all resulting records and the respective analysis dimensions.

Due to substantial heterogeneity within our dimensions of analysis, a quantitative meta-analysis was not feasible. The included studies employed diverse model approaches (Figures 4 and 5) and only partially overlapping outcome measures (see Appendix A.2, Table A2), while several lacked extractable numerical results or were purely descriptive or review papers. Under these conditions, any statistical pooling would have risked producing misleading or non-interpretable results. In line with the PRISMA 2020 guidance for systematic reviews without synthesis [14], we therefore applied a narrative synthesis and descriptive benchmarking approach, grouping comparable methods to facilitate cross-study comparison while preserving the diversity of approaches in this field.

4. Results

This section summarizes the results of the selected publications, shown in Table 2 along with the dimensions of our analysis. Figure 3 shows an integrated view of the distribution of methods for the entire intention prediction process, from the source of the input to the supported limb.

4.1. Intention Cues

User intent in human motor control involves the central nervous system (CNS), peripheral nervous system (PNS), musculoskeletal system, and environmental interactions, each with measurable state variables reflecting intention [62]. These variables are interconnected; for example, the PNS sends sensory data to the CNS, which plans motion and sends commands back.

The existing reviews and collections included in this review [11,18,62,74] presented other existing definitions and classifications of intention prediction cues. The Gantenbein et al. review [18] addressed techniques for recognizing and estimating intention in the analysis of usability, daily life applications, and user evaluation studies. For this purpose, they adapted the classification scheme of [76] to measure the distribution of input signal source, physiological phenomenon, signal to observer, and sensor technology among the selected studies. In [74], we found a discriminative criterion for sEMG-based motion intention recognition methods, which could be divided into two groups: musculoskeletal model- and machine learning-based. Similarly, for EMG-based continuous motion prediction models, we found a distinction between model-based and model-free approaches [11]. The review by [62] structured the communication of intent from user to robot into identification, measurement, and interpretation.

We organized the cues and selected methods in terms of periodicity, as shown in Figure 6, and sorted them according to their use. Intention cues were either measured for (i) intention and activity prediction, (ii) activity and movement onset and offset detection, or (iii) intention and activity classification.

4.1.1. Intention and Activity Prediction

We found papers that used eye movements as input [66], which assumed that patients always fixated their gaze approximately on the target of a reaching movement. In terms of motion prediction, biological signals and muscle activity measured with EMG [11,68] could be used for acceleration and velocity prediction, joint angle and position prediction, and joint torque prediction [53]. Kinematic data and segmentation techniques, with IMU data, could be used to predict a worker’s intended motion. They required a minimum observation window for proactive control, such as 200 ms [72].

4.1.2. Detect Onset and Offset of Activity

The work of Lotti et al. [63] presented a short-term, muscle-based intention estimator. The EMG-driven modeling was connected to the low-level controller of the exosuit. A subsequent comparative study [64] used a mechanical intrinsic controller, which was slower to react, but the exclusion of biosignals from the control framework made the reference torque more stable.

A study on short-term intention estimation in realistic overhead work tasks [75] evaluated the performance of kinematic control and EMG control. They used the user’s reaction time to the given cues as the reference for evaluating the controller’s reaction time in all trials. The onset response of the EMG controller was on average 0.17 s faster than that of the kinematics controller. The study by Heo et al. [57] used sEMG sensors to monitor muscle activities for onset estimation for two types of activities, squatting and bending. It compared the estimation latency between sEMG sensors and kinematic sensors, i.e., IMU-based motion capture. Generally, EMG assistance was triggered earlier than kinematic assistance. The latency even increased with the weight lifted. This was because heavier external loads reduced the lifting speed, which caused a delay in the onset time of kinematic-based lift detection. In their study [67], a continuation to [66], Novak and Riener presented EMG and torque sensors for onset detection, with an offline trained prediction model.

Intention to stand up can be observed at the ankle, knee, and hip angles using goniometers and pressure sensors placed on the sole of the foot [50]. The work of [55] involved developing a human–machine interface based on a wrist-level force sensor; minimal force detection in the six directions in 3D space represented the input to obtain the desired incremental movement of the end-effector in space.

4.1.3. Intention and Activity Classification

The work of Irastorza-Landa et al. [59] developed a classification system for predicting movement intention and intended direction in point-to-point horizontal-plane reaching movements based on the EMG activity of five upper arm muscles of healthy subjects. A classification accuracy of over 70% was achieved in offline evaluations. The kinematic approach of [72] used IMU motion data. A movement intention and sequence prediction model was verified by training an MLP model and predicting the motion sequence of new data. Bandara et al. [52] proposed a non-invasive BCI approach to study the dynamic features of EEG signals occurring during ADL tasks. In an offline analysis, the activated brain regions and frequency ranges were identified for each of the intended user movements. This was carried out for individual users as well as for a common model for all users during their study. The identified signals were then used in real time as input to neural networks and SVM-based classifiers to predict intended movements. The work [67] on eye movement reported on target prediction using gaze position alone. In an analysis, they found that context alone could predict the target area about 50–60% of the time.

Significantly improving the prediction and classification of human intentions requires the integration of biological signals, such as EMG and EEG, alongside sensing technologies and ML models (see Section 4.3). Although kinematic approaches are slower than EMG-based systems, they provide valuable and robust additional information for understanding movement dynamics.

4.2. Sensors and Signals

To explore how intention cues are captured and the signals used in predictive models, we analyzed the sensors and physiological signals across studies, distinguishing biological signals originating from neuromuscular or cognitive activation from non-biological signals reflecting external or motion-related traits. This separation streamlines the categorization of appropriate sensors for specific applications by taking factors such as precision, response time, and stability into account. It also eliminates the added complexity of overlapping tiers. Further, this distinction aligns with the practical challenges and requirements of industrial exoskeletons, such as setup time, calibration needs, and environmental robustness.

4.2.1. Biological Signals

Biological signals have the advantage of detecting neuromuscular activation before actual movement onset [68], as shown in Figure 6. There are different signal types, such as muscle activity, eye movement, and brain activity, that are commonly measured using electrodes, with EMG and EEG being the primary modalities.

Portable dry-electrode EMG devices improve user comfort and reduce setup time but often suffer from lower signal-to-noise ratios compared to wet electrodes [11]. Wearable devices like the Myo-Armband with multiple dry electrodes have been successfully used to estimate joint angles and angular velocities of shoulder and elbow [69] and movement classes, i.e., rest flexion, extension, and external load [71]. The majority of studies, however, use wet bipolar Ag/AgCl electrodes. As shown in Table A1, biceps and triceps brachii are frequently used for estimating elbow torque, while more complex estimations incorporate deltoid or back muscles [53,68]. When applying data-driven classification models, studies often use up to 12 EMG channels. Chen et al. [54], for instance, classified arm motions such as drinking or reaching. Simpler approaches detect lifting motion using leg muscles [57] or arm muscles [75], motion onset [50,67], or reaching directions [59].

In addition to myoelectric signals, one study used EEG to infer intended arm movement in the 3D space from brain activity [52] or employed eye tracking to identify reaching goals based on gaze direction [66,67].

In summary, EMG can detect neuromuscular activation before movement onset, especially from the biceps and triceps, and is the most commonly used technique to estimate torque, movement classes, and joint kinematics. This method is advantageous for its fast response times. Although portable dry-electrode devices have improved usability, wet electrodes remain dominant. However, both types of electrodes lack environmental robustness and require calibration. Studies often use multiple EMG channels or combine modalities, such as EEG and eye tracking, to infer more complex movement intentions [66]. This can contribute to stability and precision.

4.2.2. Non-Biological Signals

Kinematic and dynamic sensors are commonly used to capture motion-related information such as acceleration, velocity, orientation, position, and joint or limb forces. IMUs, which typically combine accelerometers, gyroscopes, and magnetometers, are widely used to estimate joint angles or body segment orientations in the 3D space. All reviewed studies calculated global angles and angular velocities either through the IMU’s internal algorithms [51,57,64,70,72,75] or external methods such as the Mahony algorithm, which is robust to sensor orientation [65].

Some approaches leverage sensors already embedded in exoskeletons. Lotti et al. [63] used the electrical motor encoder to extract elbow joint positions, while Dinh et al. [7] integrated a flex sensor to monitor elbow bending. For torque prediction, force sensors or load cells are frequently embedded at the user–exoskeleton interface to detect external loads and movement directions [53,55,58,60,66,67]. Two studies employed vision-based systems using IR depth sensors (Kinect® v2) to estimate user poses [56,61].

As observed in the reviewed studies, IMUs offer high flexibility in placement and can transmit data via wireless protocols, making them well-suited for mobile setups and various body segments. In contrast, force sensors and embedded motor sensors like encoders are structurally integrated into the system and require wired connections, but they provide highly precise, low-drift measurements that are advantageous for stable and accurate angle and interaction force determination.

4.3. Computation and Models

Researchers apply classification or regression models to predict user movements or joint torques as discrete or continuous outputs. For continuous prediction, methods are often categorized into model-based and model-free approaches [11,74]. Model-based techniques rely on biomechanical or kinematic knowledge to analytically link sensor signals with joint angles or torques. In contrast, model-free approaches use data-driven techniques, such as neural networks (NNs), to learn input–output relationships.

4.3.1. Classification

As shown in Figure 4, eight of the analyzed research papers used classification to detect user intent or activity. Zhou et al. [75] implemented a binary, threshold-based classifier to distinguish between support and non-support states. They found that IMU-based control achieved higher accuracy than EMG-based control. Other works combined binary classifiers with movement direction prediction. Irastorza-Landa et al. [59] used an EMG-based support vector machine (SVM) to detect onset and reaching direction. Novak and Riener [66,67] initially relied on velocity thresholds and later switched to a hybrid EMG/torque-based threshold method, achieving robust early detection (35.4 ms before motion, with 3.9% false positives).

Other studies implemented multi-class classification to recognize specific activities. Heo et al. [57] implemented a threshold-based finite state machine to detect lifting phases (e.g., grasping, lifting), supporting users only during the lift. Bandara et al. [52] and Chen et al. [54] classified activities of daily living, like drinking or reaching, using EEG and EMG inputs, respectively. Both compared classifiers and found that SVMs outperformed NNs in accuracy. However, Bandara et al. reported faster detection with NNs (300 ms vs. 600 ms on average), while Chen et al. noted that NNs require significantly more training data, which limits performance on smaller datasets.

Binary classification enables fast, low-complexity detection for distinct movement situations, while multi-class classification offers broader situation coverage but demands more data and careful algorithm selection based on the requirements for accuracy, latency, and computational costs.

4.3.2. Model-Based Regression

Researchers use model-based approaches to continuously estimate joint torques or motion trajectories (see Figure 5). These can be grouped into three categories: biomechanical models, which simulate neuromusculoskeletal dynamics; mechanical models, which rely on kinematic or dynamic, i.e., integrating forces and equations of motion; and interaction-based models, which infer intent from user–exoskeleton interface forces without modeling human biomechanics.

Biomechanical models use measured muscle activity and anatomical data to estimate joint torques. Buongiorno et al. [53] and Lotti et al. [63] estimated muscle forces via Hill-type models and OpenSim-based simulations. Lotti et al. calibrated the model through subject-specific maximum voluntary contraction (MVC) and movement trials, while Buongiorno et al. applied optimization methods. Simpler models like those by Lu et al. [65] and Treussart et al. [71] mapped EMG signals to torque using exponential functions with acceptable error levels.

Mechanical models apply physical equations to predict torque or trajectories. Lotti et al. [64] estimated elbow torque using inverse dynamics, integrating IMU data and users’ anthropometrics. Dinh et al. [7] derived torque from measured cable tension and the arm’s modeled potential and kinetic energies. Khan et al. [60] combined a dynamic arm model with an NN to adapt to subject-specific and time-varying dynamics. Vision-based methods by Liao et al. [61] and Gao et al. [56] use kinematic models.

Interaction-based models extract user intent directly from forces at the physical human–exoskeleton interface to classify motion states and directions [58] or treat force readings as joystick-like inputs [55].

Biomechanical and mechanical models offer physically grounded and accurate predictions of torques but require detailed modeling of both the user and system. This makes the setup more complex and calibration procedures are inevitable. Interaction-based models are simpler and rely on embedded sensors but are limited to the exoskeleton’s structure.

4.3.3. Model-Free Regression

Model-free regression approaches use machine learning to learn the relationship between sensor inputs and torque or trajectory outputs based on training data. Sedighi et al. [68] combined a CNN and LSTM to predict shoulder and elbow trajectories from EMG signals and joint angles. Sun et al. [69] employed EMG data and a NARX network with an unscented Kalman filter to estimate joint angles. Woo et al. [72] segmented lower-limb movements using k-means clustering on IMU data and applied an NN to generalize predictions across subjects. Toro-Ossaba et al. [70] predicted elbow torque from recent biceps and triceps EMG using an NN. Zhang et al. [51] applied random forest models for locomotion mode and gait phase classification based on hip and shank kinematics. While model-free methods do not require prior knowledge of the system’s dynamics or biomechanics, their performance depends on data quality and quantity and they require additional effort to generalize beyond the trained conditions.

Classification and regression methods each offer advantages depending on the requirements, such as available sensors, modeling capabilities, and application context. Classification is particularly effective for recognizing discrete motion states or transitions, such as detecting movement onset or distinguishing between lifting and standing. Threshold-based approaches enable simple, fast, and robust implementations. Regression methods, on the other hand, allow for continuous prediction of joint angles or torques and are better suited if support characteristics must change continuously. Model-based regression requires detailed biomechanical or mechanical modeling and subject-specific calibration, while model-free regression relies mainly on training data, offering greater flexibility but demanding careful data preparation and generalization strategies.

4.4. Time

Timing the intention detection and system reaction plays an important role for an effective and natural support. As shown in Figure 6, different aspects of time must be considered, such as the computational delay, the mechanical system delay, and the chosen prediction and analysis windows. Moreover, time components are intertwined with the chosen prediction method. When applying classification methods, for example, the detection time of activity onset is critical for the support effect [57].

4.4.1. System Response Time

The system response time is the total duration from the initial sensor input signal to the resulting measurable torque or force applied to the user’s limb. Time delays can be due to different causes including sensor type, algorithms, filtering, and hardware [51]. Using biological signals, such as brain or muscle activity, allows one to detect motion prior to its actual execution. This electromechanical delay, also shown in Figure 6, between the EMG activity and a mechanically measurable muscle contraction is quantified in the literature as being between 30 and 150 ms [74,77], depending on the physiological state, sensor type, and threshold criterion. Sedighi et al. measured the EMG activity an average of 150 ms before the movement onset [68]. Brain activity, on the other hand, can be measured even earlier before muscle contraction, but is prone to false positives [78]. Multiple authors compared the difference in the computational delay of biomechanical and mechanical models predicting torque, with biomechanical approaches being significantly faster due to earlier signals from biological than non-biological sensors [57,64,75].

Researchers present different approaches to reduce computational and inherent system delays. Zhang et al. applied a time-delay compensation parameter to account for filter and hardware delays, which reduced their system response time by around 43 ms, with a total delay of 0.745 ± 0.077 s. Sedighi et al. predicted the user’s arm position 450 ms into the future to compensate for system delays [68]. Liao et al. reduced their system response delay by 38% to 0.5 s for fast motions by applying a Kalman filter [61]. Riener an Novak optimized their response time by learning the onset threshold of the EMG signals for each user separately [67].

4.4.2. Analysis Times

Researchers analyze sensor signals at different frequencies and time spans (Figure 6). For models with continuous output, the sampling frequency determines the analysis time. EEG as well as EMG signals are commonly sampled at around 1 kHz and subsequently filtered [11,50,52,53,57,63,65,71,75]. Dinh et al. and Lotti et al. also sampled their motor control, IMU, and position signals at 1 kHz [7,63,64]. Woo et al. used IMUs sampling on 64 Hz [72]. Vision-based methods, such as Vicon, Kinect, or eye-tracking, have considerably slow sampling rates of 330 Hz [73], 30 Hz [61], and 60 Hz [66], respectively. For classification or model-free regression methods, a sliding window approach is used for feature extraction with an overlap [11]. Chen et al. used a sliding window of 540 ms with 81 ms of overlap to extract features from EMG signals for training and a 135 ms window during online classification [54]. Irastorza et al. compared sliding window length for feature extraction of EMG and found that 200 ms or 500 ms windows did not affect classification accuracy, but a short window allowed faster system response [59]. Similarly, Toro-Ossaba et al. used a sliding window of around 200 ms with 50% overlap [70] and Zhang et al. used a sliding window of around 300 ms but with 96% overlap [51]. Sedighi et al. used a 30 ms sliding window with 2 ms steps for EMG and joint angle for a CNN-LSTM model [68].

4.5. Support Characteristic

The support characteristic defines the type and amount of support an exoskeleton provides and is critical to both its effectiveness and usability. It is typically described by the targeted joint(s), the control strategy, and the resulting force or torque profile applied to the human limb [5]. As shown in Figure 4 and Figure 5, support strategies are broadly categorized into classification or regression approaches to derive either a target torque or a desired trajectory. These strategies differ in implementation complexity, resemblance to natural human movement, and adaptability to external factors such as load variations.

Torque-based control is often implemented for a single joint, most commonly the elbow, shoulder, or hip. Notable exceptions include Treussart et al. [71] and Buongiorno et al. [53], who simultaneously support both elbow and shoulder. Biomechanical models estimate joint torque using muscle activity signals, aiming for a direct equivalence between biological and assistive torques (

τ_{j o i n t} = τ_{e x o}

). These models typically employ admittance control, where the exoskeleton’s response depends on measured or calculated forces. Lotti et al. [63] and Buongiorno et al. [53] implemented such systems using detailed neuromusculoskeletal models and subject-specific calibration. Simpler variants using approximated exponential mappings offer reduced accuracy but benefit from faster computation [65,70,71]. Lu et al. fed torque predictions into a closed-loop position control system [65], while Treussart et al. applied a Dead Zone Integral corrector to refine torque control [71]. Mechanical models apply classical dynamic equations to estimate support torques. These are typically executed using admittance control, as seen in Dinh et al. [7] and Lotti et al. [64]. In contrast, interaction-based models use simpler mappings between body kinematics or interface forces and control output. These are often realized via closed-loop pressure control in pneumatic systems, providing gravity compensation or constant support torques [57,75].

In trajectory-following strategies, the goal is to ensure that the exoskeleton follows the user’s joint motion (

θ_{j o i n t} = θ_{e x o}

), often enhanced by gravity compensation. Biomechanical models such as the CNN-LSTM network by Sedighi et al. [68] infers joint trajectories from EMG signals and implement position control, enabling automatic adjustment to varying loads. Mechanical approaches incorporate limb dynamics into impedance control architectures, sometimes augmented with learning algorithms (e.g., Extreme Learning Machines) to forecast future states [60]. Simpler strategies based on vision-based motion tracking directly map observed limb positions to control inputs without estimating interaction forces [56,61]. Researchers most commonly employ interaction-based trajectory controllers, which either estimate movement paths online or follow predefined trajectories. Gandolla et al. [55] estimated the motion trajectory from end-effector force vectors and applied it within a gravity-compensated position control scheme. Huo et al. [58] classified movement modes and used Kalman filter-based admittance control based on link-specific force sensors. Other systems rely on predefined trajectories derived from optimality criteria such as energy minimization [73], minimum jerk [66], or straight-line motion [67], executed through standard position control. Zhang et al. [51] supported hip motion using fixed trajectories tailored to walking scenarios classified in real time.

As shown in the reviewed studies, the choice of control scheme depends on whether the exoskeleton is designed to achieve a certain torque or trajectory. Torque-based approaches typically use admittance control to adjust support based on user-generated forces, particularly in biomechanical and mechanical models. Some also apply closed-loop pressure or position control, in one case combined with a corrective control scheme. In contrast, trajectory-based methods rely mainly on position control to follow user motion, with impedance or admittance control used when accounting for interaction forces or dynamic predictions.

4.6. Users and Evaluation

The majority of reviewed studies developed intention prediction for rehabilitation or assistive exoskeletons, with only 20% targeting an industrial application. Two publications [51,53] did not specify the application context. Evaluations were exclusively conducted in laboratory settings with mostly healthy participants (1–10 per study, mean = 6 participants; 72% male; mean age = 27.2 years). Only Gandolla et al. [55] and Zabaleta et al. [50] conducted preliminary tests with one impaired user. No field studies were reported.

Study objectives generally fell into two categories: model evaluation or assessment of usability and effectiveness. Classification algorithms were evaluated using classification accuracy on recorded test datasets, as seen in Table A2. Only Riener and Novak [67] reported false positive errors on movement intention detection. Lotti et al. [64] compared mechanical and biomechanical models based on response and settling times after perturbations, showing faster computation but reduced stability for the biomechanical model. Continuous regression models were evaluated by comparing predicted and measured trajectories or torques. Most reported quantitative error metrics, i.e., root mean square error (RMSE) or coefficient of determination R², though some provided only qualitative assessments [55,58,61]. Protocols included single-plane joint motions (e.g., elbow flexion [7,63]) or functional tasks (e.g., drinking [60], object manipulation [64], activities of daily living [73]). Only interaction-based models were tested for industrial-like tasks, such as overhead work and lifting [57,75].

Support effectiveness was primarily assessed via changes in muscle activity. All elbow supporting exoskeletons led to reductions in bicep activation [7,63,64,65], with Lotti et al. also noting reduced triceps, brachioradialis, and deltoid activity. Treussart et al. [71] found decreased activation in the biceps, deltoid, and erector spinae during load lifting, but no significant changes in other muscles. Heo et al. [57] observed reduced activity in lumbar and leg muscles during squat lifting with their back exoskeleton, but no significant changes in the gluteus maximus or biceps. Their EMG-triggered system also reduced fatigue of the longissimus thoracis during stoop lifting, measured by the decreasing mean frequency of the sEMG power spectrum. Mechanical work was evaluated in two studies. Lotti et al. reported lower user effort with their biomechanical model [64], while Heo et al. identified an optimal control delay (0–100 ms), balancing mechanical support and muscle activity reduction [57]. Lotti et al. [64] assessed kinematic changes, finding no significant alterations in range of motion or peak joint velocities of elbow and shoulder flexion/extension. Zhang et al. [51] reported a significantly higher maximum thigh angle and peak velocity during support.

Four studies included user feedback through interviews or questionnaires [56,58,66,71]. Participants generally reported good support, especially for the arms and shoulders, but noted reduced efficiency compared to traditional assistance. Perceptions of comfort, fatigue, and control were also discussed. However, Huo et al. and Novak and Riener did not disclose the questionnaire content [58,66].

Evaluations remained limited to lab settings with small numbers of healthy participants who did not constitute target user groups. While classification and regression models were assessed using standard performance metrics, support effectiveness was mainly analyzed through EMG-based muscle activity, with some analyses of fatigue, mechanical work, or kinematics. Only a minority of studies incorporated subjective user feedback, and standardized usability evaluation protocols were largely absent.

5. Discussion

This systematic review provides a comprehensive overview of studies evaluating active exoskeletons’ control strategies and the inclusion of prediction methods that could be used in industrial applications. This work builds upon and extends existing reviews in the field, such as those by [11,18,62,74], with a particular emphasis on methods for industrial contexts. By narrowing its focus, it aims to address research gaps and provides insights into the development and evaluation of industrial exoskeletons. The choice of search terms and eligibility criteria influences the identified papers during the search process. Our chosen search criteria (Figure 2) may have excluded other relevant terms, such as “motion prediction”. The results were biased by our search focus, which favored work applicable to an industrial context. This included multimodal control strategies, exoskeleton actuation, task execution, and sensor perception. However, to ensure that all relevant research was considered, full texts were evaluated by the three first authors separately, especially the applicability to industrial contexts, and disagreements were discussed.

The possibility of predicting intentions must be critically assessed. The questions raised by Boncheck-Dokow and Kaminka [79] highlight the limitations of using computational methods for this purpose: How is intention prediction possible when only a failed sequence of actions is demonstrated? How is intention prediction possible when the actions are performed on novel objects, about which the observer seemingly has no prior knowledge? This challenges Leibniz’s “Calculemus”. We humans can only estimate what might happen on the basis of observations and rational assumptions. The latter is only partially true. According to psychologist Daniel Kahneman, people prefer individual experiences over probability judgements [80]. The process of predicting the intentions of an agent, whether human or machine, is defined by its stages and indicators. Humans can make associations and hypotheses about possible reasons and goals behind actions or behavior, such as the process of “mind-reading” [81]. This, in turn, can be used to predict future outcomes, such as sequences of actions, and provide a blueprint for computational models. However, this is only a best estimate.

5.1. Intention Cues

The collected work shows a number of different cues that support an intention-based control strategy of active exoskeletons. As shown in Figure 6, the cues analyzed in the review range from short-term estimation based on muscle activity to predict the start of a movement to the use of classifiers that can take several hundred milliseconds to predict the actual activity. The somatic cues, such as the most commonly used EMG-based muscle activity, tend to work fast for prediction strategies, while behavioral or situation-context cues, such as signals derived for movements, tend to be slower because they require a longer observation window for online prediction [63]. For behavioral cues, such as kinematic signals measured by IMUs or cameras, an observable activity must have already occurred, but this method is less prone to errors and results in stabler control signals.

Implementing intention prediction in upper-limb exoskeletons involves processing intention cues derived from biological and non-biological signals. Biological signals such as muscle activity (EMG), brain activity (EEG), and somatic indicators are valuable because they reflect neuromuscular activation before physical movement occurs, enabling the early detection of user intent [11,79]. For instance, EMG-based methods can predict motion onset up to 450 ms in advance, thereby minimizing response latency in dynamic environments [68,70]. Non-biological signals, including kinematic data, gaze patterns, and environmental interactions, can be captured using technologies such as IMUs, motion capture systems, and gaze trackers [18,56]. Although these signals lack the temporal immediacy of biological cues, they provide stable and robust data, thereby enhancing system reliability [70,71]. Integrating these cues requires sensor technologies and computational models. Biological signals are captured using EMG and EEG sensors, while non-biological signals rely on IMUs and cameras [6,56]. The data is then processed using machine learning models ranging from classical approaches such as SVMs to advanced deep learning techniques like CNNs and LSTMs [7,63]. Depending on application needs, these models classify discrete states or predict continuous variables such as joint torque [68,70].

A key challenge lies in balancing the sensitivity of biological signals with the stability of non-biological ones. Multimodal approaches combining both types of signals can enhance accuracy and robustness, ensuring effective responses to diverse tasks [65,70]. However, variability in user behavior and the dynamic nature of industrial tasks necessitate more adaptable models. Future research should therefore focus on integrating multimodal cues, developing standardized evaluation protocols, and validating systems in real-world industrial settings, with the aim of improving usability and effectiveness [56].

5.2. Sensors and Signals

Sensors can improve exoskeleton control by capturing user intent signals and, with proper models, predicting pose and loads to reduce musculoskeletal stress.

Electromyography (EMG) was the most widely used biological signal in the reviewed studies. Its main advantage is the ability to implicitly capture changes in external load. However, EMG faces several practical challenges that limit its industrial applicability. EMG signals are highly user-specific, non-stationary, and sensitive to electrode placement and skin condition, making long-term reliability problematic [11]. Calibration, such as MVC scaling, is typically required for threshold-based methods [63,75], increasing setup time. Sweat-related degradation further complicates prolonged or high-mobility use [57]. Dry EMG sensors present a viable alternative, with comparable signal quality and faster setup than traditional gel electrodes [71,82].

EMG signals exhibit significant variability due to individual differences and environmental factors, which can differ not only between users but also across experimental sessions for the same individual [11,57]. The commonly employed normalization method, MVC, serves as a reference standard but often yields inconsistent maximum values across sessions, further complicating its application. These challenges highlight the necessity of more robust and adaptive normalization techniques to improve the reliability and reproducibility of EMG-based systems in industrial settings. One possible step toward this goal is real-time normalization or drift compensation, which uses adaptive Kalman or moving average filters to track trends and normalize EMG relative to recent activity levels. Dynamic, task-based scaling, which calibrates based on functional tasks, could produce repeatable results for control inputs. Other possible steps include the inclusion of musculoskeletal models tailored to the subject’s anatomy or the deployment of subject-specific training to learn EMG patterns without relying on explicit MVC scaling.

Recent developments in textile-embedded EMG, e.g., integration into underwear or compression garments, promise further improvements in usability and setup time, paving the way for real-world deployment [83,84,85]. Beyond EMG, other textile-based sensors measuring strain, respiration, or load handling may broaden intent inference and enhance multimodal systems [86,87,88].

Non-biological sensors, such as force, position, and velocity sensors, offer robust, long-term performance without the variability or degradation issues typical of biosignals. For example, force-based inverse dynamics models can estimate joint torques using mechanical signals alone, showing stable and reliable performance over time [64]. Such models also avoid issues like signal loss or sweat interference and are often simpler to implement. However, they may react more slowly and fail to account for user intent under changing external dynamics; additionally, model set-up can be time-intensive. Interaction forces can also support gaze-based interfaces by resolving selection ambiguity (e.g., the “Midas touch” problem) [66]. Additionally, context-aware sensing, such as BLE-based or UWB-based proximity or tool recognition, may be used to infer the weight or type of object the user is interacting with [89,90], further enhancing prediction accuracy.

Biological signals such as EMG enable fast and intuitive intent detection but require user-specific calibration and are sensitive to environmental conditions. Recent advances in dry and textile-integrated EMG sensors offer promising improvements in usability and robustness, increasing their suitability for industrial applications. In contrast, non-biological sensors provide greater long-term reliability and ease of integration but may compromise responsiveness to user intent.

When evaluating sensor technologies for industrial exoskeletons under conditions such as moisture, mechanical friction, and long-term alignment shifts, non-biological sensors such as IMUs and force sensors emerge as the most robust and reliable options [64]. IMUs can be placed in various locations, transmit data wirelessly, and possibly require filtering to mitigate vibration-induced noise. Force sensors are embedded at the user-exoskeleton interface, providing stable measurements even under external loads. These sensors are unaffected by sweat and maintain alignment over time due to their structural integration. EMG signals are highly user-specific, non-stationary, and sensitive to electrode placement and skin conditions (changes in electrical impedance) [11,64], which complicates long-term reliability and increases setup time (effort required for threshold tuning) [57,71], making them less practical for industrial applications. Vision-based systems, such as those using depth cameras or IR sensors, are immune to mechanical noise, though they may struggle in dynamic or cluttered environments. Sensor selection should align with application-specific needs, such as precision and robustness. Multimodal approaches that combine biological and non-biological sensors can enhance reliability and adaptability further in industrial settings.

5.3. Computation and Models

In line with previous reviews [11,74,91], approaches for predicting user motion or joint torque can be broadly categorized into classification and regression. This two-stage strategy is especially valuable in dynamic environments, where initial classification allows prediction models or control strategies to adapt to task-specific requirements [51,57]. For instance, integrating activity recognition techniques, like those from Pesenti et al. [92], who used LSTM networks to detect lifting actions and payloads, could support context-aware regression models for more precise intent estimation under varying industrial conditions.

The reviewed classification models included threshold-based methods, SVMs, NNs, and recurrent architectures such as LSTMs. These models are typically optimized for prediction accuracy, but their applicability in industrial contexts is constrained by challenges in reliably distinguishing between user states, computational latency, and extensive data requirements. Although SVMs have demonstrated strong performance in online prediction tasks [52,54], the reported computational cost is high, which may hinder deployment in real-time systems. Key design parameters, such as sliding window size and feature selection, significantly influence performance and latency. A critical bottleneck, especially for deep learning-based models, is the limited availability of annotated datasets reflecting realistic industrial scenarios. Only a few multimodal datasets, such as [93,94], closely resemble actual workplace conditions. Broader access to such benchmarks would improve the generalizability and comparability of classification models. Recent research indicates that large language models (LLMs) can help address this gap. For example, LLMs have been used to automatically annotate sensor data streams with semantic labels [95] and enable human-in-the-loop model retraining through natural language interaction [96,97], potentially accelerating adaptation to diverse real-world environments.

Regression models, particularly those based on biomechanical or physical interaction modeling, support continuous estimation of joint torques or motion trajectories and inherently incorporate variations in external dynamics [57,64]. These models provide a detailed and interpretable representation of the support context but require extensive calibration specific to both the user and the device, making them sensitive to deviations from underlying assumptions. As a result, their adaptability to different users, tasks, or deployment scenarios is limited, reducing scalability in diverse industrial settings. In contrast, model-free regression methods offer greater flexibility and can adapt to changing support conditions through retraining. However, they require substantial amounts of training data and careful hyperparameter tuning and often function as black-box models, which can hinder transparency and trust, especially in time-sensitive or safety-critical industrial applications.

Ultimately, the choice between classification, regression, or hybrid approaches depends on the variability and dynamics of the environment and users. Classification excels at detecting discrete actions but depends on accurate labeling and discrete task definitions. Biomechanical regression provides interpretable estimates incorporating external dynamics but requires careful calibration and modeling. Model-free regression offers adaptability but is data-intensive and less transparent, posing challenges for rapid, trustworthy deployment in industrial contexts. Importantly, these methodological trade-offs are closely linked to real-time constraints and the timing of intention prediction, which determine how effectively the system can synchronize with the user’s movements during task execution. Trajectory-following approaches, in particular, must remain strictly aligned with user motions, whereas torque-based methods are generally less time-critical, though too early or late torque onset can reduce support efficiency [57]. Timing aspects depending on used signals and models are examined in detail in Section 5.4.

5.4. Time

The timing of intention prediction in upper-limb exoskeletons varies significantly across methods due to differences in underlying mechanisms. Biological signals, such as EMG and EEG, enable early detection of user intent by capturing neuromuscular or cognitive activity before physical movement occurs. However, biological signals are prone to noise and require computational resources for processing, with electromechanical delays further contributing to timing variability [11,68].

Non-biological signals, such as those from IMUs or motion capture systems, rely on observable physical movement, introducing inherent delays. While these methods are more stable and less sensitive to environmental factors, they lack the temporal immediacy of biological signals. For example, IMU-based systems detect motion onset only after a velocity or acceleration threshold is exceeded, resulting in slower response times [18,71].

Computational models also influence prediction timing. Classification models, such as SVMs, are faster but less flexible, while regression models, including deep learning approaches like LSTMs, offer greater adaptability at the cost of additional latency due to algorithmic complexity. Hardware and software configurations, such as sensor sampling rates and data filtering, further impact response times [7,30].

The specific application context also plays a role. Tasks requiring high precision may tolerate longer response times, while rapid or high-force movements demand shorter delays. Subsequent research should focus on optimizing multimodal approaches, improving real-time processing, and tailoring systems to industrial tasks to balance responsiveness and stability [20,68]. Studies should also incorporate user reaction times to a given start signal to relate system response times, as in [75]. Additionally, the system’s total response time should be reported to enhance comparability between different implementations. To capture habitual cues, longer observation windows can provide insights into repetitive behavior in context, which could be used to weight current predictions and translate them into new ones by capturing typical user execution patterns.

5.5. Exoskeleton Implementations

Researchers implement intention-based support in industrial exoskeletons either to achieve accurate joint angle trajectory tracking or to provide support torques aligned with the user’s joint torques. Among regression-based trajectory-following approaches, Sedighi et al. uniquely employed a detailed biomechanical model, while most others used interaction-based methods with online-estimated or predefined trajectories targeting fixed or detected points. As noted by Riener and Novak [67], trajectory planning simplifies when the number of start and target points is limited, which motivates approaches like object fixation to infer grasping targets. Eye-tracking techniques show promise in structured environments, such as precision tasks at single workstations (see Table 3), and align with intelligent worker information systems [98] or LLM-based approaches [96]. Scenario classification combined with trajectory generation, as implemented by [50,75], is well suited for repetitive upper-limb movements typical of assembly or lifting activities. Kinematic mechanical models using video tracking [56,61] also rely on calibrated cameras fixed in the workspace and are thus applicable for high-precision, fixed workstation tasks.

For torque prediction, most studies derive support torque from muscle activity measurements. Biomechanical models automatically compensate for varying external loads [64,65] but become complex for multi-degree-of-freedom exoskeletons and require extensive user- and device-specific calibration, often demanding expert knowledge and additional measurement devices. Buongiorno et al. demonstrated that genetic algorithm optimization can automate model adaptation based on user characteristics [53]. Mechanical models have shown robust performance under fixed loads [64] and greater stability against perturbations, critical for safe, real-time control in variable industrial conditions. Such models may be preferred for high-force or load tasks (see Table 3), typically combined with torque or admittance control. Incorporating interaction force measurements or context cues about handled loads can further enhance model performance. While admittance control allows for real-time adjustment, ensuring stability in complex interactions remains challenging [91]. Industrial exoskeletons for shoulder [75] and back support [57] often implement state-machine control triggered by threshold crossings, which can provide reliable support when the thresholds are well defined, for example, in repetivitve tasks, such as assembly lines or lifting activities.

Our review supports prior findings [3,18,99] that evaluation protocols for exoskeleton support and intention models lack standardization, complicating direct comparisons. Most studies are conducted in laboratory settings with non-target users and simulated tasks, limiting their relevance to real industrial environments. Only two studies explicitly evaluated models in industrial tasks [57,75]. Some research includes complex daily activities, such as drinking or object manipulation, which partially resemble industrial motions. In the future, promising methods (see Table 3) could be implemented in active industrial exoskeletons and tested in a multi-stage evaluation process to ensure efficacy and usability in industrial contexts while reducing evaluation efforts (see Section 6).

Beyond the lack of representative evaluation needed to ensure model efficacy and usability, there are additional general barriers to successful exoskeleton implementation in industrial contexts. Selecting appropriate exoskeleton hardware must be done in close alignment with the specific work processes, movement patterns, and the types of tools and loads handled, as highlighted by [100]. Independent of proactive control, mechanical design parameters and the choice of supported joints should be matched to the actual demands of the work process to maximize both support efficiency and user acceptance. Furthermore, expanding the number of well-designed field studies (see Section 6) would not only provide a clearer understanding of real-world efficacy and usability but also help companies to better assess the return on their investment. Such evidence is critical for fostering trust, accelerating adoption, and informing the development of common standards and regulations, a need previously emphasized by [101].

In the context of worker safety, identifying the hazards associated with intention-aware, active, and adaptive industrial exoskeletons requires recognizing risks such as mispredictions, unintended movements, and ergonomic mismatches. Poorly calibrated or ill-fitting systems can lead to strain or injury. Additionally, sensor malfunctions, such as errors in EMG or IMU units, pose safety concerns. An effective risk assessment requires task-specific analysis of activities such as high-force lifting or precision tasks. Furthermore, variability in user anthropometry and potential failure modes, such as power loss, software glitches, and model errors, must be thoroughly evaluated to mitigate risks and enhance safety during deployment. This also includes the impact of exoskeleton-, task-, or human-induced domain shifts on human activity or intention recognition models and methods. Such systems must comply with occupational safety regulations or ISO standards upon deployment. From a workplace perspective, there should be an opt-out culture if discomfort or safety concerns arise.

Recent statistics show no significant decrease in ergonomic risk reduction in industrial and service occupations [102]. Worker safety-II principles [103,104] can be aligned with predictive exoskeletons to reduce risks further. Safety-II is based on the idea that performance adjustment and variability are key factors, and proactive safety management involves making adjustments before negative outcomes occur. Exoskeletons with adjustable support characteristics and multi-modal sensors can respond appropriately to varying tasks and loads, overcoming the bimodality principle in the safety domain and focusing on the presence of positives and how work is really performed (i.e., “Work-As-Done”).

6. Future Research and Recommendations

The future development of industrial exoskeleton control should emphasize robustness, adaptability, and user acceptance. Mechanical and model-free approaches using IMUs or force sensors offer high fault tolerance, making them well-suited for industrial settings, though they often react more slowly than EMG-based biomechanical models, which can detect movement prior to its initialization. Integrating contextual knowledge, such as loads, tools, or activities, can improve the support characteristic. To enhance generalizability across domains, transfer learning techniques, including pre-training with motion capture data and adapting to simpler sensor setups, are promising [105]. Furthermore, federated and reinforcement learning allow for on-site model adaptation without centralized data storage. Large language models (LLMs) present a novel opportunity by enabling real-time retraining through user feedback, enabling generalization of pre-trained models.

In industrial environments, sensors must be easy to apply with minimal setup. IMUs, force sensors, or embedded sensors in the exoskeleton are most practical. While EMG sensors are widely used in research, their sensitivity to placement and inter-user variability limits their applicability to industrial contexts. However, recent progress in dry textile EMG sensors suggests that they could be a feasible alternative in the future.

Although most analyzed studies originate from rehabilitation, many of the underlying methods, especially biomechanical and load-based models, are directly applicable to industrial tasks, as they account for external loads that increase joint stress and the risk of musculoskeletal disorders. Recognizing such loads enables one to automatically adjust the support. Before field deployment, these models should be validated in standardized lab tests. Classification-based models can often be reused with adjusted thresholds or retrained on task-specific data. Cognitive signals, such as fatigue or stress, can extend prediction models, enhancing personalization [106]. Long-term monitoring of habitual patterns may also improve intent estimation over time [107,108].

For implementation, we recommend beginning with generalizable task modes, as shown in Table 3. These include force, speed, precision, and varying activity types, each representing common industrial activities. For example, the force mode targets lifting and carrying tasks, speed mode supports rapid motion, precision mode assists fine motor tasks like assembly or welding, and varying mode enables selective support during non-repetitive tasks, such as construction or setup work. The choice between torque- or position-based control should be based on task-specific demands for force and movement.

For reliable evaluation of intention prediction controlled industrial exoskeletons, we propose a three-level evaluation process that integrates technical model performance assessment, preliminary controlled laboratory testing with simulated work tasks, and long-term field trials with the actual target group to capture realistic work patterns and side tasks often absent from laboratory studies [18,109,110,111]. In the first stage, model performance can be evaluated independently of real-world applications using the standard outcome measures identified in this review (see Table A2). For classification, we prefer F1-score over accuracy to account for imbalanced datasets. Metrics should be consistently reported and accompanied by open data to facilitate cross-study comparisons. If model performance meets predefined thresholds, evaluation should proceed to controlled laboratory testing with standardized simulated work tasks, as proposed by [112,113], using established exoskeleton evaluation metrics. Extending prior work on evaluation criteria [99], we recommend measuring the following:

EMG-based muscle activity of targeted, antagonistic, and non-target muscle groups to assess whole-body load and stress distribution.
Motion capture-based kinematic changes on whole-body movements analyzed via PCA analysis; see, for example, [114].
Somatic indicators such as metabolic cost (via respirometry), heart rate, and skin temperature.
Functional task performance (execution time and execution quality).

All metrics should be benchmarked against both baseline (no exoskeleton) and passive-mode conditions without active support control. Promising configurations should then advance to long-term field trials with end-users. While initial efficacy testing in simulated environments provides valuable technical insights, it lacks demographic representativeness and may not capture real-world usability challenges. To mitigate this, we emphasize the importance of subsequent long-term field studies with actual end-users in their workplace contexts. Although sample sizes may still be limited, these real-world evaluations ensure that the tested populations better reflect the industrial workforce, including a diverse range of ages and gender identities. Given the demographic trends towards an aging workforce, particular care should be taken to include older users in usability assessments to capture age-related ergonomic and acceptance issues. The selection of appropriate exoskeleton hardware should match workplace conditions, following recommendations in [100]. Usability evaluation should begin with a closely monitored one-week familiarization phase, allowing users to adjust mechanical and software settings and address open challenges. This should be followed by a self-managed six- to eight-week structured test phase, during which users provide brief post-use feedback on usage time and usability via short questionnaires focusing on the key factors driving acceptance [16]. The questionnaire should be accessible online or offline and include Likert-scale ratings of perceived work performance, physical strain relief, and wearing comfort. Coupled with usage frequency, duration logs, and structured interviews at the start, mid-point, and end, this approach can yield detailed yet manageable insights into the usability and effectiveness of proactively controlled exoskeletons in real industrial environments. Following this framework can reduce variability in reported metrics and provide a reproducible basis for systematic benchmarking. The staged process also enables early identification of model-related issues before resource-intensive field testing, while ensuring that full software–hardware integration is ultimately validated under real working conditions. To facilitate transparent comparison across studies and accelerate progress in the field, we strongly encourage researchers to publish their datasets along with detailed descriptions of experimental protocols and metrics so that intention prediction models and exoskeleton control strategies can be assessed on common grounds.

7. Conclusions

This systematic review, conducted following the PRISMA guidelines, identified 29 studies published between 2007 and 2024 that investigated intention prediction in active exoskeletons for industrial applications. We examined the motivation behind integrating intention prediction and assessed the methods’ relevance for industrial applications. In response to our first research question, How is intention prediction in upper-limb industrial exoskeletons implemented? we found that current approaches employ a variety of sensor modalities, most commonly motion capture and electromyography, in combination with model-based and model-free regression and classification approaches. Predictions target joint angles or joint torques, with temporal prediction windows ranging from several hundred milliseconds before motion onset to shortly after movement initiation. Secondly, we asked, Why is intention prediction in upper-limb industrial exoskeletons implemented? This review revealed that studies typically aimed to reduce joint loads or enable smooth trajectory following. To evaluate the former, researchers mostly used muscle activation reduction on the targeted muscles as a metric. For trajectory following, most researchers provided results on the deviation between user and exoskeleton joint movement using, for example, root mean square errors. However, evaluations were predominantly performed in controlled laboratory settings with small, non-representative participant groups and heterogeneous performance metrics and evaluation protocols. Building on these findings, we propose targeted recommendations for developing robust, adaptable, and industry-relevant intention prediction strategies based on requirements of the support context and emphasize real-world validation, broader performance metrics, and long-term field studies.

Supplementary Materials

The following supporting information can be downloaded at https://www.mdpi.com/article/10.3390/s25175225/s1. File S1: PRISMA Process Data; File S2: PRISMA Checklist; File S3: PRISMA Abstract Checklist.

Author Contributions

D.H. and K.S. equally contibuted to this work. Conceptualization, D.H. and K.S.; methodology, D.H., K.S., and A.F.; validation D.H. and K.S.; formal analysis, D.H., K.S., and M.V.-P.; investigation, D.H. and K.S.; resources, D.H. and K.S.; data curation, D.H., K.S., and M.V.-P.; writing—original draft preparation, D.H. and K.S.; writing—review and editing, D.H., K.S., and A.F.; visualization, D.H. and K.S.; supervision, A.F.; project administration, A.F.; funding acquisition, A.F. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported in part by the Österreichische Forschungsförderungsgesellschaft (FFG) as part of the project EMPOWER, project number FO999909720, and by the Johannes Kepler Open Access Publishing Fund and the federal state Upper Austria.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are available upon request from the lead author.

Acknowledgments

Supported by Johannes Kepler University Open Access Publishing Fund.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A. Detailed Results

Appendix A.1. Biological Signals

Table A1. Placement of gel-based sEMG control sensors, grouped by estimation objectives and limb region.

		Elbow Torque@					Elbow Shoulder Torque	Elbow Shoulder Angle	Motion Classification
	Muscle	[70]	[64]	[63]	[65]	[7]	[53]	[68]	[75]	[59]	[54]	[57]	[67]	[50]
Upper Limb	Biceps Brachii	✓	✓	✓	✓	✓	✓	✓	✓	✓	✓		✓
	Triceps Brachii	✓	✓	✓			✓			✓	✓
	Brachioradialis			✓
	Anterior Deltoid						✓	✓	✓	✓	✓		✓
	Posterior Deltoid						✓			✓	✓		✓
	Medial Deltoid							✓	✓		✓		✓
	Lateral Deltoid									✓
	Supraspinatus										✓
	Infraspinatus										✓
	Teres Major										✓
	Pectoralis Major										✓
	Trapezius										✓		✓
	Wrist Flexor										✓
	Wrist Extensor										✓
Lower Limb	Gluteus Maximus											✓		✓
	Biceps Femoris											✓
	Vastus Lateralis													✓

Appendix A.2. Users and Evaluation

Table A2. Summary of model performances, categorized by classification and regression approaches. Models are grouped by type and compared using reported evaluation metrics. Only studies with quantitative results are included; if multiple models were reported, only the best-performing one is shown. (RMSE = root mean square error; R² = coefficient of determination).

	Ref.	Model Type	Evaluation Metric	Result
Classification	[52]	SVM; activity classification	accuracy; offline	73.1%
	[54]	SVM; activity classification	accuracy; offline	90–98%
			accuracy; online	81–99%
	[59]	SVM; movement intention	accuracy	93.03%
		SVM; movement direction	accuracy	78.28%
	[51]	Random Forest	accuracy	$97.0 \pm 0.9 %$ no support; $93.0 \pm 3.1 %$ support
	[55]	thresholding; kinematics-based	accuracy	99%
	[57]	thresholding; kinematics-based	accuracy	92.6%
		thresholding; EMG-based	accuracy	96.2%
	[67]	thresholding; EMG/force-based	false positives; online	3.9%
	[50]	thresholding; kinematics/force-based	accuracy	100% unimpaired; 0% impaired
	[75]	thresholding; kinematics-based	accuracy	98.7–99.7%
		thresholding; EMG-based	accuracy	57.4–95.6%
Regression	[53]	biomechanical model	R²	$0.90 \pm 0.04$ elbow $0.92 \pm 0.03$ shoulder
			RMSE	$1.39 \pm 0.25 Nm$ elbow $1.73 \pm 0.36 Nm$ shoulder
	[63]	biomechanical model	R²	$0.87 \pm 0.04$
			RMSE	$0.01 \pm 0.005 Nm / kg$
	[65]	biomechanical model	correlation coefficient	98.04%
			RMSE	1.90 Nm
	[71]	biomechanical model	RMSE	$0.04 \pm 0.01 Nm$
	[73]	mechanical geodesics model	RMSE	0.02–0.13 mm
	[51]	Random Forest	R²	$0.90 \pm 0.02$
			RMSE	$0.13 \pm 0.01 rad / s^{- 1}$
	[60]	Extreme Learning Machine	RMSE	$4.2 \pm 6$ °
	[68]	hybrid CNN-LSTM model	accuracy	91.2%
	[69]	NARX neural network	RMSE	0.87–1.1°
	[70]	multilayer neural network	mean absolute error	0.58 at 0 kg; 1.53 at 5 lb

References

Weidner, R.; Linnenberg, C.; Hoffmann, N.; Prokop, G.; Edwards, V. Exoskelette für den industriellen Kontext: Systematisches Review und Klassifikation. In Proceedings of the Gesellschaft für Arbeitswissenschaft e.V.: Dokumentation des 66. Arbeitswissenschaftlichen Kongresses, Berlin, Germany, 16–18 March 2020. [Google Scholar]
de Looze, M.P.; Bosch, T.; Krause, F.; Stadler, K.S.; O’Sullivan, L.W. Exoskeletons for industrial application and their potential effects on physical work load. Ergonomics 2016, 59, 671–681. [Google Scholar] [CrossRef]
Bär, M.; Steinhilber, B.; Rieger, M.A.; Luger, T. The influence of using exoskeletons during occupational tasks on acute physical stress and strain compared to no exoskeleton—A systematic review and meta-analysis. Appl. Ergon. 2021, 94, 103385. [Google Scholar] [CrossRef]
European Agency for Safety and Health at Work; IKEI; Panteia; de Kok, J.; Vroonhof, P.; Snijders, J.; Roullis, G.; Clarke, M.; Peereboom, K.; van Dorst, P.; et al. Work-Related Musculoskeletal Disorders: Prevalence, Costs and Demographics in the EU; Publications Office: Luxembourg, 2019. [Google Scholar] [CrossRef]
Ott, O.; Ralfs, L.; Weidner, R. Framework for qualifying exoskeletons as adaptive support technology. Front. Robot. AI 2022, 9, 951382. [Google Scholar] [CrossRef]
Wandke, H. Assistance in human–machine interaction: A conceptual framework and a proposal for a taxonomy. Theor. Issues Ergon. Sci. 2005, 6, 129–155. [Google Scholar] [CrossRef]
Dinh, B.K.; Xiloyannis, M.; Antuvan, C.W.; Cappello, L.; Masia, L. Hierarchical Cascade Controller for Assistance Modulation in a Soft Wearable Arm Exoskeleton. IEEE Robot. Autom. Lett. 2017, 2, 1786–1793. [Google Scholar] [CrossRef]
De Bock, S.; Ghillebert, J.; Govaerts, R.; Elprama, S.A.; Marusic, U.; Serrien, B.; Jacobs, A.; Geeroms, J.; Meeusen, R.; De Pauw, K. Passive Shoulder Exoskeletons: More Effective in the Lab Than in the Field? IEEE Trans. Neural Ayst. Rehab. Eng. 2021, 29, 173–183. [Google Scholar] [CrossRef]
Wang, W.; Li, R.; Chen, Y.; Jia, Y. Human Intention Prediction in Human-Robot Collaborative Tasks. In Proceedings of the Companion of the 2018 ACM/IEEE International Conference on Human-Robot Interaction, Chicago, IL, USA, 5–8 March 2018; pp. 279–280. [Google Scholar] [CrossRef]
Wang, W.; Li, R.; Chen, Y.; Sun, Y.; Jia, Y. Predicting Human Intentions in Human–Robot Hand-Over Tasks Through Multimodal Learning. IEEE Trans. Autom. Sci. Eng. 2022, 19, 2339–2353. [Google Scholar] [CrossRef]
Bi, L.; Feleke, A.G.; Guan, C. A review on EMG-based motor intention prediction of continuous human upper limb motion for human-robot collaboration. Biomed. Signal Process. Control 2019, 51, 113–127. [Google Scholar] [CrossRef]
Wei, Z.; Zhang, Z.Q.; Xie, S.Q. Continuous Motion Intention Prediction Using sEMG for Upper-Limb Rehabilitation: A Systematic Review of Model-Based and Model-Free Approaches. IEEE Trans. Neural Syst. Rehab. Eng. 2024, 32, 1487–1504. [Google Scholar] [CrossRef] [PubMed]
Zongxing, L.; Jie, Z.; Ligang, Y.; Jinshui, C.; Hongbin, L. The Human–Machine Interaction Methods and Strategies for Upper and Lower Extremity Rehabilitation Robots: A Review. IEEE Sens. J. 2024, 24, 13773–13787. [Google Scholar] [CrossRef]
Page, M.J.; McKenzie, J.E.; Bossuyt, P.M.; Boutron, I.; Hoffmann, T.C.; Mulrow, C.D.; Shamseer, L.; Tetzlaff, J.M.; Akl, E.A.; Brennan, S.E.; et al. The PRISMA 2020 statement: An updated guideline for reporting systematic reviews. BMJ 2021, 372, n71. [Google Scholar] [CrossRef]
Kim, S.; Nussbaum, M.A.; Smets, M. Usability, User Acceptance, and Health Outcomes of Arm-Support Exoskeleton Use in Automotive Assembly: An 18-month Field Study. JOEM 2022, 64, 202–211. [Google Scholar] [CrossRef]
Siedl, S.M.; Mara, M. What Drives Acceptance of Occupational Exoskeletons? Focus Group Insights from Workers in Food Retail and Corporate Logistics. Int. J. Hum.-Comput. Int. 2022, 39, 4080–4089. [Google Scholar] [CrossRef]
Nasr, A.; Ferguson, S.; McPhee, J. Model-Based Design and Optimization of Passive Shoulder Exoskeletons. J. Comput. Nonlinear Dyn. 2022, 17, 051004. [Google Scholar] [CrossRef]
Gantenbein, J.; Dittli, J.; Meyer, J.T.; Gassert, R.; Lambercy, O. Intention Detection Strategies for Robotic Upper-Limb Orthoses: A Scoping Review Considering Usability, Daily Life Application, and User Evaluation. Front. Neurorobot. 2022, 16, 815693. [Google Scholar] [CrossRef] [PubMed]
Li, J.; Gu, X.; Qiu, S.; Zhou, X.; Cangelosi, A.; Loo, C.K.; Liu, X. A Survey of Wearable Lower Extremity Neurorehabilitation Exoskeleton: Sensing, Gait Dynamics, and Human–Robot Collaboration. IEEE Trans. Syst. Man. Cybern. Syst. 2024, 54, 3675–3693. [Google Scholar] [CrossRef]
Luo, S.; Meng, Q.; Li, S.; Yu, H. Research of Intent Recognition in Rehabilitation Robots: A Systematic Review. Disabil. Rehabil. Assist. Technol. 2024, 19, 1307–1318. [Google Scholar] [CrossRef]
Smith, G.B.; Belle, V.; Petrick, R.P.A. Intention Recognition with ProbLog. Front. Artif. Intell. 2022, 5, 806262. [Google Scholar] [CrossRef]
Shibata, Y.; Tsuji, K.; Loo, C.K.; Sun, G.; Yamazaki, Y.; Hashimoto, T. Vision-based HRV Measurement for HRI Study. In Proceedings of the 2018 IEEE Symposium Series on Computational Intelligence (SSCI), Bengaluru, India, 18–21 November 2018; pp. 1779–1784. [Google Scholar] [CrossRef]
Kothig, A.; Muñoz, J.; Akgun, S.A.; Aroyo, A.M.; Dautenhahn, K. Connecting Humans and Robots Using Physiological Signals—Closing-the-Loop in HRI. In Proceedings of the 2021 30th IEEE International Conference on Robot & Human Interactive Communication (RO-MAN), Vancouver, BC, Canada, 8–12 August 2021; pp. 735–742. [Google Scholar] [CrossRef]
Mulla, D.M.; Keir, P.J. Neuromuscular control: From a biomechanist’s perspective. Front. Sports Act. Living 2023, 5, 1217009. [Google Scholar] [CrossRef]
Asghar, A.; Jawaid Khan, S.; Azim, F.; Shakeel, C.S.; Hussain, A.; Niazi, I.K. Review on electromyography based intention for upper limb control using pattern recognition for human-machine interaction. Proc. Inst. Mech. Eng. H 2022, 236, 628–645. [Google Scholar] [CrossRef]
Belardinelli, A. Gaze-Based Intention Estimation: Principles, Methodologies, and Applications in HRI. ACM Trans. Hum.-Robot Interact. 2024, 13, 1–30. [Google Scholar] [CrossRef]
Canning, C.; Scheutz, M. Functional near-infrared spectroscopy in human-robot interaction. ACM Trans. Hum.-Robot Interact. 2013, 2, 62–84. [Google Scholar] [CrossRef]
Heredia, J.; Lopes-Silva, E.; Cardinale, Y.; Diaz-Amado, J.; Dongo, I.; Graterol, W.; Aguilera, A. Adaptive Multimodal Emotion Detection Architecture for Social Robots. IEEE Access 2022, 10, 20727–20744. [Google Scholar] [CrossRef]
Schröder, T.; Stewart, T.C.; Thagard, P. Intention, Emotion, and Action: A Neural Theory Based on Semantic Pointers. Cogn. Sci. 2014, 38, 851–880. [Google Scholar] [CrossRef] [PubMed]
Cohen, A.O.; Dellarco, D.V.; Breiner, K.; Helion, C.; Heller, A.S.; Rahdar, A.; Pedersen, G.; Chein, J.; Dyke, J.P.; Galvan, A.; et al. The Impact of Emotional States on Cognitive Control Circuitry and Function. J. Cogn. Neurosci. 2016, 28, 446–459. [Google Scholar] [CrossRef]
Admoni, H.; Scassellati, B. Social eye gaze in human-robot interaction: A review. ACM Trans. Hum.-Robot Interact. 2017, 6, 25–63. [Google Scholar] [CrossRef]
Lübbert, A.; Göschl, F.; Krause, H.; Schneider, T.R.; Maye, A.; Engel, A.K. Socializing Sensorimotor Contingencies. Front. Hum. Neurosci. 2021, 15, 624610. [Google Scholar] [CrossRef]
Bremers, A.; Pabst, A.; Parreira, M.T.; Ju, W. Using Social Cues to Recognize Task Failures for HRI: Overview, State-of-the-Art, and Future Directions. arXiv 2024, arXiv:2301.11972. [Google Scholar] [CrossRef]
Wyer, R.S.; Srull, T.K. Human cognition in its social context. Psychol. Rev. 1986, 93, 322–359. [Google Scholar] [CrossRef] [PubMed]
Niemann, F.; Lüdtke, S.; Bartelt, C.; Hompel, M. Context-Aware Human Activity Recognition in Industrial Processes. Sensors 2021, 22, 134. [Google Scholar] [CrossRef]
Riboni, D.; Bettini, C. OWL 2 modeling and reasoning with complex human activities. PMC 2011, 7, 379–395. [Google Scholar] [CrossRef]
Wei, D.; Chen, L.; Zhao, L.; Zhou, H.; Huang, B. A Vision-Based Measure of Environmental Effects on Inferring Human Intention During Human Robot Interaction. IEEE Sens. J. 2022, 22, 4246–4256. [Google Scholar] [CrossRef]
Danner, U.N.; Aarts, H.; de Vries, N.K. Habit vs. intention in the prediction of future behaviour: The role of frequency, context stability and mental accessibility of past behaviour. Br. J. Soc. Psychol. 2008, 47, 245–265. [Google Scholar] [CrossRef]
Wang, X.; Shen, H.; Yu, H.; Guo, J.; Wei, X. Hand and Arm Gesture-based Human-Robot Interaction: A Review. In Proceedings of the 6th International Conference on Algorithms, Computing and Systems, ICACS ’22, Larissa, Greece, 16–18 September 2023; Association for Computing Machinery: New York, NY, USA, 2023; pp. 1–7. [Google Scholar] [CrossRef]
Shrinah, A.; Bahraini, M.S.; Khan, F.; Asif, S.; Lohse, N.; Eder, K. On the Design of Human-Robot Collaboration Gestures. arXiv 2024, arXiv:2402.19058. [Google Scholar] [CrossRef]
Koochaki, F.; Najafizadeh, L. Predicting Intention Through Eye Gaze Patterns. In Proceedings of the 2018 IEEE Biomedical Circuits and Systems Conference (BioCAS), Cleveland, OH, USA, 17–19 October 2018; pp. 1–4. [Google Scholar] [CrossRef]
Menolotto, M.; Komaris, D.S.; Tedesco, S.; O’Flynn, B.; Walsh, M. Motion Capture Technology in Industrial Applications: A Systematic Review. Sensors 2020, 20, 5687. [Google Scholar] [CrossRef]
Ji, Y.; Yang, Y.; Shen, F.; Shen, H.T.; Li, X. A Survey of Human Action Analysis in HRI Applications. IEEE Trans. Circ. Syst. Video Technol. 2020, 30, 2114–2128. [Google Scholar] [CrossRef]
Zunino, A.; Cavazza, J.; Koul, A.; Cavallo, A.; Becchio, C.; Murino, V. Predicting Human Intentions from Motion Cues Only: A 2D+3D Fusion Approach. In Proceedings of the 25th ACM International Conference on Multimedia, Mountain View, CA, USA, 27–31 October 2017; pp. 591–599. [Google Scholar] [CrossRef]
Rinchi, O.; Ghazzai, H.; Alsharoa, A.; Massoud, Y. LiDAR Technology for Human Activity Recognition: Outlooks and Challenges. IEEE IoTM 2023, 6, 143–150. [Google Scholar] [CrossRef]
Farina, D.; Merletti, R.; Enoka, R.M. The extraction of neural strategies from the surface EMG. J. Appl. Physiol. 2004, 96, 1486–1495. [Google Scholar] [CrossRef] [PubMed]
Bello, V.; Bodo, E.; Pizzurro, S.; Merlo, S. In Vivo Recognition of Vascular Structures by Near-Infrared Transillumination. Proceedings 2019, 42, 24. [Google Scholar] [CrossRef]
Zhang, S.; Li, Y.; Zhang, S.; Shahabi, F.; Xia, S.; Deng, Y.; Alshurafa, N. Deep Learning in Human Activity Recognition with Wearable Sensors: A Review on Advances. Sensors 2022, 22, 1476. [Google Scholar] [CrossRef]
Beck, M.; Pöppel, K.; Spanring, M.; Auer, A.; Prudnikova, O.; Kopp, M.; Klambauer, G.; Brandstetter, J.; Hochreiter, S. xLSTM: Extended Long Short-Term Memory. arXiv 2024, arXiv:2405.04517. [Google Scholar] [CrossRef]
Zabaleta, H.; Bureau, M.; Eizmendi, G.; Olaiz, E.; Medina, J.; Perez, M. Exoskeleton design for functional rehabilitation in patients with neurological disorders and stroke. In Proceedings of the 2007 IEEE 10th International Conference on Rehabilitation Robotics, Noordwijk, The Netherlands, 13–15 June 2007; pp. 112–118. [Google Scholar] [CrossRef]
Zhang, X.; Tricomi, E.; Missiroli, F.; Lotti, N.; Masia, L. Real-Time Assistive Control via IMU Locomotion Mode Detection in a Soft Exosuit: An Effective Approach to Enhance Walking Metabolic Efficiency. IEEE/ASME Trans. Mechatron. 2023, 29, 1797–1808. [Google Scholar] [CrossRef]
Bandara, D.S.V.; Arata, J.; Kiguchi, K. A noninvasive brain-computer interface approach for predicting motion intention of activities of daily living tasks for an upper-limb wearable robot. Int. J. Adv. Robot. Syst. 2018, 15, 1729881418767310. [Google Scholar] [CrossRef]
Buongiorno, D.; Barsotti, M.; Barone, F.; Bevilacqua, V.; Frisoli, A. A Linear Approach to Optimize an EMG-Driven Neuromusculoskeletal Model for Movement Intention Detection in Myo-Control: A Case Study on Shoulder and Elbow Joints. Front. Neurorobot. 2018, 12, e00074. [Google Scholar] [CrossRef]
Chen, B.; Zhou, Y.; Chen, C.; Sayeed, Z.; Hu, J.; Qi, J.; Frush, T.; Goitz, H.; Hovorka, J.; Cheng, M.; et al. Volitional control of upper-limb exoskeleton empowered by EMG sensors and machine learning computing. Array 2023, 17, 100277. [Google Scholar] [CrossRef]
Gandolla, M.; Luciani, B.; Pirovano, D.; Pedrocchi, A.; Braghin, F. A force-based human machine interface to drive a motorized upper limb exoskeleton. a pilot study. In Proceedings of the 2022 International Conference on Rehabilitation Robotics (ICORR), Rotterdam, The Netherlands, 25–29 July 2022; pp. 1–6. [Google Scholar] [CrossRef]
Gao, Y.; Su, Y.; Dong, W.; Du, Z.; Wu, Y. Intention detection in upper limb kinematics rehabilitation using a GP-based control strategy. In Proceedings of the 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Hamburg, Germany, 28 September–2 October 2015; pp. 5032–5038. [Google Scholar] [CrossRef]
Heo, U.; Feng, J.; Kim, S.J.; Kim, J. sEMG-Triggered Fast Assistance Strategy for a Pneumatic Back Support Exoskeleton. IEEE Trans. Neural Syst. Rehabil. Eng. 2022, 30, 2175–2185. [Google Scholar] [CrossRef]
Huo, W.; Huang, J.; Wang, Y.; Wu, J.; Cheng, L. Control of upper-limb power-assist exoskeleton based on motion intention recognition. In Proceedings of the 2011 IEEE International Conference on Robotics and Automation, Shanghai, China, 9–13 May 2011; pp. 2243–2248. [Google Scholar] [CrossRef]
Irastorza-Landa, N.; Sarasola-Sanz, A.; López-Larraz, E.; Bibián, C.; Shiman, P.; Birbaumer, N.; Ramos-Murguialday, A. Design of continuous EMG classification approaches towards the control of a robotic exoskeleton in reaching movements. In Proceedings of the 2017 International Conference on Rehabilitation Robotics (ICORR), London, UK, 17–20 July 2017; pp. 128–133. [Google Scholar] [CrossRef]
Khan, A.M.; Yun, D.W.; Zuhaib, K.M.; Iqbal, J.; Yan, R.J.; Khan, F.; Han, C. Estimation of Desired Motion Intention and Compliance Control for Upper Limb Assist Exoskeleton. Int. J. Control Autom. Syst. 2017, 15, 802–814. [Google Scholar] [CrossRef]
Liao, Y.T.; Yang, H.; Lee, H.H.; Tanaka, E. Development and Evaluation of a Kinect-Based Motion Recognition System based on Kalman Filter for Upper-Limb Assistive Device. In Proceedings of the 2019 58th Annual Conference of the Society of Instrument and Control Engineers of Japan (SICE), Hiroshima, Japan, 10–13 September 2019; pp. 1621–1626. [Google Scholar] [CrossRef]
Losey, D.P.; McDonald, C.G.; Battaglia, E.; O’Malley, M.K. A Review of Intent Detection, Arbitration, and Communication Aspects of Shared Control for Physical Human-Robot Interaction. Appl. Mech. Rev. 2018, 70, 010804. [Google Scholar] [CrossRef]
Lotti, N.; Xiloyannis, M.; Durandau, G.; Galofaro, E.; Sanguineti, V.; Masia, L.; Sartori, M. Adaptive Model-Based Myoelectric Control for a Soft Wearable Arm Exosuit: A New Generation of Wearable Robot Control. Robot. Autom. Mag. 2020, 27, 43–53. [Google Scholar] [CrossRef]
Lotti, N.; Xiloyannis, M.; Missiroli, F.; Bokranz, C.; Chiaradia, D.; Frisoli, A.; Riener, R.; Masia, L. Myoelectric or Force Control? A Comparative Study on a Soft Arm Exosuit. IEEE Trans. Robot. 2022, 38, 1363–1379. [Google Scholar] [CrossRef]
Lu, L.; Wu, Q.; Chen, X.; Shao, Z.; Chen, B.; Wu, H. Development of a sEMG-based torque estimation control strategy for a soft elbow exoskeleton. Robot. Auton. Syst. 2019, 111, 88–98. [Google Scholar] [CrossRef]
Novak, D.; Riener, R. Enhancing patient freedom in rehabilitation robotics using gaze-based intention detection. In Proceedings of the 2013 IEEE 13th International Conference on Rehabilitation Robotics (ICORR), Seattle, WA, USA, 24–26 June 2013; pp. 1–6. [Google Scholar] [CrossRef]
Riener, R.; Novak, D. Movement Onset Detection and Target Estimation for Robot-Aided Arm Training. Automatisierungstechnik 2015, 63, 286–298. [Google Scholar] [CrossRef]
Sedighi, P.; Li, X.; Tavakoli, M. EMG-Based Intention Detection Using Deep Learning for Shared Control in Upper-Limb Assistive Exoskeletons. IEEE Robot. Autom. Lett. 2024, 9, 41–48. [Google Scholar] [CrossRef]
Sun, L.; An, H.; Ma, H.; Gao, J. Real-Time Human Intention Recognition of Multi-Joints Based on MYO. IEEE Access 2020, 8, 4235–4243. [Google Scholar] [CrossRef]
Toro-Ossaba, A.; Tejada, J.C.; Rúa, S.; Núñez, J.D.; Peña, A. Myoelectric Model Reference Adaptive Control with Adaptive Kalman Filter for a soft elbow exoskeleton. Control Eng. Pract. 2024, 142, 105774. [Google Scholar] [CrossRef]
Treussart, B.; Geffard, F.; Vignais, N.; Marin, F. Controlling an upper-limb exoskeleton by EMG signal while carrying unknown load. In Proceedings of the 2020 IEEE International Conference on Robotics and Automation (ICRA), Paris, France, 31 May–31 August 2020; pp. 9107–9113. [Google Scholar] [CrossRef]
Woo, J.H.; Sahithi, K.K.; Kim, S.T.; Choi, G.R.; Kim, B.S.; Shin, J.G.; Kim, S.H. Machine Learning Based Recognition of Elements in Lower-Limb Movement Sequence for Proactive Control of Exoskeletons to Assist Lifting. IEEE Access 2023, 11, 127107–127118. [Google Scholar] [CrossRef]
Zarrin, R.S.; Zeiaee, A.; Langari, R.; Buchanan, J.J.; Robson, N. Towards autonomous ergonomic upper-limb exoskeletons: A computational approach for planning a human-like path. Robot. Auton. Syst. 2021, 145, 103843. [Google Scholar] [CrossRef]
Zhang, L.; Liu, G.; Han, B.; Wang, Z.; Zhang, T.; Payandeh, S. sEMG Based Human Motion Intention Recognition. J. Robot. 2019, 2019, 3679174. [Google Scholar] [CrossRef]
Zhou, Y.M.; Hohimer, C.; Proietti, T.; O’Neill, C.T.; Walsh, C.J. Kinematics-Based Control of an Inflatable Soft Wearable Robot for Assisting the Shoulder of Industrial Workers. IEEE Robot. Autom. Lett. 2021, 6, 2155–2162. [Google Scholar] [CrossRef]
Lobo-Prat, J.; Kooren, P.N.; Stienen, A.H.; Herder, J.L.; Koopman, B.F.; Veltink, P.H. Non-invasive control interfaces forintention detection in active movement-assistive devices. J. Neuroeng. Rehabil. 2014, 11, 168. [Google Scholar] [CrossRef]
Cavanagh, P.R.; Komi, P.V. Electromechanical delay in human skeletal muscle under concentric and eccentric contractions. Eur. J. Appl. Physiol. Occup. Physiol. 1979, 42, 159–163. [Google Scholar] [CrossRef]
Bai, O.; Rathi, V.; Lin, P.; Huang, D.; Battapady, H.; Fei, D.Y.; Schneider, L.; Houdayer, E.; Chen, X.; Hallett, M. Prediction of human voluntary movement before it occurs. Clin. Neurophysiol. 2010, 122, 364. [Google Scholar] [CrossRef] [PubMed]
Bonchek-Dokow, E.; Kaminka, G.A. Towards computational models of intention detection and intention prediction. Cogn. Syst. Res. 2014, 28, 44–79. [Google Scholar] [CrossRef]
Tversky, A.; Kahneman, D.; Slovic, P. Judgment Under Uncertainty: Heuristics and Biases; Cambridge University Press: Cambridge, UK, 1982. [Google Scholar]
Gallese, V.; Goldman, A. Mirror neurons and the simulation theory of mind-reading. Trends Cogn. Sci. 1998, 2, 493–501. [Google Scholar] [CrossRef] [PubMed]
Laferriere, P.; Lemaire, E.D.; Chan, A.D.C. Surface Electromyographic Signals Using Dry Electrodes. IEEE Trans. Instrum. Meas. 2011, 60, 3259–3268. [Google Scholar] [CrossRef]
Rescio, G.; Sciurti, E.; Giampetruzzi, L.; Carluccio, A.M.; Francioso, L.; Leone, A. Preliminary Study on Wearable Smart Socks with Hydrogel Electrodes for Surface Electromyography-Based Muscle Activity Assessment. Sensors 2025, 25, 1618. [Google Scholar] [CrossRef]
Martins, J.; Cerqueira, S.M.; Catarino, A.W.; da Silva, A.F.; Rocha, A.M.; Vale, J.; Ângelo, M.; Santos, C.P. Integrating sEMG and IMU Sensors in an e-Textile Smart Vest for Forward Posture Monitoring: First Steps. Sensors 2024, 24, 4717. [Google Scholar] [CrossRef]
Ju, B.; Mark, J.I.; Youn, S.; Ugale, P.; Sennik, B.; Adcock, B.; Mills, A.C. Feasibility assessment of textile electromyography sensors for a wearable telehealth biofeedback system. Wearable Technol. 2025, 6, e26. [Google Scholar] [CrossRef]
Hossain, G.; Rahman, M.; Hossain, I.Z.; Khan, A. Wearable socks with single electrode triboelectric textile sensors for monitoring footsteps. Sens. Actuators A Phys. 2022, 333, 113316. [Google Scholar] [CrossRef]
Khan, A.; Rashid, M.; Grabher, G.; Hossain, G. Autonomous Triboelectric Smart Textile Sensor for Vital Sign Monitoring. ACS Appl. Mater. Interfaces 2024, 16, 31807–31816. [Google Scholar] [CrossRef]
Alam, T.; Saidane, F.; Faisal, A.A.; Khan, A.; Hossain, G. Smart- textile strain sensor for human joint monitoring. Sens. Actuators A Phys. 2022, 341, 113587. [Google Scholar] [CrossRef]
Cortesi, S.; Crabolu, M.; Mekikis, P.V.; Bellusci, G.; Vogt, C.; Magno, M. A Proximity-Based Approach for Dynamically Matching Industrial Assets and Their Operators Using Low-Power IoT Devices. IEEE Internet Things J. 2025, 12, 3350–3362. [Google Scholar] [CrossRef]
Al-Okby, M.F.R.; Junginger, S.; Roddelkopf, T.; Thurow, K. UWB-Based Real-Time Indoor Positioning Systems: A Comprehensive Review. Appl. Sci. 2024, 14, 1005. [Google Scholar] [CrossRef]
Wu, X.; Liang, J.; Yu, Y.; Li, G.; Yen, G.G.; Yu, H. Embodied Perception Interaction, and Cognition for Wearable Robotics: A Survey. IEEE Trans. Cogn. Dev. Syst. 2024, 1–18. [Google Scholar] [CrossRef]
Pesenti, M.; Invernizzi, G.; Mazzella, J.; Bocciolone, M.; Pedrocchi, A.; Gandolla, M. IMU-based human activity recognition and payload classification for low-back exoskeletons. Sci. Rep. 2023, 13, 1184. [Google Scholar] [CrossRef]
Niemann, F.; Reining, C.; Moya Rueda, F.; Nair, N.R.; Steffens, J.A.; Fink, G.A.; ten Hompel, M. LARa: Creating a Dataset for Human Activity Recognition in Logistics Using Semantic Attributes. Sensors 2020, 20, 4083. [Google Scholar] [CrossRef]
Yoshimura, N.; Morales, J.; Maekawa, T.; Hara, T. OpenPack: A Large-Scale Dataset for Recognizing Packaging Works in IoT-Enabled Logistic Environments. In Proceedings of the 2024 IEEE International Conference on Pervasive Computing and Communications (PerCom), Biarritz, France, 11–15 March 2024; pp. 90–97. [Google Scholar] [CrossRef]
Yan, H.; Tan, H.; Ding, Y.; Zhou, P.; Namboodiri, V.; Yang, Y. Language-centered Human Activity Recognition. arXiv 2024, arXiv:2410.00003. [Google Scholar] [CrossRef]
Zhang, Z.; Yang, B.; Chen, X.; Shi, W.; Wang, H.; Luo, W.; Huang, J. MindEye-OmniAssist: A Gaze-Driven LLM-Enhanced Assistive Robot System for Implicit Intention Recognition and Task Execution. arXiv 2025, arXiv:2503.13250. [Google Scholar] [CrossRef]
Ahmadi, E.; Wang, C. Interpretable Locomotion Prediction in Construction Using a Memory-Driven LLM Agent With Chain-of-Thought Reasoning. arXiv 2025, arXiv:2504.15263. [Google Scholar] [CrossRef]
Zheng, T.; Glock, C.H.; Grosse, E.H. Opportunities for using eye tracking technology in manufacturing and logistics: Systematic literature review and research agenda. Comput. Ind. Eng. 2022, 171, 108444. [Google Scholar] [CrossRef]
Pesenti, M.; Antonietti, A.; Gandolla, M.; Pedrocchi, A. Towards a Functional Performance Validation Standard for Industrial Low-Back Exoskeletons: State of the Art Review. Sensors 2021, 21, 808. [Google Scholar] [CrossRef]
Ralfs, L.; Hoffmann, N.; Glitsch, U.; Heinrich, K.; Johns, J.; Weidner, R. Insights into evaluating and using industrial exoskeletons: Summary report, guideline, and lessons learned from the interdisciplinary project “Exo@Work”. Int. J. Ind. Ergon. 2023, 97, 103494. [Google Scholar] [CrossRef]
Crea, S.; Beckerle, P.; Looze, M.D.; Pauw, K.D.; Grazi, L.; Kermavnar, T.; Masood, J.; O’Sullivan, L.W.; Pacifico, I.; Rodriguez-Guerrero, C.; et al. Occupational exoskeletons: A roadmap toward large-scale adoption. Methodology and challenges of bringing exoskeletons to workplaces. Wearable Technol. 2021, 2, 1729881418767310. [Google Scholar] [CrossRef]
Lieck, L.; Anyfantis, I.; Irastorza, X.; Munar, L.; Müller, B.; Milczarek, M.; Cockburn, W.; Smith, A. Occupational Safety and Health in Europe—State and Trends 2023; Publications Office of the European Union: Luxembourg, 2023. [Google Scholar] [CrossRef]
Hollnagel, E.; Wears, R.L.; Braithwaite, J. From Safety-I to Safety-II: A White Paper. The resilient health care net: Published simultaneously by the University of Southern Denmark, University of Florida, USA, and Macquarie University, Australia. 2015. Available online: https://www.england.nhs.uk/signuptosafety/wp-content/uploads/sites/16/2015/10/safety-1-safety-2-whte-papr.pdf (accessed on 13 July 2025).
Vanderhaegen, F. Erik Hollnagel: Safety-I and Safety-II, the past and future of safety management. Cogn. Tech. Work 2015, 17, 461–464. [Google Scholar] [CrossRef]
Ray, A.; Kolekar, M.H. Transfer learning and its extensive appositeness in human activity recognition: A survey. Expert Syst. Appl. 2024, 240, 122538. [Google Scholar] [CrossRef]
Brolin, A.; Thorvald, P.; Case, K. Experimental study of cognitive aspects affecting human performance in manual assembly. Prod. Manuf. Res. 2017, 5, 141–163. [Google Scholar] [CrossRef]
Wood, W.; Mazar, A.; Neal, D.T. Habits and Goals in Human Behavior: Separate but Interacting Systems. Perspect. Psychol. Sci. 2022, 17, 590–605. [Google Scholar] [CrossRef] [PubMed]
Li, H.; Zhang, Z.; Wang, W. HabitAction: A Video Dataset for Human Habitual Behavior Recognition. arXiv 2024, arXiv:2408.13463. [Google Scholar] [CrossRef]
Siedl, S.M.; Mara, M.; Stiglbauer, B. The Role of Social Feedback in Technology Acceptance: A One-Week Diary Study with Exoskeleton Users at the Workplace. Int. J. Hum.-Comput. Interact. 2024, 13, 8210–8223. [Google Scholar] [CrossRef]
Siedl, S.M.; Mara, M. Exoskeleton acceptance and its relationship to self-efficacy enhancement, perceived usefulness, and physical relief: A field study among logistics workers. Wearable Technol. 2021, 2, e10. [Google Scholar] [CrossRef] [PubMed]
Davis, K.G.; Reid, C.R.; Rempel, D.D.; Treaster, D. Introduction to the human factors special issue on user-centered design for exoskeleton. Hum. Factors 2020, 62, 333–336. [Google Scholar] [CrossRef] [PubMed]
Ralfs, L.; Hoffmann, N.; Weidner, R. Method and Test Course for the Evaluation of Industrial Exoskeletons. Appl. Sci. 2021, 11, 9614. [Google Scholar] [CrossRef]
Kopp, V.; Holl, M.; Schalk, M.; Daub, U.; Bances, E.; García, B.; Schalk, I.; Siegert, J.; Schneider, U. Exoworkathlon: A prospective study approach for the evaluation of industrial exoskeletons. Wearable Technol. 2022, 3, e22. [Google Scholar] [CrossRef] [PubMed]
Van Andel, S.; Mohr, M.; Schmidt, A.; Werner, I.; Federolf, P. Whole-body movement analysis using principal component analysis: What is the internal consistency between outcomes originating from the same movement simultaneously recorded with different measurement devices? Front. Bioeng. Biotechnol. 2022, 10, 1006670. [Google Scholar] [CrossRef] [PubMed]

Figure 1. An active upper limb exoskeleton is defined by its mechanical properties and support characteristic, which is the set of available support curves. The effectiveness of the support depends on the requirements from the support context, which encompasses the exoskeleton, the human user, and the task. Modeling intention enables one to interpret the current requirements of the support context by analyzing various signals to select an appropriate support curve and adjust it during runtime. The figure illustrates this process using a torque-controlled shoulder exoskeleton as an example.

Figure 2. Detailed flow chart of PRISMA literature retrieval including the eligibility criteria (adapted from [14] to additionally show results of title and abstract screening). The top row displays the search terms and filters used in the databases. The asterisk (*) indicates that wildcard search was used, if possible, to find results including variations of the word.

Figure 3. An overview of the distribution of methods included in this review, giving an indication of the approaches used to predict intention and control of an exoskeleton. The numbers represent the number of publications we found that used, applied, or addressed one of the items listed in each category. From left to right, this chain shows the process from the human body sensor source to the actuated limb, with all different intermediate steps covered in this review.

Figure 4. Classification-based intention prediction typically uses threshold methods to detect activities (¹ [52], ² [50]), activity states (³ [55], ⁴ [75]) or motion onset (⁵ [57], ⁶ [64], ⁷ [65], ⁸ [71]). The support characteristic is then modeled as a constant or a function of joint position (dashed = torque, dotted = trajectory). Line endings within the stream indicate unspecified support characteristics.

Figure 5. Regression-based intention prediction is implemented based on either model-free approaches (¹ [51], ² [70], ³ [68], ⁴ [69], ⁵ [72]) or models-based approaches including biomechanical (⁶ [71], ⁷ [65], ⁸ [53], ⁹ [63]), mechanical (¹⁰ [64], ¹¹ [7], ¹² [60], ¹³ [61], ¹⁴ [56], ¹⁵ [73]), or interaction-based models (¹⁶ [55], ¹⁷ [58]). The support characteristic is then modeled as a constant or a function of joint position (dashed = torque, dotted = trajectory). Line endings within the stream indicate unspecified support characteristics.

Figure 6. Overview of symbolic time windows to contextualize intention prediction in existing studies (¹ [77], ² [68], ³ [67], ⁴ [11], ⁵ [64], ⁶ [63], ⁷ [52], ⁸ [57], ⁹ [72], ¹⁰ [54], ¹¹ [51], ¹² [59], ¹³ [75]). The left vertical line indicates movement onset, serving as a key reference to distinguish between prediction of movement onset (left of the line) and classification and regression of movements (right of the line). Movement onset is defined as the point at which a measurable, predefined threshold is crossed (e.g., velocity or muscle activation). Some entries, such as EEG-based approaches, are placed approximately due to imprecise onset definitions in the original studies. To improve comparability between studies, EMG-based onset timings were adjusted by subtracting the electromechanical delay (indicated by red arrows), accounting for the lag between muscle activation and mechanical movement.

Table 1. Research questions on intention prediction in upper-limb exoskeletons, with guiding sub-questions and corresponding analysis dimensions used for systematic categorization and information extraction.

Research Question	Analysis Dimensions
	(RQ1) How?
A. Sensing and Prediction Approach
A1. What intention cues are used?	Intention Cue
A2. What types of sensors are employed?	Sensors
A3. What methods are applied to predict intention?	Method
B. Temporal Context Consideration
B1. What temporal aspects are considered?	Timing
B2. How do sensors influence the temporal context?	Sensors
	(RQ2) Why?
C. Purpose and Targeting
C1. What is the prediction objective?	Support Characteristic
C2. Who are the target users?	Target Users
D. Control and Evaluation
D1. How is the intention prediction integrated into control?	Support Characteristic
D2. How is the system evaluated?	Evaluation, Target Users

Table 2. Included publications and their contribution to the dimensions of analysis.

Reference	Intention Cue	Sensors	Method	Timing	Support Charac.	Target Users	Evaluation
Bandara et al. [52]	✓	✓	✓	✓	-	-	-
Bi et al. [11]	✓	✓	✓	-	-	-	✓
Buongiorno et al. [53]	-	✓	✓	-	✓	✓	-
Chen et al. [54]	✓	✓	✓	✓	-	-	-
Dinh et al. [7]	-	✓	✓	✓	✓	✓	✓
Gandolla et al. [55]	✓	✓	✓	-	✓	✓	✓
Gantenbein et al. [18]	✓	✓	-	-	-	-	✓
Gao et al. [56]	-	✓	✓	-	✓	✓	✓
Heo et al. [57]	✓	✓	✓	✓	✓	✓	✓
Huo et al. [58]	-	✓	✓	-	✓	✓	✓
Irastorza-Landa et al. [59]	✓	✓	✓	✓	-	-	-
Khan et al. [60]	-	✓	✓	-	✓	✓	✓
Liao et al. [61]	-	✓	✓	✓	✓	✓	✓
Losey et al. [62]	✓	✓	✓	✓	-	-	-
Lotti et al., 2020 [63]	✓	-	✓	✓	✓	✓	✓
Lotti et al., 2022 [64]	✓	✓	✓	✓	✓	✓	✓
Lu et al. [65]	-	✓	✓	-	✓	✓	✓
Novak and Riener [66]	✓	✓	✓	✓	✓	✓	✓
Riener and Novak [67]	✓	✓	✓	✓	✓	✓	✓
Sedighi et al. [68]	✓	✓	✓	✓	✓	✓	-
Sun et al. [69]	✓	-	✓	✓	-	-	✓
Toro-Ossaba et al. [70]	✓	✓	✓	✓	✓	✓	-
Treussart et al. [71]	✓	✓	✓	-	✓	✓	-
Woo et al. [72]	✓	✓	✓	-	-	-	-
Zabaleta et al. [50]	✓	✓	✓	-	✓	✓	✓
Zarrin et al. [73]	✓	-	✓	-	✓	✓	✓
Zhang et al., 2019 [74]	✓	✓	✓	-	-	-	-
Zhang et al., 2023 [51]	-	✓	✓	✓	✓	✓	✓
Zhou et al. [75]	✓	-	✓	✓	✓	✓	✓

Table 3. Recommendations for the development of intent-aware exoskeletons for specific fields of application. We use four classes of activities: force, varying, speed, and precision activities, which have specific requirements on torque or motion accuracy.

Activity Type	Description	Examples	Signal, Sensor	Control	References
Force	high weights, low velocity, small motion range	lifting objects, heavy tool handling, hammering	kinematic, mechanical cues IMUs, embedded motor sensors, force/torque sensors	admittance control, torque control	[7,53,58,63,71]
Varying	different weights, different velocities, small motion range	construction work, commissioning, workplace setup	kinematic, mechanical cues IMUs, embedded motor sensors, force/torque sensors, context information	torque control, Model Reference Adaptive control	[57,67,70,71,75]
Speed	low-middle weights, high velocity, high motion range	object sorting, transporting, drilling, cleaning	kinematic cues IMUs, embedded motor sensors	position control	[51,55,68,73]
Precision	low weights, low velocity, small motion range	small part assembly, painting, welding	kinematic cues IMUs, embedded motor sensors, camera	admittance control, position control	[55,58,60,61,68,73]
	High Torque Accuracy High Motion Accuracy

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Hochreiter, D.; Schmermbeck, K.; Vazquez-Pufleau, M.; Ferscha, A. Intention Prediction for Active Upper-Limb Exoskeletons in Industrial Applications: A Systematic Literature Review. Sensors 2025, 25, 5225. https://doi.org/10.3390/s25175225

AMA Style

Hochreiter D, Schmermbeck K, Vazquez-Pufleau M, Ferscha A. Intention Prediction for Active Upper-Limb Exoskeletons in Industrial Applications: A Systematic Literature Review. Sensors. 2025; 25(17):5225. https://doi.org/10.3390/s25175225

Chicago/Turabian Style

Hochreiter, Dominik, Katharina Schmermbeck, Miguel Vazquez-Pufleau, and Alois Ferscha. 2025. "Intention Prediction for Active Upper-Limb Exoskeletons in Industrial Applications: A Systematic Literature Review" Sensors 25, no. 17: 5225. https://doi.org/10.3390/s25175225

APA Style

Hochreiter, D., Schmermbeck, K., Vazquez-Pufleau, M., & Ferscha, A. (2025). Intention Prediction for Active Upper-Limb Exoskeletons in Industrial Applications: A Systematic Literature Review. Sensors, 25(17), 5225. https://doi.org/10.3390/s25175225

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Intention Prediction for Active Upper-Limb Exoskeletons in Industrial Applications: A Systematic Literature Review

Abstract

1. Introduction

2. Background

2.1. Active Upper-Limb Industrial Exoskeletons

2.2. Intention Prediction

3. Research Objectives and Methodology

3.1. Objectives and Dimensions

3.2. Methodology

3.2.1. Eligibility Criteria

3.2.2. Databases and Search Strategy

3.2.3. Selection Process

3.2.4. Data Collection and Synthesis

4. Results

4.1. Intention Cues

4.1.1. Intention and Activity Prediction

4.1.2. Detect Onset and Offset of Activity

4.1.3. Intention and Activity Classification

4.2. Sensors and Signals

4.2.1. Biological Signals

4.2.2. Non-Biological Signals

4.3. Computation and Models

4.3.1. Classification

4.3.2. Model-Based Regression

4.3.3. Model-Free Regression

4.4. Time

4.4.1. System Response Time

4.4.2. Analysis Times

4.5. Support Characteristic

4.6. Users and Evaluation

5. Discussion

5.1. Intention Cues

5.2. Sensors and Signals

5.3. Computation and Models

5.4. Time

5.5. Exoskeleton Implementations

6. Future Research and Recommendations

7. Conclusions

Supplementary Materials

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A. Detailed Results

Appendix A.1. Biological Signals

Appendix A.2. Users and Evaluation

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI