1. Introduction
In recent years, remote-controlled robotic arms have been widely used in industrial production, hazardous environment detection, medical surgery, and other fields [
1,
2,
3]. It can replace manual labor to complete complex and dangerous operation tasks, which greatly improves work efficiency and safety. However, due to the complexity and variability of the teleoperation environment [
4,
5], the high degree of coupling and the nonlinearity of the robotic arm system [
6], and the long history of continuous work, the teleoperated robotic arm has a high incidence of failures in actual operation [
7], which seriously affects its reliability and work performance. The reliability of the remote-controlled robotic arm system depends on the performance of each robotic arm and its communication quality [
8]. Failure of a single arm can degrade the performance of its neighboring arms through communication, thus threatening the whole arm system [
9]. Therefore, there is an urgent need for a method that can detect and diagnose abnormalities in teleoperated robotic arm systems at the initial stage of failure. Efficient and accurate fault diagnosis techniques [
10] can not only buy valuable time for fault handling but also significantly reduce the system maintenance cost [
11], reduce the risk of serious accidents leading to the paralysis of the whole system [
12], and thus effectively guarantee the stable operation of the system and improve its reliability [
13]. In this paper, we follow the terminology and methodologies outlined by Isermann and Ballé in their seminal work on model-based fault detection and diagnosis [
14].
Most existing fault diagnosis methods for leader–follower robotic systems are based on analytical or observer-based models. These model-based approaches are known for their high accuracy and real-time performance under ideal conditions. For example, Quan et al. [
15] proposed a distributed fault detection and isolation scheme using sliding-mode observers to estimate system states based on both local and neighboring agent information. Kang et al. [
16] developed a fault diagnosis and fault-tolerant control strategy for multiple manipulators, which combined a learning observer with a radial basis function (RBF) neural network to approximate disturbances. Zhang et al. [
17] introduced a robust distributed detection and isolation method leveraging Linear Matrix Inequality (LMI) techniques, while Zhong et al. [
18] applied distributed PID control and observer-based techniques for fault detection within the context of multi-agent formation control.
However, despite their effectiveness, model-based methods depend heavily on the availability of accurate system models, which can be difficult to construct for complex, nonlinear, or dynamically changing systems [
19]. These methods are often sensitive to model uncertainties and external disturbances, which can compromise their robustness and generalization capability [
20]. Furthermore, the design and tuning of such systems typically require extensive expert knowledge and manual adjustment [
21], making them less efficient and more difficult to adapt to unseen or evolving fault conditions.
In contrast, data-driven approaches, particularly those using deep learning, have emerged as powerful alternatives for fault diagnosis tasks [
22,
23]. These methods can automatically learn features from raw sensor data, model complex nonlinear relationships, and generalize well across a variety of operating scenarios. By eliminating the need for explicit modeling and manual feature extraction, deep learning techniques improve robustness, scalability, and adaptability, making them especially suitable for remote-controlled robotic arm systems where uncertainty and real-time requirements are prevalent.
Techniques such as the convolutional neural network (CNN) and recurrent neural network (RNN) have been successfully applied to various fault diagnosis tasks. Liu et al. [
24] presented a deep-learning-based fault diagnosis method for leader–follower systems using CNN with multisensor data fusion. Xiao et al. [
25] proposed a fault diagnosis method for asynchronous motors using Long Short-Term Memory (LSTM) neural networks to directly process raw vibration data, achieving high accuracy without feature engineering. Additionally, some researchers tried to incorporate the fuzzy method into deep learning models to handle data with high uncertainty and noise. For example, Deng et al. [
26] proposed a hierarchical fused fuzzy deep neural network (FDNN) to enhance deep learning capability for data classification.
One significant challenge faced by typical deep neural networks during the backpropagation process is the vanishing gradient problem. This issue arises when gradients become exceedingly small during training, particularly in deep architectures, which hinders the effective updating of weights and slows down the learning process. The echo state network (ESN) is a novel type of recurrent neural network that forms part of reservoir computing models [
27]. In ESN, the reservoir has fixed, random weights. This unique structure eliminates the need for backpropagation through the recurrent layers, thereby mitigating the vanishing gradient problem.
Covariance matrix adaptation evolution strategy (CMA-ES) is a continuous optimization algorithm designed to solve non-linear and non-convex optimization problems, particularly effective in exploring complex, high-dimensional search spaces without requiring gradient information [
28]. The algorithm iteratively adapts a multivariate normal distribution, using the covariance matrix to guide the search towards promising regions, thereby enabling efficient exploration and exploitation of the hyperparameter space. Applying CMA-ES [
28] to optimize deep fuzzy echo state networks (DFESNs) [
29] helps address the complexity involved in tuning multiple hyperparameters that significantly influence model performance. The covariance matrix captures the dependencies among hyperparameters, generating well-suited candidates for improving model accuracy and efficiency.
In general, three challenging points remain in fault diagnosis for leader–follower robotic arm systems. First, model uncertainty due to the nonlinear and dynamic nature of the system poses significant difficulties. In addition, data variability and noise can obscure fault signals, complicating the diagnosis process. Finally, real-time diagnosis is critical as delays can result in significant performance degradation or system failures.
To address the aforementioned challenges, this paper proposes a fault diagnosis method for remote-controlled robotic arm systems based on a deep fuzzy echo state network (DFESN) [
29], whose hyperparameters are optimized using the covariance matrix adaptation evolution strategy (CMA-ES) [
28]. To overcome the difficulty of modeling nonlinear and dynamic systems, the proposed method does not rely on an explicit mathematical model of the robot but instead leverages the reservoir computing ability of the ESN to capture system dynamics directly from data. To tackle the challenge of data variability and noise, a fuzzy logic mechanism is incorporated into the ESN to enhance robustness and generalization. This allows the model to effectively identify fault patterns in noisy and uncertain teleoperation environments. The method is specifically designed to support online real-time fault diagnosis by maintaining low computational complexity, making it suitable for high-frequency feedback control in robotic systems. Compared to Zhang’s original DFESN framework [
29], which focused on general machinery fault diagnosis, our approach expands the application to a more complex and interactive leader–follower architecture. Moreover, we introduce CMA-ES for the simultaneous optimization of multiple key hyperparameters—including reservoir size, spectral radius, sparsity, and the number of fuzzy clusters—which significantly improves diagnostic accuracy and adaptability. The model is capable of identifying diverse fault types, including sensor faults, actuator faults, and communication faults, under continuous operational conditions without requiring task interruption. The use of a fuzzy ESN further enables efficient processing of smaller three-dimensional datasets, ensuring robust performance under practical operating conditions. Experimental results on a leader–follower robotic arm system validate the effectiveness of the proposed technique, demonstrating its ability to accurately diagnose faults in real time.
The remainder of this paper is organized as follows.
Section 2 provides a comprehensive overview of the system of remote-controlled robotic arms and details the types of faults considered in the study.
Section 3 presents the DFESN for fault classification, including an explanation of the architecture, training algorithm, the use of CMA-ES for optimizing hyperparameters, and an analysis of the computational complexity compared to other methods.
Section 4 showcases the experimental setup and results, validating the effectiveness of the proposed method. The paper concludes with
Section 5, which discusses the conclusion.
2. System of Remote-Controlled Robotic Arms
This section provides a technical overview of the remote-controlled robotic arm system under investigation. It introduces the main hardware components, including the leader (Geomagic Touch) and the follower (QArm), as well as the communication protocol used between them. Additionally, the types of faults considered in this study—sensor faults, actuator faults, and communication faults—are defined. This background serves as a foundation for understanding the fault diagnosis methodology proposed in the subsequent sections.
2.1. Leader: Geomagic Touch
The leading arm is Geomagic Touch, which is also named Omni Bundle. It is a six-joint robot, with three active and three passive joints, enabling the end effector to move across the workspace. Position measurements are obtained using digital encoders and potentiometers. Integrated with QUARC software within MATLAB/Simulink, it allows precise control and feedback.
2.2. Follower: QArm
The QArm, acting as the follower in this robotic system, has a roll–tilt–roll configuration with four joints: base, shoulder, elbow, and wrist. Controlled by QUARC software within MATLAB/Simulink, it ensures precise synchronization with the leader robot. While the QArm follows the three active joints of the leader, the wrist joint remains inactive. This setup enables the QArm to reliably replicate the leader’s actions, ensuring coordination in complex tasks.
2.3. Communication Protocol Between Leader and Follower
The communication protocol uses the QUARC software’s Stream modules, employing TCP/IP for reliable data transmission. Data packets are sent every 0.02 s, with typical transmission delays below 10 ms. Zero matrices are used to initialize the data buffer, ensuring correct data alignment and handling. This allows the QArm to accurately synchronize with the Geomagic Touch, maintaining real-time coordination and operational efficiency.
2.4. The Type of Faults
In this remote-controlled robotic arm system, the three joint angles of the leader,
,
, and
, are transmitted to the follower through the controller designed in [
30] to make corresponding joint angles of the follower,
,
, and
, synchronize with
,
, and
, respectively. Sensor faults, actuator faults, and communication faults are incorporated into the system as illustrated in
Figure 1.
and
, where
, are real outputs of the leader and follower, respectively.
In this paper, three typical faults of the remote-controlled robotic arm system are under consideration: sensor faults, actuator faults, and communication interruption (see
Figure 1). Sensor faults consist of leader sensor
faults and follower sensor faults
, which are modeled as follows:
An actuator fault is represented as follows:
where
is the input of the follower, and
is the rate of loss from 0 to 1. This type of fault is more representative of the robotic arm, such as actuator wear, energy supply issues, or transmission efficiency losses, where the output performance degrades in proportion to the input signal. Such faults are more commonly observed in robotic manipulator systems [
31,
32,
33].
The communication fault being investigated is classified as an interruption fault. When it occurs, the communication between the leader and the follower is disrupted, and they no longer work in coordination.
In the rest of the paper, an online fault diagnosis scheme is designed to identify the abovementioned faults, only through the follower output, that is , and .
4. Experiment
4.1. Fault Description
In this section, the DFESN is applied for fault diagnosis in remote-controlled robotic arms to validate its effectiveness. The experimental scenario is shown in
Figure 4. We first trained the DFESN by collecting data from the follower. The trained DFESN was then deployed to the remote-controlled robotic arm system for online fault diagnosis. The parameter of the sensor is shown in
Table 1. The categories of faults under investigation are shown in
Table 2.
4.2. Data Collection
Different types of faults are introduced into the remote-controlled robotic arm system, with data recorded from three sensors in the follower. Real-time data are collected via Simulink, with a sampling interval of 0.02 s. In each fault scenario, real-time data from the follower robotic arm were collected over a period of 30 s, resulting in 1500 data samples per scenario. In total, 13,500 data samples were obtained across 9 different fault scenarios. Among these, 80% of the datasets were randomly selected for training, while the remaining 20% were used for testing in the experiments.
4.3. Experimental Results
In this experiment, the covariance matrix adaptation evolution strategy (CMA-ES) is used to optimize the hyperparameters of the deep fuzzy echo state network (DFESN). The four hyperparameters evaluated are reservoir size, spectral radius, sparsity parameter, and the number of fuzzy clusters, with respective ranges of [100–1000], [0–1], [0–0.5], and [5–50]. In the CMA-ES algorithm, as four parameters have been chosen as decision variables, which means , and according to , is found to be 80, i.e., the population size is 80, the search step length is set at 0.1, and the maximum number of iterations is 200. The values for step length and the maximum number of iterations are determined based on initial experiments to balance computational efficiency and optimization effectiveness. The step length of 0.1 was chosen to ensure a moderate exploration rate in the search space. A higher step length may lead to larger jumps and potentially miss local optima, while a lower step length might result in slow convergence. The maximum number of iterations was set to 200 to provide sufficient exploration without incurring excessive computational time, given the complexity of the model.
Figure 5 illustrates the variation in Mean Squared Error (MSE) of the DFESN throughout the optimization process. Despite some fluctuations in the curve, an overall decreasing trend in the MSE can be observed. The MSE reached its minimum value at iteration 124, at which point the best hyperparameters are found to be a reservoir size of 500, a spectral radius of 0.1, a sparsity parameter of 0.01, and a number of fuzzy clusters of 30.
In addition, to assist users intending to apply this network to other datasets, we recommend starting with an optimization strategy similar to CMA-ES. Initial hyperparameter values can be set using the same ranges as described in this study—reservoir size [100–1000], spectral radius [0–1], sparsity [0–0.5], and number of fuzzy clusters [5–50]. These ranges have proven effective across a variety of fault types in this study and provide a reasonable starting point.
The test sets for different fault scenarios were input into the trained DFESN model, and the results are shown in
Table 3. DFESN model achieved high accuracy rates exceeding 90% across the 9 fault types. Notably, fault types 3, 4, 6, 8, and 9 achieved a classification accuracy of 100%.
The confusion matrix presented in
Figure 6 provides a detailed evaluation of the DFESN model’s performance in fault classification across different fault types for the leader–follower robotic arm system. Each coordinate from 1 to 9 represents a distinct fault type as described in
Table 3, including normal operation, communication faults, actuator faults, and different sensor faults. The DFESN model exhibits a high classification accuracy for most of the fault types, achieving a perfect accuracy of 1.00 for fault types 1, 3, 4, 5, 6, 8, and 9. This indicates that these faults were correctly identified without any misclassification, highlighting the model’s robustness in diagnosing these particular types of faults. Notably, the perfect classification for sensor faults (types 4, 5, 6, 8, and 9) shows that the model can effectively handle fault detection that involves complex sensor data.
However, for fault types 2 (communication fault) and 7 (leader sensor 2 fault), there are slight misclassifications, with accuracies of 0.97 and 0.99, respectively. These minor discrepancies suggest that there may be some overlap in the features representing these faults, possibly due to similarities in the observed data patterns under these conditions. The communication fault (type 2) seems particularly challenging for the model, possibly because interruptions in communication can manifest in a manner similar to other faults, making it harder for the model to distinguish.
To comprehensively evaluate the robustness of the DFESN model against various real-world conditions, we further analyzed its performance under different types of noise: Gaussian, Uniform, and Impulse noise.
Figure 7 shows the classification accuracy across different signal-to-noise ratios (SNRs). At lower SNR levels (e.g., 5 dB), classification accuracy is significantly impacted by noise, dropping to around 60% for all noise types, highlighting the sensitivity of the model in highly noisy environments. However, as SNR increases to 10 dB and above, classification accuracy improves substantially, surpassing 80%. At an SNR of 20 dB and higher, the accuracy approaches nearly 100%, indicating excellent robustness of the DFESN model under typical operational noise conditions. Moreover, the results show no significant difference among Gaussian, Uniform, and Impulse noise types at higher SNR levels, demonstrating that the DFESN method maintains strong performance irrespective of the noise distribution encountered. This analysis confirms the suitability of the proposed DFESN model for practical scenarios where varying noise conditions are common.
Table 4 shows the parameter configurations for each approach, which were established through parameter tuning experiments. This ensures that all parameters were optimally adjusted to enable a fair comparison across methods. To further validate the effectiveness of DFESN, we performed comparative studies with commonly used methods including ESN, CNN, LSTM, and BP neural networks. DFESN outperformed these methods in terms of both accuracy and real-time efficiency, as shown in the
Table 5. The training time of the ESN method was 1.45 s, which is the shortest of all of the methods. However, this method achieved the lowest accuracy. The LSTM, CNN, BP, and DFESN methods all achieve a mean accuracy exceeding 0.9, yet their training times vary significantly. The LSTM method has the longest training time at 320.59 s. Both the BP and DFESN methods achieve similarly high mean classification accuracies, with the DFESN method slightly outperforming at 0.9947 compared to BP’s 0.9925. Notably, the DFESN method has the shortest training time (4.75 s) and online diagnosis time (0.0105 s) among all the compared methods. This indicates that the proposed DFESN method is highly suitable for real-time fault diagnosis applications, providing both high accuracy and efficient processing times, which is crucial for handling the demands of real-time online diagnostics in complex systems.
4.4. Discussions
The aforementioned fuzzy ESN is a fundamental component of the DFESN illustrated in
Figure 3. In the DFESN model, the original input signal passes through multiple fuzzy ESNs for feature extraction, ultimately being classified by a final ESN.
The results of this study confirm the effectiveness of our fault diagnosis approach for single follower robotic arm, supporting our hypothesis that advanced diagnostic algorithms enhance fault detection accuracy. This aligns with previous research but goes further by demonstrating robustness in dynamic scenarios. However, the approach’s scalability to systems involving multiple follower arms remains untested, particularly in complex environments requiring high inter-arm coordination. Additionally, the current reliance on a limited set of data sources may not capture all potential fault conditions, potentially leading to missed detections or false positives. Although the presented method is validated for the fault types listed, it can potentially be extended to other fault categories. For instance, the same architecture could be applied to detect faults in additional sensors or actuators not explicitly considered in this study. The DFESN model’s design, which allows for efficient adaptation through hyperparameter optimization, suggests that it could maintain high performance under different conditions, provided that appropriate training data are available.
Although the current results demonstrate promising performance, several avenues for further research remain open. Future studies should assess the generalizability of the proposed DFESN approach across diverse robotic platforms and multi-arm collaborative systems. It is worth noting that this study focuses primarily on fault classification, specifically identifying the component and type of fault (sensor, actuator, or communication). Future research will aim to extend this framework to include fault detection, isolation, and identification (FDII), enabling not only localization but also root cause analysis of faults. Additionally, while our current work provides comparisons between DFESN and standard methods such as ESN, CNN, LSTM, and BP neural networks, subsequent research could further extend these comparisons by exploring performance under more varied operational scenarios, including complex and combined fault conditions. While this study focused primarily on common fault types encountered in remote-controlled robotic arms—such as sensor faults, actuator faults, and communication faults—future investigations should incorporate more challenging fault scenarios, including simultaneous multi-sensor failures, sophisticated actuator degradations, and variable communication delay scenarios, to comprehensively evaluate the robustness of the proposed method. Finally, although real-time performance was evaluated primarily using online diagnosis time metrics, further research is necessary to explicitly examine other critical real-time factors, such as communication delays and tracking latency, thus ensuring more thorough insights into operational reliability under realistic working conditions.
5. Conclusions
In this work, we proposed a novel fault diagnosis technique for remote-controlled robotic arm systems, utilizing a neural network inspired by the DFESN. The DFESN model addresses the significant challenges posed by the high uncertainty and noise typical in teleoperated environments. By effectively handling smaller, three-dimensional datasets, the DFESN model proves to be both efficient and accurate in real-time fault diagnosis. Furthermore, we applied the covariance matrix adaptation evolution strategy (CMA-ES) to optimize the hyperparameters of the DFESN model, significantly enhancing its performance. This optimization was crucial in adapting the DFESN model for the complex dynamics involved in robotic fault diagnosis in real-time environments.
Our extensive experiments on a leader–follower robotic arm system demonstrate that the DFESN model achieves superior performance compared to traditional methods. Specifically, it achieves the highest mean accuracy of 99.5% while maintaining the shortest training time (4.75 s) and online diagnosis time (0.0105 s). These results highlight the DFESN model’s robustness and its ability to perform real-time fault diagnosis, which is critical for maintaining the reliability and safety of robotic arm systems in various applications.
The high classification accuracy across different fault types, with perfect classification for several fault categories, underscores the model’s effectiveness in distinguishing between various faults. The minimal misclassification rates further validate the DFESN model’s potential in practical applications where prompt and accurate fault detection is crucial.