Condition Monitoring and Fault Prediction in PMSM Drives Using Machine Learning for Elevator Applications

Vlachou, Vasileios I.; Karakatsanis, Theoklitos S.; Efstathiou, Dimitrios E.; Vlachou, Eftychios I.; Vologiannidis, Stavros D.; Balaska, Vasiliki E.; Gasteratos, Antonios C.

doi:10.3390/machines13070549

Open AccessArticle

Condition Monitoring and Fault Prediction in PMSM Drives Using Machine Learning for Elevator Applications

by

Vasileios I. Vlachou

¹

,

Theoklitos S. Karakatsanis

^2,*

,

Dimitrios E. Efstathiou

³

,

Eftychios I. Vlachou

³,

Stavros D. Vologiannidis

³

,

Vasiliki E. Balaska

²

and

Antonios C. Gasteratos

²

¹

School of Electrical and Computer Engineering, National Technical University of Athens, 15780 Athens, Greece

²

Department of Production and Management Engineering, Democritus University of Thrace, 67100 Xanthi, Greece

³

Department of Computer, Informatics and Telecommunications, International Hellenic University, 62124 Serres, Greece

^*

Author to whom correspondence should be addressed.

Machines 2025, 13(7), 549; https://doi.org/10.3390/machines13070549

Submission received: 10 May 2025 / Revised: 14 June 2025 / Accepted: 23 June 2025 / Published: 24 June 2025

(This article belongs to the Section Electrical Machines and Drives)

Download

Browse Figures

Versions Notes

Abstract

Elevators are a vital part of urban infrastructure, playing a key role in smart cities where increasing population density has driven the rise in taller buildings. As an essential means of vertical transportation, elevators have become an integral part of daily life, making their design, construction, and maintenance crucial to ensuring safety and compliance with evolving industry standards. The safety of elevator systems depends on the continuous monitoring and fault-free operation of Permanent Magnet Synchronous Motor (PMSM) drives, which are critical to their performance. Furthermore, the fault-free operation of PMSM drives reduces operating costs, increases service life, and improves reliability. The PMSM drive components may be susceptible to electrical, mechanical, and thermal faults that, if undetected, can lead to operational disruptions or safety risks. The integration of artificial intelligence and Internet of Things (IoT) technologies can enhance fault prediction, reducing downtime and improving efficiency. Ongoing challenges such as managing machine thermal load and developing more durable materials for PMSMs require the development of suitable models that are adapted to existing drive systems. The proposed framework for fault prediction is validated on a real residential elevator equipped with a PMSM drive. Multimodal signal data is processed through a Generative Adversarial Network (GAN)-enhanced Positive Unlabeled (PU) classifier and a Reinforcement Learning (RL)-based adaptive decision engine, enabling robust and scalable fault prediction in a non-intrusive fashion.

Keywords:

PMSM drives; fault prediction; condition monitoring; deep machine learning; predictive maintenance; elevator systems; smart sensors

1. Introduction

As urban populations continue to grow, cities are increasingly expanding vertically, with high-rise buildings becoming a defining feature of modern urban landscapes. Elevators play a critical role in supporting this urbanization by enabling efficient vertical transportation. From residential and commercial complexes to industrial applications, elevators are an integral part of modern life [1]. Their reliability and efficiency directly affect the quality of life, productivity, and safety of urban populations. Consequently, the design, construction, and maintenance of elevators are areas of significant technological focus, particularly given the need to meet rigorous safety standards and ensure operational efficiency [2].

At the heart of most elevator systems are Permanent Magnet Synchronous Motors (PMSMs), which are favored for their energy efficiency, compact size, high power density, and precise torque control. Combined with sophisticated drives and control systems, PMSMs ensure smooth, safe, and efficient elevator operation [3]. However, like any complex electromechanical system, PMSMs and their associated drives are prone to faults, including electrical failures (e.g., winding faults, short circuits), mechanical issues (e.g., bearing wear, rotor misalignment), and thermal problems (e.g., overheating due to continuous operation). If not addressed promptly, these faults can lead to operational disruptions, costly downtime, and, in extreme cases, safety hazards for passengers [4].

Traditional maintenance strategies, such as scheduled inspections or reactive repairs, are often inadequate for the demands of modern elevator systems. These approaches are either inefficient, as they rely on fixed intervals rather than actual system conditions, or costly, as they address faults only after failures occur [5]. Predictive maintenance has emerged as a superior alternative, leveraging condition monitoring to assess the health of elevator systems in real-time and to predict potential failures before they occur. Central to this paradigm is the continuous monitoring of PMSM drives using data collected from smart sensors, which measure electrical, mechanical, and thermal parameters [6].

Despite its advantages, predictive maintenance presents significant technical challenges. The complexity of PMSM drives, coupled with the high-dimensional and noisy nature of sensor data, requires sophisticated techniques for data processing, fault diagnosis, and failure prediction [7]. Traditional analytical methods, while useful, often fail to capture the complex, nonlinear relationships in the data. This has led to the increasing adoption of machine learning (ML) techniques, which can uncover hidden patterns in large datasets and improve the accuracy of fault detection and prediction [8].

Machine learning techniques have shown considerable promise in predictive maintenance applications. Supervised learning models, such as Support Vector Machines (SVM), Random Forests, and Neural Networks, are commonly used for classifying operational states or predicting faults based on labeled data. However, these methods often require large amounts of labeled fault data, which can be challenging to obtain in real-world elevator systems due to the rarity of certain faults [9,10]. To overcome this limitation, unsupervised learning and hybrid methods, such as Positive Unlabeled (PU) learning, have been explored. PU learning, in particular, is advantageous for leveraging unlabeled operational data alongside a small subset of labeled fault cases, enabling robust fault detection even in data-scarce environments [11].

The success of machine learning models in predictive maintenance depends heavily on the quality of the data input. Raw sensor data is often noisy, high-dimensional, and contains irrelevant or redundant information [12]. To address these issues, several preprocessing steps are employed. Sensor readings, such as voltage, current, vibration, and temperature, often have varying ranges and units. Normalization ensures that all features are scaled to a consistent range, typically [0, 1] or [−1, 1], to improve model performance and convergence. Noisy data can obscure meaningful patterns, making accurate fault detection difficult [13]. Advanced filtering techniques, such as the Kalman Filter, Gaussian Filter, and Low-pass Filters, are used to reduce noise while preserving important signal characteristics [14,15].

Kalman Filter is effective for dynamic systems, and it provides an optimal estimate of the system state by combining noisy measurements and prior knowledge. Gaussian Filter smooths the data by averaging values within a Gaussian window, reducing high-frequency noise [16]. Time domain features extract basic statistical metrics, such as mean, standard deviation, skewness, and kurtosis, to capture signal properties. Frequency domain features use Fourier Transform or Wavelet Transform to analyze signal frequencies, identifying harmonics or transient anomalies. Time–frequency features combine time and frequency analysis, using techniques like the Short-Time Fourier Transform (STFT) or Wavelet Packet Transform (CWT), to capture transient fault characteristics [17].

This study introduces a machine learning-based framework for condition monitoring and fault prediction in PMSM drives, tailored specifically to elevator systems. By leveraging advanced signal processing and feature engineering, the framework addresses the challenges of noisy data and complex fault patterns. The methodology includes intelligent smart sensor integration with real-time electrical, mechanical, and thermal data collection. Data preprocessing using normalization and filtering techniques enhances signal quality and extracts characteristic averages from raw sensor data, improving fault detection accuracy. The development of positive label-free (PU) learning models and other advanced AI algorithms for classifying operational states and predicting potential faults has been shown to enhance the reliability and safety of elevator systems. This underscores the transformative role of machine learning in predictive maintenance.

The main objective of this paper is to utilize the data collected using sensors and integrate it into appropriate machine learning models to investigate the probability of faults and to avoid critical errors through continuous monitoring of the PMSM state. Section 2 provides a literature review of the key elements of elevator systems, common PMSM failures, and existing machine learning techniques in predictive maintenance. Section 3 describes the proposed methodology framework, including data acquisition, preprocessing, feature extraction, and machine learning models. Section 4 presents the experimental results by evaluating the framework through simulations and experimental setups and highlighting its effectiveness in fault detection and prediction. Similarly, Section 5 analyzes the results, discusses the limitations of the proposed methodology, and explores potential applications. Finally, Section 6 summarizes the strong points of the proposed methodology and suggests directions for further research. This structured approach aims to provide a comprehensive exploration of machine learning techniques for condition monitoring and fault prediction in PMSM drives, with a focus on enhancing the safety, efficiency, and reliability of elevator systems.

2. Advanced Techniques in Elevator Systems

Elevators play a central role in the vertical movement of passengers and loads in any building, operating as complex electromechanical systems. Their subsystems are divided into electrical, mechanical, and safety devices [18]. Using wireless networks, data can be collected, processed, and transmitted to the cloud. The introduction of automation systems contributes to safety in such drive systems by achieving the timely and efficient diagnosis of various types of faults [19]. The integration of machine learning techniques with elevator systems enables fault prediction and condition monitoring at unprecedented levels of accuracy. Among these techniques, Positive Unlabeled (PU) learning and Reinforcement Learning (RL) stand out for their potential to address challenges such as data scarcity and dynamic operational conditions. The subsequent sections provide an in-depth discussion of their applications, advantages, and challenges in predictive maintenance for elevator systems.

2.1. Fault Diagnosis and Condition Monitoring in PMSMs

The continuous use of elevators causes significant stress on PMSMs, causing damage to electrical, mechanical, and magnetic parts of the motor. For high-voltage cases, the highest percentage of faults (66%) is found in stator windings, while for the low-voltage level cases (48 V to 380 V), the highest percentage of faults (41%) is due to bearing failures [20]. To address such faults, it is necessary to maintain the equipment at regular intervals. Reactive maintenance, also known as breakdown maintenance, involves repairs carried out after equipment failure [21].

The development of the Internet of Things (IoT) has contributed to predictive maintenance as the integration of a new generation of smart sensors—in both the motor and the lift system—helps to collect large amounts of data in real time [22]. The integration of IoT-based elevator emergency notification systems via smart-phone-enabled technicians and maintenance companies to monitor elevator status in real time provides alerts in case of emergency [23].

Similar techniques have been applied with suitable efficient data reduction algorithms adapted to Industrial Internet of Things (IIoT) nodes for bearing fault diagnosis in PMSMs, where data is filtered and compressed [24]. Sensor specifications play a critical role for accurate data collection. A major problem identified is that the collected data may be very noisy due to external interference. Stochastic models have been proposed to address the noise problem, utilizing wavelets and thresholding techniques to extract reliable features; however, these techniques significantly increase the complexity [25].

The accuracy of the measurements is determined by the specifications of the sensors meeting international standards such as ISO 20816-1 [26]. The most typical analysis and fault detection techniques for monitoring the operational status of the PMSMs are focused on vibration, power, thermal analysis, stator windings with current analysis, torque, and speed measurements [27]. Recent research has highlighted the importance of using multimodal data for elevators by measuring vibration, sound intensity, and thermal imaging. The use of these methods provides a comprehensive understanding of the technical condition of the device [28].

Several research efforts have focused on the application and development of methods to diagnose errors. Initially, as far as electrical faults are concerned, the most typical faults are located in the stator windings (open circuit and short circuit) [29]. To detect internal stator coil faults in the stator at early stages, an improved method based on advanced signal processing was proposed, which relied on the Motor Current Signal Analysis (MCSA) with the transformation technique to obtain frequency–time information with improved resolution [30].

A high accuracy and time efficiency in predicting electrical problems was observed by Zhang et al. The study focused on three states of current by identifying changes in the deviations between predicted and measured currents, finding the fault phase and concluding with the diagnosis of the fault type [31]. In addition to utilizing the current signal, the noise signal was also leveraged [32].

An open circuit diagnostic method based on a mixed-logic dynamics model was applied to multiphase PMSMs. This methodology was found to be highly reliable for both single and multiple open circuit faults, achieving a reduced time of 60% of the current cycle [33]. The most typical cause of electrical failures is due to insulation faults. These faults can be divided into three types: internal turn short circuit (ITSC), phase-to-phase short circuit, and earth short circuit [34].

The most common method of diagnosing such faults is frequency domain signal analysis. The main advantages that make it the most widely used technique are related to its low computational cost and the real-time condition monitoring capability of the machine [35]. One of the main disadvantages of frequency domain signal processing is that it assumes signal stationarity, which can limit its effectiveness in modern electric motor drive systems, where signals often exhibit non-stationary behavior [36].

Other similar methodologies examined such as Short-Time Fourier Transform (STFT), Undecimated Discrete Wavelet Transform (UDWT), Wigner–Ville Distribution (WVD), and Choi–Williams Distribution (CWD) have demonstrated the accuracy and effectiveness of the UDWT method in diagnosing faults occurring during transient operating conditions [37].

The highest percentage of failures is found in PMSMs, with 40–50% accounting for mechanical problems, the lifetime of the mechanical equipment, improper maintenance, poor lubrication, or incorrect design and assembly. These faults mainly include bearing faults, camshaft misalignment, imbalance, and demagnetization [38]. Bearing wear can cause cam problems, increased frictional losses, and high noise creating magnetic flux inside the motor, which is a major cause of stator winding insulation failure [39].

To identify eccentricity failures, the harmonics of the stator current are analyzed, and the indices of standard deviation, kurtosis, skewness, peak factor, purity, and shape factor are extracted [40]. Time domain methods use statistical data compared to specific fault samples in already damaged bearings to draw conclusions about wear localization. However, fault diagnosis is impossible under low load conditions [41,42].

The most widespread technique allowing fault detection in dynamic drive situations involves the application of the zoom Fast Fourier Transform (FFT). The main signal considered in this case is the mechanical vibration and is distinguished by its high performance in accurately diagnosing the condition of bearings and detecting faults. The damage indicator is the characteristic harmonic frequencies that occur depending on the specific mechanical damage and operating speed [43,44].

Other methodologies in the frequency domain based on noise and vibration signal analysis, cepstrum analysis, and extended vector park analysis (EPVA) were proposed. A large number of symptoms for bearing failure can be effectively detected by scattering flux analysis by fitting additional sensors, but at the same time, it can be applied to the motor with magnetic asymmetry [45].

By analyzing the motor rotation speed signal, the detection of damaged bearings is achieved even in low rotation speed conditions, which is its advantage compared to the spectral kurtosis of the speed signal and the three current-based methods [46]. The largest percentage (about 80%) of mechanical faults are due to eccentricity. Thus, there is non-uniformity in the distribution of the air gap between the stator and rotor and which is divided into three categories: static, dynamic, and mixed [47].

2.2. Preprocessing Signals and Deep Learning Methodologies

Advances in elevator technology have led to the widespread adoption of Permanent Magnet Synchronous Motors (PMSMs), which are favored for their high power density, superior efficiency, and precise torque control. Unlike traditional induction motors, PMSMs offer reduced energy consumption and smoother operation, making them the preferred choice in modern elevator systems [48].

Recent developments in power electronics, such as vector control and direct torque control, have further optimized PMSM performance, improving responsiveness and operational safety [49]. However, the increased complexity of these systems has also introduced new challenges in maintenance, particularly in detecting and diagnosing faults that could compromise safety and functionality. Electrical faults, mechanical failures, and thermal issues are the primary concerns that necessitate the development of advanced diagnostic and monitoring systems [50].

Research has highlighted the critical role of predictive maintenance in addressing these challenges. PMSMs offer significant energy savings compared to traditional induction motors, particularly in high-rise applications, due to their superior torque characteristics and compact design [51,52]. Similarly, advancements in control mechanisms, such as direct torque control, have enhanced the fault tolerance of PMSMs in dynamic elevator systems [53].

Additionally, other studies in sustainability further emphasize the importance of PMSMs in elevator systems for improved performance [38] and discuss high-order sliding mode magnetometers for detecting excitation faults in elevator traction motors, highlighting their contribution to improving fault detection accuracy. The lifecycle benefits of PMSMs report that these motors reduce operational costs by up to 20% over a 15-year period [48].

Wang et al. also emphasize the importance of multi-sensor integration in PMSM applications, enabling enhanced diagnostics and greater reliability [54]. The reliability and safety of elevator systems depend on effective fault diagnosis and condition monitoring. Fault diagnosis involves identifying and classifying issues in real-time, whereas condition monitoring focuses on tracking the health of components over time to predict potential failures. PMSM drives, being critical components of elevator systems, are prone to faults that can disrupt operations and compromise safety. These faults can be broadly categorized into electrical, mechanical, and thermal issues.

Electrical faults, such as stator winding short circuits, rotor demagnetization, and power electronics failures, are among the most common issues. Mechanical faults (including bearing wear, rotor misalignment, and excessive vibrations) are equally significant, often leading to operational inefficiencies. Thermal faults, resulting from inadequate cooling or prolonged operation, can exacerbate these problems by accelerating component degradation.

Various diagnostic and monitoring techniques have been developed to address these challenges. Vibration analysis, for instance, utilizes sensors to detect anomalies in mechanical components by analyzing vibration patterns. Thermal imaging [55] provides a non-invasive method to identify overheating in electrical and mechanical parts, while electrical signal monitoring analyzes current and voltage waveforms to detect abnormalities [56,57]. These methods rely on data collected from smart sensors, such as accelerometers, infrared cameras, and current sensors, which provides the real-time information necessary for effective monitoring and diagnostics [58].

Recent studies have explored how these techniques complement each other in enhancing system reliability. For instance, vibration analysis has proven highly effective for mechanical fault detection but may not capture the subtleties of thermal or electrical issues. Combining it with thermal imaging and electrical signal monitoring allows for a more comprehensive diagnostic approach, as evidenced by research from Wang et al., which highlighted the synergistic benefits of multi-sensor data fusion in elevator fault diagnosis [54].

Condition monitoring, beyond fault detection, encompasses the continuous assessment of system performance under varying operational conditions. Advanced methods integrate historical data with real-time measurements to predict trends and flag deviations from normal behavior. This approach enables proactive interventions, reducing unplanned downtime and improving safety. A notable review article discusses the challenges faced in predictive maintenance (PdM), particularly focusing on the complexities of implementing PdM systems. It examines the difficulties in data collection, analysis, and integration, and highlights the challenges related to sensor limitations, data quality, and the scalability of predictive models. It also emphasizes the need for advanced technologies, such as machine learning and artificial intelligence, to address these challenges and improve the reliability and efficiency of PdM systems. Key obstacles include the management of large datasets, the adaptation of models to different environments, and the interpretation of results in real-time operations [59].

Data collected from sensors in elevator systems is often noisy and requires preprocessing to extract meaningful insights. Effective data processing involves filtering, normalization, and feature extraction, each of which plays a critical role in enhancing the quality and usability of sensor data [60,61].

The Kalman Filter is a widely used technique for dynamic state estimation, particularly in noisy environments [62,63]. By combining measurements and prior knowledge, it provides optimal estimates of system states, making it highly effective for real-time applications. However, its reliance on accurate system modeling and its computational intensity can limit its effectiveness in certain scenarios. In contrast, the Gaussian Filter offers a simpler approach to noise reduction by averaging values within a Gaussian window, making it suitable for steady-state conditions. While efficient for reducing high-frequency noise, it may blur sharp transitions, limiting its effectiveness in dynamic environments. Low-pass Filters, which remove high-frequency components, are commonly used in vibration and current signal analysis. Though straightforward and effective, they can distort important signal details in high-frequency ranges, posing challenges in applications requiring high-resolution data [64]. Adaptive filtering methods have further enhanced the utility of these techniques [65]. For example, the application of Extended Kalman Filters (EKFs) in real-time systems accounts for nonlinearities, which is a significant advantage in dynamic elevator environments [66]. Kim et al. introduce a fault diagnosis algorithm for Permanent Magnet Synchronous Motors (PMSMs) to identify stator open-phase and inter-turn short circuit faults. The methodology integrates an EKF for phase current estimation and a Multiple Model (MM) filter for fault classification [14].

Normalization is another crucial preprocessing step, ensuring the consistent scaling of sensor data to improve the performance of machine learning models. Methods such as Min–Max Scaling and Z-Score Normalization standardize data ranges and distributions, reducing variability that could negatively impact model training and inference. Feature extraction, on the other hand, transforms raw sensor data into meaningful representations that capture relevant patterns. Time domain features, such as the Root Mean Square (RMS) value, skewness, and kurtosis, provide basic statistical insights, while frequency domain features derived from Fourier Transform reveal harmonics and spectral energy indicative of faults. Time-frequency features, obtained through Wavelet Transform, combine temporal and spectral analysis to detect transient anomalies, offering a comprehensive view of system behavior [13,15,25].

Recent advances in signal processing have further improved fault detection capabilities. Multi-resolution analysis techniques, such as Empirical Mode Decomposition (EMD), have shown promise in decomposing complex signals into simpler components, facilitating the identification of subtle fault signatures [67,68]. Similarly, adaptive filtering techniques have enabled dynamic noise reduction, tailoring the filtering process to varying operational conditions.

Machine learning (ML) has emerged as a transformative approach in predictive maintenance, enabling automated fault detection and prediction. By analyzing large volumes of sensor data, ML techniques can identify patterns indicative of faults, even in their early stages. These techniques can be broadly categorized into supervised learning, unsupervised learning, and hybrid approaches.

Supervised learning methods, such as Support Vector Machines (SVMs) and Neural Networks (NNs), rely on labeled datasets to classify faults. These models are highly accurate and capable of handling complex patterns, but their dependence on extensive labeled data can be a limitation in real-world applications. Pietrzak et al. [69] focus on detecting and classifying inter-turn short circuits in PMSMs using a spectral analysis of stator phase current signals for feature extraction and SVMs for fault classification in PMSM drives, achieving high accuracy with limited feature sets. Additionally, Dou et al. [70] explore the application of SVMs in fault diagnosis for key elevator structures. By modeling elevator faults and employing SVMs for fault detection, separation, classification, and evaluation, the research aims to enhance maintenance strategies. The approach ensures the accurate and timely identification of faults, reducing maintenance costs and improving the reliability of elevator operations. Similarly, Mishra et al. [25,71] highlighted the effectiveness of deep Neural Networks (DNNs) in capturing intricate fault patterns.

Unsupervised learning techniques, such as clustering and anomaly detection, are particularly valuable when labeled data is scarce. Clustering algorithms, including K-means, Hierarchical Clustering, Density-Based Spatial Clustering of Applications with Noise (DBSCAN), Gaussian Mixture Models (GMMs), and Self-Organizing Maps (SOMs), are used to group data points based on their similarities or distances [72]. Unlike supervised learning, which relies on labeled data to train models, clustering methods do not require labels and instead identify natural groupings within the data using distance metrics like Euclidean distance or cosine similarity. These techniques analyze patterns in the data to group similar instances together and detect deviations from normal operating conditions without the need for predefined fault labels [73]. Chen et al. [74] present a fault diagnosis approach for elevators based on artificial intelligence techniques. The study explores various methods, particularly machine learning and artificial Neural Networks, and employs clustering algorithms to detect anomalies in elevator system performance, highlighting the potential of unsupervised approaches in real-time applications.

Hybrid methods, such as Positive Unlabeled (PU) learning, are particularly effective in scenarios where labeled fault data is scarce but large amounts of unlabeled operational data are available. This approach relies on identifying reliable negative samples within the unlabeled dataset, leveraging both labeled and unlabeled data to train classifiers. Park et al. [75] explored the use of PU learning for high-dimensional data, proposing methods to address challenges in such datasets effectively.

Deep learning models, particularly Convolutional Neural Networks (CNNs) and Recurrent Neural Networks (RNNs), have shown significant potential in fault prediction. CNNs excel in feature extraction from high-dimensional sensor data, while RNNs are well-suited for capturing temporal dependencies in sequential data. Recent studies have demonstrated the integration of these models in hybrid architectures, enhancing their ability to handle diverse fault scenarios and operational conditions.

The integration of Reinforcement Learning (RL) into predictive maintenance offers a novel approach for optimizing maintenance strategies in real time. RL improves decision-making by learning from interactions with the environment, dynamically adjusting processes based on feedback. Through a balance of exploration and exploitation, RL agents adapt to changing operational conditions, minimizing unplanned downtime and enhancing system reliability. In elevator systems, RL algorithms can adjust maintenance schedules dynamically, utilizing real-time sensor data to ensure efficient resource allocation and proactive fault prevention. These models continuously refine their decision-making to optimize maintenance schedules based on ongoing performance feedback. Dogru et al., in their review paper [76], provide an overarching view of RL techniques in process industries, covering applications such as sensors, control, fault detection, and optimization, highlighting its potential in complex systems, where operational conditions fluctuate frequently, demanding adaptive fault prediction models.

Recent research presents a fault diagnosis model integrating Principal Component Analysis (PCA) and Long Short-Term Memory (LSTM) Neural Networks [77]. Using operational data from elevators, the model improves fault prediction accuracy by identifying patterns and trends associated with different failure types. Another study proposes a deep learning model that combines a Graph Neural Network (GNN) for structural relationships with LSTM for time-series analysis and a Bayesian Deep Adversarial Neural Network (BDANN) for robust faults prediction in elevator door systems by analyzing acoustic signals generated during operation [78]. The deep learning model enhances the ability to deal with the complex and variable nature of elevator operations and considers environmental changes and signal acquisition methods, improving fault prediction accuracy. In [79] the authors propose a risk evaluation method for elevator operation based on fuzzy logic theory and machine learning algorithms. By using sensor data and evaluation indices, the model enhances accuracy and efficiency in risk assessments compared to traditional approaches. In [80] the study also proposes a generic Multi-Layer Perceptron (MLP) Neural Network model for fault detection in elevator systems. The model demonstrates robust classification capabilities and resistance to overfitting, aiding predictive maintenance systems in reducing false alarms and unnecessary service visits. The proposed model achieved nearly 100% accuracy in fault detection, effectively minimizing false positives and enhancing predictive maintenance strategies.

Despite their advantages, ML techniques face challenges, including high computational requirements, difficulties in generalizing models across different systems, and the need for real-time processing capabilities. Addressing these issues requires further research and the development of scalable, efficient solutions. Federated learning has emerged as a promising approach to address data privacy concerns and enhance model generalization by training ML models across decentralized datasets without sharing sensitive data. Ahn et al. [81] provide a comprehensive survey on federated learning frameworks for IoT-based smart city applications, addressing challenges like latency and data privacy in predictive maintenance systems.

While significant progress has been made in fault diagnosis and predictive maintenance for elevator systems, several gaps remain. The lack of comprehensive datasets for training robust ML models is a critical barrier, limiting the applicability of advanced techniques in diverse operational contexts. Integration challenges with legacy elevator systems further complicate the deployment of modern predictive maintenance solutions. Additionally, real-time processing constraints pose difficulties in scaling these approaches for large-scale systems. Reference [82] reviews recent advances in artificial intelligence, focusing on machine learning interpretability methods that have led to widespread industrial adoption. The study focuses on machine learning interpretability methods and a literature review and taxonomy of these methods are presented, as well as links to their programming implementations.

Future trends in this domain include the integration of IoT technologies for enhanced connectivity and real-time data acquisition. Edge computing offers a promising solution to reduce latency and reliance on cloud infrastructure, enabling localized data processing for faster and more efficient fault detection. Explainable AI (XAI) is another emerging trend, providing transparency and interpretability in ML models to improve trust and adoption in safety-critical systems. The survey in [83] structures and analyzes challenges, techniques, and methods for developing AI-based safety-critical systems, focusing on industrial and transportation domains.

To conclude, recent advancements in machine learning and IoT technologies have significantly enhanced fault detection and predictive maintenance capabilities in building systems, including more safe elevator operations, reduced downtime and maintenance costs, and energy saving. Studies such as the one by Hodavand et al. [84] emphasize the integration of digital twin technology with machine learning, enabling real-time monitoring and fault diagnosis to improve infrastructure reliability and safety. The study in [85] explores the use of artificial intelligence in smart elevators to enhance time and energy management, emphasizing the benefits of AI integration in elevator systems. The study in [86] examines how machine learning models can optimize elevator energy consumption by predicting usage patterns in real-time, contributing to energy-efficient building operations. Nelson et al. [87] highlight the transformative potential of machine learning across smart building operations, focusing on fault detection methodologies tailored to subsystems like Heat Ventilation and Air Conditioning (HVAC) and elevator systems. Reference [88] involves implementing a deep learning algorithm for fault detection and prediction in elevator systems, showcasing the effectiveness of deep learning in maintenance prediction. A systematic review in [89] delves into predictive maintenance techniques in Industry 4.0, showcasing the adaptability of sensor-based monitoring and failure prediction methods to diverse operational contexts. Its relevance to elevator systems lies in the techniques discussed for sensor-based monitoring and failure prediction. Finally, Alanne et al. [90] explore machine learning applications for sustainable building systems, underlining the growing role of these technologies in ensuring the efficient and fault-free functioning of elevators as part of broader smart infrastructure initiatives. These advancements highlight the potential of combining cutting-edge technologies to overcome current limitations and lay a foundation for developing robust methodologies to address the unique challenges of condition monitoring and fault prediction in PMSM-driven elevator systems. Additionally, collaborative research initiatives and open data repositories can play a crucial role in addressing data scarcity, fostering innovation, and accelerating the adoption of advanced predictive maintenance strategies.

Compared to existing diagnostic solutions that rely on static classifiers or unimodal signal sources (e.g., vibration or current only), the proposed method integrates GAN-augmented PU learning and Reinforcement Learning (RL) within a multimodal framework. This architecture allows for real-time adaptability under real-world uncertainties, including varying load conditions and movement profiles typical of elevator systems.

Recent work by Xu et al. [91] proposed a reduced-order interval observer for simultaneous fault diagnosis in inverter-fed induction motor (IM) drives. Their method effectively detects both open-switch and current sensor faults with high precision. However, it assumes full model observability and requires direct access to the inverter’s switching logic conditions, which are not feasible in the context of commercial elevator systems, where such access is restricted by safety regulations and manufacturer-imposed limitations. In contrast, the proposed method in this paper operates in a non-intrusive manner, without accessing or modifying the motor, inverter, or controller. It leverages only externally measurable signals, such as current and vibration, to infer fault states. This makes it suitable for practical deployment in warranty-protected elevator systems.

Based on the literature, while several machine learning approaches address isolated faults in PMSM-based systems, few utilize multimodal data from real elevators operating in residential, commercial, or industrial buildings. Moreover, the integration of PU learning and RL remains underexplored in elevator fault prediction. These gaps in fault prediction for elevator systems motivate the comprehensive, real-world-validated framework proposed in this study.

3. Proposed Methodology

In the field of diagnostics, the quality of data plays a crucial role in the accuracy and reliability of the methodologies. Signals collected by sensors often contain noise resulting from many factors. Thus, the use of appropriate filters to remove noise helps to improve the quality of the signals, ensuring that the input data to diagnostic algorithms is clean, smooth, and informative. By reducing noise and preserving the basic characteristics of the signals, key features important for detecting potential anomalies are extracted. This chapter focuses on the proposed predictive maintenance methodology which was deployed on a fully operational elevator system installed in a residential apartment building. Unlike a laboratory-scale testbench, this installation reflects the real conditions of elevator operation, with unpredictable loading profiles, variable trip frequencies, and natural signal disturbances. The proposed methodology includes the description of the elevator installation for data collection through smart sensors, preprocessing of the signals using appropriate and effective filters, and the extraction of significant indicators by studying the signal in the time and frequency domain. In this installation, data collection is performed using a network of non-intrusive smart sensors compliant with international standards. The entire signal acquisition and preprocessing pipeline operates in real-time, ensuring minimal latency and accurate temporal correlation across sensor modalities. This infrastructure supports a non-intrusive diagnostic process, fully compatible with the manufacturer’s warranty and safety restrictions, as no access to the inverter firmware or motor internals is required.

3.1. Real Elevator Installation and Sensor Deployment in a Residential Apartment Building

The accuracy of data collection is determined by the selection of suitable smart sensors and the placement of the sensors based on international regulations. The placement of the accelerometer was carried out based on the new ISO 20816-3 standard [92], while the energy sensor contributed to the recording of critical energy quantities such as current, voltage, and Total Harmonic Distortion (THD). The elevator system used in this study consists of a gearless traction elevator designed for six passengers (450 kg) and nine stops, equipped with a three-phase, surface-mounted PMSM rated at 5.1 kW and operating at 160 rpm under normal service conditions. It is a multi-pole (12), low-nominal-frequency (16 Hz), and high-torque (350 Nm) motor with a 36-slot stator and distributed two-layer winding. The nominal specifications of the elevator and motor are presented in Table 1 and Table 2. The motor’s constructional features are shown in Table 2 to complement the nominal electrical and geometrical characteristics. Data was sampled at 5 kHz and at 10 kHz using a Hioki PW3390 (Hioki E.E. Corporation, Ueda, Japan) and transmitted via Ethernet and wireless interfaces. Fault observations were captured under real load transitions.

Figure 1 presents an overview of the residential apartment elevator installation, and the main subsystems involved in the implementation of the proposed methodology. It includes a combination of photographs depicting the automation control panel, the inverter, the PMSM installed inside the elevator shaft, and the positions of the deployed sensors. The complete data acquisition, local storage, and transmission system are also shown, highlighting the integration of the energy analyzer, the communication interfaces, and the electrical infrastructure. All components are non-invasively installed on the existing elevator system, preserving operational safety and manufacturer compliance.

Gearless Interior Permanent Magnet (IPM).
Automation Panel Elevator System.
Control board.
Raspberry pi compute module 4G.
Variable Voltage Variable Frequency (VVVF) inverter.
Vibration sensor.
Power quality analyzer Hioki PW3390.
Data collector.
Magnetic field current transformers.
Device for sending the data to the cloud.
Automatic switch.
Three-phase circuit breaker.
Ethernet cable from data collector to raspberry pi module 4G.
Uninterruptible power supply (UPS).
Mean Well Enterpises HDR 30-24 power supply (Mean Well Electronics Co., Guangzhou, China).
Rail socket for connecting the device that will send the data to the cloud.

Figure 2a presents the cross-section view (one-quarter section) of the PMSM, highlighting the arrangement of stator teeth, windings, and permanent magnets. It is a surface-mounted permanent magnet motor, utilizing NdFe-B magnets with a remanent flux density of 1.23 T, to ensure strong magnetic performance and reliability over long periods of operation. Figure 2b represents a simplified layout of stator winding in the PMSM, showing the potential location of a short circuit fault in phase A.

3.2. Data Processing and Transmission

The next step of our proposed methodology is the preprocessing of the collected data based on the experimental setup. In the preprocessing phase the combination of two filters—the Gaussian and Extended Kalman Filter (EKF)—was used due to their significant advantages. The Gaussian Filter is used to remove noise at high frequencies, smoothing the signal by preserving the basic characteristics of the signal and enhancing feature extraction such as RMS and FFT. The Gaussian Filter’s impulse response is given by:

g (x) = \frac{1}{\sqrt{2 π σ^{2}}} e x p (- \frac{x^{2}}{2 σ^{2}})

(1)

where σ is the standard deviation.

The filtered signal

s^{'}

(t) is calculated as the convolution of the collected data from the measurements (signal s(t)) with the Gaussian Filter impulse response g(t):

s^{'} (t) = s (t) * g (t) = \int_{- \infty}^{\infty} s (t - τ) \cdot g (τ) d τ

(2)

Discrete signals are calculated as follows:

s^{'} [k] = \sum_{n = - N}^{N} s [k - n] \cdot g [n]

(3)

where N is the length of the filter’s impulse response.

The Extended Kalman Filter (EKF) is suitable for nonlinear systems when the model’s objective is the accuracy of results. The EKF can linearize the system’s nonlinearities through the Jacobian matrix (current, voltage, and acceleration) in order to study the dynamic behavior of the system. The system’s state is described by the expression:

x_{k} = f (x_{k - 1}, u_{k - 1}) + ω_{k - 1}

(4)

where

x_{k}

is the system state at time step k,

u_{k} = s_{k}^{'}

is the system input, and

ω_{k}

is the process noise.

The measurement equation:

z_{k} = h (x_{k}) + υ_{k}

(5)

where

z_{k}

is the measurement vector at time step k,

h

(

x_{k})

is the nonlinear function that describes how the state is mapped to the measurements, and

υ_{k}

is the measurement noise.

The State Prediction is defined as follows:

{\hat{x}}_{k | k - 1} = f ({\hat{x}}_{k - 1 | k - 1}, υ_{k - 1})

(6)

P_{k | k - 1} = A_{k} P_{k - 1 | k - 1} A_{k}^{T} + Q

(7)

where

A_{k}

is the Jacobian matrix in dynamic operation and Q is the process noise configuration.

Gaussian Filtering was applied with a standard deviation of σ = 0.75, which was empirically selected to optimize the trade-off between noise suppression and the preservation of essential signal features. The Gaussian Filter was used to smooth the raw vibration and current signals, reducing high-frequency measurement noise without distorting the underlying waveform shape. Subsequently, the EKF was applied to estimate the system’s dynamic state evolution and further refine the filtered signal trajectories. EKF tuning was performed offline using representative datasets from both healthy and faulty operating conditions, configuring the process noise covariance matrix Q = diag(0.01, 0.01, 0.005) and the measurement noise covariance matrix R = diag(0.05).

The combined use of Gaussian Filtering and EKF showed superior performance over classic filtering techniques such as median filtering and Low-pass Butterworth Filters. Specifically, the proposed approach improved the F1-score by 3–7%, as it preserved transient characteristics and anomalies critical for accurate fault detection, while minimizing both high-frequency noise and smoothing-induced distortion. This improvement is attributed to the method’s ability to suppress irrelevant noise while preserving transient events and diagnostic anomalies. By retaining key fault-related signal features, the proposed two-stage filtering approach enhances the robustness and reliability of downstream fault classification models.

The next step involves the application of Short-Time Fourier Transform (STFT), which is one of the most important tools for analyzing the time-frequency characteristics of signals. STFT provides a way to examine the variation in a signal’s spectrum in time, making it suitable for detecting dynamic phenomena and transient faults. The ability to monitor the signal spectrum in real time offers accurate and efficient monitoring of the frequency sidebands. The STFT of a signal x(t) is defined as:

Χ (τ, f) = \int_{- \infty}^{\infty} x (t) ω (t - τ) e^{- j 2 π f t} d t

(8)

where Χ (τ, f) is the short-time–frequency spectrum, x(t) is the signal of analysis,

ω (τ)

is the window function which defines the local time interval, f is the frequency, and t is the time.

The analysis of the indicators provides quantitative information on the overall operation of the engine and is vital for fault detection. These are indicators in frequency bands as well as statistical indicators. The total energy and mean frequency of a signal in a specific frequency band from

f_{1}

to

f_{2}

is calculated as:

E (t, f_{1}, f_{2}) = \int_{f_{2}}^{f_{1}} {| X (t, f) |}^{2} d f

(9)

F_{m e a n} (t) = \frac{\sum_{i = 1}^{N} f_{i} \cdot P (f_{i})}{\int_{0}^{\infty} P (f_{i})}

(10)

where

f_{i}

is the frequency of i-th sample in the frequency spectrum,

P (f_{i})

is the power spectral density or the magnitude of the spectral component at frequency fi, and N is the total number of samples (frequencies) in the spectrum.

Crest factor measures the maximum peak energy relative to the average energy:

CF (t) = \frac{{m a x}_{f} | X (t, f) |}{\sqrt{\frac{1}{F} \int_{0}^{F} {| X (t, f) |}^{2} d f}}

(11)

Another critical indicator for the detection of transient damage and vibration characteristics is kurtosis:

Κ = \frac{\frac{1}{N} \sum_{i = 1}^{N} (x_{i} - μ)^{4}}{(\frac{1}{N} \sum_{i = 1}^{N} (x_{i} - μ)^{2})^{2}}

(12)

where

x_{i}

are the signal values,

μ

is the average signal value, and N is the number of samples.

Skewness is an indicator that measures the asymmetry of the distribution of data around the mean value. Indications of asymmetry can reveal mechanical faults, such as rotor imbalance or magnetic asymmetries:

S = \frac{\frac{1}{N} \sum_{i = 1}^{N} (x_{i} - μ)^{3}}{(\frac{1}{N} \sum_{i = 1}^{N} (x_{i} - μ)^{2})^{3 / 2}}

(13)

Entropy measures the complexity or randomness of the signal.

E_{n t r o p y} = - \sum_{x \in X} p (x_{i}) l o g p (x_{i})

(14)

where

p (x_{i})

is the probability of

x_{i}

.

Principal Component Analysis (PCA) reduces dimensionality by projecting data onto components that capture the maximum variance value. Mutual Information (MI) quantifies the dependency between two variables, identifying features most relevant to the target output. Min–Max and Z-Score Normalization are techniques which ensure that features are on the same scale to a fixed range, typically [0, 1] or [

-

1, 1]. This process is critical for preventing the dominance of features with larger ranges over others in model training.

Before the data feature matrix is extracted, the dataset must be balanced to avoid either undersampling or oversampling. Data balancing is an important process in machine learning problems where there is an imbalance in the categories of the dataset. This occurs when one category (e.g., positive or negative) is much less frequent than the other, which can affect the performance of the model. The appropriate combination of techniques such as the Synthetic Minority Oversampling Technique (SMOT) or Adaptive Synthetic Sampling (ADASYN) for the minority and Tomek Links or Cluster-Based Undersampling for the majority achieves balance without overfitting or loss of information.

The final processed dataset is a feature vector table where each sample must have a label. The label is y = 1 for fault-positive samples only, while the largest percentage of the vector samples is undefined and contains both normal (healthy) and faulty states, making it difficult to train a supervised classifier directly. The integration of PU machine learning aims to train the model to identify undefined signals and classify them accordingly as positive or negative.

3.3. Model Training and Fault Classification

The setup of the problem has two sets: a set of positively labeled samples P and a set of unlabeled samples U, which may contain both positive and negative instances. The goal is to train an initial classifier f(Z) that predicts whether a new sample z is positive or negative.

The use of encoder–decoder architectures in PU learning is a powerful tool that enhances the extraction of meaningful representations from complex, high-dimensional data, such as spectrogram patterns derived from STFT.

The encoder maps the input high-dimensional data X into a low-dimensional latent space Z.

Z = Encoder (X; θ_{e})

(15)

where

X

\in R

^n×d is the n samples and d features input data;

Z

\in R

^n×k is the latent representation, where k < d;

θ_e represents the trainable parameters of the encoder.

The encoder compresses the information in X by learning important patterns or features.

The decoder reconstructs the input X from the latent representation Z.

\hat{X} = Decoder (Z; θ_{d})

(16)

where

\hat{X}

is the reconstructed version of the original input;

θ_d represents the trainable parameters of the decoder.

To ensure that Z retains the essential features of X, the reconstruction loss, typically the Mean Squared Error (MSE), is minimized.

L_{r e c o n s t r u c t i o n} = \frac{1}{n} \sum_{i = 1}^{n} {‖X_{i} - {\hat{X}}_{i}‖}^{2}

(17)

The latent space representation Z from the encoder is passed to the PU classifier to estimate the probability of a sample being positive p(y=1|Z).

This probability can be expressed as:

p (y = 1| Z) = p (y = 1| Z, s = 1) p (s = 1| Z)

(18)

where

s represents whether a sample is labeled (s = 1) or unlabeled (s = 0);

p (s = 1| Z)

is the propensity score, indicating the likelihood of a sample being labeled given its features.

By modeling

p (s = 1| Z)

and using the labeled data, the classifier f(Z) can infer the true class probabilities.

f (Z) = ψ (W \cdot Z + b)

(19)

where

W and b are the weights and biases of the PU classifier;

ψ is the activation function.

The Rectified Linear Unit (ReLU) activation function is used in encoder–decoder architectures due to its simplicity and effectiveness in addressing the vanishing gradient problem and in introducing nonlinearity and sparsity. It is defined as:

R e L U (x) = \max (0, x)

(20)

ReLU is applied to the hidden layers in both the encoder and decoder to activate the meaningful features only:

h_{i} = R e L U (W_{i} \cdot h_{i - 1} + b_{i})

(21)

The expected risk R(f) for the classifier f(Z) is defined as:

R (f) = 1 𝔼_{(Z, y)} [l (f (Z), y)]

(22)

where

l (f (Z), y)

is the binary cross-entropy loss function.

Since y is not directly available for unlabeled samples, the risk is decomposed as:

R (f) = π_{p} R_{p} (f) + (1 - π_{p}) R_{n} (f)

(23)

where

π_{p}

is the estimated proportion of positive samples in the unlabeled dataset U;

R_{p} (f) =

𝔼_Z~P

[l (f (Z), 1)]

is the risk for positive samples;

R_{n} (f) =

𝔼_Z~U

[l (f (Z), 0)]

is the risk for negative samples;

Approximated using the unlabeled data.

So the decomposed risk estimation is:

R (f) = π_{p} 𝔼_{Z ~ P} [l (f (Z), 1)] + 𝔼_{Z ~ U} [l (f (Z), 0)] - π_{p} 𝔼_{Z ~ U} [l (f (Z), 0)]

(24)

And the total loss function combines the risk and reconstruction losses

L_{total} = L_{reconstruction} + λ L_{PU}

(25)

where L_PU is derived from the risk estimation and λ is a regularization parameter to balance reconstruction and classification.

The encoder–decoder is trained to minimize the L_{reconstruction} ensuring that Z captures meaningful features. The classifier is trained with Z as the input to the PU classifier to minimize the L_PU using the decomposed risk function. Optimization is achieved by fine tuning both components to minimize the total loss function. The model performs an initial estimation and classification of fault diagnosis and anomaly detection at the output, where the data exhibits nonlinear and high-dimensional features even with limited labeled data.

Reinforcement Learning (RL) is used to refine the model’s decision-making process and to optimize its performance before applying data augmentation. By integrating RL after PU learning, the system learns to make optimal decisions based on feedback from its environment, improving its ability to classify or predict outcomes effectively.

In the RL framework, the model operates in an environment characterized by the state (s_t), the actions (a_t), the reward (r_t), and the policy (π(α|s)). The goal is to learn a policy π(α|s) that maximizes the cumulative reward over time.

The state includes all the information, such as the latent representation derived from the PU learning output, the load information (L_t) of the elevator, and the direction indicator (D_t) of ascent or descent.

s_{t} = {Z, p (y = 1| Z,), L_{t}, D_{t}, L_{PU}

(26)

where

L_t is the normalized load as percentage of the elevator’s maximum capacity;

D_t is the indicator of the movement direction modeled as a binary variable.

L_{t} = \frac{C u r r e n t L o a d}{M a x i m u m C a p a c i t y} \in [0, 1]

(27)

D_{t} = \{\begin{matrix} 1, \\ 0, \end{matrix} \begin{matrix} f o r a s c e n t \\ f o r d e s c e n t \end{matrix}

(28)

Actions (a_t) are decisions made by the RL agent, including modifying decision thresholds according to normalized load and movement conditions, prioritizing fault diagnosis, and addressing increased loads or changes in operating conditions.

a_{t} = {T h r e s h o l d A d j u s t m e n t, E r r o r C o r r e c t i o n P a r a m e t e r}

(29)

The reward (r_t) is designed to encourage behavior that improves classification accuracy, reduces misclassification, or enhances robustness and optimization while considering load and direction conditions.

r_{t} = a \cdot A c c u r a c y (L_{t}, D_{t}) - b \cdot M i s c l a s s i f i c a t i o n (L_{t}, D_{t}) - c \cdot E n e r g y C o s t (L_{t}, D_{t})

(30)

where a, b, and c are weighting factors.

The policy π(α|s) is modeled using a Neural Network with parameters θ_π

π (α | s); θ_{π} = s o f t m a x (W_{π} \cdot s + b_{π})

(31)

where

W_{π}

and

b_{π}

are trainable weights and biases while softmax ensures the output is a valid probability distribution over actions.

The objective in RL is to maximize the expected cumulative reward

G_{t}

.

G_{t} = \sum_{k = 0}^{\infty} γ^{k} r_{t + k}

(32)

where

γ \in [0, 1)

is the discount factor, prioritizing immediate rewards over distant ones.

The policy is optimized to maximize the expected reward J(

θ_{π}

) using the gradient method.

J (θ_{π}) = 𝔼_{π} [G_{t}] .

(33)

\nabla_{θ_{π}} J (θ_{π}) = 𝔼_{π} [\nabla_{θ_{π}} \log π (α | s; θ_{π}) G_{t}]

(34)

The value function

V^{π} (s)

estimates the expected return starting from the state s under the policy π.

V^{π} (s) = 𝔼_{π} [G_{t} | s_{t} = s] .

(35)

The advantage function

A^{π} (s, a)

quantifies the benefit of taking action a in state s, compared to the average performance.

A^{π} (s, a) = Ω^{π} (s, a) - V^{π} (s)

(36)

where

Ω^{π} (s, a)

is the action-value function

Ω^{π} (s, a) = 𝔼_{π} [G_{t} | s_{t} = s, a_{t} = a]

(37)

The training process begins with the initialization of the policy network

π (α | s; θ_{π})

, which is set with random weights. The state

s

_t is defined using the encoder–decoder’s latent space Z derived from PU learning. During exploration, actions a_t are sampled from the policy π(α|s; θπ) to interact with the environment and collect rewards r_t. The policy is updated by computing the gradient of the policy objective using the policy gradient theorem:

\nabla_{θ_{π}} J (θ_{π}) = 𝔼_{π} [A^{π} (s, a) \nabla_{θ_{π}} \log π (α | s; θ_{π})]

(38)

Additionally, a value network

(s; θ_{υ})

is trained to approximate the value function

V^{π} (s)

by minimizing the loss function L_value.

L_{value} = 𝔼_{π} [(G_{t} - {V (s; θ_{υ}))}^{2}]

(39)

Optimization is carried out by performing gradient ascent to update

θ_{π}

for the policy network and gradient descent performed on L_value to update parameters

θ_{υ}

for the value network.

Data augmentation using Generative Adversarial Networks (GANs) involves two main components: a generator and a discriminator. The generator creates synthetic data, while the discriminator evaluates the authenticity of the data by distinguishing between real and synthetic samples.

The generator takes random noise z~ℕ(0, 1) sampled from the latent space as the input and maps the data space X_aug producing synthetic samples

x' = G (z; θ_{G})

. The generator’s objective is to create realistic data that can fool the discriminator. The discriminator receives both real data x and synthetic data x′ as the input. It outputs a probability

D (x; θ_{D})

or

D (x'; θ_{D})

indicating whether the data is real

D (x) \to 1

or synthetic

D (x') \to 0

.

So, the generator and discriminator have competing objectives. The discriminator aims to maximize the GAN loss function L_D by correctly classifying real and synthetic data, while the generator aims to minimize L_G by producing data that maximizes the discriminator’s classification probability.

L_{D} = - E_{x ~ pdata} [\log D (x)] - E_{z ~ pz} [\log (1 - D (G (z)))]

(40)

L_{G} = - E_{z ~ pz} [\log D (G (z))]

(41)

The training loop for GANs involves iterative updates to the generator and discriminator to achieve adversarial optimization. In each iteration, the discriminator is updated by maximizing its ability to distinguish between real and synthetic data, adjusting its parameters

θ_{D}

using the gradient ascent rule with a learning rate

η_{D}

. Conversely, the generator is updated by minimizing the likelihood of the discriminator correctly identifying synthetic samples, adjusting its parameters

θ_{G}

using the gradient descent rule with a learning rate

η_{G}

. This adversarial optimization alternates updates between

θ_{D}

and

θ_{G}

to maintain balance, ensuring neither the generator nor the discriminator becomes too dominant.

θ_{D} \leftarrow θ_{D} + η_{D} \nabla_{θ_{D}} L_{D}

(42)

θ_{G} \leftarrow θ_{G} - η_{G} \nabla_{θ_{G}} L_{G}

(43)

This dynamic interaction drives the generator to produce increasingly realistic synthetic data while challenging the discriminator to refine its classification performance. GAN training reaches convergence when the discriminator’s output approximates

D (x) = D (x') = 0.5

for all samples.

The augmented dataset is represented as (X_aug, Y_aug), where X_aug = {x₁, x₂, …, x_n}; each x_i is a feature vector, and Y_aug = {y₁, y₂, …, y_n} contains the corresponding labels. The dataset is split into training and validation subsets, (X_train, Y_train) and (X_val, Y_val), and is used to train multiple predictive models, including Random Forest (RF), Support Vector Machine (SVM), and Deep Neural Networks (DNNs).

A Random Forest consists of T decision trees

{h_{t} (x; θ_{t})}_{t = 1}^{T}

trained on different bootstrap samples of the training data. For an input x_i, each tree h_t outputs a class label ŷ_t.

The final prediction is obtained by majority voting:

ŷ = m o d e \{h_{t} (x; θ_{t}) : t = 1, \dots, T\}

(44)

The object of training is for each tree to minimize the entropy for the splitting nodes:

I = \sum_{c} p_{c} (1 - p_{c})

(45)

where p_c is the proportion of class c in the split.

The class c represents one of the possible categories or labels in the classification problem. Specifically, c refers to a distinct class within the set of all possible classes in the dataset. Entropy quantifies the impurity of the node, measuring how mixed the classes are. A lower entropy indicates a purer split, meaning the samples in the node predominantly belong to a single class.

SVM aims to find a hyperplane

w^{T} x + b = 0

that separates the classes with the maximum margin. The optimization problem is formulated as:

\min_{w, b} \frac{1}{2} {‖w‖}^{2} + C \sum_{i = 1}^{n} m a x (0, 1 - y_{i} (w^{T} x_{i} + b))

(46)

where

C

is a regularization parameter.

For nonlinear separable data, a Kernel function

K (x_{m}, x_{n})

maps the data to a higher-dimensional space, enabling separation in the transformed space:

K (x_{m}, x_{n}) = φ {(x_{m})}^{T} φ (x_{n})

(47)

For a new input x_, the decision function is:

f (x) = s i g n (\sum_{m = 1}^{n} λ_{m} y_{m} K (x_{m}, x) + b)

(48)

where

λ_{m}

are the support vector coefficients.

A DNN consists of L layers, including input, hidden, and output layers.

The output of each layer l is computed as:

a^{(l)} = R e L U (W^{(l)} a^{(l - 1)} + b^{(l)})

(49)

where

W^{(l)}

and

b^{(l)}

are the weight matrix and bias vector, and ReLU is the activation function

R e L U (z) = m a x (0, z)

.

For classification, the final layer applies a softmax activation:

ŷ_{i} = \frac{e x p (z_{i})}{\sum_{j = 1}^{k} e x p (z_{i})}

(50)

where

z_{i}

is the raw output for class i.

The cross-entropy loss is minimized to train the network:

L = - \frac{1}{n} \sum_{i = 1}^{n} \sum_{j = 1}^{k} y_{i j} \log ŷ_{i j}

(51)

where

y_{i j}

is the true label and

ŷ_{i j}

is the predicted probability.

The DNN parameters

\{W, b\}

are updated for optimization using Adaptive Movement Estimation (Adam), which is suitable for various machine learning problems, including large-scale and sparse datasets.

The integration of PU learning, GANs, and RL forms a collaborative mechanism to handle practical elevator conditions. PU learning enables classification from positive and unlabeled data, GANs generate synthetic rare-fault examples, and RL dynamically adjusts thresholds based on load and direction. Feature extraction techniques include RMS, THD, kurtosis, FFT, harmonic amplitudes, crest factor, and skewness. The fault types identified include winding asymmetries, load imbalance, and mild mechanical misalignment.

3.4. Model Evaluation and Database Update

Randomly sampled data is preprocessed to ensure consistency and compatibility with each trained model. The preprocessed test data is evaluated against the trained model to produce predictions compared to the true labels. These predictions are analyzed using multiple metrics to determine the model’s performance.

A c c u r a c y = \frac{T P + T N}{T P + T N + F P + F N}

(52)

P r e c i s i o n = \frac{T P}{T P + F P}

(53)

R e c a l l = \frac{T P}{T P + F N}

(54)

F 1 = 2 \cdot \frac{P r e c i s i o n \cdot R e c a l l}{P r e c i s i o n + R e c a l l}

(55)

where TP, TN are true positive and negative and FP, FN are false positive and negative.

The F1-score is a performance metric for classification models that represents the harmonic mean of Precision and Recall. It is particularly useful in cases of imbalanced datasets, where accuracy alone may not provide an accurate assessment of the model’s effectiveness. The F1-score balances false positives and false negatives, ensuring a trade-off between Precision (the proportion of true positive predictions among all predicted positives) and Recall (the proportion of true positive predictions among all actual positives). A high F1-score indicates a well-performing model in terms of both Precision and Recall.

The Receiver Operating Characteristic Area Under Curve (ROC-AUC) score evaluates the model’s ability to distinguish between classes:

A U C = \int_{0}^{1} T P R (F P R) d (F P R)

(56)

where

T P R = \frac{T P}{T P + F N}

is the true positive rate and

F P R = \frac{F P}{F P + T N}

is the false positive rate.

For graphical analysis, visual tools are used, such as the confusion matrix, ROC curve, Precision–Recall curve, and heatmaps, to enhance interpretability and provide insights into the model’s performance:

The Pareto front is used for multi-objective optimization, balancing multiple performance metrics (e.g., Precision vs. Recall). A set of solutions

{o_{1}, o_{2}, \dots, o_{n}}

represents candidate optimal solutions in the multi-objective optimization process. A solution o_i is Pareto-optimal if no other solution exists that improves one objective function without degrading another.

Mathematically, this is defined as:

\forall o_{j}, i f f_{k} (o_{j}) \geq f_{k} (o_{i}) f o r a l l c r i t e r i a k, t h e n f_{k} (o_{j}) = f_{k} (o_{i}) \forall k

(57)

The scatter plots of performance metrics are used to identify the Pareto front.

Using the computed metrics and visual analysis, predictions of malfunctions are made. The insights from graphical analysis, metrics, and Pareto optimization are integrated into a decision-support system. Predicted errors or anomalies are stored in a knowledge base for further evaluation or feedback.

Figure 3 presents a detailed flowchart of the proposed methodology, which consists of the following stages: data acquisition (from the elevator system), signal preprocessing (Gaussian Filtering, EKF), data transmission, data processing, feature extraction, data balancing and pattern analysis, PU learning, Reinforcement Learning (RL), data augmentation (GAN-based), model training (RF, SVM, DNN), model evaluation, alarm generation, and database update.

The flowchart illustrates a comprehensive methodology for monitoring, analyzing, and maintaining an elevator system using advanced machine learning techniques and data-driven processes. The data begins with sensors attached to the elevator system, capturing raw signals such as vibrations, current, and temperature. These signals undergo preprocessing using Gaussian and Extended Kalman Filters to remove noise and enhance quality before being saved locally. The preprocessed data is transmitted via secure protocols such as MQTT, HTTPS, and TLS, ensuring encrypted and reliable communication. The data packets are formatted with timestamps and sensor IDs, enabling traceability and efficient cloud monitoring for storage and accessibility.

Once transmitted, the data undergoes further processing, integrating historical and real-time information from the database. Feature extraction techniques such as RMS, THD, STFT, and statistical measures like kurtosis and skewness are applied to derive meaningful patterns. Balancing the data ensures that all conditions, including elevator movement during ascent and descent, are adequately represented, and patterns are analyzed to form representative sample vectors for further learning stages.

The methodology includes generating alarms to monitor system conditions in real-time. Thresholds are predefined based on system specifications and operational requirements. Alerts are categorized into normal, marginal, and critical levels, and dispatched to relevant personnel through an automated system. Alerts are archived for future reference, creating a condition monitoring log that supports effective decision-making and rapid responses to potential malfunctions.

Finally, the database is continually updated through secure connections, allowing new data to be inserted while maintaining existing records. Labels are improved through iterative learning, and historical data is archived for long-term analysis. The updated database facilitates the generation of detailed reports, supports decision-making processes, and integrates preventive maintenance scheduling based on statistical insights. This ensures a robust and adaptive framework for the monitoring, analysis, and maintenance of the elevator system, enhancing its operational efficiency and safety.

This system operates independently of the elevator’s control panel, functioning as a non-intrusive monitoring and diagnostic tool. It does not interfere with the elevator’s operational processes or its built-in safety mechanisms, ensuring compliance with all regulatory standards and preserving the integrity of the elevator’s original design. By collecting and analyzing data from external sensors, the system provides valuable insights without altering the core functionality or safety features of the elevator.

The proposed system presents numerous advantages. First, it reduces maintenance costs by enabling predictive and condition-based maintenance strategies. By identifying potential issues before they escalate, unnecessary repairs are minimized, and service schedules can be optimized. The second advantage is the minimization of downtime by proactively addressing faults, ensuring the elevator remains operational and reducing inconvenience for users. The third advantage is that the system enhances the overall safety and reliability of the elevator by continuously monitoring critical parameters and generating real-time alerts when thresholds are exceeded. Additionally, it extends the lifespan of the equipment by ensuring timely interventions and preventing excessive wear and tear.

The proposed system introduces several innovations. It leverages advanced machine learning techniques, including PU learning, Reinforcement Learning, and GAN-based data augmentation, to improve fault detection and classification accuracy. The use of encrypted communication protocols ensures secure data transmission, while the integration of cloud monitoring provides scalability and remote access to data. Furthermore, the system’s ability to adapt to various operating conditions, such as different load profiles during elevator ascent and descent, showcases its robustness and versatility. By incorporating a knowledge base for error analysis and decision support, the system also facilitates informed decision-making, enhancing maintenance efficiency and reducing human error. These innovative features make the system a cutting-edge solution for modern elevator monitoring and diagnostics.

4. Results

This section presents the results of the proposed methodology with an emphasis on the accuracy and reliability of the analysis. Thus, Python code was used for the algorithm development and the graphs captured the two functional states of the machine, providing strong evidence for the effectiveness of the approach. The software stack for the analysis is shown in Table 3.

4.1. Experimental Current Signals

At state (Section 3.3 of this article), the collection of appropriate data through smart energy and vibration sensors is presented in Figure 4a,b, where the current waveforms during the upward and downward movements of the chamber for different testing loads are shown. In addition, Figure 5a,b show the measured current waveform of the faulty motor with the occurrence of the short circuiting of the winding.

Based on the results of the measurements and the comparative investigation of the current waveforms, we conclude that the elevator could be put in safe mode and serve loads of only 0 to 5 persons during ascent and 2 to 6 persons during descent. However, since it is called upon to perform simultaneous movements in both directions, it follows that under short circuit conditions, the motor could serve loads of 2 to 5 persons, i.e., 150 kg to 375 kg, respectively, in both directions of movement, corresponding to an active force on the motor shaft of 150 kg, i.e., approximately half of the rated force.

4.2. Preprocessing Data

4.2.1. Signal Filtering and Normalization

The use of the inverter and the influence of external factors in the network causes distortions in the various current signals. The healthy signals show pure periodicity. At high loads (300 kg, 415 kg) a small amount of noise is present, which causes fluctuations in the system’s operation. Gaussian and Extended Kalman Filters (EKFs) were used to reduce the noise and capture the dynamic characteristics of the system (see Figure 6 and Figure 7).

Thus, we observe a significant smoothing of the signal using a Gaussian Filter, especially for low loads, while correspondingly, the noise is significantly reduced at high loads, ensuring a smoother representation of the data. Similarly, with EKT, the dynamic and natural behavior of the signal as well as the points with strong fluctuations are preserved, showing accuracy in the machine’s changing operational situations. As far as higher loads are concerned, it accurately reproduces current variations without eliminating peaks.

Figure 8a,b and Figure 9a,b present the current waveforms using Gaussian Filter and EKF, respectively. The Gaussian Filter smooths out the signal and reduces noise, but does not react well to fast signal changes, as it smooths out the rapidly changing spikes. The use of EKF offers accuracy in monitoring the dynamic state of the machine and helps to isolate fault-related features by facilitating the diagnostic process.

4.2.2. Harmonic Analysis

In the case of mechanical faults, one of the most popular and traditional techniques, with a high accuracy and low cost, is Fast Fourier Transform (FFT). FFT transforms the vibration signal from the time domain to the frequency domain, as illustrated in Figure 10a,b and Figure 11a,b for the healthy and faulty state in the motor.

Comparing the two signals in the time domain, we observe that in the healthy state, the signal is relatively smooth and limited to small amplitude values, while in the faulty signal, the amplitude of the signal is higher and there are strong fluctuations, as large peaks are observed, indicating strong anomalies. Similarly, in the frequency domain in the healthy state, the FFT has higher energy at low frequencies, while at higher frequencies, the amplitude decreases very quickly with no significant high frequency components observed, indicating the absence of errors. The faulty waveforms show enhanced components at the low frequency elements, indicating the occurrence of an eccentricity error while higher frequencies are associated with bearing wear.

The ability of STFT to perform frequency domain analysis to monitor signal frequency is a critical factor for dynamic fault diagnosis problems. Figure 12a,b, Figure 13a,b, Figure 14a,b, and Figure 15a,b illustrate the spectrum of the STFT signals for various chamber motion loads. Table 4 and Table 5 present the main characteristics of the signals and are important factors in determining the operating condition of the machine. The same method is used to analyze the faulty current signal, as shown in Figure 16a,b for switching frequencies of 5 kHz and 12 kHz.

Based on the harmonic analysis of the PMSM sound, we observe a smooth spectral distribution over the whole range of the tested loads. In all cases a smooth distribution of spectral energy at low frequencies (0–100 Hz) with strong peaks of a constant intensity are observed. At 0 kg and 150 kg the fundamental frequency remains dominant while at 300 kg and 415 kg the higher frequencies are slightly more pronounced due to increased mechanical noise.

In the motor with a shorted winding, the spectral energy distribution is observed over a wider frequency range. The higher frequencies are particularly pronounced, indicating the presence of noise and vibration, showing parasitic harmonics at low frequencies. The above conclusions can be easily understood by extracting the characteristics of the signals in the comparative study in Table 4 and Table 5. The healthy motor has a flatter distribution, as shown by more negative kurtosis, while the faulty motor shows sharper peaks. The skewness in both cases is close to zero, indicating relative symmetry in the signal, while the healthy machine shows better distributed peaks, such as in the crest factor. Similarly, in the case of a healthy PMSM, a larger

F_{m e a n}

occurs during the descent, while the shorted one shows a decrease in the average frequency due to spectral energy dispersion. The entropy is significantly higher in the case of a healthy motor due to the complexity of dynamic operations, while the short circuit machine presents lower spectral complexity.

Figure 17a,b show the vibration signal collected during motor operation in the chamber. The healthy signal is characterized by normal spectral energy distribution at low frequencies without strong vibrations, and stable operation with no evidence of mechanical and electromagnetic faults. In the faulty signal we observe intense energy bands at elevated frequencies, showing an instability in time with fluctuations in dynamic operation.

Based on the data in Table 6, the main cause may be a mechanical malfunction due to eccentricity, bearing wear, and misalignment of the system. However, these failures can also be caused by electrical anomalies in the motor stator windings and unwanted vibrations due to uneven torque generation.

4.2.3. Pattern Feature Extraction

The pattern feature extraction process is a critical stage after signal preprocessing. This process provides important information for certain key features regarding load conditions and elevator motion (mean value, skewness, kurtosis, Principal Component Analysis (PCA)). The analysis of these features allows for pattern discrimination and combined with data normalization and balancing ensures that the model obtains uniformly distributed data.

Figure 18 compares the distribution of the original (Figure 18a) and normalized features where we observe that normalization balances the data in a single range [0, 1] (Figure 18b). This ensures that all features have comparable values, thus improving the performance of machine learning algorithms before the model training process begins.

Skewness is a statistical measure that describes the degree of skewness of a distribution around the mean. In our application it is a critical factor as it may indicate changes in the operating profile of the lifting system. According to Figure 19, clear differences between the signals are observed as the healthy signals are more dispersed, indicating that the skewness in the normal system may be within normal limits while the faulty signals are clustered around specific skewness values, suggesting a pattern of deviation from normal operation.

In order to better capture and monitor the effect of each indicator on our system, we created the correlation table in Figure 20 which shows the relationship between different attributes, with a color scale indicating the strength of the correlation. We distinguish a very high positive correlation (0.98) between standard deviation and energy, where energy consumption can be predicted from signal variation. Similar characteristics of strong correlation appear between the kurtosis and crest factor (0.91), which are particularly important indicators in suggesting elevator faults, indicating abrupt changes in motor voltage and current. Negative values, especially in variables such as mean and entropy, indicate that a higher mean value leads to less uncertainty in the signal.

The PU and Reinforcement Learning methods are used in order to achieve correct data classification when we only have positive samples and unclassified data, and to optimize the operation of an agent that makes experience-based decisions. In this way they create a self-improving system for prediction and anomaly detection, even if there is limited labeled data. Figure 21 shows the confusion matrix that captures how well the model classifies faulty and healthy samples, with labels 0 and 1, respectively. The model seems to perform well, because true positives (TPs) and true negatives (TNs) are high, while correspondingly, false positives (FPs) and false negatives (FNs) are low, but errors in prediction are shown.

The confusion matrix reveals false negatives during descent under heavy load, indicating the partial masking of fault symptoms. To mitigate this, synthetic signals generated by GANs were used to enhance boundary cases, and PU learning improved classifier robustness under data imbalance. This reduced false negatives by 12% over baseline classifiers.

We used Reinforcement Learning with appropriate adaptation of the reward function to achieve the correction of the wrong predictions with a high accuracy, reaching 90.91%. Figure 22 shows the 2D representation of the original data and its classification into labels (blue = healthy (1), red = faulty (0)). The points of the two classes are relatively separated, which means that PU learning has been able to recognize the structure of the data; however, outliers and points where the two classes overlap are observed, which affects the training process.

The data augmentation technique is a solution to enhance the less representative classes in various deep learning models. By creating synthetic data, the dataset is enriched, allowing the models to learn better. Generative Adversarial Networks (GANs) create realistic data samples, improving the ability of models to better generalize to real data; this helps mitigate sample size and often prevents overfitting. This realistically fits synthetic samples and allows the model to better detect anomalies and unusual situations [93].

An examination of the comparative distribution of important features for the successful implementation of the method is shown in Figure 23. The synthetic data follows the original data (shown in Figure 24) to a significant degree, highlighting the statistical sequence between the data. Most features follow the distribution of the real ones, thereby helping to train a more generalizable model. Similarly, in mean and std_dev there is more overlap between the real and synthetic data, indicating better generalization. In addition, the existence of peaks in the synthetic data indicates overpopulation at specific values where the GAN preferred some values over others, while the data does not exhibit the same natural stochasticity as the original data.

4.2.4. Model Training Algorithms

Recent advances in machine learning (ML) have enabled the adoption of combinatorial approaches that incorporate different algorithms, such as Deep Neural Networks (DNNs), Support Vector Machines (SVMs), and Random Forests (RFs), to improve accuracy and generalization in fault analysis. The hybrid model we propose focuses on the advantages of several algorithms for fault classification. Table 7 shows the calculated key feature parameters for each algorithm considered, which are crucial for evaluating the performance of each model.

The Receiver Operating Characteristic (ROC) and Precision–Recall curves in Figure 25 evaluate the performance of the classifiers. In Figure 25a the curves of the Random Forest and Ensemble models show a perfect ability to discriminate between classes. Similarly, the SVM and DNN, although showing worse performance, are found to be highly efficient. The steep slope of the RF and Ensemble curves demonstrates the ability of these models to identify positive cases. In Figure 25b, the excellent flat line with the RF and Ensemble models means that there are zero false positives, with the model not classifying false negative as positives. The SVM and DNN have a more gradual drop in Recall, showing that they lose some positive cases as coverage increases, while the steep drop towards the end of the curve shows the inability of the models to maintain high accuracy when Recall increases too much, as is the case with DNN.

The histogram in Figure 26 shows the performance of four different classification algorithms (Random Forest, DNN, SVM, and Ensemble) based on five different evaluation metrics. According to the results, SVM has the lowest performance, especially on parameters such as accuracy and Recall, but has absolute Precision, indicating that it is probably overly conservative, avoiding errors but missing several important cases. The Random Forest and Ensemble models show the best accuracy and are characterized by high reliability and balance, making them an excellent choice for classification. Finally, the DNN shows a slightly lower Recall than RF, which means that it may miss some positive cases.

The diagram below in Figure 27 illustrates the confusion matrices for the four different algorithms, showing the total samples that were correctly classified and how many were incorrectly classified. The RF and Ensemble models are characterized as the most accurate classifiers, with 5 and 14 total errors, respectively. SVM shows 74 FNs, failing particularly to identify class 2, which affects its Recall.

The multi-criteria optimization of the two key parameters of accuracy and Precision for the different algorithms tested is described in Figure 28. Random Forest presents the best performance for both metrics, constituting the optimal solution for malfunctions occurring at accuracy values in the range of 80–90%. The Ensemble presents the second-best performance, indicating a high accuracy and a Precision reaching approximately 99%, with very few malfunctions, indicating that the model has a very low failure rate. SVM maintains high accuracy, but its Precision gradually drops, with accuracy values in the range of 90–99%, indicating that it fails at a certain accuracy threshold. DNN shows an even larger drop in Precision compared to accuracy, which may indicate that the model has difficulty in correctly predicting positive cases with failures occurring even when the accuracy is high.

In order to achieve better monitoring of the state of the model’s operation, specific thresholds were set for each metric. The alerts corresponding to a number of threshold values were created in order to evaluate the metric performance and categorize them into three states: Normal, Marginal, and Critical. Figure 29 shows the results of condition monitoring where 11 (55%) alerts were found and classified as Normal, 5 (25%) as Critical, and 4 as Marginal (20%), with a total of 20 alerts. In general, what we find is the satisfactory operation of the majority of the alert elements, which indicates that the system is working properly.

4.2.5. Update Database

Proper monitoring of the operational status of the machine and the elevator system requires the process of updating and managing the data in real time. A data collection and processing platform with the ability to monitor vibration and energy values was built. The platform design was based on the new ISO 20816-3 standard, as shown in Figure 30. It distinguishes the operating state of the motor, considering the rms value of the speed and the estimated change of state. In order to make any malfunction more distinct, there is a color separation of the states.

Based on the results of the predictive maintenance, the state of health of the machine is classified as Normal, indicating that the equipment is operating within acceptable limits. The vibration velocity measurement is 0.94 mm/s, it is classified as Newly Commissioned Machinery, and no immediate intervention is required. In addition, taking into account temperature variations, the estimated remaining health lifetime is highest at 3 months, an extremely positive indicator of the machine’s functionality.

The possibility of monitoring the energy quantities allows us to transmit online electrical characteristics of the stator windings and the system in general. In Figure 31 we can distinguish the state of the motor based on the energy data, the manufacture’s specifications, and the active power. The data includes the recording of the rms values of voltage and current in the three phases of the motor and the total apparent, reactive, and active values, as well as the harmonic distortion of voltage and current. Also, for better visualization of the results, the platform has the ability to display the voltage waveform as shown in Figure 31.

To ensure reproducibility and to support further research, the complete dataset used in this study, including synchronized current and vibration signals under healthy and faulty conditions, has been made available through an open access repository [94].

5. Discussion

The proposed methodology relies on machine learning techniques in order to achieve early fault detection in elevator drive systems by monitoring the functionality of the PMSM. The simulation results demonstrate the accuracy and effectiveness of the proposed method as well as categorization of system operating states for immediate intervention. The main novelty of this paper is the utilization of two basic signals for studying electrical and mechanical problems, their processing for feature extraction, the combination of categorization and data generation techniques, as well as the comparative investigation of various machine learning algorithms. Unlike other traditional techniques, the work presented in this paper does not rely on reactive maintenance but adopting predictive maintenance, thereby minimizing cost and improving output. Other ML approaches require a large amount of labeled data, while PU learning overcomes this limitation.

Some future extensions of the technique could be as follows:

Using Edge AI with a local analysis capability, reducing the need for cloud computing and response delays;
Incorporating Explainable AI into machine learning algorithms to better understand the causes of faults;
Evaluation of alternative RF approaches by implementing Multi-Agent (MARL) and Meta-RL;
Digital twin technology to simulate different lift faults;
A combined study of other machine learning algorithms such as Extreme Gradient Boosting (XGBoost), Light Gradient Boosting Machine (LightGBM), and Bayesian Neural Networks (BNN);
Combining Convolutional Neural Networks (CNNs) and Recurrent Neural Networks (RNNs) to extract spatial and temporal features from sensors;
Enhancement of feature extraction via Wavelet Packet Decomposition (WPD) and Empirical Mode Decomposition (EMD) for the detection of non-static and nonlinear patterns in signals from PMSMs.

6. Conclusions

Focusing on the key points of this paper, we can see the significant contribution of advanced signal analysis methods as well as an improvement in fault prediction. Our research proposes a PMSM maintenance framework for elevator applications based on machine learning methodologies to monitor the system status in real operating conditions, avoiding potential outages and ensuring robust reliability.

The effectiveness of the proposed method is distinguished through the use of PU learning, Reinforcement Learning, and deep machine learning models for more accurate diagnosis and correct fault classification. The results confirm that the algorithm was found to be highly effective against key challenges such as a lack of labeled data. Furthermore, the use of Gaussian Filter and Extended Kalman Filter (EKF) led to a significant improvement of the signals by removing noise in the sensor data, while signal analysis via Short-Time Fourier Transform (STFT) and Fast Fourier Transform (FFT) allowed for an initial analysis of possible electrical and mechanical faults.

With advanced data visualization and analysis evaluating the performance metrics, the accuracy and other factors confirmed the reliability of the system, while visualization techniques helped to better understand fault patterns. Experimental measurements showed that the system can operate independently of the lift control panel, offering a non-intrusive solution for preventive maintenance. The probability of fault occurrence based on the data processing reached 35%, while the proposed maintenance time was calculated at 5 days based on the analysis of the measured current values. Finally, it was considered necessary for the maintainer to monitor the alarms in the Marginal and Critical states particularly, in order to be able to intervene immediately and to reassess the situation in a short time.

This work demonstrates the feasibility of a non-intrusive, multimodal, AI-driven fault prediction system on a real operational elevator with a PMSM drive. The method operates without requiring access to the controller or motor internals, ensuring compatibility with warranty and safety regulations. Future extensions of this research study could include a sensitivity analysis of the parameters affecting the data training model, as well as the implementation of predictive maintenance based on data collected using different motor types and elevator architectures with additional smart sensors. In addition, Finite Element Analysis (FEA) could be employed to simulate fault conditions in the PMSM, enabling the system to detect fault types, estimate their probability of occurrence, and define corresponding threshold values.

Author Contributions

Conceptualization, V.I.V.; methodology, V.I.V., T.S.K., D.E.E., E.I.V., S.D.V., V.E.B. and A.C.G.; software, V.I.V.; validation, V.I.V., T.S.K., D.E.E., E.I.V., S.D.V., V.E.B. and A.C.G.; formal analysis, V.I.V.; investigation, V.I.V., T.S.K., D.E.E., E.I.V., S.D.V., V.E.B. and A.C.G.; data curation, V.I.V. and T.S.K.; writing—original draft preparation, V.I.V., T.S.K., D.E.E., E.I.V., S.D.V., V.E.B. and A.C.G.; writing—review and editing, T.S.K. and D.E.E.; visualization, V.I.V., T.S.K., D.E.E., E.I.V., S.D.V., V.E.B. and A.C.G.; supervision, T.S.K. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The datasets generated and analyzed during the current study are available in an open access repository [94].

Conflicts of Interest

The authors declare no conflicts of interest.

References

Hou, Y.; Dong, J. Design and Application of Elevator Variable Frequency Speed Control System based on PLC. In Proceedings of the International Conference on Advances in Electrical and Computer Applications (AEECA), Dalian, China, 20–21 August 2021; pp. 344–348. [Google Scholar] [CrossRef]
Al-Kodmany, K. Elevator Technology Improvements: A Snapshot. Encyclopedia 2023, 3, 530–548. [Google Scholar] [CrossRef]
Avsar, Y.; Fenercioglu, A.; Soyaslan, M. Design Optimization of PM Synchronous Motor: Rail Mounted Belt Drive Elevator Systems. IEEE Trans. Ind. Appl. 2024, 60, 301–311. [Google Scholar] [CrossRef]
Niu, D.; Song, D. Model-Based Robust Fault Diagnosis of Incipient ITSC for PMSM in Elevator Traction System. IEEE Trans. Instrum. Meas. 2023, 72, 3533512. [Google Scholar] [CrossRef]
Gupta, S.; Kumar, A.; Maiti, J. A critical review on system architecture, techniques, trends and challenges in intelligent predictive maintenance. Saf. Sci. 2024, 177, 106590. [Google Scholar] [CrossRef]
Vlachou, V.I.; Karakatsanis, T.S.; Kladas, A.G. Current trends in elevator systems protection including fault tolerance and condition monitoring techniques implemented in emerging synchronous motor drives. In Proceedings of the Protection, Automation & Control World Conference (Pac World 2024), Athens, Greece, 17–21 June 2024; pp. 1–22. Available online: https://www.researchgate.net/publication/381800209 (accessed on 30 April 2025).
Mahmoud, M.S.; Huynh, V.K.; Senanyaka, J.S.L.; Robbersmyr, K.G. Robust Multiple-Fault Diagnosis of PMSM Drives Under Variant Operations and Noisy Conditions. IEEE Open J. Ind. Electron. Soc. 2024, 4, 762–772. [Google Scholar] [CrossRef]
Wang, Z.; Yang, J.; Ye, H.; Zhou, W. A review of Permanent Magnet Synchronous Motor fault diagnosis. In Proceedings of the IEEE Conference and Expo Transportation Electrification Asia-Pacific (ITEC Asia-Pacific), Beijing, China, 31 August–3 September 2014. [Google Scholar] [CrossRef]
Shih, K.J.; Hseih, M.F.; Chen, B.J.; Huang, S.F. Machine Learning for Inter-Turn Short-Circuit Fault Diagnosis in Permanent Magnet Synchronous Motors. IEEE Trans. Magn. 2022, 58, 8204307. [Google Scholar] [CrossRef]
Du, B.; Huang, W.; Cheng, Y.; Chen, J.; Tao, R.; Cui, S. Fault Diagnosis and Separation of PMSM Rotor Faults Using Search Coil Based on MVSA and Random Forest. IEEE Trans. Ind. Electron. 2024, 71, 15089–15099. [Google Scholar] [CrossRef]
Sevetlidis, V.; Pavlidis, G.; Mouroutsos, S.G.; Karakatsanis, T.; Gasteratos, A. Positive and Unlabelled Learning with Data Approximation for Photovoltaic Defect Detection. In Proceedings of the International Conference on Imaging Systems and Techniques (IST), Tokyo, Japan, 14–16 October 2024. [Google Scholar] [CrossRef]
Dong, T.; Zang, C.; Zeng, P. Fault diagnosis study of elevator based on stochastic configuration networks. In Proceedings of the International Conference on Industrial Artificial Intelligence (IAI), Shenyang, China, 24–27 August 2022; pp. 1–6. [Google Scholar] [CrossRef]
Akbar, S.; Vaimann, T.; Asad, B.; Kallaste, A.; Sardar, M.U.; Kudelina, K. State-of-Art Techniques for Fault Diagnosis in Electrical Machines: Advancements and Future Directions. Energies 2023, 16, 6345. [Google Scholar] [CrossRef]
Kim, M.; Park, J.; Moon, H.J.; Ko, S. EKF/MM-based Fault Diagnosis Algorithm of PMSM in the Presence of Sensor, Open and Short-Circuit Faults. In Proceedings of the International Conference on Control, Automation and Diagnosis (ICCAD), Paris, France, 15–17 May 2024. [Google Scholar] [CrossRef]
Orlowska-Kowalska, T.; Wolkiewicz, M.; Pietrzak, P.; Skowron, M.; Ewert, P.; Tarchala, G. Fault Diagnosis and Fault-Tolerant Control of PMSM Drives-State of the Art and Future Challenges. IEEE Access 2022, 10, 9979–60024. [Google Scholar] [CrossRef]
Pietrzak, P.; Wolkiewicz, M. Fault Diagnosis of PMSM Stator Winding Based on Continuous Wavelet Transform Analysis of Stator Phase Current Signal and Selected Artificial Intelligence Techniques. Electronics 2023, 12, 1543. [Google Scholar] [CrossRef]
Vlachou, V.I.; Karakatsanis, T.S.; Kladas, A.G. Fault Tolerant Real Time Monitored Elevator System Development. In Proceedings of the 14th Mediterranean Conference on Power Generation, Transmission, Distribution and Energy Conversion (MEDPOWER 2024), Athens, Greece, 3–6 November 2024. [Google Scholar]
Karakatsanis, T.S. Modeling of elevators as Driving systems. In Proceedings of the International Associate of Elevator Engineering (IAEE), Thessaloniki, Greece, 11–13 June 2008. [Google Scholar]
Vlachou, E.I.; Vlachou, V.I.; Efstathiou, D.E.; Karakatsanis, T.S. Overview of IoT Security Challenges and Sensors Specifications in PMSM in Elevator Applications. Machines 2024, 12, 839. [Google Scholar] [CrossRef]
Pietrzak, P.; Wolkiewicz, M. Comparison of Selected Methods for the Stator Winding Condition Monitoring of a PMSM Using the Stator Phase Currents. Energies 2021, 14, 1630. [Google Scholar] [CrossRef]
Wang, K.; Dai, G.; Guo, L. Intelligent Predictive Maintenance (IPdM) for Elevator Service. In Proceedings of the International Workshop of Advanced Manufacturing and Automation (IWAMA 2016), Wuyi, China, 10–11 November 2016; pp. 1–6. [Google Scholar] [CrossRef]
Lai, C.T.A.; Jiang, W.; Jackson, P.R. Internet of Things enabling condition-based maintenance in elevator service. J. Qual. Main. Eng. 2019, 25, 563–588. [Google Scholar] [CrossRef]
Sharma, A.; Chatterjee, A.; Thakur, P.K.; Jha, S.; Sriharipriya, K. IoT Based Automated Elevator Emergency Alert System Using Android Mobile Application. In Proceedings of the International Conference on Data Science and Information System (ICDSIS), Hassan, India, 29–30 July 2022. [Google Scholar] [CrossRef]
Wang, X.; Lu, S.; Huang, W.; Wang, Q.; Zhang, S.; Xia, M. Efficient Data Reduction at the Edge of Industrial Internet of Things for PMSM Bearing Fault Diagnosis. IEEE Trans. Instrum. Meas. 2021, 70, 3508612. [Google Scholar] [CrossRef]
Mishra, K.M.; Huhtala, K. Elevator Fault Detection Using Profile Extraction and Deep Autoencoder Feature Extraction for Acceleration and Magnetic Signals. Appl. Sci. 2019, 9, 2990. [Google Scholar] [CrossRef]
ISO 20816-1:2016; Mechanical Vibration—Measurement and Evaluation of Machine Vibration—Part 1: General Guidelines. International Organization for Standardization (ISO): Geneva, Switzerland, 2016.
De las Morenas, J.; Moya-Fernandez, F.; Lopez-Gomez, J.A. The Edge Application of Machine Learning Techniques for Fault Diagnosis in Electrical Machines. Sensors 2023, 23, 2649. [Google Scholar] [CrossRef]
Szydlo, K.; Longwic, R. Diagnostics of the Passenger Lift Winch. Adv. Sci. Technol. 2018, 12, 26–35. [Google Scholar] [CrossRef] [PubMed]
Duan, Y.; Toliyat, H. A review of condition monitoring and fault diagnosis for permanent magnet machines. In Proceedings of the IEEE Power and Energy Society General Meeting, San Diego, CA, USA, 22–26 July 2012. [Google Scholar] [CrossRef]
Siddiqui, K.M.; Bakhsh, F.I.; Ahmad, R.; Solanki, V. Advanced Signal Processing Based Condition Monitoring of PMSM for Stator-inter Turn Fault. In Proceedings of the Uttar Pradesh Section International Conference on Electrical, Electronics and Computer Engineering (UPCON), Dehradun, India, 11–13 November 2021. [Google Scholar] [CrossRef]
Zhang, Y.; Mao, Y.; Wang, X.; Wang, Z.; Xiao, D.; Fang, G. Current Prediction-Based Fast Diagnosis of Electrical Faults in PMSM Drives. IEEE Trans. Transp. Electron. 2022, 8, 4622–4632. [Google Scholar] [CrossRef]
Liang, Y. Diagnosis of Inter-Turn Short-Circuit Stator Winding Fault in PMSM Based on Stator Current and Noise. In Proceedings of the International Conference on Industrial Technology (ICIT), Busan, Republic of Korea, 26 February–1 March 2014; pp. 138–142. [Google Scholar] [CrossRef]
Huang, W.; Huang, Y.; Luo, L.; Zhou, L. A Mixed Logical Dynamic Model-Based Open-Circuit Fault Diagnosis Method for Five-Phase PMSM Drives. In Proceedings of the International Conference on Electrical Machines and Systems (ICEMS), Zhuhai, China, 5–8 November 2023; pp. 3641–3646. [Google Scholar] [CrossRef]
Zhao, J.; Guan, X.; Li, C.; Mou, Q.; Chen, Z. Comprehensive Evaluation of Inter-Turn Short Circuit Faults in PMSM Used for Electric Vehicles. IEEE Trans. Intell. Transp. Syst. 2021, 22, 611–621. [Google Scholar] [CrossRef]
Jiang, Y.; Ji, B.; Zhang, J.; Yan, J.; Li, W. An Overview of Diagnosis Methods of Stator Winding Inter-Turn Short Faults in Permanent Magnet Synchronous Motors for Electric Vehicles. World Electron. Veh. J. 2024, 15, 165. [Google Scholar] [CrossRef]
Antoni, J. A Tutorial Review on Time-Frequency Analysis of Non-Stationary Vibration Signals with Nonlinear Dynamics Applications. Mech. Syst. Signal Process. 2009, 23, 182–217. [Google Scholar] [CrossRef]
Strangas, E.G.; Aviyente, S.; Zaidi, S.S.H. Time-Frequency Analysis for Efficient Fault Diagnosis and Failure Prognosis for Interior Permanent-Magnet AC Motors. IEEE Trans. Ind. Electron. 2008, 55, 4191–4199. [Google Scholar] [CrossRef]
Chen, Y.; Liang, S.; Li, W.; Liang, H.; Wang, C. Faults and Diagnosis Methods of Permanent Magnet Synchronous Motors: A Review. Appl. Sci. 2019, 9, 2116. [Google Scholar] [CrossRef]
Krichen, M.; Benhadj, N.; Chaieb, M.; Neji, R. Fault Detection and Diagnosis Methods in Permanent Magnet Synchronous Machines: A Review. In Proceedings of the International Conference on Recent Advances in Electrical Systems, Hammamet, Tunisia, 22–24 December 2017; pp. 229–237. Available online: https://www.researchgate.net/publication/328610743 (accessed on 30 April 2025).
Mbo’o, C.P.; Hameyer, K. Bearing damage diagnosis by means of the linear discriminant analysis of stator current feature. In Proceedings of the International Symposium on Diagnostics for Electrical Machines, Power Electronics and Drives (SDEMPED), Guarda, Portugal, 1–4 September 2015; pp. 296–302. [Google Scholar] [CrossRef]
Beldjaatit, C.; Sebbagh, T.; Guentri, H. Time Domain Approach for Rolling Element Bearing Fault Detection and Diagnosis in Vibration Monitoring. In Proceedings of the International Conference of Innovation in Science and Engineering (ICISE), Kuala Lumpur, Malaysia, 23–24 December 2022; pp. 96–102. Available online: https://www.researchgate.net/publication/372595835 (accessed on 30 April 2025).
Sreejith, S.; Verma, A.S.; Kumar, C.S.; Kar, I.N. Fault Diagnosis of Rolling Element Bearing Using Time-Domain Features and Neural Networks. In Proceedings of the 2008 IEEE Region 10 and the Third international Conference on Industrial and Information Systems, Kharagpur, India, 8–10 December 2008; pp. 201–205. [Google Scholar] [CrossRef]
Lee, J.S.; Yoon, T.M.; Lee, K.B. Bearing fault detection of IPMSMs using zoom FFT. J. Electron. Eng. Technol. 2016, 11, 1235–1241. [Google Scholar] [CrossRef]
Yang, M.; Chai, N.; Liu, Z.; Ren, B.; Xu, D. Motor speed signature analysis for local bearing fault detection with noise and cancellation based on improved drive algorithm. IEEE Trans. Ind. Electron. 2020, 67, 4172–4182. [Google Scholar] [CrossRef]
Gurusamy, V.; Baruti, K.H.; Zafarani, M.; Lee, W.; Akin, B. Effect of magnets asymmetry on stray magnetic flux based bearing damage detection in PMSM. IEEE Access 2021, 9, 68849–68860. [Google Scholar] [CrossRef]
Ren, B.; Yang, M.; Chai, N.; Li, Y.; Xu, D. Fault Diagnosis of Motor Bearing Based on Speed Signal Kurtosis Spectrum Analysis. In Proceedings of the International Conference on Electrical Machines and Systems (ICEMS), Harbin, China, 11–14 August 2019. [Google Scholar] [CrossRef]
Ebrahimi, B.M.; Faiz, Z. Configuration impacts on eccentricity fault detection in permanent magnet synchronous motors. IEEE Trans. Magn. 2012, 48, 903–906. [Google Scholar] [CrossRef]
Vlachou, V.I.; Karakatsanis, T.S.; Kladas, A.G. Energy Savings in Elevators by Using a Particular Permanent-Magnet Motor Drive. Energies 2023, 16, 4716. [Google Scholar] [CrossRef]
Rao, A.; Mircevski, S. Possibilities for Energy Saving Predictions in Elevators. Power Electron. Drives 2021, 6, 218–228. [Google Scholar] [CrossRef]
Bhuiyan, E.A.; Maeenul, A.A.; Sajal, K.D.; Ali, F.; Tasneem, Z.; Islam, R.; Saha, D.K.; Badal, F.R.; Ahamed, H.; Moyeen, S.I. A Survey on Fault Diagnosis and Fault Tolerant Methodologies for Permanent Magnet Synchronous Machines. Int. J. Autom. Comput. 2020, 17, 763–787. [Google Scholar] [CrossRef]
Jong, P.; Hakala, M. The Advantage of PMSM Technology in High-Rise Buildings. J. Build. Eng. 2020, 32, 101799. [Google Scholar] [CrossRef]
Chen, K.-Y.; Huang, M.-S.; Fung, R.-F. Dynamic Modelling and Input-Energy Comparison for the Elevator System. Appl. Math. Model. 2014, 38, 2037–2050. [Google Scholar] [CrossRef]
Merabet, A. (Ed.) Advanced Control Systems for Electric Drives; MDPI: Basel, Switzerland, 2020; 342p. [Google Scholar] [CrossRef]
Wang, Q.; Chen, L.; Xiao, G.; Wang, P.; Gu, Y.; Lu, J. Elevator Fault Diagnosis Based on Digital Twin and PINNs-e-RGCN. Sci. Rep. 2024, 14, 30713. [Google Scholar] [CrossRef]
Wang, R.; Zhan, X.; Bai, H.; Dong, E.; Cheng, Z.; Jia, X. A Review of Fault Diagnosis Methods for Rotating Machinery Using Infrared Thermography. Micromachines 2022, 13, 1644. [Google Scholar] [CrossRef] [PubMed]
Kuang, Z.; Wu, S.; Du, B.; Xu, H.; Cui, S.; Chan, C.C. Thermal Analysis of Fifteen-Phase Permanent Magnet Synchronous Motor Under Different Fault Tolerant Operations. IEEE Access 2019, 7, 81466–81480. [Google Scholar] [CrossRef]
Liu, Y.; Zhang, B.; Zong, M.; Feng, G. Thermal Analysis of a Modular Permanent Magnet Machine under Open-Circuit Fault with Asymmetric Temperature Distribution. Electronics 2023, 12, 1623. [Google Scholar] [CrossRef]
Senanayaka, J.S.L.; Van Khang, H.; Robbersmyr, K.G. Online Fault Diagnosis System for Electric Powertrains Using Advanced Signal Processing and Machine Learning. In Proceedings of the 2018 IEEE International Conference on Electrical Machines (ICELMACH), Alexandroupoli, Greece, 3–6 September 2018; pp. 1–6. [Google Scholar] [CrossRef]
Nunes, P.; Santos, J.; Rocha, E. Challenges in Predictive Maintenance—A Review. CIRP J. Manuf. Sci. Technol. 2023, 40, 53–67. [Google Scholar] [CrossRef]
Heydarzadeh, M.; Zafarani, M.; Ugur, E.; Akin, B.; Nourani, M. A Model-Based Signal Processing Method for Fault Diagnosis in PMSM Machine. In Proceedings of the 2017 IEEE Energy Conversion Congress and Exposition (ECCE), Cincinnati, OH, USA, 1–5 October 2017; pp. 5663–5670. [Google Scholar] [CrossRef]
Saha, S.; Kar, U. Signal-Based Position Sensor Fault Diagnosis Applied to PMSM Drives for Fault-Tolerant Operation in Electric Vehicles. World Electron. Veh. J. 2023, 14, 123. [Google Scholar] [CrossRef]
Ma’arif, A.; Iswanto; Nuryono, A.A.; Alfian, R.I. Kalman Filter for Noise Reducer on Sensor Readings. Signal Image Process. Lett. 2020, 1, 11–22. [Google Scholar] [CrossRef]
Wen, Z.; Fu, Z.; Gao, Y.; Wang, B.; Huang, R.; Long, F. Non-invasive Method for Elevator’s Movement Monitoring Based on MEMS Sensor and Kalman Filter. In Proceedings of the 2018 14th IEEE International Conference on Signal Processing (ICSP), Beijing, China, 12–16 August 2018; pp. 1383–1387. [Google Scholar] [CrossRef]
Gao, F.; Yin, Z.; Li, L.; Li, T.; Liu, J. Gaussian Noise Suppression in Deadbeat Predictive Current Control of Permanent Magnet Synchronous Motors Based on Augmented Fading Kalman Filter. IEEE Trans. Energy Convers. 2023, 38, 1410–1420. [Google Scholar] [CrossRef]
Peroutka, Z.; Smidl, V.; Vosmik, D. Challenges and Limits of Extended Kalman Filter-Based Sensorless Control of Permanent Magnet Synchronous Machine Drives. In Proceedings of the 13th European Conference on Power Electronics and Applications, (EPE ’09), Barcelona, Spain, 8–10 September 2009; pp. 1–11. [Google Scholar]
Belkhadir, A.; Pusca, R.; Romary, R.; Belkhayat, D.; Zidani, Y. Detection of External Rotor PMSM Inter-Turn Short Circuit Fault Using Extended Kalman Filter. In Proceedings of the 14th International Symposium on Diagnostics for Electrical Machines, Power Electronics and Drives (SDEMPED), Chania, Greece, 28–31 August 2023; IEEE: New York, NY, USA, 2023. [Google Scholar] [CrossRef]
Rosero, J.A.; Romeral, L.; Ortega, J.A.; Rosero, E. Short-Circuit Detection by Means of Empirical Mode Decomposition and Wigner–Ville Distribution for PMSM Running Under Dynamic Condition. IEEE Trans. Ind. Electron. 2009, 56, 4534–4547. [Google Scholar] [CrossRef]
Selçuk, R.; Doğan, Z. A Diagnosis of Stator Winding Fault Based on Empirical Mode Decomposition in PMSMs. Balk. J. Electron. Comput. Eng. 2020, 8, 73–80. [Google Scholar] [CrossRef][Green Version]
Pietrzak, P.; Wolkiewicz, M. Application of Support Vector Machine to Stator Winding Fault Detection and Classification of Permanent Magnet Synchronous Motor. In Proceedings of the 2021 IEEE 19th International Power Electronics and Motion Control Conference (PEMC), Gliwice, Poland, 25–29 April 2021. [Google Scholar] [CrossRef]
Dou, Y.; Guo, L.; Li, Y.; Zhu, Y. Application of Support Vector Machine in Fault Diagnosis of Elevator. MATEC Web Conf. 2019, 267, 01008. [Google Scholar] [CrossRef]
Mishra, K.M.; Huhtala, K. Condition Monitoring of Elevator Systems using Deep Neural Network. J. Sens. Actuator Netw. 2021, 10, 14. [Google Scholar] [CrossRef]
Ezugwu, A.E.; Ikotun, A.M.; Oyelade, O.O.; Abualigah, L.; Agushaka, J.O.; Eke, C.I.; Akinyelu, A.A. A Comprehensive Survey of Clustering Algorithms: State-of-the-Art Machine Learning Applications, Taxonomy, Challenges, and Future Research Prospects. Eng. Appl. Artif. Intell. 2022, 110, 104743. [Google Scholar] [CrossRef]
Dahiya, S.; Nanda, H.; Artwani, J.; Varshney, J. Using Clustering Techniques and Classification Mechanisms for Fault Diagnosis. Int. J. Adv. Trends Comput. Sci. Eng. 2020, 9, 2138–2146. [Google Scholar] [CrossRef]
Chen, L.; Lan, S.; Jiang, S. Elevators Fault Diagnosis Based on Artificial Intelligence. J. Phys. Conf. Ser. 2019, 1345, 042024. [Google Scholar] [CrossRef]
Park, C.H. Multi-Class Positive and Unlabeled Learning for High Dimensional Data Based on Outlier Detection in a Low Dimensional Embedding Space. Electronics 2022, 11, 2789. [Google Scholar] [CrossRef]
Dogru, O.; Xie, J.; Prakash, O.; Chiplunkar, R.; Soesanto, J.; Chen, H.; Velswamy, K.; Ibrahim, F.; Huang, B. Reinforcement Learning in Process Industries: Review and Perspective. IEEE/CAA J. Autom. Sin. 2024, 11, 283–300. [Google Scholar] [CrossRef]
Chen, C.; Ren, X.; Cheng, G. Research on Distributed Fault Diagnosis Model of Elevator Based on PCA-LSTM. Algorithms 2024, 17, 250. [Google Scholar] [CrossRef]
Pan, J.; Shao, C.; Dai, Y.; Wei, Y.; Chen, W.; Lin, Z. Research on Fault Prediction Method of Elevator Door System Based on Transfer Learning. Sensors, 2024; 24, 2135. [Google Scholar] [CrossRef]
Pan, W.; Xiang, Y.; Gong, W.; Shen, H. Risk Evaluation of Elevators Based on Fuzzy Theory and Machine Learning Algorithms. Mathematics 2024, 12, 113. [Google Scholar] [CrossRef]
Mishra, K.M.; Huhtala, K.J. Fault Detection of Elevator Systems Using Multilayer Perceptron Neural Network. In Proceedings of the 2019 24th IEEE International Conference on Emerging Technologies and Factory Automation (ETFA), Zaragoza, Spain, 10–13 September 2019; pp. 904–909. [Google Scholar] [CrossRef]
Ahn, J.; Lee, Y.; Kim, N.; Park, C.; Jeong, J. Federated Learning for Predictive Maintenance and Anomaly Detection Using Time Series Data Distribution Shifts in Manufacturing Processes. Sensors 2023, 23, 7331. [Google Scholar] [CrossRef]
Linardatos, P.; Papastefanopoulos, V.; Kotsiantis, S. Explainable AI: A Review of Machine Learning Interpretability Methods. Entropy 2021, 23, 18. [Google Scholar] [CrossRef]
Perez-Cerrolaza, J.; Abella, J.; Borg, M.; Donzella, C.; Cerquides, J.; Cazorla, F.J.; Englund, C.; Tauber, M.; Nikolakopoulos, G.; Flores, J.L. Artificial Intelligence for Safety-Critical Systems in Industrial and Transportation Domains: A Survey. ACM Comput. Surv. 2024, 56, 1–40. [Google Scholar] [CrossRef]
Hodavand, F.; Ramaji, I.J.; Sadeghi, N. Digital Twin for Fault Detection and Diagnosis of Building Operations: A Systematic Review. Buildings 2023, 13, 1426. [Google Scholar] [CrossRef]
Rashed, A.N.Z.; Yarrarapu, M.; Prabu, R.T.; Raj Antony, G.S.; Edeswaran, L.; Kumar, E.S.; Aswitha, K.; Snehith, N.; Ahammad, S.H. Connected smart elevator systems for smart power and time saving. Sci. Rep. 2024, 14, 19330. [Google Scholar] [CrossRef] [PubMed]
Wang, B.; Zhang, T.; Zhang, T.; Yu, C.; Zou, H. Optimizing Elevator Energy Use with Machine Learning-Based Real-Time Prediction. In Proceedings of the IEEE International Conference on Smart Systems and Technologies, Hangzhou, China, 12–14 July 2024; pp. 123–128. [Google Scholar] [CrossRef]
Nelson, W.; Culp, C. Machine Learning Methods for Automated Fault Detection and Diagnostics in Building Systems—A Review. Energies 2022, 15, 5534. [Google Scholar] [CrossRef]
Verma, G.; Awatramani, S. Implementing Deep Learning Model to Predict the Maintenance of an Elevator System. In Proceedings of the 2021 IEEE International Conference on Artificial Intelligence and Computer Engineering (ICAICE), Belagavi, India, 21–23 May 2021; pp. 1–5. [Google Scholar] [CrossRef]
Zonta, T.; da Costa, C.A.; Righi, R.d.R.; de Lima, M.J.; da Trindade, E.S.; Li, G.P. Predictive Maintenance in the Industry 4.0: A Systematic Literature Review. Comput. Ind. Eng. 2020, 150, 106889. [Google Scholar] [CrossRef]
Alanne, K.; Sierla, S. An Overview of Machine Learning Applications for Smart Buildings. Sustain. Cities Soc. 2022, 76, 103445. [Google Scholar] [CrossRef]
Xu, S.; Yu, H.; Wang, H.; Chai, H.; Ma, M.; Chen, H.; Zheng, W.X. Simultaneous Diagnosis of Open-Switch and Current Sensor Faults of Inverters in IM Drives Through Reduced-Order Interval Observer. IEEE Trans. Ind. Electron. 2024, 71, 6485–6496. [Google Scholar] [CrossRef]
ISO 20816-3; Mechanical Vibration-Measurements and Evaluation of Machine Vibration—Part 3: Industrial Machines with Nominal Power Above 15 kW and Nominal Speeds Between 120 r/min and 30,000 r/min When Measured In Situ. International Organization for Standardization: Geneva, Switzerland, 2022.
Sevetlidis, V.; Pavlidis, G.; Mouroutsos, S.G.; Gasteratos, A. Dense-PU: Learning a Density-Based Boundary for Positive and Unlabeled Learning. IEEE Access 2024, 12, 90287–90298. [Google Scholar] [CrossRef]
Vlachou, V.I.; Karakatsanis, T.S.; Efstathiou, D.E.; Vlachou, E.I.; Vologiannidis, S.D.; Balaska, V.E.; Gasteratos, A.C. Multimodal Signal Dataset for Fault Detection in PMSM-Driven Elevators Under Real Operating Conditions. Zenodo, 2019. Dataset. Available online: https://zenodo.org/records/15613954 (accessed on 7 June 2025).

Figure 1. Overview of the residential apartment elevator installation used in this study.

Figure 2. (a) Cross-section view (one-quarter section) of the PMSM. (b) Simplified layout of stator winding in PMSM showing the potential location of a short circuit fault in phase A.

Figure 3. Diagram of the developed framework.

Figure 4. Healthy current graphs in a PMSM: (a) Upward movement; (b) downward movement.

Figure 5. Measured faulty signal: (a)

f_{1}

= 5 kHz; (b)

f_{2}

= 12 kHz.

Figure 5. Measured faulty signal: (a)

f_{1}

= 5 kHz; (b)

f_{2}

= 12 kHz.

Figure 6. Gaussian Filtering in a healthy current signal in a PMSM: (a) Upward movement; (b) downward movement.

Figure 7. Extended Kalman Filtering in a healthy current signal in a PMSM: (a) Upward movement; (b) downward movement.

Figure 8. Gaussian Filtering in a faulty current signal in a PMSM: (a)

f_{1}

= 5 kHz; (b)

f_{2}

= 12 kHz.

Figure 8. Gaussian Filtering in a faulty current signal in a PMSM: (a)

f_{1}

= 5 kHz; (b)

f_{2}

= 12 kHz.

Figure 9. Extended Kalman Filtering in a faulty current signal in a PMSM: (a)

f_{1}

= 5 kHz; (b)

f_{2}

= 12 kHz.

Figure 9. Extended Kalman Filtering in a faulty current signal in a PMSM: (a)

f_{1}

= 5 kHz; (b)

f_{2}

= 12 kHz.

Figure 10. Measured healthy vibration signal in a PMSM: (a) Time domain; (b) frequency domain.

Figure 11. Measured faulty vibration signal in a PMSM: (a) Time domain; (b) frequency domain.

Figure 12. STFT healthy current signal in a PMSM for 0 Kg: (a) Upward motion; (b) downward motion.

Figure 13. STFT healthy current signal in a PMSM for 150 Kg: (a) Upward motion; (b) downward motion.

Figure 14. STFT healthy current signal in a PMSM for 300 Kg: (a) Upward motion; (b) downward motion.

Figure 15. STFT healthy current signal in a PMSM for 415 Kg: (a) Upward motion; (b) downward motion.

Figure 16. STFT faulty current signal in a PMSM: (a)

f_{1}

= 5 kHz; (b) f₂ = 12 kHz.

Figure 16. STFT faulty current signal in a PMSM: (a)

f_{1}

= 5 kHz; (b) f₂ = 12 kHz.

Figure 17. STFT vibration signal in a PMSM: (a) Healthy; (b) faulty.

Figure 18. Feature distribution: (a) Original; (b) normalized.

Figure 19. Skewness distribution for healthy and faulty data.

Figure 20. Feature correlation heatmap.

Figure 21. Confusion matrix for the classifications of the original samples.

Figure 22. Feature visualization.

Figure 23. Comparison of characteristic distribution.

Figure 24. Original (blue) and synthetic data (green) visualization.

Figure 25. Basic characteristic curves of the proposed model: (a) ROC curve; (b) Precision–Recall curve.

Figure 26. Comparative analysis metric characteristics for each algorithm.

Figure 27. Confusion matrix for different classification models.

Figure 28. Pareto front and malfunction multi-criteria optimization.

Figure 29. Condition monitoring of the operation system.

Figure 30. Dashboard with vibration signal and predictive maintenance in a PMSM.

Figure 31. Dashboard with energy data and condition monitoring in elevator system.

Table 1. Nominal parameters of the elevator system.

Quantity	Value	Units
Total Weight Chamber (P)	495	kg
Nominal Load (Q)	450	kg
Counterweight (G)	720	kg
Rated Speed (U)	1	m/s
Diameter of the Friction Pulley (D)	240	mm
Force Power (F)	225	kg

Table 2. Nominal characteristics and design parameters of the PMSM.

Parameters	Value	Units
$Output Power (P_{o u t})$	5.1	kW
$Input Power (P_{i n})$	6.0	kW
$Nominal Current (I_{n})$	10	A
$Nominal Torque (T_{n}$ )	350	Nm
Nominal Speed (n)	160	Rpm
$Nominal Voltage (V_{n})$	380	Volt
Number of Poles (p)	12	-
Frequency (f)	16	Hz
Efficiency (a)	85	%
Power Factor (cosφ)	0.95	-
Moment of Inertia (J)	0.35	$kg \cdot m^{2}$
Stator Outer Diameter	$D_{o}$	220 mm
Stator Inner Diameter	$D_{s}$	126 mm
Rotor Outer Diameter	$D_{r}$	124 mm
Rotor Inner Diameter	$D_{i r}$	100 mm
Axial Length	L	350 mm
Shaft Diameter	$D_{s h a f t}$	60 mm
Airgap Length	$L_{g}$	1 mm
Slot Opening Width	$b_{s o}$	3 mm
Slot Width at the Base	$b_{s 1}$	8.6 mm
Slot Width at the Top	$b_{s 2}$	12.5 mm
Stator Tooth Shoe Height	$H_{s o}$	1.5 mm
Stator Curvature Height	$H_{s 1}$	2.5 mm
Slot Total Height	$H_{s 2}$	18 mm
Magnet Thickness	$l_{m}$	4.5 mm

Table 3. Implemented framework.

Software	Version	Objective
Python	3.10.11	Main Programming Language
Numpy	1.23.5	Array Computations
Pandas	1.5.2	Data Analysis
Matplotlib	3.6.2	Graph Visualization
Scipy	1.9.3	Signal Processing
Scikit-learn	1.2.2	Preprocessing and Classification
PyKalman	0.9.5	Kalman Filtering
TensorFlow/Keras	2.12.0	Deep Learning

Table 4. Main characteristics from healthy current signal analysis.

Parameters	Kurtosis	Skewness	$F_{m e a n}$	CF	Entropy
Upward 0 kg	−1.178693	−0.079805	174.641483	2.092751	22.513715
Downward 0 kg	−1.449820	−0.022865	128.423439	1.698632	8.598839
Upward 150 kg	−1.368116	−0.038797	143.970040	1.842108	16.954872
Downward 150 kg	−1.474748	−0.009041	121.203588	1.714493	9.874350
Upward 300 kg	−1.378286	−0.028730	136.443628	1.738689	10.857418
Downward 300 kg	−1.202166	−0.068855	163.865171	1.949762	17.181473
Upward 415 kg	−1.440483	−0.022311	127.162694	1.694734	8.427130
Downward 415 kg	−1.043277	0.032834	185.328061	2.140981	22.979208

Table 5. Main characteristics from faulty current signal analysis for different switching frequencies.

Parameters	Kurtosis	Skewness	$F_{m e a n}$	CF	Entropy
$f_{1}$ = 5 kHz	−0.518260	0.037931	169.027522	3.721357	5.707496
$f_{2}$ = 12 kHz	−0.768821	−0.071542	152.274270	2.929318	11.589026

Table 6. Vibration signal characteristics for each operation.

Parameters	Kurtosis	Skewness	$F_{m e a n}$	CF	Entropy
Healthy	−0.088098	0.038074	206.423823	3.735503	−23,915.8115
Faulty	61.059997	7.037883	180.372562	8.565845	−17,575.1876

Table 7. Calculated key features of machine learning algorithms.

Algorithms	Accuracy	Precision	Recall	F1-Score	ROC-AUC
Random Forest	99.50%	99.60%	99.40%	99.50%	99.50%
DNN	90.40%	92.24%	90.40%	91.31%	91.40%
SVM	89.50%	84.96%	96%	90.14%	89.50%
Ensemble	98.10%	100%	96.20%	98.06%	98.10%

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Vlachou, V.I.; Karakatsanis, T.S.; Efstathiou, D.E.; Vlachou, E.I.; Vologiannidis, S.D.; Balaska, V.E.; Gasteratos, A.C. Condition Monitoring and Fault Prediction in PMSM Drives Using Machine Learning for Elevator Applications. Machines 2025, 13, 549. https://doi.org/10.3390/machines13070549

AMA Style

Vlachou VI, Karakatsanis TS, Efstathiou DE, Vlachou EI, Vologiannidis SD, Balaska VE, Gasteratos AC. Condition Monitoring and Fault Prediction in PMSM Drives Using Machine Learning for Elevator Applications. Machines. 2025; 13(7):549. https://doi.org/10.3390/machines13070549

Chicago/Turabian Style

Vlachou, Vasileios I., Theoklitos S. Karakatsanis, Dimitrios E. Efstathiou, Eftychios I. Vlachou, Stavros D. Vologiannidis, Vasiliki E. Balaska, and Antonios C. Gasteratos. 2025. "Condition Monitoring and Fault Prediction in PMSM Drives Using Machine Learning for Elevator Applications" Machines 13, no. 7: 549. https://doi.org/10.3390/machines13070549

APA Style

Vlachou, V. I., Karakatsanis, T. S., Efstathiou, D. E., Vlachou, E. I., Vologiannidis, S. D., Balaska, V. E., & Gasteratos, A. C. (2025). Condition Monitoring and Fault Prediction in PMSM Drives Using Machine Learning for Elevator Applications. Machines, 13(7), 549. https://doi.org/10.3390/machines13070549

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Condition Monitoring and Fault Prediction in PMSM Drives Using Machine Learning for Elevator Applications

Abstract

1. Introduction

2. Advanced Techniques in Elevator Systems

2.1. Fault Diagnosis and Condition Monitoring in PMSMs

2.2. Preprocessing Signals and Deep Learning Methodologies

3. Proposed Methodology

3.1. Real Elevator Installation and Sensor Deployment in a Residential Apartment Building

3.2. Data Processing and Transmission

3.3. Model Training and Fault Classification

3.4. Model Evaluation and Database Update

4. Results

4.1. Experimental Current Signals

4.2. Preprocessing Data

4.2.1. Signal Filtering and Normalization

4.2.2. Harmonic Analysis

4.2.3. Pattern Feature Extraction

4.2.4. Model Training Algorithms

4.2.5. Update Database

5. Discussion

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI