1. Introduction
Ultra-high voltage (UHV) transmission line construction is often carried out in mountainous and spatially constrained environments, where tower erection involves high-altitude assembly, heavy lifting, and complex safety risks [
1]. In such contexts, mobile gin poles are widely used because they can be coupled with the tower structure without requiring large external working space [
2]. Compared with conventional gin poles, mobile gin poles integrate the lifting structure with a tracked chassis, thereby improving mobility and reducing relocation time. Owing to these advantages, mobile gin poles have shown strong potential for tower erection in complex terrain.
Despite these benefits, the operation of mobile gin poles still faces significant challenges in safety and efficiency [
3]. Their lifting tasks are executed in limited workspaces and often near surrounding structures or energized components, which increases the risks of collision, overload, and structural failure. In current practice, these risks are still mainly managed through manual observation and experience-based decision-making, which makes it difficult to achieve timely perception, consistent load control, and efficient operation under changing construction conditions [
4].
Building on research in technologies such as sensors, communications, virtual platforms, and artificial intelligence, many studies have addressed the challenges of automated lifting operations using methods such as computer vision [
5], path planning [
6], finite-element analysis [
7], and digital twins [
8]. For example, digital twin technology has been applied across multiple industries, achieving significant success by creating virtual scenarios consistent with the real world to enable real-time monitoring and control of the entire system [
9]. The rapid development of artificial intelligence technology has provided a fast and feasible method for establishing associations between different data sets [
10]. In practice, efficiency and safety often have conflicting improvement directions, making comprehensive optimization of the construction process challenging. Therefore, scholars have proposed multi-objective optimization algorithms, with notable examples including genetic algorithms and particle swarm algorithms [
11]. Parallel control theory provides an efficient path for the simulation and control of complex systems [
12]. However, most of these studies focus on conventional cranes rather than mobile gin poles. Although mobile gin poles share certain structural and operational similarities with cranes, they differ in terms of tower-coupled configuration, working space constraints, and multi-mode motion characteristics. As a result, existing crane-oriented approaches cannot be directly transferred. More importantly, there is still a lack of an intelligent control framework for mobile gin poles that can integrate real-time perception, structural response prediction, multi-objective decision-making, and uncertainty-aware virtual–physical interaction within a unified system [
13].
To address this gap, this study proposes a digital twin-enabled robust parallel control framework for the mobile gin pole. The framework combines finite-element-based kinetic analysis, a TROA-ET surrogate prediction model, and NSGA-III-based multi-objective optimization to support real-time safety assessment and operational decision-making under uncertain conditions. Specifically, this study addresses three questions: (1) how to realize real-time spatial attitude monitoring for collision-risk reduction; (2) how to balance operational efficiency and structural safety during lifting; and (3) how to achieve robust parallel control under uncertainty. Accordingly, the main contributions of this study are as follows: (1) a digital twin based intelligent control framework of the mobile gin pole is established; (2) a behavior–mechanics surrogate model is developed by combining finite-element analysis with a meta-heuristic optimized ET model for accurate structural response prediction; and (3) a multi-objective optimization strategy is introduced to improve both safety and efficiency in practical lifting scenarios. Among these, the key methodological contribution is the construction of a computationally efficient surrogate model that translates high-fidelity finite-element results into real-time structural response prediction for control purposes. On this basis, multi-objective optimization and digital twin integration are used to support safety–efficiency co-optimization and closed-loop virtual–physical interaction. Therefore, the novelty of this study should be understood as an integrated control framework tailored to mobile gin poles.
The subsequent sections of this paper are arranged as follows.
Section 2 reviews the relevant literature on crane equipment control methods.
Section 3 elaborates on the proposed framework and explains the mechanisms of the involved algorithms, including the TROA-ET and the NSGA-III.
Section 4 further illustrates the effectiveness of the proposed framework through real-world examples.
Section 5 discusses the performance of different optimization algorithms in this case study. Finally,
Section 6 summarizes the conclusions of this study, along with limitations and future recommendations.
2. Literature Review on Crane Equipment Control
Regarding the control issues of mobile gin pole systems, a type of large-scale crane equipment, numerous scholars have conducted relevant research to enhance the safety and performance of such equipment. In this section, we will review the research on the control of crane equipment, which can be categorized into several areas: (1) rule-based control using computational models, (2) intelligent control based on artificial intelligence, and (3) system control based on digital twins. While these research streams have been successfully applied in crane control, their integration into a unified intelligent control framework for mobile gin poles remains largely unexplored.
- (1)
Rule-based control using computational models
Rule-based control using mathematical models. This type of research aims to quantitatively calculate the forces and motion of equipment using mathematical and physical theories, with most methods based on the lumped mass approach. This includes Lagrange modeling and bond graph modeling [
14,
15]. Mustafa Tinkir et al. [
16] used the Lagrange formula to obtain a dynamic model of a scaled crane system. Yingguang et al. modeled marine cranes using the bond graph method [
15]. On the other hand, finite-element analysis and computer-based model analysis are also used for equipment mechanical analysis [
17]. For example, Gerdemeli et al. [
18] used the finite-element method to perform stress analysis on crane components, considering the crane’s self-weight, effective load, hook weight, trolley weight, and dynamic load, to investigate the damage caused by heavy loads to the equipment. Research on crane mechanical modeling has already reached a relatively mature stage.
- (2)
Intelligent control based on artificial intelligence
Currently, equipment controller designs are mainly divided into open-loop control and closed-loop control. Compared to open-loop control, closed-loop controllers can adjust their performance based on the required output response. Feedback schemes utilize measurements and estimates of system state to reduce oscillations and achieve accurate system positioning. Therefore, feedback loops or closed-loop control schemes are less sensitive to disturbances and parameter changes [
19]. To measure system state, necessary sensors must be added, which increases the additional costs. One drawback of closed-loop systems is their slow response due to input delay in the feedback loop [
20].
Closed-loop controllers can be categorized into linear control [
21], optimal control [
22], adaptive control [
23], intelligent control [
24], sliding mode control [
25], etc. Among these, intelligent control has garnered significant attention due to the rapid development of artificial intelligence technology. Intelligent control can achieve direct or indirect adaptive control under highly nonlinear systems or model uncertainty conditions, leveraging its powerful learning and adaptive capabilities. Neural networks possess excellent nonlinear processing capabilities and inherent robustness due to their parallel architecture, making them significant for solving mathematical modeling problems [
26]. Lee et al. [
27] proposed a combination of neural networks and sliding mode controllers to achieve precise vehicle positioning and eliminate the sway angle of the payload. Duong et al. [
28] proposed a hybrid evolutionary algorithm to control an underactuated three-dimensional tower crane system using a recurrent neural network. The hybrid evolutionary algorithm (HEA) was designed by embedding genetic operators (crossover and mutation) from genetic algorithms (GA) into particle swarm optimization (PSO) to create offspring that consider parent selection based on adaptive results. The hybrid algorithm was used to construct an RNN-based controller. It demonstrated good performance as it could drive the system to the desired point. Fuzzy logic controllers (FLC) have also been widely used in many crane control systems, which perform well in handling unstable machines, nonlinear systems, and optimal point control problems [
29]. Benhellal et al. [
30]. combined NN with FLC to serve as a neuro-fuzzy controller for crane systems. The main method of the proposed control strategy is to use sliding mode theory as the learning algorithm to adjust the neuro-fuzzy parameters. Li et al. [
31] proposed a method combining NN and fuzzy logic on a specific crane system. The learning algorithm adopted by the neuro-fuzzy controller is based on ant colony optimization, demonstrating faster convergence performance compared to the backpropagation algorithm.
In recent years, swarm intelligence optimization and deep learning have been widely applied in engineering fields such as structural health monitoring and damage identification. Meta-heuristic algorithms effectively solve nonlinear, multi-local-optimum inverse and parameter identification problems, while deep learning supports damage detection, data recovery and physics-informed analysis, and their hybrid strategies further improve data reconstruction and response estimation. Both approaches excel in handling nonlinear mappings, noisy data and incomplete measurements. However, deep learning demands large datasets, high computational costs and extensive parameter tuning, whereas swarm intelligence-optimized ensemble learning is more suitable for medium-scale datasets and scenarios requiring efficiency and robustness. Given this study uses 3350 finite-element simulation samples to build an accurate and efficient surrogate model for real-time digital twin decision support, the TROA-ET strategy is adopted to balance accuracy, robustness and efficiency, with ongoing advances in swarm intelligence and deep learning offering promising directions for future work.
- (3)
System control based on digital twins
The concept of digital twins was first proposed by Grieves as a digital virtual representation of a physical entity. A digital twin primarily consists of a physical entity, a virtual model, and the connection between the physical and virtual components [
32,
33]. It is updated through modeling, simulation, and self-optimization of the physical entity. NASA and the U.S. Air Force have applied digital twin technology to various types of aircraft for condition monitoring, fault diagnosis, lifespan prediction, and design optimization by integrating diverse heterogeneous information sources [
34,
35,
36]. Additionally, extensive research has demonstrated its applications in various equipment, including tunnel boring machines [
37], wind turbines [
38], and computer numerical control (CNC) machine tools [
39]. Digital twins have evolved from a descriptive concept into a practical technology.
In particular, structural safety monitoring is one of the most promising applications of digital twins, as it provides real-time reflections of the physical world and facilitates decision-making based on rapid analysis algorithms [
40]. Haag and Anderl [
41] developed a digital twin that uses the finite-element method (FEM) for structural analysis of a simple bending beam, with actual forces or displacements obtained from sensors as input. Guivarch et al. [
42] proposed a method for constructing a digital twin of a helicopter’s tilting rotor assembly using multibody simulation. Fotland et al. [
43] performed various simulation methods and effectively constructed a digital twin for crane pulleys and cables. Moi et al. [
44] used strain gauges as load sensors and implemented a digital twin for state monitoring of folding-arm cranes based on an inverse method, which can determine stress, strain, and load at an unlimited number of points in real time. The focus of these studies is to combine mechanistic models, such as structural statics and dynamics with numerical methods to establish digital twins. The fundamental advantage of constructing digital twins using this approach is that the model parameters have practical physical significance, which facilitates scientific interpretation of the results. However, due to the large computational load, numerical methods are typically difficult to use in real-time applications.
To overcome this limitation, Rasheed et al. [
45] proposed to use artificial intelligence (AI) to construct digital twins. Both mechanical models and statistical models have their advantages and disadvantages. Therefore, a more reasonable approach should be adopted to combine simulation with AI to construct digital twins for structural analysis [
46]. Willcox et al. [
47] proposed a promising method that combines a component-based reduced-order model library with Bayesian state estimation or optimal tree algorithms to create data-driven, physics-based digital twins. Therefore, if computational workload is reduced and prediction accuracy is improved, digital twins composed of mechanistic and statistical models will become lighter, more efficient, and more reliable.
Overall, the reviewed studies provide important foundations for intelligent lifting control, but they also reveal clear limitations when considered from the perspective of mobile gin pole applications. Rule-based approaches offer good interpretability and physical consistency, yet they usually rely on simplified assumptions and are difficult to deploy in real time under complex and uncertain construction conditions. AI-based approaches improve adaptability and nonlinear mapping capability, but many of them are weakly connected to structural mechanics and often focus on control performance without explicitly coupling safety-related structural responses with operational efficiency. Digital twin-based approaches provide a promising way to connect physical assets with virtual models for monitoring and decision support; however, existing studies mainly emphasize state monitoring or structural analysis, and an integrated framework that combines real-time perception, physics-informed surrogate prediction, multi-objective optimization, and virtual–physical parallel control is still lacking, especially for mobile gin poles.
Compared with these existing studies, the proposed framework advances the literature in three aspects. First, it combines finite-element-based kinetic analysis with a TROA-ET surrogate model to establish a behavior–mechanics mapping, thereby preserving physical relevance while improving computational efficiency. Second, it moves beyond pure prediction or monitoring by introducing NSGA-III to jointly optimize structural safety and operational efficiency under different operating conditions. Third, it embeds the above models into a digital twin-enabled parallel control architecture with perception, prediction, and guidance subsystems, so that the framework supports not only offline analysis but also virtual–physical interaction and robust decision-making under uncertainty. Therefore, the proposed method can be understood as an integrated extension beyond existing rule-based, AI-based, and digital twin approaches, rather than a simple application of any single research stream.
4. Case Study
To demonstrate the practical performance and operational effectiveness of the framework proposed in this study, and to test the reliability of the prediction and optimization methods, this study uses actual engineering cases as samples for method validation. This section provides a comprehensive overview of the engineering background, variable prediction, and target parameter optimization, and analyzes the results.
4.1. Data and Resources
The real-world scenario in this study is derived from the 110 kV transmission project of a 220 kV substation in a certain city, with the MT2DO6 model tracked tower erection equipment as the research subject. All sensor monitoring data in the system is sourced from the equipment’s original or newly added sensors, as shown in
Figure 8. The equipment primarily performs three operational modes: lifting, luffing, and slewing. The mobile gin pole rotation speed is ≤0.3 r/min, with a rotation angle of ±170°, a lifting speed of 0–15 m/min, a lifting height ranging from 9 m to 34.6 m, a rated lifting moment of 5.98 t·m, and a luffing speed of 0–7.5 m/min.
A high-precision sensor system records the working status of the gin pole in real time, as shown in
Figure 8, thus providing data support for safety risk control. For better monitoring and risk control, a bidirectional data communication system has been developed, capable of bi-directional transmission of information. Its data is derived from a wide range of installed sensors. The upgraded equipment includes jib tilt sensors, hook tilt sensors, hook weight sensors, wind speed sensors, and track vehicle horizontal angle sensors. By calculating sensor monitoring data, information such as the current working amplitude of the mobile gin pole, torque values, and torque difference values can be obtained. After the sensors are fixedly installed, address binding and calibration operations must be performed to ensure that monitoring data is displayed normally. The equipment employs SPC-SDIO-S1212 intelligent distributed I/O across its various systems, utilizing the CANopen bus to collect data and transmit control commands. Additionally, the research equipment is equipped with a Data Transfer Unit (DTU) to facilitate the conversion between serial data and IP data, establishing a data connection with the cloud platform via 4G signals. The DTU module supports the MQTT IoT protocol and is currently connected to the hydraulic system controller, sensor system controller, and main controller within the equipment. The equipment connection interface uses an RS485 interface, enabling real-time transmission of equipment sensing information to a pre-configured MQTT server via the 4G network, as well as subscription to relevant information.
The dataset in this study comes from finite-element calculation results, collecting force data of the luffing, lifting, and slewing of the gin pole under different load conditions. One-hot encoding was used to process the categorical variables. Considering the structural safety and efficiency performance during operation, the maximum axial stress and maximum shear stress of the structure were selected as the target parameters in this study. Axial stress and shear stress are the direct causes of structural failure. At the same time, this study used the product of the load weight and lifting speed to characterize the operating efficiency of the mobile gin pole, i.e., the change in gravitational potential energy of the lifted object per unit time. These three target parameters can well represent the comprehensive operating status of the mobile gin pole. In this study, the main control and state parameters of the gin pole were selected as decision variables for subsequent control optimization, as shown in
Table 1. The relevant parameter values distribution is shown in
Figure 9.
To clarify the representativeness of the dataset, the 3530 samples were not intended to exhaust all possible field scenarios, but to cover the main operational envelope of the studied mobile gin pole in a structured manner. Specifically, the dataset includes the three primary operating modes of the equipment, i.e., lifting, slewing, and luffing, and spans the principal decision and state variables that govern its mechanical response, including load, elevation angle, acceleration, velocity, and time history. The simulated conditions were designed around the actual operating characteristics of the MT2DO6 equipment and the engineering constraints adopted in this study, so that the generated samples represent typical combinations of working states within the practical operating range rather than isolated idealized cases. Therefore, the dataset should be understood as a representative finite-element-based sampling of realistic operating conditions for the target equipment.
4.2. Model Development
This study uses the TROA-ET hybrid algorithm to map the relationships between different parameters. ET constructs multiple decision trees and then averages them to obtain the final prediction results, thereby reducing the risk of overfitting in a single decision tree and improving generalization ability. To obtain a more reasonable combination of hyperparameters, TROA is used to find potential combinations of hyperparameters. In ET, the core hyperparameters include the size of decision trees (n_estimators), minimum sample size for splitting (min_samples_split), and minimum sample size of leaf (min_samples_leaf). In TROA, the Tyrannosaurus rex position corresponds to a set of randomly generated hyperparameter lists. By calculating the loss function on the test set and using it as a fitness function, the quality of each individual (i.e., hyperparameter combination) in TROA is evaluated.
The TROA-ET model is trained on a dataset of 3530 samples, of which 70% is used as the training set and 30% as the test set. The dataset is derived from finite-element kinetic analysis models based on the physical behavior of an actual gin pole to ensure the robustness of the training process. After multiple trial calculations, the hyperparameters of TROA are determined as shown in
Table 2. The ET models for the two target parameters
and
were optimized using TROA for hyperparameter combination, and the results are shown in
Table 3. This target parameter
does not require prediction because it can be calculated directly.
To improve the reproducibility of the case study, the main implementation settings are summarized here. The dataset used for surrogate model development contained 3530 samples generated from finite-element simulations of the mobile gin pole under different lifting, slewing, and luffing conditions. The categorical operation modes were transformed using one-hot encoding, and the dataset was divided into training and test subsets with a ratio of 70% and 30%, respectively. To avoid using the test set during model selection, TROA-based hyperparameter optimization was conducted only on the training set. Specifically, for each candidate–hyperparameter combination, 5-fold cross-validation was performed on the training set, and the average validation loss was used as the fitness criterion of TROA. After the optimal hyperparameters were identified, the ET model was retrained using the full training set, and its final performance was evaluated on the independent test set. All baseline models followed the same train–test split for fair comparison. This protocol ensures that hyperparameter tuning and final model assessment were separated, thereby improving the rigor and reproducibility of the reported results. The predicted results are shown in
Table 4. The prediction targets of the TROA-ET model were the maximum axial stress and maximum shear stress, while the efficiency objective was directly calculated as the product of load and lifting velocity.
For hyperparameter optimization, TROA was adopted to search for the optimal ET settings. The number of iterations and population size were both set to 50, the convergence threshold was set to 1 × 10
−6, and the mutation probability was set to 0.1. The search ranges of the ET hyperparameters were [30, 250] for n_estimators, [2, 4] for min_samples_split, and [1, 3] for min_samples_leaf. After optimization, the best hyperparameter combinations were obtained as (75, 2, 1) for maximum axial stress prediction and (93, 2, 1) for maximum shear stress prediction. For multi-objective optimization, the three operating conditions, i.e., lifting, slewing, and luffing, were optimized separately according to the engineering constraints listed in
Table 5. The objective set included maximum axial stress, maximum shear stress, and operational efficiency, and the final solution was selected from the Pareto set using the weighted sum method. To evaluate computational efficiency, the runtime of each optimization algorithm was recorded under the same implementation conditions. NSGA-III achieved the lowest runtime of 8.13 s, compared with 10.44 s for ACO, 9.20 s for PSO, and 12.71 s for SA. In addition, the finite-element model was used for offline data generation and post-optimization validation, whereas the online decision-making stage relied on the surrogate model rather than repeated high-fidelity simulations, which reduced the computational burden and improved practical applicability.
All finite-element simulations and data-driven computations were conducted on a workstation equipped with Inter Core i7-12700F, 32 GB, running a Windows 10 system. The finite-element analysis was implemented in ANSYS 2024 R2, while the surrogate modeling and optimization procedures were implemented in Python 3.10. These settings are reported to facilitate replication and future comparative studies.
To validate the reliability of the selected decision parameters, further correlation analysis was conducted.
Figure 9 shows that the correlation between the decision variables and the objective is weak, with no redundant variables. For the TROA-ET prediction model used in this study, the main hyperparameters were selected according to
Section 4.2. The results of the prediction model are shown in
Figure 10. All models used the same external train–test split, while hyperparameter tuning for the proposed TROA-ET model was performed only within the training set via cross-validation. The corresponding comparison analysis results will be detailed in the discussion section.
Figure 11 and
Table 4 present the performance of different baseline models under the same dataset.
Figure 12 shows the performance changes in the TROA-ET model and other selected baseline models under different Gaussian noise influences and different sampling ratios.
To minimize safety risks during operation while balancing operational efficiency, this study employs NSGA-III to address the multi-objective optimization problem. The objective parameters are set as the objective function, and the Pareto optimal solution with the highest optimization degree is identified using the weighted sum method within the Pareto solution set. This solution includes the specific values of seven decision variables, including three operating mode variables represented by unique heat coding, which can guide operators or directly control the mobile gin pole in the parallel control system to enhance safety performance and efficiency.
After establishing a predictive model for decision variables and objective parameters, it is necessary to optimize the main control parameters of the mobile gin pole to achieve better operating conditions. To distinguish the working conditions under different operations, three operating conditions were defined based on actual conditions, categorized according to different operations. During the optimization process, the value of each variable must align with engineering realities and be controlled within reasonable limits. The optimization algorithm requires an objective function as shown in Equation (17). In this study, it is the maximum minimum value of three objective parameters, defined as Equation (18).
where
is the fitness function for the
ith objective,
are the constraint functions, and
is the required value for the constraint. Based on the engineering experience, the constraint ranges of the control parameters are given in
Table 5.
- (1)
The TROA-ET prediction model proposed in this study achieved R
2 values of 0.964 and 0.943 for the two prediction targets, respectively. As shown in
Figure 11, the Tyrannosaurus algorithm was used to optimize the performance of the ET model, achieving excellent results, the best among all models. The model generated RMSE of 19.6 and 7.42 for the two prediction targets, with MSE results as low as 385 and 5.51, respectively. Although there are some differences in the magnitude of these results, they remain small compared to the scale of the original data. As shown in
Figure 10, the evaluation metric values for the training set are slightly better than those for the test set. Overall, the model achieved satisfactory predictive performance on both the training set and the independent test set. Since hyperparameter tuning was conducted through cross-validation within the training set, the test results provide a more reliable estimate of the generalization capability of the proposed model. In the parallel control system, the predicted values and actual historical data are shown in
Figure 10. Compared with machine learning baseline methods, the accuracy of TROA-ET used in this study improved by 1.5% to 21.5%. As shown in
Figure 11 and
Table 4, the R
2 scores for TROA-ET, DT, XGB, CB, and WRF are 0.9642, 0.9458, 0.9299, 0.9170, and 0.9504, respectively. Compared to other methods, the prediction accuracy of TROA-ET improved by 21.5%, 1.5%, 5.1%, and 1.5%, respectively. Other accuracy regression metrics, such as RMSE and MSE, also showed similar performance. The results indicate that this method outperforms other machine learning algorithms in terms of accuracy.
- (2)
The proposed TROA-ET model maintains an average R
2 value of 0.583 when the noise level rises to 0.7 and achieves an average R
2 value of 0.697 at an ultra-low sampling rate (0.3), performing best among all models and demonstrating excellent robustness. As shown in
Figure 12, the TROA-ET model demonstrates optimal prediction performance and stability under different noise levels and sampling ratios. Its R
2 score increases steadily with increasing sampling ratio, and the decrease in R
2 score under noise interference is the smallest. In contrast, traditional models like DT are highly sensitive to data volume and noise, with performance deteriorating sharply under insufficient sampling or high noise conditions. While ensemble methods like XGB and CB perform well, their stability falls short of TROA-ET. Additionally, the CB model exhibits abnormally low values in small-sample scenarios (sampling ratio of 0.4), indicating that its randomness may affect reliability. Therefore, the TROA-ET model has significant advantages in terms of data efficiency (low sampling requirements) and robustness (noise resistance), making it particularly suitable for scenarios with limited data quality in practical engineering applications.
- (3)
An optimal decision model was established based on NSGA-III, which reduced safety indicators by an average of 8.40%, 22.40%, and 32.30% under the three operating conditions of lifting, slewing, and luffing, respectively, while improving operational efficiency by an average of 42.30%, 40.57%, and 32.73%, respectively. The average optimization rates are 18.90%, 29.31%, and 32.28%, respectively, significantly improving the safety metrics and operational efficiency of the mobile gin pole. The optimization results were validated using finite-element analysis, confirming the practical effectiveness of this optimization approach.
Figure 13 and
Table 6 in the reference study clearly illustrate the progress achieved in these objectives. As shown in
Figure 13 of the manuscript, the entire solution set is relatively uniformly distributed in three-dimensional space, forming a Pareto front, with no tendency to collapse into a two-dimensional plane or a one-dimensional point, indicating a conflicting relationship among the three objective parameters. NSGA-III demonstrates sustained performance improvements in safety control, proving its versatility and adaptability.
Table 6 reinforces this by showing the algorithm’s consistent effectiveness in enhancing all three objective parameters under different conditions. The maximum optimization rates for these objectives are impressive, at 31.19%, 71.24%, and 10.34%, respectively. To validate the practical effectiveness of this method, the optimized results were subjected to finite-element dynamic analysis, as shown in
Figure 14. Both the maximum axial stress and shear stress of the structure were significantly reduced. These results not only highlight the algorithm’s high adaptability but also its potential to significantly extend the service life of the mobile gin pole and reduce maintenance requirements. Overall, NSGA-III reduces specific safety risks during equipment operation.
- (4)
This study presents the development of a virtual–physical integrated mobile gin pole control system, as illustrated in
Figure 15. The system achieves real-time perception mapping and closed-loop control between physical assets and their corresponding digital twins. A sensor network is used to collect real-time operational data from the equipment. The data are preprocessed through edge computing and then sent to a finite-element-based digital twin model. The model dynamically simulates structural responses and the evolution of the construction process. This supports an intelligent closed-loop workflow of “perception–simulation–decision–execution.” In addition, the integration of agent-based models and multi-objective optimization improves both safety and operational efficiency in complex environments. Furthermore, the layered architecture and core technologies developed can be extended to other equipment types, such as tower cranes, offering a scalable technical framework for the intelligent control of construction machinery.
5. Discussions
In this study, meta-heuristic algorithms are employed for model hyperparameter optimization and the recommendation of optimal operational parameters. Meta-heuristic algorithms combine random algorithms with local search algorithms, gradually emerging as powerful tools in the field of decision optimization. This approach simulates natural mechanisms such as evolution and group behavior to identify optimal solutions in complex problem spaces, finding widespread application in engineering optimization, artificial intelligence, and other fields [
54]. Classic meta-heuristic algorithms include: genetic algorithms (GA) that simulate biological evolution, particle swarm optimization algorithms (PSO) that mimic bird flocks foraging, and ant colony algorithms (ACO) that draw inspiration from ants’ path selection when searching for food. These algorithms not only exhibit theoretical innovation but also demonstrate robust performance in practical applications.
To evaluate the optimization performance of optimization algorithms, corresponding algorithm evaluation metrics must be introduced. This study has three optimization objectives, which are high-dimensional optimization objectives. When selecting evaluation metrics, it is important to consider that the solution set in high-dimensional objective spaces becomes extremely sparse, and a limited solution set is unlikely to cover the entire Pareto front. In high-dimensional objective spaces, the adaptability of various commonly used evaluation metrics decreases. Specifically, this study selected three evaluation metrics: time (
T), generation distance (
GD), and inverse generation distance (
IGD). These three evaluation metrics exhibit good adaptability in high-dimensional objective spaces, with calculations as follows:
where
S represents the Pareto approximate optimal solution set.
P represents the Pareto optimal solution set.
represents the Euclidean distance between two elements.
represents fitness function.
GD denotes the convergence of the algorithm.
IGD can denote both the convergence and diversity of the algorithm. Both
GD and
IGD are dimensionless.
In this study, several mainstream meta-heuristic algorithms were selected, including NSGA-III, PSO, ant colony, and simulated annealing algorithms (SAs). Under the same proxy model and the same optimal solution evaluation criteria, the optimization performance was compared, as shown in
Table 7.
The experimental results indicate that the NSGA-III algorithm demonstrates significant advantages over ACO, PSO, and SAs in terms of convergence and computational efficiency. As shown in
Table 7, the lowest GD value achieved by NSGA-III is 0.25, which is notably superior to ACO, PSO, and SA. A lower GD value indicates better convergence to the optimal solution set. In terms of computational speed, NSGA-III stands out with a computational time of only 8.13 s, demonstrating its processing efficiency and lower computational complexity. Another important aspect is the distribution and scalability of the solution set. Both NSGA-III and ACO exhibit outstanding performance in this regard, with more uniform solution set distribution and stronger scalability compared to the SA and PSO algorithms. Additionally, the IGD values for NSGA-III and ACO are both 0.24, significantly lower than PSO’s 2.11. The lower IGD values of NSGA-III algorithms indicate that they not only converge to the true Pareto front but also cover it more uniformly. This makes them more robust and comprehensive when handling multiple objectives, a common scenario in complex engineering problems. In terms of average multi-objective optimization rates, NSGA-III and ACO exhibit similar performance, while PSO and SA perform slightly worse. The results indicate that NSGA-III performs better in terms of convergence efficiency, computational cost, and accuracy.
It is worth noting that the present TROA-ET framework belongs to a broader family of intelligent engineering methods that combine optimization algorithms and data-driven models. Recent studies have shown that swarm intelligence algorithms are effective for solving inverse identification and parameter optimization problems in engineering, while deep learning models are increasingly used for damage detection, structural response estimation, and monitoring data reconstruction. These methods are particularly attractive when the target problem involves strong nonlinearity, high-dimensional search spaces, or incomplete sensor observations. Nevertheless, deep learning models often rely on larger datasets and higher computational resources, which may limit their direct use in engineering scenarios where data volume is moderate and online deployment efficiency is important. In contrast, the present study focuses on a medium-scale finite-element-derived dataset and a real-time digital twin control scenario. Under this setting, the TROA-ET model provides a more lightweight surrogate modeling strategy while still achieving strong predictive accuracy, noise resistance, and low-sampling robustness, as demonstrated in the case study.
Construction projects are not homogeneous processes; instead, they involve multiple stages with substantially different site environments, operational objectives, and data forms. For example, Wang’s domain-adaptive Faster R-CNN study addressed non-PPE identification on construction sites from body-worn and general images, whereas Wang’s graph-neural-network-driven text classification study focused on fire-door defect inspection in pre-completion construction, indicating that construction intelligence problems can differ markedly between safety monitoring and defect inspection tasks across project phases [
55,
56]. In this context, the present study focuses on the tower-erection stage of UHV construction and proposes an intelligent control framework for mobile gin poles, while broader deployment in construction practice should further account for phase-dependent environmental variability and task heterogeneity.
6. Conclusions and Future Work
This study addresses the issue of safety and efficiency optimization for mobile gin poles in complex construction environments by proposing a robust parallel control framework based on digital twins. To achieve accurate prediction of the target parameters, this study proposes a TROA-ET hybrid model. By combining the TROA-ET hybrid prediction model with the NSGA-III optimization algorithm, the parallel control system facilitates accurate regression prediction and improvement of key control parameters. The validity of this integrated approach is assessed through extensive computational analysis.
To verify the feasibility of the framework, this method is validated through practical cases. The results indicate that: (1) The TROA-ET model can forecast the values of target parameters with high precision, as evidenced by an average R2 of 0.9534. This model has good robustness, demonstrating efficient utilization of data and strong noise resistance. (2) Thanks to the agency relationship established by TROA-ET, NSGA-III is used to convert complex dynamic constraints into quantifiable objective functions. This optimization algorithm reduces safety metrics by an average of 8.40%, 22.40%, and 32.30% under lifting, slewing, and luffing conditions, respectively, while improving operational efficiency by an average of 42.30%, 40.57%, and 32.73%, respectively. (3) The gin pole parallel control system achieved mapping and closed-loop control between physical entities and digital twins. The system provides a visual interface, physics-based collision monitoring, and optimized decision-making, which helps engineers understand the complexity of engineering. The case shows that the system can ensure the safe operation of equipment and improve efficiency, proving its progressiveness.
Despite the promising results, several limitations of the present study should be acknowledged. First, the proposed framework was validated on a specific engineering case and a specific type of mobile gin pole, and its scalability to other lifting equipment, such as tower cranes or other crane-like construction machinery, has not yet been fully verified. Although the proposed architecture shows potential for extension, differences in structural configuration, working mechanisms, motion patterns, and control objectives may require additional model adaptation and parameter recalibration before direct transfer can be achieved. Second, the current study mainly demonstrates the effectiveness of the framework under the developed digital twin environment and case-based validation, whereas large-scale real-time deployment in field conditions still faces practical challenges. These challenges include computational latency, communication delay between sensors, edge devices, and cloud platforms, synchronization between the physical system and the virtual system, as well as the stability of online optimization under continuously changing outdoor construction conditions. Third, the framework is highly dependent on the quality and reliability of sensor data. Since the digital twin mapping, state perception, and optimization decisions all rely on sensor measurements, issues such as sensor noise, missing data, calibration drift, communication interruption, or individual sensor failure may reduce the accuracy of structural response prediction and affect the robustness of control decisions. Fourth, the present dataset and influencing factors are still relatively limited. Although the selected parameters can represent the major operational characteristics of the mobile gin pole, additional environmental, structural, and operational factors may further improve the generalization and interpretability of the model.
Future research should therefore focus on several concrete directions. First, multi-project and multi-equipment datasets should be established to evaluate the transferability of the framework across different types of lifting equipment and construction scenarios. In particular, transfer learning, domain adaptation, or parameter-sharing strategies may be explored to extend the proposed method from mobile gin poles to structurally similar equipment such as tower cranes. Second, to enhance field applicability, future studies should investigate lightweight surrogate modeling, reduced-order mechanical models, and cloud–edge collaborative computing architectures, so that prediction, optimization, and control can be executed with lower latency and higher real-time responsiveness. Third, more robust sensing and perception strategies should be developed, including sensor redundancy design, multi-source data fusion, fault diagnosis, missing-data recovery, and uncertainty-aware control mechanisms, in order to reduce the framework’s dependence on individual sensor quality. Fourth, long-term field deployment and hardware-in-the-loop validation should be conducted to assess the stability of the framework under real construction disturbances, varying weather conditions, and communication uncertainties. Finally, the digital twin control concept may be extended beyond tower erection to other stages of UHV transmission line construction, such as foundation construction, hoisting coordination, and integrated site-level equipment management, thereby promoting the broader development of intelligent and resilient construction systems in power infrastructure engineering.