1. Introduction
In recent years, the scale and intensity of power grid transmission project construction have continued to increase, which has led to stricter requirements for project costs [
1]. The efficient and accurate prediction of transmission project costs is of great significance for fine control of power grid costs and improvement of infrastructure investment efficiency [
2]. Especially in high-altitude regions, the construction of power transmission and transformation projects is of strategic significance and presents a challenge. These regions typically contain abundant mineral, hydro, wind, and solar resources, making them important bases for national energy strategies and key channels for exporting clean energy [
3]. China’s high-altitude regions cover a vast area, spanning multiple provinces, and the number of related power transmission and transformation projects has been increasing year by year [
4]. The construction of such projects is directly related to providing a reliable power supply for millions of people, as well as promoting social stability and economic development in border areas, and the implementation of major energy strategies such as the “Power Transmission From West to East” of the country. The harsh natural environment in high-altitude regions (low temperature, frozen soil, strong winds, snow, etc.) not only greatly increases the technical difficulty of engineering construction but also significantly pushes up its cost [
5]. Accurately predicting such costs is of great practical significance for optimizing huge investment decisions, ensuring project economic feasibility, promoting coordinated regional economic development, and achieving efficient development and utilization of energy resources. However, the cost of grid projects is closely related to complex factors such as sample indicator selection, intelligent algorithm performance, and prediction model structure. A large number of uncertain factors make it difficult to accurately predict the cost of grid projects. Therefore, new methods are urgently needed to solve the prediction problem and improve prediction accuracy.
A large number of scholars have conducted research on the influencing factors involved in the cost prediction for power transmission and transformation projects. On the one hand, early research mainly focused on the impact of factors such as human resources and machinery costs; for instance, Ref. [
6] analyzed the impact of human transportation on the cost scale and proportion of various parts of power transmission and transformation projects from the perspective of human transportation. Ref. [
7] constructed a cost index for power transmission projects, taking into account the dynamic changes in personnel and machine prices, but it did not take into account the impact of construction levels on cost. With the increasing diversity of construction scenarios for power transmission and transformation projects, some scholars have begun to pay attention to the impact of environmental factors or special construction requirements on the cost of power transmission and transformation projects. For example, Ref. [
8] correlated geological divisions with transmission engineering quotas and conducted an analysis of the impact of geological divisions on cost. Ref. [
9] theoretically analyzed the cost-sensitive factors of overhead transmission projects from the perspectives of wind speed, icing, etc., but did not consider the differences in engineering quantities under different engineering parameters. Ref. [
10] examined the impact of environmental investment in power transmission and transformation projects on project cost based on project environmental impact assessment. Ref. [
11] provided a detailed analysis of the composition of operation, maintenance, and repair costs for ultra-high voltage equipment, and identified and quantified the factors affecting cost. Ref. [
12] conducted a correlation analysis on the cost-influencing factors of ultra-high-voltage transmission lines and screened key indicators that affect cost, but did not conduct in-depth research on the nonlinear relationship between indicators.
For the cost-prediction methods of power transmission and transformation projects, early research mainly used statistical methods for prediction. For example, Ref. [
13] used the ARIMA method to construct a cost-index prediction model for power transmission and transformation projects, and Ref. [
14] used decomposition–integration combination prediction and introduced a multiple regression method to predict the cost level of power transmission and transformation projects. This type of method has good applicability in the face of conventional cost prediction for power transmission and transformation projects. However, as the influencing factors on power transmission and transformation project costs become increasingly complex and the application scenarios of power transmission and transformation projects become more diverse, statistical methods gradually show their inapplicability in power transmission and transformation project cost prediction. With the development of computer technology, machine-learning algorithms have gradually been introduced into the field of prediction due to their ability to better fit the complex interrelationships between factors. Relevant scholars have conducted cost predictions for power transmission and transformation projects using machine-learning algorithms. For instance, Ref. [
15] used the random forest algorithm to predict small sample engineering cost data and established a high-dimensional small sample engineering cost estimation model. Ref. [
16] combined the wolf pack algorithm with an improved least squares support vector product to predict the cost of substation engineering, reducing prediction errors. Refs. [
17] and [
18], respectively, use an improved ant colony algorithm and an improved NSGA2 algorithm to optimize the cost of power engineering. The backpropagation (BP) neural network, as a widely used machine-learning algorithm, has also received attention in the field of cost prediction for power transmission and transformation engineering. Ref. [
19] used a BP neural network for engineering cost prediction, but there were problems involving falling into local minima, resulting in reduced prediction accuracy during prediction. Therefore, relevant scholars have proposed heuristic optimization algorithms for parameter optimization of BP neural network algorithms to improve prediction performance. Ref. [
20] used the sparrow search algorithm to optimize the BP neural network model for predicting the cost of substation engineering. Ref. [
21] proposed a fault diagnosis method for power transformers based on an improved BP neural network. Ref. [
22] applied the particle swarm optimization algorithm (PSO) and BP neural network algorithm to predict residential construction costs, improving the prediction performance of the BP neural network. Ref. [
23] used the genetic algorithm BP neural network method to predict engineering costs, which improved the prediction accuracy of the BP neural network.
However, the cost of power transmission and transformation projects is influenced by numerous factors, and there are complex nonlinear relationships among these factors. Traditional machine-learning algorithms often face difficulties in ensuring the goodness of fit when fitting such high-dimensional nonlinear relationships. As a further extension of machine-learning algorithms, deep-learning algorithms have shown better applicability in the field of cost prediction for power transmission and transformation projects. Ref. [
24] applied the Levenberg–Marquardt convolutional neural network (LM-CNN) algorithm to identify anomalies in the cost of power transmission and transformation projects, identifying abnormal information and enhancing the ability to monitor and analyze cost data. Ref. [
25] applied the LM-CNN algorithm to establish a cost model for power transmission and transformation projects, achieving a balance between prediction accuracy and stability. Ref. [
26] used a multi-layer perceptron neural network to predict the cost of public construction projects. Ref. [
27] established a construction period prediction model for transmission line engineering based on a Bayesian regression algorithm and a long short-term memory neural network (LSTM) and used modern statistical theory to complete nonlinear modeling, improving prediction efficiency. Ref. [
28] combined data mining and a support vector machine algorithm to predict the cost of transmission engineering.
In summary, the existing research on cost prediction for power transmission and transformation projects is relatively rich, presenting diverse research results in identifying influencing factors and cost-prediction methods, providing useful references for this study. However, there is still room for further deepening: Firstly, current research focuses on the analysis of the impact of a single parameter, without considering the construction of a comprehensive index system for the factors affecting the cost of power transmission and transformation projects. Specifically, high-altitude regions are not conducive to the normal construction of power transmission and transformation projects due to their long, low-temperature periods, frequent winds, high wind speeds, and deep permafrost layers [
29,
30]. In high-altitude regions, the frozen soil area is large and the freezing time is long; it freezes and expands under lower temperatures in winter and melts and settles under rising temperatures in summer. The existence of these frozen soil areas poses great challenges to the construction of power transmission and transformation projects. The high-altitude regions also have adverse conditions such as large interannual temperature differences and large changes in air humidity, and their winter temperatures are lower and last longer [
31,
32]. Due to their unique low temperatures, frozen soil, and other conditions, the construction costs of power transmission and transformation projects in high-altitude areas are much higher than those in general areas. However, there have been few studies on the cost of power transmission and transformation projects that consider the special conditions of high-altitude areas. Secondly, regarding the handling of overlapping information in the indicator system of influencing factors, existing research has mostly considered linearly reducing the dimensionality of indicators but has not fully considered how to simultaneously handle the problem of overlapping information in linear and nonlinear engineering quantity indicators.
Based on this, this paper focuses on the cost-prediction problem of power transmission and transformation projects in the special scenario of high-altitude regions. Aiming at the nonlinear relationships among multiple influencing factors, a hybrid deep-learning algorithm framework based on cost-prediction technology for power transmission and transformation projects is proposed. Firstly, the influencing factors on the cost of power transmission and transformation projects in high-altitude regions are analyzed, and a targeted set of influencing factors is constructed. On this basis, the improved whale optimization algorithm (WOA) is combined with the improved convolutional neural network (CNN) to optimize the learning rate of the CNN, thereby effectively improving the prediction accuracy. The effectiveness of the proposed model is verified through case analysis. The main contributions of this paper are as follows:
- (1)
An analysis was conducted on the impact of climate conditions in high-altitude regions on the cost of power transmission and transformation projects, and a factor system for the cost of high-altitude region power transmission and transformation projects was constructed, which includes four dimensions: climate environment, project scale, material consumption, and technical economy. Compared with existing research, the influencing factor system constructed for this paper can better reflect the characteristics of power transmission and transformation projects in high-altitude regions and expand the set of influencing factors for the cost of conventional power transmission and transformation projects.
- (2)
A hybrid deep-learning algorithm is proposed to predict the cost of power transmission and transformation projects in high-altitude regions by combining the improved WOA and CNN, with the learning rate of CNN as the link. Compared with existing research, the constructed IWOA-CNN algorithm improves the training accuracy of the CNN and avoids falling into local optima through the SGDM optimizer, L2 regularization method, and Bayesian optimization method. By introducing the nonlinear convergence factor α and adaptive weights, it enhances the ability of the whale algorithm to perform random global and local searches, thereby obtaining the optimal CNN learning rate. Finally, the case analysis verified the effectiveness of the proposed hybrid deep-learning algorithm in the field of cost prediction for power transmission and transformation projects in high-altitude regions.
The remaining chapters of this paper are arranged as follows:
Section 2 analyzes the factors affecting the cost of power transmission and transformation projects in high-altitude regions and constructs a system of influencing factors.
Section 3 introduces the constructed cost-prediction model for power transmission and transformation projects in high-altitude regions based on hybrid deep-learning algorithms.
Section 4 presents the results of the case analysis.
Section 5 summarizes this paper.
2. Cost-Influencing-Factors System for Power Transmission and Transformation Projects in High-Altitude Regions
Climate environment is an important consideration factor for the cost of power transmission and transformation projects in high-altitude regions. The extreme low temperatures, heavy snowfall, frequent freeze–thaw cycles, and strong winds in this region have increased the requirements for the material selection, construction techniques, and post-maintenance of power transmission and transformation projects. For example, in low-temperature environments, conventional wires may become fragile and prone to breakage [
33], so special low-temperature resistant wires need to be selected, which undoubtedly increases material costs. At the same time, in order to prevent damage to transmission lines caused by snow and ice accumulation, it may be necessary to install de-icing devices or adopt special line designs [
34], which will significantly increase the project cost.
The scale of the project plays a crucial role in the cost of power transmission and transformation projects. The length of the line, the capacity of the substation, and the voltage level of the transmission are important parameters that determine the scale of the project. Taking line length as an example, long-distance transmission lines require more towers, conductors, and other auxiliary materials, which not only increases material procurement costs but may also lead to an increase in transportation and installation costs. For example, the increase in substation capacity means the need for larger transformers, switchgear, and more complex control systems, and the high cost of these devices directly affects the overall cost of the project.
The amount of material used directly affects the cost of power transmission and transformation projects in high-altitude regions. Due to the special environment in high-altitude regions, engineering often requires more materials to enhance the stability and durability of the structure. For example, in permafrost regions, in order to ensure the stability of tower foundations, it may be necessary to use a large amount of concrete and steel bars to reinforce the foundation, or to adopt expensive special foundation treatment techniques such as precast piles and deep mixing. Although these measures can improve the quality and safety of the project, they also significantly increase material costs, thereby raising the overall cost of the project.
Technical and economic analysis is an indispensable part of evaluating the cost of power transmission and transformation projects in high-altitude regions. The choice of construction technology not only affects the smooth implementation of the project but also directly impacts the project cost. In high-altitude regions, winter construction techniques and foundation treatment techniques for frozen soil areas may be required [
30,
35], which often necessitate special equipment and additional construction measures, such as heating equipment and insulation materials, thereby increasing construction costs. At the same time, from the perspective of long-term operation and maintenance, power transmission and transformation projects in high-altitude regions require more frequent inspections and higher-level maintenance to ensure the stable operation of equipment in harsh environments. These long-term maintenance costs are also an important part of the project cost that cannot be ignored.
Based on the above analysis, this paper constructs a system of influencing factors on the cost of power transmission and transformation projects for high-altitude regions, as shown in
Table 1.
3. Model Construction
3.1. Basic Ideas of the Model
The constructed cost-prediction model for power transmission and transformation projects in high-altitude regions is used as follows to determine the nonlinear relationship between project cost
and
influencing factors:
Because
indicators are difficult to unify on a unit basis and cannot directly calculate the cost from
indicators, a hybrid deep-learning model is developed to fit the project cost. The relationship expressed by this prediction model is represented by
, and the input–output relationship is shown in
Figure 1.
The constructed hybrid deep-learning model consists of two modules: a nonlinear input–output relationship approximation based on a CNN (Module 1) and an optimization of the optimal learning rate of the CNN based on a WOA (Module 2). The core logic of the constructed hybrid model lies in combining the powerful feature extraction and nonlinear fitting capabilities of a CNN with the global search ability of a WOA to synergistically improve model performance.
In Module 1, the CNN is designed to directly learn complex nonlinear input–output relationship mappings. In order to effectively train the CNN, the model uses a stochastic gradient descent method (SGDM) with driving factors as the core optimizer. By introducing momentum terms, the SGDM not only considers the direction of the current gradient when updating weight and bias parameters but also accumulates the trend of historical gradients. This significantly accelerates the training process and helps the model smoothly cross flat areas or small valleys of the loss function, thereby improving the stability of parameter updates when randomly selecting small batches of samples for training. In order to further enhance the robustness and generalization ability of the CNN in small samples or noisy data, this module introduces the Bayesian optimization method. Bayesian optimization constructs a probabilistic surrogate model of the objective function (such as validation set error) and intelligently explores and utilizes the hyperparameter space (such as convolution kernel size, number of layers, regularization strength, etc.) based on the acquisition function in order to systematically find the optimal set of hyperparameter configurations. The combination of SGDM and Bayesian optimization effectively alleviates the problems of poor fitting performance and slow convergence caused by poor hyperparameters or gradient oscillations in traditional CNNs during random small batch training.
In Module 2, the WOA is innovatively applied to optimize the crucial hyperparameter in CNN training, namely the learning rate. The model takes the mean absolute percentage error (MAPE) calculated by the CNN on the training/validation set as the target fitness value of the WOA, and sets the learning rate that needs to be optimized as the individual population of the WOA (i.e., each “whale” position represents a candidate learning rate value). The unique advantage of the WOA lies in its search mechanism, which simulates whale predation (bubble-net attack) behavior, with a good balance between global exploration and local development. In order to overcome the drawbacks of slow convergence speed or susceptibility to local optima that may exist in conventional WOAs, this model has made two key improvements to the algorithm itself: firstly, the convergence factor (usually linearly decreasing with iteration) that controls the algorithm from exploration to development has been modified to a non-linear reduction strategy (such as exponential decay), which enables the algorithm to maintain stronger global exploration ability in the initial stage to cover a wider search space and to focus more quickly on potential areas for fine search in the later stage; secondly, the fixed weights used for position updates in the algorithm will be replaced with adaptive weights, which can dynamically adjust based on the current iteration progress or individual fitness, thus endowing the algorithm with more flexible adjustment capabilities in different search stages (global exploration or local development). Through the optimization of the WOA, the model can ultimately automatically find the optimal learning rate for a specific dataset and network structure.
The overall advantage of the IWOA-CNN hybrid architecture is that the WOA’s global optimization capability effectively avoids the blindness and time-consuming nature of manual parameter tuning, and its improvement strategy (nonlinear convergence factor and adaptive weight) significantly improves search efficiency and accuracy; the synergistic effect of Bayesian optimization hyperparameters and the WOA’s optimization learning rate ensures that the CNN model can converge quickly and stably while achieving excellent fitting accuracy and generalization performance. Ultimately, a hybrid deep-learning prediction method with high automation, strong robustness, and superior prediction accuracy is constructed.
3.2. Module 1: Nonlinear Relationship Fitting Based on an Improved CNN
- (1)
Conventional CNN
Figure 2 shows a schematic diagram of the CNN used to predict the cost of power transmission and transformation projects in high-altitude regions. Use the
indicators in Equation (1) to predict the cost
, and sequentially pass through the input layer (with
neurons), the convolution layer (with
convolution kernels, with a size of 3 × 1), the activation layer (using ReLU function), fully connected layer 1 (
connections), fully connected layer 2 (
connections), and the output layer.
The formulas for each layer of the CNN are as follows:
(1) Input layer, convolutional layer, and activation layer
Firstly,
indicators are input into the convolutional layer of the convolutional neural network, followed by an activation layer. Equation (2) represents merging the functions of the convolutional layer and the activation layer, that is, performing convolution operations first and then activating.
is the 3 × 1 convolution kernel,
is the bias, and
represents the output of the ReLU function used for the convolutional layer. Equation (3) explains the ReLU function [
57].
(2) Two-layer, fully connected layer and output layer
Input the output value of the activation layer into the two-layer fully connected layer. In Equation (4), is the weight of the fully connected layer 1, is the bias of the fully connected layer 1, is the activation function, and is the output of the fully connected layer 1. In Equation (5), represents the weight of the fully connected layer 2, and is the bias of the fully connected layer 2.
- (2)
Improved CNN
The improved CNN adopts the SGDM optimizer, which selects a small batch of samples instead of the entire sample and uses gradient descent to update the weight and bias parameters of the CNN, aiming to solve the problem of poor fitting performance of random small batch samples [
58,
59]. Introduce the momentum factor when updating weight and bias parameters, subtract the weighted sum of the current and previous iteration gradients to reduce their impact on the weight and bias parameters, and add the L2 regularization term. The relevant formulas are as follows:
Equation (6) represents the loss function of the improved CNN with the addition of the L2 regularization term. Firstly, samples are selected from the training set, including , where corresponds to the target of transmission and transformation project cost , and and are the weights and biases of the improved convolutional neural network, respectively. is the total loss function, is the loss function without the regularization term, is the L2 regularization term, and is the weight decay coefficient or regularization coefficient.
Equations (7)–(9) show the process of using the SGDM optimizer with L2 regularization to update weights. In Equation (7), is the gradient of weight parameters, is the number of iterations, and is the gradient of the total loss function with respect to the weight parameters (including L2 regularization); represents the gradient of weight parameters for a loss function without regularization terms, and represents the gradient of regularization terms with respect to weight parameters. In Equation (8), is the number of iterations, and is the cumulative momentum of the t-th iteration of the weight parameters; is the momentum parameter, is the cumulative momentum of the t − 1-st iteration of the weight parameter, and is the learning rate. In Equation (9), is the weight parameter for the t + 1th iteration, and is the weight parameter for the t-th iteration.
Equations (10)–(12) show the process of updating biases using the SGDM optimizer with L2 regularization. L2 regularization is not required for bias, so no L2 regularization term is set in Equation (10). In Equation (10), is the gradient of the bias parameter, is the number of iterations, is the total loss function for the gradient of the bias parameter (excluding L2 regularization), and represents the gradient of the loss function without the regularization term for the bias parameter. In Equation (11), is the cumulative momentum of the t-th iteration of the bias parameter, and is the number of iterations; is the momentum parameter, and is the cumulative momentum of the t − 1-st iteration of the bias parameter. In Equation (12), is the bias parameter for the t + 1th iteration, and is the bias parameter for the t-th iteration.
In order to optimize the hyperparameters in the CNN, a Bayesian optimization method [
60] is adopted. The learning rate
varies within the range of [0.01, 1], the momentum of stochastic gradient descent
varies within the range of [0.8, 0.98], and the L2 regularization strength varies within the range of [1 × 10
−10, 0.01].
3.3. Module 2: Optimizing CNN Learning Rate Based on Improved WOA
- (1)
Conventional WOA
The MAPE () obtained by the improved CNN is used as the target fitness value of the WOA, and the learning rate of the improved CNN is used as the population variable of the WOA. This combines the improved CNN with the improved WOA. During the process of updating the fitness value in the WOA, the learning rate of the CNN changes accordingly, thereby altering the prediction accuracy of the CNN.
The fitness index (
) is as follows:
In the formula, is the measured value, is the true value, and is the number of training samples.
The whale position in the WOA is defined as follows:
In the formula, represents the position of the whale. The whale position update in the WOA is an update that improves the learning rate of the CNN.
The WOA treats each whale as a particle and simulates the process of whales surrounding, hunting, and searching for prey. Throughout the process, the whale’s position needs to be continuously updated to obtain the global optimal solution.
- (2)
Improved WOA
The improvement of the WOA involves the addition of convergence factors and adaptive weights for nonlinear algorithms. The description and related formulas for the improved WOA (IWOA) are as follows:
(1) Surrounding process
To complete the encirclement of prey, it is necessary to know the position of the prey and use the current optimal solution as the prey position. Other whales will approach this position, and the position update formula is as follows:
In the formula, is the weight of the IWOA, is the number of iterations, is the optimal solution position at the T-th iteration, is the distance between the whale individual and the optimal solution position, and are coefficient vectors, is the convergence factor, and is a random vector with a value range of [0, 1].
For the weight
, changing it from a fixed value to a non-linear cosine function can comply with the setting of global search in the early stage and local search in the later stage. Therefore, the following formula is proposed [
61]:
For the convergence factor
, change the way it linearly decreases and decay it as follows:
(2) Predatory process
This type of whale usually uses two methods of predation: bubble-net predation and encirclement predation. When using bubble nets for hunting, the process of updating the position of whales and prey satisfies the logarithmic spiral equation, which is as follows [
62]:
In the formula, is the distance between the individual whale and the optimal solution position, is the parameter describing the spiral shape, , and is a random number.
For bubble-net predation or encirclement predation, the probability
is used as the selection criterion, and the position update equations for the two probabilities are as follows [
63]:
In the formula, is the probability distribution of adopting a certain hunting method, with a range of [0, 1]. The above is the update of the position of the whale when the coefficient vector . The whale is in the local search stage, and the next whale position is required to be between the current whale position and the prey.
(3) Exploration process
In the case where
, the global search process begins [
64]. First, a whale in the population is randomly selected to update its coordinates, and at this point, other particles will move away from the whale to complete the global search process.
In the formula, is a randomly selected individual whale, and is the distance between the whale and a previously randomly selected whale. The final obtained is the optimal solution of the IWOA, which is the optimal learning rate, and is substituted into the CNN.
In summary, a new cost-prediction model for power transmission and transformation projects in high-altitude regions will be constructed by combining the convergence factor and adaptive weight improved WOA with a Bayesian optimization regularization algorithm and an SGDM optimizer CNN. This model will accelerate convergence and make predictions more accurate.
4. Case Analysis
4.1. Case Setting
The case analysis adopts 75 sets of 10kV overhead line project cost samples provided by a provincial-level power grid enterprise in the high-altitude region of northern China. Fifty sets of samples are used as the training set, and 25 sets of samples are used as the test set. To verify the effectiveness of the prediction method proposed in this paper, four comparative models are set up, with specific parameter settings as follows:
Model 1: IWOA-CNN (model proposed in this paper). The number of whales is 40, and the maximum number of iterations for whales is 100. The SGDM optimizer is used, with 25 inputs and 1 output, a convolution kernel size of 3 × 1, a total of 25 convolution kernels, and 2 fully connected layers, each with 625 connections. The maximum number of iterations is 3000, and the number of traversal samples is 8. Bayesian optimization method is used for the CNN, with a learning rate variation range of [0.01, 1], a random gradient descent momentum variation range of [0.8, 0.98], and an L2 regularization strength variation range of [1 × 10−10, 0.01].
Model 2: GA-BP neural network. The network structure is the same as above, with a crossover probability of 0.7 and a mutation probability of 0.1 [
65]. The number of training iterations and learning rate are the same as above, and the hidden layer excitation function is tansig [
66].
Model 3: Wolf pack algorithm-improved CNN (WPA-CNN). The total number of artificial wolves is 40, the maximum number of iterations is 100, the distance judgment factor is 800 [
67], the wolf detection scale factor is 4 [
67,
68], the maximum number of walks is 30, the step size factor is 1000, the update scale factor is 10 [
67,
69], the maximum number of walks [
70] is 6, and the minimum number of walks is 3. The Adam optimizer is used, with 25 inputs and 1 output, a convolution kernel size of 3 × 1, a total of 25 convolution kernels, 2 fully connected layers, and 625 connections in each layer.
Model 4: CNN. Using the Adam optimizer, set 25 inputs and 1 output, a convolution kernel size of 3 × 1, a total of 25 convolution kernels, and the rest are the same as above.
Model 5: PSO-BP. The network structure is 26-6-1, the learning factor is c1 = 2.2, c2 = 0.8; the maximum number of evolutions is 100, and the population size is 40; the initial fitness value is 600, the training times are 50,000, and the learning rate is 0.1. According to Ref. [
71], the transfer function from the input layer to the hidden layer is set as a logsig function, the transfer function from the hidden layer to the output layer is a purelin function, and the training function is a traingdm function.
Using MATLAB R2018b as the experimental platform, five representative predictive performance indicators are selected, and their calculation formulas are as follows:
Mean relative error (MRE):
Mean absolute percentage error (MAPE):
Root mean square error (RMSE):
Sum of squares error (SSE):
Mean absolute error (MAE):
4.2. Analysis of Prediction Results
4.2.1. Convergence Performance
Using the MAPE (
) as the target fitness value for each model, the convergence of the fitness values for each model is calculated based on the training samples, as shown in
Figure 3.
According to
Figure 3, the IWOA-CNN algorithm proposed in this paper demonstrates significant advantages in cost prediction for power transmission and transformation projects in high-altitude regions. From the iterative process, it can be seen that the IWOA-CNN achieved a breakthrough decrease in MAPE value from 0.6583 to 0.0291 in the 17th generation, entering the fine optimization stage about 20 iteration cycles earlier than the comparative algorithms. Its unique WOA mechanism effectively overcomes the low efficiency of traditional CNNs in parameter space search. The final convergence value remains stable at 0.0158, which is 28.3% lower than the optimal comparison model WPA-CNN (0.0219) and 30.7% lower than the CNN (0.0228). In the comparison algorithm, GA-BP and PSO-BP are limited by the local search ability of traditional optimization algorithms, and their convergence speed slows down significantly in the later stage; although the WPA-CNN shows a decreasing competitive trend in the mid-term, the global optimization depth is insufficient. In summary, the IWOA-CNN achieves a balance between fast convergence and high-precision modeling in engineering cost prediction in complex geographical environments by integrating intelligent optimization algorithms and deep-learning collaborative mechanisms. Its steep initial decline curve and stable final value characteristics can effectively adapt to the nonlinear characteristics of engineering cost data in high-altitude regions.
4.2.2. Fitting Performance
The fitting results of the above five models for 50 training samples are shown in
Figure 4.
Figure 4 shows that the IWOA-CNN model proposed in this paper demonstrates excellent fitting performance in cost predictions for power transmission and transformation projects in high-altitude regions. From the fitting results of 50 training samples, it can be seen that the predicted values of the IWOA-CNN generally closely follow the actual cost curve, and the fluctuation range of its prediction deviation is significantly smaller than that of the comparison model, especially in samples with obvious extreme climate characteristics (such as sample 19, with its actual value of 367.87 and IWOA-CNN prediction of 363.12, an error of only 1.29%), it shows stronger environmental adaptability. The comparative model shows obvious limitations: the GA-BP systematically overestimates in complex terrain samples (such as sample 1, with an error rate of 17.36%), the WPA-CNN has insufficient sensitivity to mutation cost data (such as sample 23, with an error rate of 4.96%), the CNN shows lagging response in time-series fluctuation samples (such as sample 30, with an error rate of 3.88%), while the PSO-BP exhibits prediction instability (sample 4, with an error rate of 17.08%). The IWOA-CNN optimizes the convolution kernel parameters through WOA, achieving high-precision capture of nonlinear features such as frozen soil depth and material transportation costs in high-altitude regions.
4.2.3. Prediction Performance
For the 25 test samples, the prediction results of the five models mentioned above are shown in
Figure 5, and the prediction performance of each model is shown in
Table 2.
The IWOA-CNN model proposed in this paper demonstrates significant predictive advantages on the test set. As shown in
Table 2, its MAPE is only 1.51%, which is 10.1% lower than the optimal comparison model WPA-CNN (1.68%) and 39.8% lower than the traditional CNN (2.51%), confirming the effectiveness of the IWOA in parameter tuning. From specific sample observations, the IWOA-CNN performs particularly well in complex working conditions, with a prediction error of only 1.68% for sample 6 (actual value 210.59), while both the GA-BP and PSO-BP produce significant deviations of 22.87% and 16.15%, respectively, exposing the limitations of traditional intelligent algorithms in capturing characteristics of high-altitude frozen soil environments. Although the WPA-CNN performs suboptimally in most samples (such as sample 17, with an error of 0.68%), its RMSE (5.45) still lags behind that of the IWOA-CNN (5.07) by 7.5%, reflecting the accuracy bottleneck of the original CNN in nonlinear regression.
Further analysis of the error distribution shows that the SSE of the IWOA-CNN (643.37) is 13.4% lower than that of the WPA-CNN (742.76), indicating that its prediction results have better stability, especially in high volatility samples (such as sample 14, with an actual value of 332.27 and a prediction error of 2.54%), effectively suppressing outlier interference. The comparative model exhibits significant flaws: the GA-BP falls into local optima due to premature convergence, resulting in a 20.15% overfitting bias in sample 4; the PSO-BP experiences 18.95% directional prediction errors in sample 18 due to the influence of particle swarm inertia mechanism; and the CNN generates a lag response error of 4.3% in sample 23 due to the lack of optimization strategies. The IWOA-CNN successfully achieves accurate modeling of complex engineering factors in high-altitude regions by integrating the global search capability of a WOA with the local feature extraction advantage of a CNN. Its MAE index (4.23) is 85.5% lower than the commonly used PSO-BP (29.29), providing more reliable quantitative support for engineering cost decisions in extreme environments.
4.3. Further Discussion
To further analyze the effectiveness of the cost-prediction system for power transmission and transformation projects in high-altitude regions proposed in this paper, two scenarios are designed:
Scenario 1 (scenario of this paper): The IWOA-CNN algorithm is used for prediction, considering the influencing factors of four dimensions: climate environment, engineering scale, material consumption, and technical economy.
Scenario 2 (comparative scenario): The IWOA-CNN algorithm is used for prediction, and the influencing factors (i.e., climate environment dimensions) reflecting the characteristics of high-altitude regions in
Table 1 are removed, only considering the influencing factors of engineering scale, material consumption, and technical economy.
Using the aforementioned 50 sets of samples as the training set and 25 sets of samples as the testing set, the prediction performance of power transmission and transformation projects in the two scenarios is shown in
Figure 6 and
Table 3.
This verifies the key role of the climate environment in predicting the cost of power transmission and transformation projects in high-altitude regions through scenario comparison. As shown in
Table 3, Scenario 1, which includes climate environment factors, has significantly improved prediction accuracy compared to Scenario 2. The MAPE decreases from 3.12% to 1.51%, a decrease of 51.6%, and the RMSE is reduced from 9.65 to 5.07, a decrease of 47.5%. This performance difference is particularly evident in specific samples, such as Scenario 1 in sample 3 (actual value 289.21), with a prediction error of only 1.22%, while Scenario 2, ignoring climate characteristics, has an error rate of 3.31%, indicating that the thermal expansion and contraction effects of wires in low-temperature environments have not been accurately modeled. The predicted value (343.26) of Scenario 2 in sample 14 (actual value 332.27) shows a 71.4% increase in deviation compared to Scenario 1 (340.71), reflecting the estimation bias of structural reinforcement cost caused by the lack of snow depth and frozen soil characteristics.
The inclusion of climate environment factors significantly improves the prediction stability under extreme working conditions. The SSE in Scenario 1 (643.37) decreases by 72.4% compared to Scenario 2 (2327.97), and its advantage is more prominent in typical high-altitude feature samples: the prediction error of Scenario 2 in sample 6 (actual value 210.59) reaches 3.0%, while Scenario 1 is only 1.68%. The difference is due to the underestimation of basic processing costs caused by the lack of consideration for frozen-soil depth; The predicted value (241.53) of Scenario 2 in sample 19 (actual value 234.93) shows a significant increase in deviation compared to Scenario 1 (231.35), indicating that the tower reinforcement demand under strong wind loads has not been effectively captured. This accuracy difference confirms the strong nonlinear correlation between climate parameters and engineering costs in high-altitude regions. The IWOA-CNN achieves precise quantification of additional costs for special environments (such as premiums for low-temperature-resistant materials and investment in de-icing devices) by integrating characteristics such as temperature and permafrost.
5. Conclusions
With the accelerated construction of power transmission and transformation projects in high-altitude regions, special climatic conditions such as extreme low temperatures, deep permafrost, strong winds, and snow accumulation have significantly increased project costs. However, traditional prediction methods have insufficient accuracy due to neglecting climatic factors. In response to this issue, this paper constructs a four-dimensional influencing factor system covering climate environment, engineering scale, material consumption, and technical economy. Then, a hybrid deep-learning model (IWOA-CNN) is proposed, which combines the IWOA with a CNN. The model optimizes parameter search efficiency through nonlinear convergence factors and adaptive weights and includes Bayesian hyperparameter optimization to improve model robustness. The applicability and effectiveness of the model are verified using actual engineering data.
The results show that the IWOA-CNN is significantly better than traditional models in terms of prediction accuracy and stability. The MAPE of the test set is only 1.51%, which is 10.1% lower than the optimal comparison model (WPA-CNN). The RMSE is reduced to 5.07, and the SSE is reduced by 72.4%. The comparative analysis of scenarios further confirms the key role of climate environment factors: the MAPE reduction of the model with climate dimension (Scenario 1) compared to the model without climate dimension (Scenario 2) is 51.6%, indicating that the nonlinear effects of low temperature, frozen soil, and other features on costs cannot be ignored. Research has confirmed that optimizing the CNN’s learning rate through an IWOA can effectively capture complex engineering features such as material performance degradation and increased ground treatment costs in high-altitude regions, providing reliable technical support for cost control in extreme environments.
Overall, this paper enriches the research on cost prediction for power transmission and transformation projects in high-altitude regions by integrating climate factors and deep-learning techniques, providing a scientific basis for engineering investment decision-making and cost management. However, there are still limitations in the research, such as limited data sample size and lack of coverage of multi-voltage-level engineering. Future work can be extended to transmission and transformation projects at higher voltage levels, and dynamic climate models can be introduced to enhance temporal prediction capabilities. Meanwhile, transfer learning frameworks can be explored to address generalization issues in small sample scenarios, thereby further enhancing the applicability of the model in complex geographic environments.