Damage Probability Assessment of Transmission Line-Tower System Under Typhoon Disaster , Based on Model-Driven and Data-Driven Views

Under the typhoon disaster, the power grid often has serious accidents caused by falling power towers and breaking lines. It is of great significance to analyze and predict the damage probability of a transmission line-tower system for disaster prevention and reduction. However, some problems existing in current models, such as complicated calculation, few factors, and so on, affect the accuracy of the prediction. Therefore, a damage probability assessment method of a transmission line-tower system under a typhoon disaster is proposed. Firstly, considering the actual wind load and the design wind load, physical models for calculating the damage probability of the transmission line and power tower are established, respectively based on model-driven thought. Then, the damage probability of the transmission line-tower system is obtained, combining the transmission line and power tower damage probability. Secondly, in order to improve prediction accuracy, this paper analyzes the historical sample data containing multiple influencing factors, such as geographic information, meteorological information, and power grid information, and then obtains the correction coefficient based on data-driven thought. Thirdly, the comprehensive damage probability of the transmission line-tower system is calculated considering the results of model-driven and data-driven thought. Ultimately, the proposed method is verified to be effective, taking typhoon ‘Mangkhut’ in 2018 as a case study.


Introduction
In recent years, the intensity and frequency of extreme weather events (especially typhoon disasters) have increased [1].The typhoon disaster is devastating.Under a typhoon disaster, transmission lines, power towers, and poles might be damaged, which may cause large area blackouts [2].In order to improve the ability of the power grid to withstand typhoons, it is of great significance to predict the damage probability of transmission lines and power towers under typhoon disasters.
In view of the damage prediction of transmission lines and power towers under typhoon disasters, according to the model-driven view, some scholars studied the mechanical characteristics and damage mechanisms and pointed out that the damage of transmission lines and power towers is mainly due to the fact that the typhoon wind speeds exceed the designed wind speeds.Then, different physical models of transmission lines and power towers were proposed to predict the damage probability.In [3], based on the fuzzy mathematics principle, the membership function of a typhoon disaster was obtained by analyzing the relationship between typhoon wind speed and design wind speed and the failure rate of a transmission line was simulated by an exponential function.The mechanical stability of towers and fixings plays an important role in windproofing, but the size of the effect depends on the size of the designed wind speed and the environment situation.In [4,5], in order to analyze the failure probability of the transmission line and power tower under a typhoon disaster, the piecewise function was used to simulate the damage probability, according to the predicted typhoon wind speed and the design wind speed of the transmission line and power tower.In addition to considering the relationship between typhoon wind speed and design wind speed, the importance of non-electrical quantity data to the power system in disaster prevention was pointed out and a space-time early warning framework model for power grid failure rate under typhoons and heavy rainfall was proposed in [6].In [7,8], considering the micro-topographic features to correct the actual wind speed of the transmission line, a damage early warning model of the typhoon disaster, based on the design wind speed and the actual wind speed, was proposed.In [9], the geographic elevation information was integrated with the geographic information of the transmission channel path and then the failure probability evaluation model of the short-term transmission channel was constructed, however, only the geographic information was considered to correct the failure model.
In summary, the model-driven thought mainly established the damaged physical model by analyzing the damage mechanism of the power grid and the calculation process was relatively simple.Most models only considered the most important influencing factors.When there were many factors to be considered, the model would be more complicated and difficult to solve.
With the development of data analysis and data mining technology, the power grid damage data analysis has been increased.Many scholars have extracted the historical typhoon sample data and have proposed data-driven power grid fault models based on the analysis and mining of sample data.
Based on data-driven thought [10], many statistical models were established to predict power outages caused by natural disasters, such as hurricanes, typhoons, and severe storms [11][12][13][14][15][16].In [17], considering the component running time, the pollution level of the transmission line and the lightning density, based on the support vector machine and gray prediction technology, the reliability prediction model of the transmission line was proposed.In [18,19], using the relevant public data affecting the power system, power outage prediction models were established under the hurricane through data mining.In [20], linear regression, exponential regression, linear multiple regression, and neural network association models reflecting the relationship between precipitation, maximum temperature, minimum temperature, and line failure rate were established to predict the line failure rate caused by plant growth.In [21], considering weather factors such as storms, heavy rain, and high temperatures, the original parameter estimation model of a power system, based on fuzzy clustering and similarity, was proposed.The model took into account most of the climatic factors, but did not further assess the damage to the power grid.The data-driven method will be more accurate when the sample size is large and the data quality is high.However, the current collection and collation of power grid sample data is still in the initial stage and there are still problems, such as insufficient sample sizes and low data quality.
In the study of damage assessment of transmission lines and power towers under typhoon disasters, the model-driven methods generally have problems, such as insufficient accuracy or complicated solutions due to consideration of too few or too many variables.Meanwhile, data-driven methods have problems such as insufficient sample sizes or low data quality.This paper mainly aims at the problem of the prediction method of transmission line and power tower damage under typhoon disasters.It overcomes the conflict of previous research and proposes a damage predicting method of transmission lines and power towers under a typhoon disaster, combined with the model-driven and data-driven views.In this method, considering that typhoon wind speed is the main cause affecting the damage of the transmission line and power tower, the model-driven part mainly considers the typhoon wind speed and the designed parameters of the transmission line and power tower and then the damage probability physical model is established.The data-driven part mainly uses the multi-factor sample data from under the previous typhoon disaster.Through data mining analysis, the correction coefficient is obtained to correct the result of the physical model and then the final comprehensive damage probability is obtained.The simulation results based on the typhoon 'Mangkhut' in 2018 have verified its high efficiency and accuracy.

Transmission Line-Tower System Damage Prediction Framework
Under the typhoon disaster, the main reason for the damage of the transmission line and power tower is that the actual wind speed exceeds the design wind speed.At the same time, geographic information (such as altitude, slope, underlying surface, etc.), meteorological information (such as wind speed, wind direction, rainfall, etc.) and power grid information (such as the running time) will also have a certain impact on the damage of the transmission line and power tower.This paper intends to use the combination of model-driven and data-driven thought to predict the damage probability of a transmission line-tower system under typhoon disasters.
In this method, the model-driven part only needs to consider the most important influencing factors (gust wind speed and design wind speed) to establish a model, without considering all the influencing factors, controlling the complexity of the model.As a consequence of the different structure of the transmission line and power tower, the damage analysis method is different.Therefore, this paper first establishes the damage probability model of the transmission line and the power tower, respectively.Then, the transmission line and power tower are considered as an overall system, called the 'transmission line-tower system'.Then the damage probability of the transmission line-tower system is calculated based on the damage probability of the transmission line and power tower.
In the data-driven part, this paper considers the possible influencing factors (altitude, slope, underlying surface, etc.), and collects and sorts out the historical typhoon information that has affected the power grid in China's coastal area and obtains the correction coefficient, through the data analysis method, to correct the model-driven results.Finally, the probability of comprehensive damage is obtained.The advantage of this approach, compared with existing methods, is that it not only analyzes the physical mechanism of the typhoon disaster on the transmission line and power tower, but also controls the complexity of the model.The historical data analysis transforms the factors affecting the damage of the transmission line and power tower into a correction coefficient.The prediction framework of the transmission line and power tower damage probability is shown in Figure 1.
Energies 2019, 12, x FOR PEER REVIEW 3 of 17 mining analysis, the correction coefficient is obtained to correct the result of the physical model and then the final comprehensive damage probability is obtained.The simulation results based on the typhoon 'Mangkhut' in 2018 have verified its high efficiency and accuracy.

Transmission Line-Tower System Damage Prediction Framework
Under the typhoon disaster, the main reason for the damage of the transmission line and power tower is that the actual wind speed exceeds the design wind speed.At the same time, geographic information (such as altitude, slope, underlying surface, etc.), meteorological information (such as wind speed, wind direction, rainfall, etc.) and power grid information (such as the running time) will also have a certain impact on the damage of the transmission line and power tower.This paper intends to use the combination of model-driven and data-driven thought to predict the damage probability of a transmission line-tower system under typhoon disasters.
In this method, the model-driven part only needs to consider the most important influencing factors (gust wind speed and design wind speed) to establish a model, without considering all the influencing factors, controlling the complexity of the model.As a consequence of the different structure of the transmission line and power tower, the damage analysis method is different.
Therefore, this paper first establishes the damage probability model of the transmission line and the power tower, respectively.Then, the transmission line and power tower are considered as an overall system, called the 'transmission line-tower system'.Then the damage probability of the transmission line-tower system is calculated based on the damage probability of the transmission line and power tower.
In the data-driven part, this paper considers the possible influencing factors (altitude, slope, underlying surface, etc.), and collects and sorts out the historical typhoon information that has affected the power grid in China's coastal area and obtains the correction coefficient, through the data analysis method, to correct the model-driven results.Finally, the probability of comprehensive damage is obtained.The advantage of this approach, compared with existing methods, is that it not only analyzes the physical mechanism of the typhoon disaster on the transmission line and power tower, but also controls the complexity of the model.The historical data analysis transforms the factors affecting the damage of the transmission line and power tower into a correction coefficient.The prediction framework of the transmission line and power tower damage probability is shown in Figure 1.

Data-Driven Correction Coefficient Calculation
In [22], when the damage prediction of the transmission line and power tower under typhoon disaster is carried out, only the relationship between the actual wind speed and the design wind speed is considered to establish the model.Under actual circumstances, the transmission line and power tower damage should also consider other factors, such as the elevation, the slope, and the underlying surface of the geographic information, all of which will affect the damage to the transmission line and power tower.
To this end, based on the data-driven approach, this paper comprehensively considers the influencing factors, such as geographic information, meteorological information, and power grid information, to correct the damage probability of the transmission line-tower system.

Influence Factor Weight Calculation
In the calculation of variable weights, the typhoon data that caused damage to the Guangdong power grid in China over the years are selected as samples and the weights of the variables are obtained by analyzing and processing the sample data.
The feature information contained in the sample is as follows: Geographic information: Elevation, slope direction, slope, slope position, underlying surface, roughness, etc.; Meteorological information: Gust wind speed; Power grid information: Design wind speed, running time.
The correction coefficient calculation framework is shown in Figure 2.

Data-Driven Correction Coefficient Calculation
In [22], when the damage prediction of the transmission line and power tower under typhoon disaster is carried out, only the relationship between the actual wind speed and the design wind speed is considered to establish the model.Under actual circumstances, the transmission line and power tower damage should also consider other factors, such as the elevation, the slope, and the underlying surface of the geographic information, all of which will affect the damage to the transmission line and power tower.
To this end, based on the data-driven approach, this paper comprehensively considers the influencing factors, such as geographic information, meteorological information, and power grid information, to correct the damage probability of the transmission line-tower system.

Influence Factor Weight Calculation
In the calculation of variable weights, the typhoon data that caused damage to the Guangdong power grid in China over the years are selected as samples and the weights of the variables are obtained by analyzing and processing the sample data.
The feature information contained in the sample is as follows: Geographic information: Elevation, slope direction, slope, slope position, underlying surface, roughness, etc.; Meteorological information: Gust wind speed; Power grid information: Design wind speed, running time.
The correction coefficient calculation framework is shown in Figure 2.  As shown in Figure 2, the geographic information, meteorological information, and power grid information from previous typhoon disasters are selected as sample data.The weights of each variable are first obtained by analyzing and calculating the sample data and then the correction coefficient is obtained according to the predicted variable data.In this paper, three kinds of analysis methods are used to evaluate the importance of each variable.Finally, the reasonable weights of the three kinds of analysis results are used to calculate the correction coefficient.As shown in Figure 2, the geographic information, meteorological information, and power grid information from previous typhoon disasters are selected as sample data.The weights of each variable are first obtained by analyzing and calculating the sample data and then the correction coefficient is obtained according to the predicted variable data.In this paper, three kinds of analysis methods are used to evaluate the importance of each variable.Finally, the reasonable weights of the three kinds of analysis results are used to calculate the correction coefficient.

1.
Variable importance evaluation based on the Gini index.
The random forest (RF) was proposed by Breiman et al. in 2001 and has now become one of the most commonly used tools in data mining and bioinformatics [23].RF can effectively analyze nonlinear, collinear, and interactive data and give variable importance scores while analyzing the data.
In the process of generating a decision tree, the RF algorithm divides each node based on the Gini index (one of the strategies).The Gini index can characterize the importance of a node and thus the importance of the variable [24].Therefore, the importance of the variables can be evaluated accordingly, based on the Gini index.
Assuming that there are m variables, this paper intends to calculate the weight of the variable X j (j = 1, 2, . . ., m) according to the Gini index, based on the random forest algorithm.
In this paper, the damage of the transmission line-tower system is regarded as the two-category variable, that is, damaged and not damaged, then the Gini index is calculated as follows: where p m is a probability estimate of the sample belonging to any class at node m.The importance of the variable X j at the node m is VIM jm , that is, the Gini index change before and after the node m branch is as follows: where GI l and GI r represent the Gini indices of the two new nodes split by node m, respectively.If the variable X j appears M times in the ith tree, the importance of the variable X j in the ith tree is as follows: The Gini importance of the variable X j in RF is defined as follows: where, n is the number of classification numbers in the RF.

2.
Variable importance assessment based on mean decrease accuracy.
Mean decrease accuracy is a commonly used feature selection method that directly measures the impact of each feature on the accuracy of the model [25].The main idea is to disrupt the order of the eigenvalues of each feature and measure the impact of sequence changes on the accuracy of the model.For variables that are not important, the scrambling order does not affect the accuracy of the model too much.For important variables, the disordered order will reduce the accuracy of the model.
Assuming that there are m variables, this paper intends to calculate the weight of the variable X j (j = 1, 2, . . ., m) according to the mean decrease accuracy based on the random forest algorithm.
The specific steps are as follows: Step 1: The sample data is divided into a training set and a test set, of which 80% are training samples and 20% are test samples.
Step 2: Train and adjust the RF model to obtain the accuracy rate acc.
Step 3: Calculate the effect of the variable X j on the test accuracy.The impact score is represented by score j .The feature data corresponding to the variable X j is randomly shuffled n times and, at the same time, the corresponding test accuracy rate, shuff_acc ij (i = 1, 2, . . ., n), will be obtained when the feature data is be shuffled at ith time.score ij = (acc − shuff_acc ij )/acc ij . (5) The effect of variable X on the test accuracy is represented by score j .
Step 4: Calculate the importance weight w j of each variable.
Variable importance assessment based on the entropy weight method.
The entropy weight method is commonly used for multi-factor weight analysis [26], because its conclusion is more objective and the calculation process is simple.In the entropy weight method, the information entropy of each index is negatively correlated with the degree of numerical difference.The larger the numerical difference degree, the smaller the information entropy, and the larger the information amount, the final weight will be larger [27].
The specific steps of the entropy weight method are as follows: Step 1: Assuming there are n damaged samples, each sample has m variables.The evaluation matrix X was obtained according to Reference [28].
Step 2: Calculate the specific gravity size P ij of the ith sample in the variable X j .Calculate the entropy, e j , of the variable X j .
Step 3: Calculate the entropy weight a j .

Optimal Weight Determination Method
Since the above three variables importance weight determination methods have advantages and disadvantages, considering the actual situation and human subjective judgment, this paper intends to use the fuzzy multi-criteria decision-making method [29] to evaluate the variable importance determination method and select the relatively superior determination method.
Supposing there are n (x 1 , x 2 , . . ., x n ) schemes for determining weights, considering m variable factors, specific steps are as follows: Step 1: Establishing the decision matrix Y based on the results of n kinds of weight determination.
where X mn represents the weight of the mth variable under the nth weight calculation method.
Step 2: Normalizing the decision matrix Y base on the minimum value, and the relative membership degree matrix R is obtained.
where r ij = min j (x ij )/x ij .
Step 3: Calculating the weight vector of R.
In this paper, p = m is taken and all the weights are to be determined, and α = 1.
Step 4: Calculating the decision vector D.
To this end, the scheme corresponding to the maximum value in the decision vector D is selected as a relatively optimal scheme.According to the actual situation, the range of values for each variable is shown in Table 1.

Correction Coefficient Calculation
It is assumed that the weight of each variable (m variables) is w = w 1 w 2 • • • w m and the predicted value corresponding to each variable is x = x 1 x 2 • • • x m .In this paper, the correction coefficient is calculated based on the value range of the variables and the prediction data.The specific steps are as follows: Step 1: Comprehensive scoring benchmark.
Considering that the numerical growth of each variable has different effects on the damage situation, the gust wind speed, running time, slope and roughness are positively correlated with the damage of the transmission line-tower system and the design wind speed, altitude, underlying surface, slope direction, and slope position are negatively correlated with the damage of the transmission line-tower system [30].To this end, in the calculation of the score, the positive correlation variable front symbol is 'positive' and the negative correlation variable front symbol is 'negative'.
where W B max and W B min are the maximum and minimum values of the comprehensive scoring benchmark, respectively.The value w i is the weight of each variable and x B i,max , x B i,min are the upper and lower bounds of the variable range, respectively.
where W is the comprehensive forecast score and x i is the forecast value of the ith variable.
Based on reference [7,31], this paper maps the comprehensive prediction score to the interval (0.9, 1.3) and obtains the correction coefficient k as follows: In this paper, the correction coefficient k is used to correct the damage probability of the transmission line-tower system under a typhoon disaster and the final comprehensive damage probability prediction result is obtained.At the same time, since some of the variables that can be collected are based on a 1 km × 1 km mesh, in order to match the data of each variable and ensure the accuracy of the data, this paper performs a 1 km × 1 km mesh division on the prediction area and calculates the correction coefficients of each grid separately.

Model-Driven Damage Probability Calculation
This paper mainly analyzes the transmission line and power tower and proposes a damage prediction model of the transmission line-tower system.The flow chart for the damage prediction of the transmission line-tower system is shown in Figure 3. Step3: Correction coefficient.
Based on reference [7,31], this paper maps the comprehensive prediction score to the interval (0.9, 1.3) and obtains the correction coefficient k as follows: min max min 0.4 0.9 In this paper, the correction coefficient k is used to correct the damage probability of the transmission line-tower system under a typhoon disaster and the final comprehensive damage probability prediction result is obtained.At the same time, since some of the variables that can be collected are based on a 1 km × 1 km mesh, in order to match the data of each variable and ensure the accuracy of the data, this paper performs a 1 km × 1 km mesh division on the prediction area and calculates the correction coefficients of each grid separately.

Model-Driven Damage Probability Calculation
This paper mainly analyzes the transmission line and power tower and proposes a damage prediction model of the transmission line-tower system.The flow chart for the damage prediction of the transmission line-tower system is shown in Figure 3.

Calculation of Damage Probability of the Transmission Line
According to the reliability theory of the engineering structure [32], the damage probability of an overhead transmission line can be calculated based on the theory of stress strength interference.
Usually, the design wind load of overhead transmission lines obeys normal distribution [22] and the probability density function of design wind load is as follows:

Calculation of Damage Probability of the Transmission Line
According to the reliability theory of the engineering structure [32], the damage probability of an overhead transmission line can be calculated based on the theory of stress strength interference.
Usually, the design wind load of overhead transmission lines obeys normal distribution [22] and the probability density function of design wind load is as follows: where w d is the design wind load of the overhead transmission line and u d and σ d are the mean and standard deviation of design wind load, respectively.In addition, the probability distribution function of the actual wind load can be fitted with an extreme I distribution function [33] as follows: where w x is the actual wind load of the overhead transmission line, a is the scale parameter of distribution, and u is the location parameter of distribution.
Based on the stress intensity interference model, the reliable probability of an overhead transmission line is as follows: where f 1 (w x ) is the probability density function of the actual wind load.Then, the damage probability of overhead transmission line is as follows: where a and u are the scale parameters and positional parameters of the extreme value I distribution, respectively.In Equation (24), the relationship between the mean, u d , and the standard deviation, σ d , is defined according to reference [34].
where Z is called the coefficient of variation of the transmission line and its general selection range is 0.05-0.2[34].In this paper, the coefficient of variation Z was chosen to be 0.18 when calculating the damage probability of the transmission line.
In Equation ( 24), the wind load w d can be calculated as follows [31]: where a is the wind pressure asymmetrical coefficient, µ z is the wind pressure height coefficient, µ sc is the shape coefficient of the transmission line, d is the outer diameter of the transmission line, L p is the span of the transmission line, and θ is the angle between the wind and the transmission line.

Calculation of the Damage Probability of the Power Tower
Considering the exponential growth characteristics of the power tower when deformed by external force, based on the relationship between the design wind load and the actual wind load of the transmission tower, the prediction model of the damage probability of the power tower under a typhoon disaster is established.The damage probability p t is calculated as follows [35]: where w d and w x are the design wind load and the actual wind load of the power tower, respectively.

Calculation of the Damage Probability of the Transmission Line-Tower System
In the power grid, each transmission line is formed by a series of transmission lines and multi-base transmission towers.Therefore, the entire transmission line can be regarded as a series structure when calculating the damage probability of the transmission line.Under the typhoon disaster, the entire line will not be broken.Therefore, this paper selects a base transmission tower and transmission line on both sides, as analysis and calculation units, and calculates the damage probability p a in the unit.The result has not been corrected by multiple factors, called the pre-correction damage probability in this paper.

Calculation of the Comprehensive Damage Probability of the Transmission Line-Tower System
Based on the pre-correction damage probability and the data-driven correction coefficient calculation stated above, the calculation equation of the transmission line-tower system's comprehensive damage probability is as follows: where k is the correction coefficient calculated by data-driven thought.As it is the damage probability corrected by coefficient factors, it is called the post-correction damage probability in this paper.

Influence of the Line Span on the Damage Probability
Under the typhoon disaster, the line span will affect the wind load of the transmission line, as well as the damage probability.Therefore, analyzing the relationship between the line span and the damage probability can provide some guidance for the transmission line windproof work.
According to Equations ( 24)-( 26), the relationship between the line span and the damage probability is shown in Figure 4.

Calculation of the Comprehensive Damage Probability of the Transmission Line-Tower System
Based on the pre-correction damage probability and the data-driven correction coefficient calculation stated above, the calculation equation of the transmission line-tower system's comprehensive damage probability is as follows: where k is the correction coefficient calculated by data-driven thought.As it is the damage probability corrected by coefficient factors, it is called the post-correction damage probability in this paper.

Influence of the Line Span on the Damage Probability
Under the typhoon disaster, the line span will affect the wind load of the transmission line, as well as the damage probability.Therefore, analyzing the relationship between the line span and the damage probability can provide some guidance for the transmission line windproof work.
According to Equations ( 24), (25), and ( 26), the relationship between the line span and the damage probability is shown in Figure 4.As shown in Figure 4, as the transmission line span increases, the probability of damage increases.Therefore, in order to reduce the risk of damage to the transmission line under a typhoon disaster, when designing the transmission line the line span can be appropriately reduced or, before the typhoon disaster, there should be a focus on the reinforcement of long-span lines to improve their ability to withstand typhoons.As shown in Figure 4, as the transmission line span increases, the probability of damage increases.Therefore, in order to reduce the risk of damage to the transmission line under a typhoon disaster, when designing the transmission line the line span can be appropriately reduced or, before the typhoon disaster, there should be a focus on the reinforcement of long-span lines to improve their ability to withstand typhoons.

Influence of the Coefficient of Variation on the Damage Probability
In the damage prediction model of the transmission line under typhoon disaster, there is a coefficient of variation Z reflecting the relationship between the mean and the variance of the transmission line's designed wind load.According to Equations ( 24)-( 26), the relationship between the coefficient of variation and the damage probability of the transmission line is shown in Figure 5.As shown in Figure 5, as the coefficient of variation increases, the damage probability increases.
Therefore, when planning and designing transmission lines, the appropriate coefficient of variation can be selected according to the wind speed division of the line location to reduce its damage probability under windstorms and to enhance its disaster prevention capability.

Results
Taking the typhoon 'Mangkhut', No. 22 of 2018, as an example, it was formed in the northwestern Pacific on the afternoon of 7 September.After moving into the northeastern part of the South China Sea on the morning of 15September, it quickly moved westward, attacking the China Pearl River Delta and causing serious impact on the Guangdong Province in China.'Mangkhut' is the typhoon that has the widest range of land surface winds, the longest duration of strong winds, and the highest gust wind speed.It caused damage to a line of 35 kV and above, with 10 base towers and 13 broken (dropped) lines, which had a certain impact on Guangdong Province in China [36].
In this paper, the actual meteorological and power data of the typhoon 'Mangkhut' is taken as an example to predict the transmission line-tower system damage situation of a city in Guangdong Province, China.The simulation results verify the effectiveness and acurracy of the proposed method.

Correction Coefficient Calculation Based on Data-Driven Thought
This paper collects and records 630 samples from the typhoon ('Rammasun', 'Mujigae', and 'Hato') affected Guangdong Power Grid in China over the years.The nine characteristics of the sample data mainly include geographic information, meteorological information, and power grid information.Among them, geographic information includes elevation, slope, slope direction, slope position, roughness, underlying surface; meteorological information includes gust wind speed; and power grid information includes design wind speed and running time.
Step 1: Influence factor weight calculation.As shown in Figure 5, as the coefficient of variation increases, the damage probability increases.Therefore, when planning and designing transmission lines, the appropriate coefficient of variation can be selected according to the wind speed division of the line location to reduce its damage probability under windstorms and to enhance its disaster prevention capability.

Results
Taking the typhoon 'Mangkhut', No. 22 of 2018, as an example, it was formed in the northwestern Pacific on the afternoon of 7 September.After moving into the northeastern part of the South China Sea on the morning of 15September, it quickly moved westward, attacking the China Pearl River Delta and causing serious impact on the Guangdong Province in China.'Mangkhut' is the typhoon that has the widest range of land surface winds, the longest duration of strong winds, and the highest gust wind speed.It caused damage to a line of 35 kV and above, with 10 base towers and 13 broken (dropped) lines, which had a certain impact on Guangdong Province in China [36].
In this paper, the actual meteorological and power data of the typhoon 'Mangkhut' is taken as an example to predict the transmission line-tower system damage situation of a city in Guangdong Province, China.The simulation results verify the effectiveness and acurracy of the proposed method.

Correction Coefficient Calculation Based on Data-Driven Thought
This paper collects and records 630 samples from the typhoon ('Rammasun', 'Mujigae', and 'Hato') affected Guangdong Power Grid in China over the years.The nine characteristics of the sample data mainly include geographic information, meteorological information, and power grid information.
Among them, geographic information includes elevation, slope, slope direction, slope position, roughness, underlying surface; meteorological information includes gust wind speed; and power grid information includes design wind speed and running time.
Step 1: Influence factor weight calculation.The importance weights of each variable based on the Gini index (represented by the scheme A in the table), the mean decrease accuracy (represented by the scheme B in the table), and the entropy weight method (represented by the scheme C in the table) are calculated.The results are shown in Table 2.In order to visualize the importance relationship of each variable, the weight relationship diagram is shown in Figure 6.In order to visualize the importance relationship of each variable, the weight relationship diagram is shown in Figure 6.It can be seen from Figure 6a to Figure 6c that gust wind speed, the running time, and the altitude are more important based on the Gini index method and the mean decrease accuracy method.In the importance evaluation of variables based on the entropy weight method, the design of wind speed, the underlying surface, and the slope position is of great importance.The evaluation mechanism of the three methods is inconsistent, resulting in different final evaluation results.In order to choose a relatively reasonable evaluation mechanism, this paper intends to use fuzzy multi-criteria decision-making to select the optimal evaluation scheme.
According to the evaluation results of the three schemes in Step 1, the decision weights of each scheme are calculated based on fuzzy multi-criteria decision making.The results are shown in Table 3.It can be seen from Figure 6a to Figure 6c that gust wind speed, the running time, and the altitude are more important based on the Gini index method and the mean decrease accuracy method.In the importance evaluation of variables based on the entropy weight method, the design of wind speed, the underlying surface, and the slope position is of great importance.The evaluation mechanism of the three methods is inconsistent, resulting in different final evaluation results.In order to choose a relatively reasonable evaluation mechanism, this paper intends to use fuzzy multi-criteria decision-making to select the optimal evaluation scheme.
Step 2: Evaluation scheme selection.According to the evaluation results of the three schemes in Step 1, the decision weights of each scheme are calculated based on fuzzy multi-criteria decision making.The results are shown in Table 3.It can be seen from Table 3 that the weighting scheme based on the Gini index method has the largest decision weight and its evaluation results are more consistent with the subjective experience.Therefore, the Gini index method is used to evaluate the importance of variables in this paper.
Step 3: Comprehensive scoring benchmark.This paper analyzes the probability of damage to the transmission line-tower system under the typhoon "Mangkhut" in a city in Guangdong Province.Since the data of each variable provided is in a 1 km × 1 km grid, in order to facilitate the collection and sorting of each variable data, this example performs 1 km × 1 km mesh division for the city and obtains the correction coefficient of each grid.Part of the calculation results are shown in Table 4 below.In Table 4, No. indicates the serial number of the mesh, Lon indicates the longitude of the mesh center, lat indicates the latitude of the mesh center, and k indicates the correction coefficient of the corresponding mesh.Due to the limited space, Table 4 only lists partial results (1st to 9th and 8840th to 8846th grid data).
It can be seen from Table 4 that, due to the influence of various variables, the correction coefficient of each grid in the city is greater than 1, the final comprehensive damage probability will be greater than the damage probability based on model-driven thought, and the damage risk is higher than the risk before the correction.

Damage Probability Calculation Based on Model-Driven Thought
The damage probability model of the transmission line-tower system in each mesh is established, and their damage probability is calculated.Part of the results are shown in Table 5.
In Table 5, p a represents the basic damage probability based on model-driven thought.In order to visually reflect the prediction results of the damage probability of Table 5 and make the emergency command of the relevant departments of the power grid more intuitive and convenient, we visualize the results based on ArcGIS software.

Comprehensive Damage Probability Calculation
According to Equation ( 28), the comprehensive damage probability of post-correction of each grid in the city is calculated.Part of the results are shown in Table 6.In Table 6, p represents the comprehensive damage probability of post-correction in order to visually reflect the damage probability results of Table 6 and better guide the relevant departments of the grid for emergency command.

Discussions
In order to visually represent the results of the analysis, this paper visualizes the results based on ArcGIS software [37].Under typhoon 'Mangkhut', the actual breaking line and tower failure in the city was mainly distributed in the southeast coastal areas.The actual damage distribution is shown in Figure 7c.According to the damage probability results of pre-correction in Table 5, the result is divided into four levels and plotted on the map in different colors, as shown in Figure 7a.
It can be seen from Figure 7a that the larger damage probability is mainly concentrated in the southeast direction of the city and the distribution area is small.Under the typhoon "Mangkhut", damaged transmission line-tower systems based on the model-driven view are distributed in different areas of damage probability.
According to the damage probability results of post-correction in Table 6, the distribution of damage probability of post-correction is shown in Figure 7b.
It can be seen from Figures 7a and 7b that the damage probability of the transmission line-tower system in each mesh has an increasing tendency, after considering the correction coefficient.All the accidents are distributed in the maximum damage probability area after considering the correction coefficient.
At the same time, as shown in Figure 7b, the maximum damage probability distribution area of post-correction is consistent with the actual damage distribution area (as shown in Figure 7c) and the prediction result of post-correction is better than the result of pre-correction (as shown in Figure 7a).This illustrates the necessity of calculating the correction coefficient considering multiple influencing factors and it shows the rationality of the method proposed in this paper.

Figure 1 .
Figure 1.Damage probability prediction framework of the transmission line-tower system.

Figure 1 .
Figure 1.Damage probability prediction framework of the transmission line-tower system.

1 .
Variable importance evaluation based on the Gini index.The random forest (RF) was proposed by Breiman et al. in 2001 and has now become one of the most commonly used tools in data mining and bioinformatics [23].RF can effectively analyze

Figure 3 .
Figure 3. Damage prediction flow chart of the transmission line-tower system.

Figure 3 .
Figure 3. Damage prediction flow chart of the transmission line-tower system.

Figure 4 .
Figure 4. Relationship curve between line span and damage probability.

Figure 4 .
Figure 4. Relationship curve between line span and damage probability.

Energies 2019 , 17 Figure 5 .
Figure 5. Relationship curve between the coefficient of variation and the damage probability.

Figure 5 .
Figure 5. Relationship curve between the coefficient of variation and the damage probability.

Figure 6 .
Figure 6.Variable weight relation graph.(a) Gini index method; (b) Mean decrease accuracy method; and (c) Entropy weight method.

Figure 6 .
Figure 6.Variable weight relation graph.(a) Gini index method; (b) Mean decrease accuracy method; and (c) Entropy weight method.

Table 1 .
Value range of each variable.

Table 2 .
Each variable importance weight evaluation result.

Table 2 .
Each variable importance weight evaluation result.

Table 3 .
Decision weights for each scheme.

Table 4 .
Partial results of correction coefficient.

Table 5 .
Partial damage probability results of pre-correction.

Table 6 .
Partial damage probability results of post-correction.