In the initial analysis, we studied a system with a non-ideal three-phase source connected to the transmission line, and at the end, an unbalanced three-phase load. The corresponding system is shown in
Figure 3. The load was given as a percentage of the surge impedance loading (SIL) of the transmission line [
18]. Using the lossless case, the SIL is given by:
where
is the equivalent one-phase SIL, or the three-phase SIL divided by three, and
is the phase–ground voltage of phase A.
The presented system was simulated using MATLAB/Simulink using a notebook equipped with an Intel Core i7-1165G7 2.8 GHz CPU and 8GB of memory. In the initial analysis, the dataset size was considered to be 10,000 samples, which were split into training and test sets in an 80/20 proportion.
Numerical Analysis
The model is built using a training database and evaluated on a test database that is completely excluded from the training process. To present the performance, it is necessary to use some metrics. For this purpose, consider the actual values of the target
and the estimated values
, where
k is the size of the test database. In this work, we have adopted the root mean squared error (RMSE) and mean absolute percentage error (MAPE), defined as follows:
It is important to note that the RMSE is given in the unit of the parameter, e.g., for resistance, the RMSE is in
, whereas the MAPE is given in % using the presented configurations. Additionally, it is relevant to note that the metric is taken considering different
k values for the target in the test database. The result for the noiseless case is shown in
Table 1. Regarding the computational time required to obtain the solution, we conducted an average time analysis based on running 100 realizations to train and test the algorithm, resulting in an average time of 0.0397 s. As a benchmark, we implemented a simple linear regression solution to that commonly employed in the literature [
7,
10,
31], and we found an average computational time of 0.0213 s. Thus, our method exhibits a computational burden that is very similar to the standard methods used in the literature.
The use of a Bayesian method to estimate the parameters presents another advantage compared to other machine learning-based solutions or even common maximum likelihood estimators. In (
3), the developed method for prediction provides the estimator with the distribution of the target
in the test database. Consequently, the mean value and the standard deviation of each sample in the test set are calculated. The mean value corresponds to the prediction used for a given set of features. Additionally, the standard deviations can be used to illustrate the possible variations in the estimation process. To demonstrate this behavior, we have performed the estimation for the parameters
and
, along with their error bars, considering a variation of
. The obtained results are shown in
Figure 4.
The results depicted in
Figure 4 reveal some important aspects of the solution. Firstly, the estimation performance is remarkably high, as the true values closely align with the estimated ones. Another noteworthy observation is the reliability of the estimations, evidenced by the small standard deviations. Consequently, the proposed method generates stable predictions with a high performance for estimating three-phase transmission line parameters.
An important influencing factor in a machine learning solution is the size of the dataset, i.e., the number of samples used to build the statistical model and obtain results. Some papers in the literature have employed various dataset sizes. In the context of data-based solutions in power systems, [
32] used 5000 samples to train the model, whereas [
33] employed 32,000 samples to develop the model used to address the problem. Generally, non-machine learning solutions utilize a day of PMU measurements considering a certain reporting rate for the device. For instance, in [
7], a reporting rate of 50 samples per second was utilized, resulting in approximately
samples.
The presented work initially employed 10,000 samples to obtain the results shown in
Table 1. However, as shown in
Figure 5, the required number of samples to achieve the best performance is considerably lower. By analyzing this figure, it is possible to conclude that, for the noiseless case, 200 samples are sufficient to produce results with a high performance.
Another relevant aspect of the solution is the necessity of measurements at both ends. In real-world applications, obtaining PMUs to monitor each transmission line of the system is challenging due to the high cost of implementation [
16]. Thus, it is expected that some non-critical lines may only be monitored using a single PMU. Consequently, we tested the proposed algorithm considering measurements from only one side. This process involves removing either the receiver or sender end measurement from the feature matrix
and applying the method as described in
Section 3.2. The results of this analysis are shown in
Table 2.
The results presented in
Table 2 confirm that, as expected, the method’s performance decreases when using measurements from only one side. However, the obtained values still demonstrate an acceptable performance [
6], as it is desirable to achieve parameter values that are within 10% of the maximum values. It is worth highlighting that the worst-performing parameter is the zero-sequence susceptance. Generally, estimating zero-sequence parameters is more challenging due to their smaller magnitude compared to the positive sequence, which poses an additional barrier to achieving numerical stability in the estimation process.
Another relevant parameter in the estimation process is the active power carried through the line. In the initial analysis, we assumed that the active power was 1 SIL, equivalent to 100% of its value. To consider the influence of this parameter, the active power was varied in a range of
of the 1 SIL. This range is a commonly observed value for the variation in the active power in a period of 24 h [
34]. The results of this analysis are shown in
Figure 6.
The results presented in
Figure 6 show that the proposed methodology is robust concerning the active power carried through the line. This way, the variation in the load did not have a significant influence on the performance of the method, since there was only a slight difference among the results for the studied loads. This is an interesting behavior of the proposed approach due to the fact that common methods in the literature [
11], based on nonlinear optimization, present extreme variation in performance when the load is modified, especially for extreme values of load.
Another relevant scenario to test the performance of the algorithm is to estimate the parameters in a complete power system with the presence of other components, such as transformers, generators, loads, and transmission lines. To study the performance of the proposed method under this situation, we implemented the methodology to estimate the transmission line parameters in the IEEE 39-bus system. The IEEE 39-bus standard system is a widely used power network in the New England area of the United States [
35], providing an extensive scope to evaluate power system solutions. The system consists of 10 generators, 39 bus bars, and 12 transformers, as illustrated in
Figure 7. The methodology employed to generate the data is consistent with that presented in
Section 3.1. It is important to note that the simulation was carried out for positive sequence analysis, which is the standard case for these test systems [
36]. Thus, the results for several transmission lines of the system are presented in
Table 3.
As observed for the simplified 2-bus system,
Table 3 demonstrates that the method exhibits a high performance in estimating the positive sequence parameters of a three-phase transmission line. Particularly noteworthy is the fact that the worst parameter yielded a mean absolute percentage error (MAPE) smaller than 1%. Furthermore, it is relevant to emphasize that the parameters with the worst performances shared the characteristic of having one of their buses (sender or receiver) controlled, meaning that the voltage was constant in practical applications. This characteristic poses a challenge for conventional methods, which rely on algebraic solutions and often neglect such restrictions in their solutions. In contrast, the proposed methodology, based on statistical learning, is capable of handling this characteristic simply by ignoring this feature in the regression.
To directly compare different methods, it is necessary to perform the comparison using scenarios similar to those used to build and test the proposed methodology. In general, papers in the literature develop methods to address specific problems, such as considering the simplified model for the line in the dq domain [
38] or in three-phase untransposed short transmission lines [
7], which makes it difficult to propose a fair comparison. Additionally, the data models employed are generally different due to the use of other measurement resources, such as power measurements, or even due to the methods adopted to model the inaccuracies present in the data. However, it is possible to use references that present results using noiseless cases and some IEEE test systems. In [
31,
36], a method was implemented to estimate the parameters based on the extended vector of states in a power system. The comparison of the proposed solution with such references is shown in
Table 4. Note that the works have calculated only the admittance branch parameters, which are simply obtained by inverting the elements
.
By observing the results presented in
Table 4, it is possible to conclude that the developed methodology achieved better or at least comparable performance with the analyzed previous works in the literature. Furthermore, it is important to highlight that the methods developed in [
31,
36] may not work in some situations depending on the detection method used to evaluate the parameters. For example, for the branch 9–10, it was not possible to detect such a parameter as erroneous using the approach used in [
39].