A Study and Application Analysis Exploring Pythagorean Fuzzy Set Distance Metrics in Decision Making

: The ever-increasing demand for high-quality solutions drives research toward more sophisticated decision-making solutions. In the field of decision making, the ability to solve complex real-world problems is of paramount importance. To this end, fuzzy sets are used, which offer the possibility of incorporating uncertainty into the values describing decision options. This study focuses on Pythagorean fuzzy sets, an extension of classical fuzzy sets, providing even more tools for modeling real-world problems by presenting a distance measure for these specific sets. A verification of the characteristics of the proposed distance measure has been carried out, proving its validity. The proposed measure is characterized by a more straightforward formula and thus simplifies the calculations. Furthermore, to confirm its usability, a multi-criteria decision-making methodology is presented, the results of which are compared with two multi-criteria decision-making methods, namely, PF-TOPSIS and PF-VIKOR, and another distance measure previously presented in the literature. The comparative analysis highlights lower variability in terms of preference values calculated using the proposed distance measure, which confirms the stability and reliability of the newly proposed distance measure while maintaining low computational complexity. Moreover, a high correlation with rankings calculated using PF-TOPSIS ensures its utility in terms of decision making.


Introduction
In order to make better use of the available information, different approaches are used to analyze the data.In the case of decision-making situations, such a tool is multi-criteria decision analysis (MCDA), which allows for a comprehensive analysis of the decision options considered in a given problem [1].Over the years, several multi-criteria decisionmaking methods have been presented and applied to problems, such as the assessment of the construction process [2], infrastructure management [3], evaluation of the value of healthcare interventions [4], or sustainable manufacturing process management [5].This approach to problem solving is vital because it allows for an initial analysis of the alternatives considered in the problem under consideration and the identification of the best and worst possible options.
Given the increasing complexity of decision-making problems, where making a critical decision can significantly impact the course of subsequent events, it is essential to ensure the best possible outcome.To this end, it is important to use the available information as accurately as possible.The measures used in multi-criteria decision-making problems often include distance [6], standard deviation [7], or entropy [8].They make it possible to describe or characterize the data on which the problem is defined.In addition, measures such as rank similarity coefficients make it possible to compare results obtained using different multi-criteria decision-making methods [9].In the MCDA literature, many researchers have used other measures, such as the Shannon-Spearman measure, to compare the results of different aggregation methods [10] or measure the congruence or incongruence of the relative significance of alternatives [11].
However, classical decision making, in which the problem is described using crisp values, may not be sufficient in many real-world cases.In such cases, uncertainty often arises that cannot be adequately accounted for by the classical approach.For this purpose, a fuzzy approach is used to describe the data using fuzzy sets (FSs).Zadeh introduced these sets [12] in 1978 so that decision-makers could take uncertainty into account by means of membership functions and express its degree of membership and non-membership.Despite the passage of years, these sets are still in use, and a growing number of solutions appear in the literature to solve certain problems, as presented in many reviews [13][14][15], as well as to present new approaches to deal with problems that are defined in an uncertain environment [16][17][18].
However, classical fuzzy sets also have some drawbacks, which have been addressed by the introduction of intuitionistic fuzzy sets (IFSs), which allow for an even better representation of the decision problem, enabling more informed and accurate decision making.However, they were presented by Atanassov in 1986 [19], and many new challenges have emerged since then.To address them, Yager in 2013 [20] presented Pythagorean fuzzy sets (PFSs), which were quickly popularized and have found many applications in decisionmaking processes for real-world problem solving [21][22][23].For example, El-Morsy presented stock portfolio optimization in the environment of PFSs [24], while Rani et al. explored the assessment of sustainable bioenergy technologies for agricultural residues [25].Another interesting line of research is aggregation operators, the application of which was presented by Tapas Kumar et al., where they used Hamacher aggregation operators to improve the quality of decisions made in problems described on PFSs [26].Moreover, further research in the Pythagorean fuzzy environment led to the emergence of new measures, like the similarity measure of Pythagorean fuzzy sets based on the Jaccard index [27] or aggregation operators for PFSs based on confidence levels [28].
The distance measure is widely used in multi-criteria decision-making problems.Its use has been demonstrated, for example, in cases such as the evaluation of a flexible manufacturing system [29] or for the development of new multi-criteria decision-making methods to assess alternatives [30].Ejegwa applied the Hamming distance, Euclidean distance, normalized distance, and similarity measure in the context of interval type-2 fuzzy sets and their application in medical diagnosis and pattern recognition [31].In the case of Pythagorean fuzzy sets, research has been conducted in various directions [32].Over the years, many examples of the application of the distance measure to Pythagorean fuzzy sets have been presented.For instance, Chen presented an application of the Minkowski measure to, among other things, internet stocks and investment in research and development projects [33].Hussian et al. presented distance and similarity measures of Pythagorean fuzzy sets based on the Hausdorff metric with application to the fuzzy Technique for Order of Preference by Similarity to Ideal Solution (TOPSIS) [34].Agheli et al. explored the relationship between the similarity measure, distance measure, inclusion measure, and entropy for Pythagorean fuzzy sets (PFSs) [35].However, there are very few research reports on distance measures designed explicitly for Pythagorean fuzzy sets [36].
In 2021, Ohlan proposed a novel distance measure between Pythagorean fuzzy sets based on exponential function [37].Yin et al. explored an idea of the distance measure based on the earth mover's distance in the Pythagorean fuzzy environment [38].Zhou et al. proposed a new distance measure based on the characteristics of Pythagorean fuzzy sets and incorporated it into the TOPSIS method [39].Moreover, Wang et al. presented the geometric distance measure-based approach to solve Pythagorean cubic fuzzy multi-criteria decision-making problems [40], and Sun et al. presented the centroid distance measure application in Pythagorean fuzzy information processing [41].
It has been observed that numerous distance measures for PFSs available in the literature can be considered as extensions of the measures presented for IFSs.Moreover, it is worth noting that some distance measures lack normalization and have complex mathematical expressions, which limits their practical utility.To this end, this paper proposes a new distance measure characterized by a simple mathematical expression to address the challenges of solving complex real-world problems accurately and efficiently.The research addressed the research gap under consideration, proposed a metric that satisfied the characteristics required for PFS, and validated the proposed measure as a distance measure.In addition, the procedure utilizing the proposed measure was conducted on two multi-criteria decision-making problems, demonstrating its applicability, stability, and reliability in solving decision problems.In summary, the motivation and objectives of this work are as follows: • Developing a new distance measure with a simple mathematical expression to address the challenges of solving complex real-world problems accurately and efficiently.

•
Demonstrating the proposed measure's applicability, stability, and reliability in solving multi-criteria decision-making problems.

•
Comparing the proposed measure with the existing distance measure and other multicriteria decision-making methods, emphasizing its advantages and superiority.
The rest of this paper is structured as follows: Section 2 presents an introductory discussion that covers the basic concepts of Pythagorean fuzzy sets and introduces a new distance measure for PFSs.Section 3 presents a procedure that, using the newly proposed measure, allows us to obtain a ranking of the alternatives in the decision problem under consideration.Section 4 presents the application of the proposed methodology to solve two multi-criteria decision-making problems, one smaller one that considers the problem of energy microgeneration and the other larger one that presents the solution for selecting a renewable energy source.In addition, this section compares the results with another distance measure presented previously in the literature.Finally, Section 5 concludes this paper and discusses future directions.

Preliminaries
According to Yager [20], PFSs can be defined in the following way.Let us take a set S of nominal elements, which is non-empty and has a finite number of elements, denoted by S = {s 1 , s 2 , . . . ,s n }.The representation of the PFS M that is contained inside the set of a crisp set S is as follows: where µ M (s The µ M (s) and ϑ M (s) are the degree of membership and non-membership of the element s ∈ S. The complement of the Pythagorean fuzzy set is π(s), which is determined by the following equation.

Distance for Pythagorean Fuzzy Set
Let S be a non-empty set and A, B, C ∈ PFS(S).The distance measure D between A and B is a function D : PFS × PFS → [0, 1] that satisfies four characteristics, boundedness, reflexivity, symmetry, and separability, in the following way.
where e is the number e, which is also known as Euler's number.
Theorem 1.The measure defined in Equation ( 3) is a reliable distance indicator.
Proof.To demonstrate that the proposed measure is a valid distance measure, it is necessary to confirm the four characteristics set out in Section 2.1 for the given measure.For this purpose, let us consider the following characteristics in turn: i Boundedness: From the definition of the Pythagorean fuzzy set, we know that: Therefore, we have: ii Reflexivity: iii Symmetry: iv Separability: This proves that the proposed measure can be considered a valid distance measure.

PF-TOPSIS
The Technique for Order of Preference by Similarity to Ideal Solution (TOPSIS) is one of the most popular methods for solving multi-criteria decision-making problems.It was introduced in 1994 by Hwang and Yoon [42] and has become very popular over the years, with many examples in the literature, such as ranking evolutionary algorithms [43], green low-carbon port evaluation [44], and many others [45].Due to its repeated confirmation of its usefulness in a wide variety of decision-making problems, it has lived to see many developments using, among others, spherical fuzzy information presented by Akram et al. [46] or interval-valued Fermatean fuzzy sets presented by Ilieva and Yankova [47].In the context of this work, the most relevant extension is the one that considers problems using Pythagorean fuzzy sets.This extension was presented by Zhang and Xu [48] and will be used following the procedure established by Zhang and Xu in this work for comparative purposes.

PF-VIKOR
VIseKriterijumska Optimizacija I Kompromisno Resenje (VIKOR) is another prevalent method that has gained a lot of trust in the field of multi-criteria decision making.It was introduced by Opricovic, who later extended it to the fuzzy environment [49].Like the TOPSIS method, it has been widely used in the literature to solve problems in various fields, such as, for example, assessing regional security [50], evaluating the current situation of smart healthcare [51], and selecting sustainable hydrogen production [52].In addition, it has also seen many extensions, among others, to group decision making [53] or to accommodate the use of bipolar fuzzy sets [54].In the case of the VIKOR method, like TOPSIS, in the research conducted in this article, the most relevant extension is the one using Pythagorean fuzzy sets presented by Bakioglu Atahan [55].The distinguishing feature of this method is that it offers three final rankings, namely, S, R, and a compromise ranking of Q.In this study, the Q ranking will be used, and the compromise value will be set to 0.5.This means the S and R rankings will be equally important in the final evaluation.

Similarity Coefficients
For research in the field of multi-criteria decision making, it is important to compare a newly proposed solution with other solutions that have been presented in the literature over the years.For this purpose, ranking similarity coefficients are most often used to find out how different the proposed solutions are [9].The most commonly used ranking similarity coefficients in the literature are the weighted Spearman correlation coefficient, which was proposed by Pinto da Costa and Soares [56], and the weighted similarity coefficient, which was proposed by Salabun [57].Equation ( 9) represents the weighted Spearman coefficient, while Equation (10) represents the weighted similarity coefficient.
where R i -position in the reference ranking, Q i -position in the second ranking, and n-number of alternatives.
where R i -position of reference ranking, Q i -position of second ranking, and n-amount of alternatives.

Methodology
In multi-criteria decision-making problems, a decision option is selected from a set of considered alternatives that obtains the highest preference value.In the case considered in this study, the alternatives are described by Pythagorean fuzzy sets, and each criterion has a weight that corresponds to how the criterion will affect the outcome.In this procedure, a measure of distance and similarity is used to obtain a final preference value that allows for the considered alternatives to be further ranked, thus obtaining a solution in which one of the considered decision alternatives is regarded as the best available option.The procedure can be presented in the following detailed steps.
Step 1: The first step to consider a problem in multi-criteria decision making in the Pythagorean fuzzy environment is to establish a decision matrix.Considering that, establish a Pythagorean fuzzy decision matrix N = (A i , C j ) mxn where A i denotes alternatives (i = 1, 2, 3, . . .m) and C j criteria (j = 1, 2, 3, . . .n) where each element contains the value of membership and non-membership: Step 2: The next step requires the calculation of a positive ideal PFS (M + ) and a negative ideal PFS (M − ).This determination can be made using the following equations.
Step 3: Next, determine the distances from the positive D + and negative D − ideal solution.
The distance measure is a weighted distance calculated with a given weight vector w = {w i , 1 ≤ i ≤ n} using the following equation.
where e is the number e, which is also known as Euler's number.
Step 4: The similarity measure S is then calculated using the distance determined in the previous step.In this context, different functional types of similarity measures can be explored.However, the following equation is used in this study to calculate the similarity.
Step 5: In the fifth step, the relative similarity S(R) is calculated using the following formula.
where D + shows the positive distance and D − shows the negative distance for each alternative.
Step 6: Lastly, rank the alternatives A i in descending order based on their relative similarity measure value.

Numerical Examples
This section presents the application of the newly proposed distance measure to acquire rankings for two theoretical multi-criteria decision-making problems, namely, for one smaller problem that considers the selection of a microgeneration technology, considering five alternatives described by four criteria, and for a second larger renewable energy source selection problem that considers ten alternatives described by ten criteria.

Microgeneration Technology Selection
The first problem that will be considered is a smaller problem where four criteria describe five alternatives.As microgeneration is an important topic that should be further researched, let us consider such a problem.The importance of this topic is shown through numerous studies, such as the one presented by Kamali Saraji et al. [58], where they proposed a framework utilizing the combined Stepwise Weight Assessment Ratio Analysis and Technique for Order of Preference by Similarity to Ideal Solution (SWARA-TOPSIS) approach to solve a similar problem, or Zhang et al. [59], where another integrated MCDA approach was used for the microgeneration problem.However, not many researchers provide an example that utilizes a distance measure, which is why providing such an example is essential.
When it comes to the problem of energy microgeneration technology, a key consideration is the cost-effectiveness of investing in this type of solution.As with any newly developed technology, costs are expected to fall as the industry develops.However, if the adaptation of a technology encounters significant difficulties, this can significantly affect the subsequent ability to use it effectively.Therefore, in addition to the costs associated with using the chosen technology, the simplicity of its application with sufficient performance should be considered.In this regard, it should be borne in mind that alternative solutions that are difficult to implement will result in a decrease in productivity or the resulting process efficiency and an increase in costs.Environmental compatibility is another essential aspect in this situation, as not all technologies may be suitable for given geographical areas.As a result, it would be prudent to choose an alternative that is acceptable for the climatic conditions of the region in which the microgeneration projects are located.
In this problem, let us consider solar photovoltaics, small-scale wind turbines, microhydroelectric systems, biomass energy, and combined heat and power systems (CHPs) as alternatives and operational and maintenance cost, energy conversion efficiency, environmental impact, and technology maturity as criteria.The considered problem is presented on the chart in Figure 1.In the considered problem, a decision matrix presented in Table 1 will be used for further calculations.Then, the positive ideal PFS (M + ) and negative ideal PFS (M − ) must be established.It is performed using Equation ( 12) for the positive and Equation ( 13) for the negative ideal PFS.Table 2 presents the according values.We then need to calculate the distance using Equation (3).The weights for the criteria need to be set to use the equation mentioned earlier.In this example, the weights were chosen arbitrarily to provide an example where differences in the results from different approaches would be more apparent and are defined as follows: w = [0.25, 0.25, 0.25, 0.25].We can then calculate the distances between Mi and M + /M − , which can ultimately be used to measure the similarity using Equation (15), which lets us calculate the relative similarity using Equation (16).Table 3 shows the computed distance and relative similarity measures.Those results are considered preference values that should be ranked in descending order and will be further discussed in next section.

Comparison
It is essential to check whether the results offered by the newly proposed distance measure are stable and allow for multi-criteria decision-making problems to be solved reliably.For this purpose, another distance measure was chosen, presented by Mahanta and Panda in 2021 [60], where they provided proof that their proposed distance is valid and showed how it performs compared to other distance measures.This ensures that the distance proposed in this paper can be compared to the one proposed by Mahanta and Panda, providing insightful information.The following Equation ( 17) describes the distance presented by Mahanta and Panda.
The above measure was applied in the same way as the newly proposed measure, namely, in the procedure described in Section 3. In addition, the PF-TOPSIS and PF-VIKOR methods were used in the comparison carried out.The preference values obtained from the evaluation of the different alternatives are summarized in Table 4. Considering that the same problem is further analyzed, the same weights vector was used, namely, w = [0.25, 0.25, 0.25, 0.25], and all the criteria were considered to be of the profit type.The obtained preference values for the individual alternatives look similar when distance measures are used in multi-criteria decision making.For PF-TOPSIS, the preference values are negative, and in the case of PF-VIKOR, the values are in the range [0, 1].To better illustrate the distribution of the preference values, Table 5 shows the measures of the minimum, mean, maximum, and standard deviation.It can be seen that the values obtained by PF-TOPSIS are characterized by the highest standard deviation and the proposed approach by the lowest.Such consistency in the case of the proposed approach ensures that the results cluster closely around the mean, enhancing the reliability of the data interpretation.Additionally, a smaller standard deviation solidifies the mean value as a more representative indicator of the overall trends, making it a more dependable metric for further analysis.In addition, because a multi-criteria decision-making problem is being solved, it would be useful to look at the resulting rankings.These are visualized in Figure 2, where it can be seen that the proposed approach offers similar results to PF-TOPSIS despite significant differences in the statistical measures that describe the preference values.Compared with the distance measure presented by Mahanta, we can see that changes appeared in the first positions of the rankings.The podium has significant discrepancies, especially looking at alternative three, which was considered the best using the distance presented by Mahanta.However, this alternative would be regarded as the best with the distance measure presented in this work.
A further comparison of the ranking was conducted using rank similarity coefficients.Table 6 presents the comparison of the ranking calculated using the proposed approach and other methods.The weighted Spearman coefficient and the WS rank correlation coefficient provided similar results, allowing us to conclude that the proposed distance measure offers different results than the measure presented by Mahanta while still being valid.In addition, it can be said that the proposed approach utilizing the proposed distance measure has a basis to be used in multi-criteria decision-making problems due to the fact that it offers precisely the same ranking as the PF-TOPSIS method while providing a more straightforward procedure requiring fewer mathematical operations.

Renewable Energy Source Selection
Another prevalent and crucial issue from a socio-economic point of view is the production of energy from renewable sources.This topic is widely discussed due to the current emphasis on sustainability.It is also a complex topic because using renewable energy sources for energy production must consider costs, the least possible impact on the environment, and the existing architectural order in the area.This topic has been addressed many times in the literature, whether in problems using crisp values, such as the presented comparison of classical multi-criteria decision-making methods in ranking renewable energy sources in Taiwan by Lee and Chang [61]; or the ones that were considered in a fuzzy environment, as in the case presented by Xu et al. [62] where they showed the application of decision-making methods in a renewable energy evaluation problem; or even Ezbakhe and Pérez-Foguet [63], where they considered renewable energy planning under uncertainty.
Therefore, it is worth looking into this problem, and in this numerical example, we will consider a larger theoretical decision problem containing ten alternatives and ten criteria.These will be nuclear energy, tidal energy, geothermal energy, wind energy, hydropower energy, bioenergy, solar energy, hydrogen fuel cells powerplant, hydroelectric energy, and photovoltaic energy as the alternatives.Energy storage capacity, land use requirements, aesthetic impact, noise levels, material availability and environmental impact, adaptability to climate variability, technology flexibility, maintenance requirements, job creation and economic impact, and supply chain resilience will be the criteria.This is also additionally shown as a scheme in Figure 3. Table 7 presents a decision matrix that will be taken under further consideration.Similarly to the previous numerical example, the positive and negative ideal PFS must be determined using Equations ( 12) and ( 13) accordingly.The resulting values are presented in Table 8.In the case of this problem, an equal weighting approach was used so that the further vector of weights considered is as follows w = [0.1, 0.1, 0.1, 0.1, 0.1, 0.1, 0.1, 0.1, 0.1, 0.1].We then need to calculate the distance using Equation (3), which can ultimately be used to measure the similarity using Equation (15), which lets us calculate the relative similarity using Equation (16).The resulting values are presented in Table 9, which will be further discussed in next section.

Comparison
The comparison of the results in the renewable energy source selection problem was carried out in the same way as for the earlier, smaller problem, but in this case, equal weights were used.The preference values obtained using each solution are shown in Table 10.The values offered by PF-TOPSIS stand out the most due to their negative value.Also, the results obtained using the proposed distance measure are most similar to those obtained using the measure presented by Mahanta, followed by PF-VIKOR and PF-TOPSIS.Notably, the preference values obtained using the proposed distance measure are close to each other and thus would be more difficult for a human to draw conclusions from quickly.The statistical measures are then presented in Table 11.These reaffirm how preference values are distributed by the different approaches to solve multi-criteria decision-making problems.In this case, as well as in the microgeneration technology selection problem, we can see that the results obtained using the proposed measure have the smallest standard deviation.This provides stability and reliability when solving decision-making problems.It is also worth noting the PF-VIKOR values, which, despite the large range, are characterized by a relatively low standard deviation.The preference values were then ranked and visualized in Figure 4.It can be seen that there are no significant changes on the podium, but in the case of alternative three, PF-VIKOR rated it extremely less favorably than the other approaches, as in this case it was placed ninth instead of second place.The situation is similar for alternative two, whose position was assessed more favorably by PF-VIKOR than the rest of the approaches.However, it is worth noting that the distance measure proposed in this paper offers very similar results to the distance measure presented by Mahanta as well as to PF-TOPSIS.
3)7236,6 3)9,.25The rankings were then compared using rank similarity coefficients (see Table 12).For this problem, we can observe a very high correlation with the approach using the distance proposed by Mahanta and with PF-TOPSIS.This confirms the usefulness of the proposed measure and methodology for decision-making problems.In multi-criteria decision-making problems, the values of the weights, which determine the importance of the individual criteria and the input data itself, might significantly impact the outcome.Simulation studies would need to be conducted to draw more insightful conclusions.It is also important to bear in mind the low variance of the preference values obtained after using the proposed distance measure, which may prove to be a limitation in problems containing a very large number of alternatives.

Conclusions and Future Directions
Many decisions made in life can significantly influence the course of subsequent events, whether in one's personal, professional, or social life.Some decisions, which are global or national in scale, can affect many people as well as the economic state of an entire country.Therefore, studying this field and seeking new solutions to solve multi-criteria decision-making problems by considering various aspects is worthwhile.
This paper proposes a new formula for measuring the distance between Pythagorean fuzzy sets.The proof is carried out to demonstrate that the proposed measure is valid, and its use to obtain rankings in two example theoretical decision problems is presented.Furthermore, for both cases of multi-criteria decision-making problems, the results are compared with another distance measure and two multi-criteria decision-making methods, namely, PF-TOPSIS and PF-VIKOR.The analysis allows us to conclude that the results obtained by the approach using the proposed measure are valid and yield results similar to the PF-TOPSIS method while maintaining less complicated calculations and a more straightforward procedure.The preference results have the smallest standard deviation, which allows for higher stability and reliability, leading to more accurate and reliable outcomes in decision-making processes.However, the low variability in the data can become a limitation when solving large decision-making problems, i.e., those in which the number of alternatives would be at least several hundred.
Future studies would benefit from a simulation approach to provide a more in-depth analysis considering the sensitivity analysis of the proposed approach.Moreover, it would be worthwhile to lean into real-world problems from different domains to reassert the stability of the proposed measure.Furthermore, it would be worth comparing the results obtained by the presented procedure with other multi-criteria decision-making methods designed for an uncertain environment.

Figure 2 .
Figure 2. Rankings visualization for the problem of microgeneration technology selection.

Figure 4 .
Figure 4. Rankings visualization for the problem of renewable energy source selection.

Table 1 .
Pythagorean fuzzy matrix for microgeneration technology selection problem.

Table 2 .
Positive and negative ideal PFS for a given criterion in the problem of microgeneration technology selection.

Table 3 .
Distance and similarity for alternatives considered in the problem of microgeneration technology selection.

Table 4 .
Preference values for the assessed alternatives in the problem of microgeneration technology selection.

Table 5 .
Statistical measures for the values of preferences from different approaches in the problem of microgeneration technology selection.

Table 6 .
Similarity of rankings of the proposed approach with other available approaches in the problem of microgeneration technology selection.

Table 7 .
Pythagorean fuzzy matrix for renewable energy source selection problem.

Table 8 .
Positive and negative ideal PFS for a given criterion in the problem of renewable energy source selection.

Table 9 .
Distance and similarity for alternatives considered in the problem of renewable energy source selection.

Table 10 .
Preference values for the assessed alternatives in the problem of renewable energy source selection.

Table 11 .
Statistical measures for the values of preferences from different approaches in the problem of renewable energy source selection.

Table 12 .
Similarity of rankings of the proposed approach with other available approaches in the problem of renewable energy source selection.