Next Article in Journal
Discussion on the Dominant Factors Affecting the Main-Channel Morphological Evolution in the Wandering Reach of the Yellow River
Previous Article in Journal
Physically Consistent Runoff Simulation in Mountainous Catchments Using a Time-Varying Gated Hybrid XAJ–LSTM Model
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

An Improved Entropy Weight Method Mitigating Grade Distortion in Water Quality Assessment

1
School of Civil and Environmental Engineering, Nanchang Institute of Science and Technology, Nanchang 330108, China
2
School of Civil Engineering and Architecture, Nanchang University, Nanchang 330031, China
*
Author to whom correspondence should be addressed.
Water 2025, 17(24), 3508; https://doi.org/10.3390/w17243508
Submission received: 19 November 2025 / Revised: 4 December 2025 / Accepted: 8 December 2025 / Published: 11 December 2025
(This article belongs to the Section Water Resources Management, Policy and Governance)

Abstract

The Entropy Weight Method (EWM) is a prevalent and objective technique for assigning weights in water quality assessment. However, engineering practice has shown that distortion phenomena occur in water quality assessment results based on the EWM. This study reveals EWM’s distortion in grade discrimination via theory and case studies. To address this, we developed an improved entropy weight model (I-EWM) based on fuzzy variable set theory, which determines weights by incorporating both pollution degree and grade discrimination capacity. We quantify an indicator’s grade discrimination level and pollution degree using the fuzzy entropy and first-order moment of its average membership vector, respectively. A specific water quality assessment example was used to demonstrate the effectiveness of the I-EWM. The I-EWM significantly altered the weight allocation: the weights for CODMn, NH3-N, and TP shifted from (0.687, 0.185, 0.127) to (0.191, 0.428, 0.381). When applied to the five monitoring points, the I-EWM produced markedly different results from the EWM. The water quality grades shifted from a pattern of (“Good”, “Good”, “Good”, “Good”, “Medium”) to a more conservative and realistic assessment of (“Medium”, “Medium”, “Poor”, “Poor”, “Poor”). Theoretical analysis demonstrates that the I-EWM provides more reasonable water quality assessments and effectively addresses the grade discrimination distortion inherent in the EWM.

1. Introduction

Water is a natural resource that humans rely on for survival and development. Due to rapid industrialization and continuous population growth, water pollution has become increasingly serious, and water environmental problems have become a widely concerning issue worldwide [1]. To ensure the rational development and utilization of water resources, it is necessary to scientifically plan and manage the water environment [2]. Water quality assessment is an important means of water environment supervision and management. In recent years, the scientific effectiveness of water quality assessment methods has been a hot topic in the field of the water environment.
Water quality assessment encompasses a variety of approaches, with widely used methods including the comprehensive index method, fuzzy comprehensive evaluation method, principal component analysis, machine learning techniques, and multi-criteria decision-making [3,4,5]. The comprehensive index method evaluates water quality through an integrated water quality index, offering straightforward implementation and interpretable results, yet its accuracy is heavily influenced by the weighting of pollutant indicators [6,7]. Fuzzy comprehensive evaluation introduces membership functions to address the inherent ambiguity in water quality classification boundaries, though the definition of these functions involves a degree of subjectivity. Principal component analysis is suitable for handling high-dimensional pollution data, but the resulting principal components often lack clear physical interpretation [8]. Machine learning methods such as artificial neural networks can establish predictive black-box models, yet their decision-making processes remain opaque and lack physical explicability [9]. Thus, while these diverse methods provide adaptable solutions for different scenarios, each carries inherent limitations that require careful selection and adjustment based on specific practical needs.
In recent years, water quality assessment models have shown a trend of shifting from single methods to hybrid models. By coupling models based on different principles, their strengths can be combined to achieve more robust and accurate comprehensive evaluations. For instance, Wang Wenchuan et al. integrated the Fuzzy Binary Comparison Method (FBCM) with Variable Fuzzy Sets (VFS) theory to propose a Variable Fuzzy Evaluation Model (VFEM) for river water quality assessment [4]. Applied to the Three Gorges Reservoir and Tseng-Wen River, this model effectively handles grading standards in interval form. Wu Guozheng et al. employed a coupled Principal Component Analysis (PCA) and Self-Organizing Map (SOM) artificial neural network model to evaluate the eutrophication status of Ulansuhai Lake [10]. This approach objectively classified samples and calculated evaluation weights, avoiding the subjectivity and arbitrariness inherent in selecting evaluation indicators. Peng Qi et al. combined an optimized TOPSIS model with Hasse diagram technique to analyze groundwater samples from villages around an industrial site in Beijing. The results demonstrated a more refined assessment compared to those obtained using the traditional TOPSIS model for water quality evaluation [11].
These hybrid approaches underscore a fundamental principle: whether in a single water quality assessment model or a coupled water quality assessment model, the determination of weights plays a decisive role in the accuracy of evaluation results. The precision of weight assignment directly governs the model’s effectiveness—even within complex coupled water quality assessment frameworks, this challenge remains a central methodological concern [12].
In comprehensive water quality assessment, weights reflect the importance of indicators and are one of the most important parameters [1]. The earliest weighting methods were subjective methods, such as expert scoring and the Analytic Hierarchy Process [13]. The subjective weighting method can intuitively explain the differences in the importance of indicators, but it cannot guarantee the uniqueness of the weighting results, which can cause uncertainty in the water quality assessment results [14].
To ensure the uniqueness of water quality assessment results, objective weighting methods have been developed, among which the Entropy Weight Method (EWM) proposed by is a prominent representative. EWM assigns weights based on the information content of pollutant indicators. The higher the information content of the indicators, the greater the corresponding weight; vice versa [15]. Compared with subjective methods, EWM eliminates human interference and ensures the uniqueness of the weighted results. Therefore, it has been widely used in water quality assessment [16]. For example, Agossou and Yang [17] used EWM to evaluate the vulnerability and contamination of groundwater in the south of Benin, West Africa. Md. Yousuf Mia et al. [18] applied a combined approach of explainable artificial intelligence (XAI) and the entropy weight method to identify the key influencing factors of drinking water quality from a hydrochemical perspective in the Haor region. Soumya S. Singha et al. [19] evaluated the groundwater quality status in the Arang Block of Chhattisgarh, India, through the integration of the entropy weight method and the PROMETHEE II (Preference Ranking Organization METHod for Enrichment of Evaluations II) method, achieving results with high accuracy.
However, the wide application of EWM has also revealed several limitations. Scholars have found that the weighting results of EWM may not always align with environmental realities. For example, Yan et al. [20] pointed out that when monitoring data are highly concentrated, EWM fails to reflect the true importance of pollutants. Bao, Qian et al. [21] found that data with excessive zero values can lead to distorted results. Additionally, Wang et al. [14] reported that in dynamic assessments, EWM might even cause the evaluation trend to contradict the actual pollution trend. These issues indicate that the purely data-driven nature of EWM can sometimes lead to biased assessments in practice.
Particularly, through extensive practice, in addition to the problems discovered by Feng, Bao, and Zhe, the authors found another abnormal phenomenon in the EWM in water quality assessment. When there is a significant difference in the threshold division of different indicators, the weighting results of the EWM cannot reflect the difference in indicator grades correctly, resulting in distorted evaluation results.
The objectives of this study are threefold: (i) to intuitively reveal the distortion phenomenon of the EWM in grade discrimination through theoretical derivation and evaluation examples; (ii) to develop an improved entropy weight model (I-EWM) based on fuzzy set theory to effectively mitigate this issue; and (iii) to validate the effectiveness and superiority of the proposed I-EWM through a concrete case study in water quality assessment.

2. Materials and Methods

2.1. Water Quality Index (WQI)

The Water Quality Index (WQI) serves as a structured approach for assessing the overall quality of water [22]. Since Horton established the first WQI in 1965, numerous variants have been developed to suit specific environmental contexts, including those by Bhargava, the Oregon Index, and the Weighted Arithmetic WQI [23]. Among these, the Weighted Arithmetic WQI is the most widely adopted method for comprehensive water quality evaluation in China [24].
According to the Chinese national criteria for surface water environmental quality (China 2002) [25], the standard limits for basic parameters are categorized into five grades: excellent, good, medium, poor, and bad. The specific thresholds are presented in Table 1. In this section, we selected three pollutant indicators, including permanganate index (CODMn), ammonia nitrogen (NH3-N), and total phosphorus (TP), as the research subjects, with a total of five monitoring stations.
The monitoring value of the i-th indicator of the j-th station is recorded as xij. There are m indicators and n stations. The WQI of the i-th indicator of the j-th station is recorded as qij.
Following the method of Yan et al. [26], who established that for these indicators, smaller values correspond to better water quality, qij is calculated using the following formula:
q i j = 100 x i j l i 1 80 + 20 · s i 1 x i j s i 1 l i 1 l i 1 < x i j s i 1 60 + 20 · s i 2 x i j s i 2 l i 2 l i 2 < x i j s i 2 40 + 20 · s i 3 x i j s i 3 l i 3 l i 3 < x i j s i 3 20 + 20 · s i 4 x i j s i 4 l i 4 l i 4 < x i j s i 4 20 · s i 5 x i j s i 5 l i 5 l i 5 < x i j s i 5 0 s i 5 < x i j
Accordingly, qij is calculated with the following formula (for the case where a larger value is better):
q i j = 100 x i j > s i 1 80 + 20 · x i j l i 1 s i 1 l i 1 l i 1 < x i j s i 1 60 + 20 · x i j l i 2 s i 2 l i 2 l i 2 < x i j s i 2 40 + 20 · x i j l i 3 s i 3 l i 3 l i 3 < x i j s i 3 20 + 20 · x i j l i 4 s i 4 l i 4 l i 4 < x i j s i 4 20 · x i j l i 5 s i 5 l i 5 l i 5 < x i j s i 5 0 x i j l i 5
where sik and lik are the maximum and minimum of the k-th grade of the i-th indicator, respectively. The comprehensive water quality index (CWQIj) of the j-th station can be generated as follows:
C W Q I j = i = 1 m w i · q i j
where wi represents the weight assigned to the i-th indicator, with its assessment categories of “excellent”, “good”, “medium”, “poor”, and “bad” corresponding to the ranges [80, 100], [60, 80), [40, 60), [20, 40), and [0, 20), respectively.

2.2. Entropy Weight Method (EWM)

The Entropy Weight Method (EWM) is an objective technique for assigning weights based on information entropy theory, and it is extensively applied in water quality assessment [15]. The EWM determines weights according to the information significance of each indicator. A greater degree of dispersion for an indicator corresponds to a higher information content and, consequently, a greater assigned weight; conversely, lower dispersion results in a lower weight. The standard procedure of the EWM is as follows:
Step 1: Data Standardization
The initial step involves standardizing the raw monitoring matrix. The specific formula is selected based on the pollutant indicator’s attribute:
For “the larger, the better” indicators:
y i j = x i j min j = 1 , 2 , , n x i j max j = 1 , 2 , , n x i j min j = 1 , 2 , , n x i j
For “the smaller, the better” indicators:
y i j = max j = 1 , 2 , , n x i j x i j max j = 1 , 2 , , n x i j min j = 1 , 2 , , n x i j
yij is the standardized value of the monitoring value of the i-th indicator at the j-th monitoring station, with a value range of [0, 1]; Although the domain of yij is [0, 1], the sum of yij is not 1. Therefore, in order to calculate the entropy value, it is necessary to normalize yij.
Step 1: Data Normalization
p i j = y i j j = 1 n y i j
where pij is the normalized value of yij, the domain of pij is [0, 1], and the sum of pij is 1.
Step 3: Calculate the Entropy Value
The entropy value Hi for the i-th indicator is calculated as follows:
H i = 1 ln n · j = 1 n p i j · ln p i j
where Hi ranges from 0 to 1. A larger Hi indicates less variability in the data for the i-th indicator and, consequently, a lower information content. Conversely, a smaller Hi signifies greater variability and higher information content. By convention, if pij = 0, the term pij⋅ln pij is defined as 0 to ensure the calculation remains valid [27].
Step 4: Determine the Indicator Weight
The weight wi is calculated as follows:
w i = 1 H i m i = 1 m H i
wi represents the weight value assigned to the i-th indicator in the water quality assessment model, and its value range is [0, 1].
It should be noted that the EWM assigns weights according to the degree of dispersion, namely the numerical discrimination of indicators but not the grade discrimination of indicators [5]. The grade discrimination and the numerical discrimination of indicators are not the same thing, the grade discrimination of indicators is the parameter that we truly want to reflect the information content of water quality. For example, when the monitoring values of CODMn and TP at the five stations are 0.015, 0.025, 0.15, 0.25, and 0.35, the numerical discrimination of the two indicators is the same. However, referencing Table 1, it can be found that the water quality grade of CODMn at all 5 stations is “excellent”, without any grade discrimination, while the grade of TP at the five stations is “excellent”, “good”, “medium”, “poor”, and “bad”. The grade discrimination of TP is much higher than that of CODMn, which means that the information content of TP is much higher, but the EWM cannot accurately reflect this difference.

2.3. Improved Entropy Weight Model (I-EWM) Based on Fuzzy Variable Set Theory

2.3.1. Fuzzy Variable Set Theory and the Relative Membership Degree

Fuzzy variable set theory, initially introduced by Chen as a method for uncertainty analysis, has been subsequently utilized in water resource management studies, as demonstrated by Chunqing, Duan et al. [28]. According to Section 2.1, it can be seen that each water quality grade indicator has an accurate threshold. However, when the monitoring value of the indicator falls within the threshold range of the k-th water quality grade, it cannot be guaranteed that the water quality definitely belongs to that water quality grade or other water quality grades [29]. In other words, when the monitoring value of the indicator falls within the threshold range of the k-th water quality grade, the water quality grade belonging to the k-th grade is a fuzzy concept [30,31].
The monitoring value of the i-th indicator at the j-th station is denoted as xij. The possibility that a water sample belongs to the k-th water quality grade is represented by the relative membership degree μ(xij)k, which ranges from 0 to 1 [32]. A higher value of μ(xij)k indicates a greater likelihood that the water quality grade is k. Specifically, when μ(xij)k = 1, it indicates definite membership to the k-th grade, while μ(xij)k = 0, indicates definite non-membership.
As established in Section 2.2, the information content of water quality is derived from the grade discrimination of indicators rather than their numerical values. To quantify this grade discrimination, the distribution of an indicator across the various quality grades must be determined [32].
For a specific monitoring value xij, its membership to the k-th water quality grade is a fuzzy concept; thus, the grade distribution cannot be directly inferred from xij alone. However, the relative membership degree μ(xij)k provides a precise quantitative measure of this distribution. The grade discrimination—and consequently, the information content of water quality—can be quantified by calculating the dispersion (e.g., entropy or standard deviation) of the membership vector μ(xij)k, across all grades k [33]. In other words, the grade discrimination of monitoring values can be indirectly quantified by calculating the information entropy of the relative membership degrees—that is, the fuzzy entropy.

2.3.2. Improved Entropy Weight Model (I-EWM)

The Improved Entropy Weight Method (I-EWM) attempts to calculate the information content of water quality grade distribution using fuzzy entropy. The basic approach of this method is to first compute the relative membership degree matrix and the fuzzy entropy of indicator monitoring values, and then determine the weights based on the informational significance of each indicator’s relative membership degrees. The greater the dispersion of an indicator’s monitoring values in terms of relative membership degrees, the higher the information content of that indicator, and thus the larger the weight assigned to it; conversely, the smaller the dispersion, the lower the weight.
Assuming there are m pollutant indicators and n monitoring stations, the thresholds for the i-th indicator are defined in Figure 1 and Figure 2, where the N*-th grade represents the highest water quality level. For the k-th grade:
aik denotes the lower limit of the threshold,
bik denotes the upper limit,
mik denotes the median value,
and general β = 1 [34,35].
The weighting procedure of the Improved Entropy Membership Weight (I-EMW) method comprises the following steps.
Step 1: Calculate the relative membership degree μ(xij)k:
The membership degree vector serves as the primary carrier of information. Therefore, a scientific analysis of this vector is a prerequisite for conducting grade evaluation and identifying limiting factors. Common principles for such evaluation include the maximum membership degree principle, the proximity principle, and the weighted average principle [36]. Building upon the weighted average principle, this paper calculates the relative membership degree and the average membership degree of the indicators.
For a “smaller-is-better” indicator, μ(xij)k is calculated as follows:
When k = 1;
μ x i j k = 1 , x i j < m i k μ x i j k = 0.5 × 1 + x i j b i k m i k b i k β ,   x i j m i k ,   b i k   μ x i j k = 0.5 × 1 x i j b i k m i k + 1 b i k β , x i j b i k ,   m i k + 1 μ x i j k = 0 , x i j m i k + 1
When N* > k > 1;
μ x i j k = 0 , x i j < m i k 1 μ x i j k = 0.5 × 1 x i j a i k m i k 1 a i k β ,   x i j m i k 1 ,   a i k μ x i j k = 0.5 × 1 + x i j a i k m i k a i k β ,   x i j a i k ,   m i k μ x i j k = 0.5 × 1 + x i j b i k m i k b i k β ,   x i j m i k ,   b i k   μ x i j k = 0.5 × 1 x i j b i k m i k + 1 b i k β ,   x i j b i k ,   m i k + 1 μ x i j k = 0 , x i j m i k + 1
When k = N*,
μ x i j k = 0 , x i j < m i k 1 μ x i j k = 0.5 × 1 x i j a i k m i k 1 a i k β ,   x i j m i k 1 ,   a i k μ x i j k = 0.5 × 1 + x i j a i k m i k a i k β ,   x i j a i k ,   m i k μ x i j k = 1 , x i j m i k
For a “larger-is-better” indicator, μ(xij)k is calculated as follows:
When k = 1;
μ x i j k = 1 , x i j > m i k μ x i j k = 0.5 × 1 + x i j a i k m i k a i k β ,   x i j a i k , m i k μ x i j k = 0.5 × 1 x i j a i k m i k + 1 a i k β ,   x i j m i k + 1 , a i k μ x i j k = 0 , x i j m i k + 1
When N* > k > 1;
μ x i j k = 0 , x i j > m i k 1 μ x i j k = 0.5 × 1 x i j b i k m i k 1 b i k β ,   x i j b i k , m i k 1 μ x i j k = 0.5 × 1 + x i j b i k m i k b i k β ,   x i j m i k , b i k μ x i j k = 0.5 × 1 + x i j a i k m i k a i k β ,   x i j a i k , m i k   μ x i j k = 0.5 × 1 x i j a i k m i k + 1 a i k β ,   x i j m i k + 1 , a i k μ x i j k = 0 , x i j m i k + 1
When k = N*,
μ x i j k = 0 , x i j > m i k 1 μ x i j k = 0.5 × 1 x i j b i k m i k 1 b i k β ,   x i j b i k , m i k 1 μ x i j k = 0.5 × 1 + x i j b i k m i k b i k β ,   x i j m i k , b i k μ x i j k = 1 , x i j m i k
Step 2: Calculate the average membership vector of indicators.
The relative membership degrees for all grades constitute the relative membership vector of the monitoring value xij, denoted as uij = {μ(xij)1, μ(xij)2, …, μ(xij)k, …, μ(xij)N*}. It is easy to find that each element of the relative membership vector uij satisfies the following conditions:
0 μ x i j k 1 k = 1 N * μ x i j k = 1
To comprehensively represent the environmental characteristics of the i-th indicator across all monitoring stations, we define its average membership vector vi = {vi1, vi2, …, vik, …, viN*}. This vector is obtained by averaging the membership vectors uij from all n stations:
v i k = j = 1 n μ x i j k n
where vik represents the average extent to which the i-th indicator belongs to the k-th grade across the entire study area.
0 v i k 1 k = 1 N * v i k = 1
Step 3: Calculate the fuzzy entropy and the first-order moments.
The basic calculation formula for fuzzy entropy is derived from Shannon’s information entropy theory. After more than a century of development, the concept of entropy has expanded beyond its narrow physical origins, with its definition, formulas, and laws undergoing varying degrees of transformation—particularly in the expressions used to calculate entropy. In this paper, the fuzzy entropy formula is computed by treating the average membership vector as a discrete variable [37].
The fuzzy entropy Qi for the i-th indicator is introduced to quantify the uncertainty (or information content) inherent in its grade discrimination. It is calculated based on the average membership vector vi as follows:
Q i = 1 ln N * · k = 1 N * v i k · ln v i k
The fuzzy entropy Qi reflects the grade discrimination level of the i-th indicator, with a value range of [0, 1]. A higher Qi value indicates a greater grade discrimination level and, consequently, a greater importance of the indicator in the evaluation system. Specifically:
When Qi = 1, the grade discrimination and information content of the indicator are maximized, signifying the highest importance.
When Qi = 0, the indicator possesses no grade discrimination or information content, rendering it unimportant.
However, in water quality assessment, the practical significance of heavily polluted indicators often outweighs that of clean indicators. To account for this, I-EMW incorporates the pollution degree into the weighting process. This is quantified by the first-order moment Pi, which is defined as the expected (average) water quality grade for the i-th indicator across all monitoring stations. It is calculated as follows:
P i = k = 1 N * v i k · k
where k is the grade number (e.g., 1, 2, …, N*), and vik is the average membership degree of the i-th indicator to the k-th grade from Equation (16). A higher Pi value indicates a worse overall pollution state for the indicator.
As established, Pi reflects the pollution degree, while Qi quantifies the grade discrimination level of the i-th indicator. The I-EMW integrates these two dimensions to assign weights, adhering to the following principles [14]:
For indicators with an identical pollution degree (Pi), a higher grade discrimination level (Qi) confers greater importance and thus a higher weight.
For indicators with an identical grade discrimination level (Qi), a higher pollution degree (Pi) confers greater importance and thus a higher weight.
Step 4: Calculate the importance scale.
To integrate the pollution degree Pi and the grade discrimination level Qi into a composite measure, the importance scale Si for the i-th indicator is defined by their weighted sum:
S i = Q i + P i N *
This additive form ensures that both dimensions contribute independently and additively to the overall importance, preventing the scenario where a low value in one dimension nullifies the contribution of the other.
The value range of Si is [1/N*, 2], which is derived from the lower and upper bounds of its components (Qi ∈ [1/N*, 2], Pi/N* ∈ [1/N*, 1]). This signifies:
When Si = 2, both the pollution degree and the grade discrimination level are maximized (Qi = 1, Pi = N*), signifying the greatest importance.
When Si = 1/N*, both dimensions are minimized (Qi = 0, Pi = 1), signifying the least importance.
Step 5: Calculate the final weights.
The final weight wi for the i-th indicator is obtained by normalizing the importance scale Si across all m indicators to ensure their sum equals unity, fulfilling the fundamental constraint of weight assignment:
w i = S i i m S i

2.4. Compared with the EWM and I-EWM

2.4.1. Benchmark Methods for Comparison

To more convincingly demonstrate the advantages and applicability of the I-EWM, this study selects two additional mainstream weighting methods—Principal Component Analysis (PCA) and the AHP-Entropy Weight combination method—as benchmarks for a comprehensive comparative analysis. As widely adopted approaches in integrated water quality assessment, they are frequently used in conjunction with comprehensive water quality indices and are considered well-represented in the field.

2.4.2. Comparative Analysis of Weighting Logic and Procedure

A comparison of the EWM and I-EWM reveals distinct weighting logics. The EWM assigns weights based on entropy values derived from the numerical discrimination of raw monitoring data. According to this logic, a lower entropy value results in a higher weight. Thus, the weight in EWM fundamentally reflects the statistical characteristics of the data. In contrast, the I-EWM determines weights by integrating fuzzy entropy and the first-order moment. Here, higher values of both fuzzy entropy and the first-order moment lead to a higher weight. The fuzzy entropy is calculated from the dispersion of the “grade membership distribution”, while the first-order moment is computed based on the expected (average) water quality grade for each indicator. Consequently, the weight in I-EWM essentially represents a measure of grade-discriminatory power.
As shown in Figure 3, a comparison of the calculation procedures for the Entropy Weight Method (EWM) and the Improved Entropy Weight Method (I-EWM) can be made based on the content in Section 2.2 and Section 2.3.
A procedural comparison reveals that I-EWM, while involving more steps than the classic EWM, does not incur a substantial computational penalty. The EWM process requires data standardization, normalization, entropy calculation, and weighting. In contrast, I-EWM replaces the preprocessing steps with a fuzzy transformation (calculating relative and average membership degrees) and introduces the calculation of a dual-parameter set (fuzzy entropy and first-order moment) along with an importance scale. Although this adds to the step count, each new operation is mathematically straightforward and common in fuzzy evaluation. The additional computational cost is marginal and justified, as it enables the model to capture the critical information needed for grade-based discrimination, which is the core improvement over EWM.
When comparing the applicability of the Entropy Weight Method (EWM) and the Improved Entropy Weight Method (I-EWM), EWM shows a strong dependence on the overall distribution of raw data. As a purely statistical analysis-based method, it can objectively and efficiently identify the most discriminative dimensions within a dataset and is therefore suitable for scenarios where evaluation indicators lack predefined grading standards or exhibit insignificant differences in decision thresholds. In contrast, the weights derived from I-EWM are more robust. This method relies on predefined and relatively stable grading standards, making it insensitive to occasional extreme values in the data, and is thus particularly applicable to situations where indicator thresholds differ significantly. From a theoretical perspective, I-EWM provides a more solid theoretical foundation and better aligns with the purpose of assigning weights based on the information content pertaining to water quality grades in water quality assessment.

3. Results and Discussion

3.1. Statistical Characteristics of Monitoring

To validate the effectiveness of the method proposed in this study, a water quality monitoring scenario was simulated for a typical section of a river. Five sampling points were set up along this section, and a one-time simultaneous sampling was conducted. The monitoring indicators selected were key pollution parameters from the Environmental Quality Standards for Surface Water (GB 3838-2002) [25]: permanganate index (CODMn), ammonia nitrogen (NH3-N), and total phosphorus (TP). The concentration data for each indicator are shown in Table 2, aiming to construct a typical scenario with significant differences in thresholds among indicators (e.g., wide thresholds for CODMn and narrow thresholds for NH3-N), thereby highlighting the limitations of the traditional entropy weight method.
As shown in Table 2, the monitoring values of CODMn for the five stations are 0.7 mg/L, 1.84 mg/L, 1.82 mg/L, 1.86 mg/L, and 1.84 mg/L; referencing Table 1, all monitoring values are in the “excellent” grade, indicating that the water quality at all five stations is good and not polluted. It is easy to see that the monitoring values of CODMn range from 0.7 to 1.84 mg/L, with water quality grades concentrated in the “excellent” grade, which does not have grade discrimination. From a grade perspective, the information content of CODMn is low.
The monitoring values of NH3-N for the five stations are 0.88 mg/L, 0.91 mg/L, 1.21 mg/L, 1.62 mg/L, and 1.80 mg/L; referencing Table 1, it can be seen that the monitoring values are in the “medium”, “medium”, “poor”, “bad”, and “bad” grades, respectively. This indicates that the water quality at all five stations has been polluted to varying degrees. It is easy to see that the monitoring values of NH3-N range from 0.88 to 1.80 mg/L, with water quality grades distributed in the “medium”, “poor”, and “bad” grades, indicating significant grade discrimination. From a grade perspective, the information content of NH3-N is high.
The monitoring values of TP for the five stations are 0.26 mg/L, 0.29 mg/L, 0.28 mg/L, 0.31 mg/L, and 0.38 mg/L; referencing Table 1, the monitoring values are in “poor”, “poor”, “poor”, “bad”, and “bad” grades, respectively. This indicates that the water quality at all five stations has been significantly polluted. It is easy to see that the monitoring values of TP range from 0.26 to 0.38 mg/L, with water quality grades distributed in the “poor” and “bad” grades, indicating some grade discrimination. From a grade perspective, the information content is relatively high.
It should be noted that, referencing Table 1 and Table 2, among the three indicators, CODMn has the largest numerical discrimination but the smallest grade discrimination. This is because all monitoring values of CODMn are distributed in one water quality grade, while all monitoring values of NH3-N are distributed in three water quality grades, and TP is distributed in two water quality grades. Although the numerical discrimination of monitoring values of CODMn is the largest, all monitoring values of CODMn are actually in the same grade. Although the numerical discrimination of monitoring values of NH3-N and TP is smaller, the monitoring values of NH3-N and TP are distributed in different grades. From the perspective of water resource management, NH3-N and TP provide more information than CODMn.
In addition, the monitoring values of indicators are not concentrated in seriously polluted grades, so there is no distortion phenomenon as described by [20]. There are also no monitoring values of 0, so there is no distortion phenomenon as described by Bao et al. [21]. This water quality assessment is not a dynamic evaluation, so there is no distortion phenomenon as described by Wang et al. [14].

3.2. Weight Results of the EWM

The weight results of the EWM introduced in Section 2.1 are shown in Table 3.
As shown in Table 3, the weight results of CODMn, NH3-N, and TP are 0.687, 0.185, and 0.127, respectively, indicating that the rank of the weight parameters generated by the EWM is CODMn > NH3-N > TP. Referencing Table 2, the difference in monitoring values between CODMn is the largest, NH3-N is the second largest, and TP is the smallest, indicating that the numerical discrimination of CODMn is the largest, followed by NH3-N and then TP. Therefore, the weight results of the EWM can correctly reflect the dispersion degree of indicators and data information content.
Referencing Table 2, the water quality grades of CODMn are all “excellent” grades, while the water quality grades of NH3-N are “medium”, “poor”, or “bad” grade, and the water quality grades of TP are “poor” or “bad”. According to the grade distribution of indicators, NH3-N has the largest information content, followed by TP, and CODMn has the smallest information content. Therefore, the rank of weight parameters should be NH3-N > TP > CODMn. However, based on the weight results of the EWM the weight parameters of CODMn, which has the smallest grade discrimination, are much larger than those of NH3-N and TP, which have higher grade discrimination. This is clearly unreasonable.
Furthermore, referencing Table 2, TP has the highest degree of pollution, NH3-N has the second highest, and CODMn has the lowest. Therefore, the rank of weight parameters should be TP > NH3-N > CODMn. However, the weight parameters of CODMn, which has a lower pollution degree, are much larger than those of NH3-N and TP, which have high pollution degrees. The weight results of the EWM are clearly unreasonable because the EWM does not consider the importance of heavy pollution indicators.
Based on the above content, the weight results of the EWM cannot correctly reflect the importance of indicators in water quality assessment. The reason for this phenomenon is that EWM is weighted based on the numerical discrimination of indicators, rather than the grade discrimination of indicators. The grade discrimination of indicators can only reflect the information content of indicators, and the numerical discrimination of indicators cannot. Furthermore, EWM neglects the importance of heavy pollution indicators in the weighting process, which leads to distorted weighting results of EWM. How to obtain information entropy based on the grade discrimination of indicators while considering the importance of heavily polluting indicators is the research content of this article.

3.3. Weight Results of I-EWM

The weight results of I-EWM introduced in Section 2.3 are shown in Table 4.
As shown in Table 4, the fuzzy entropies of CODMn, NH3-N, and TP are 0.397, 0.681, and 0.430, respectively. The larger the fuzzy entropy of the indicator is, the higher the level of grade discrimination of the indicator is, the greater the information content of the indicator is and the greater the importance of the indicator is. Therefore, NH3-N has the greatest level of grade discrimination, that of TP is second, and that of CODMn is the smallest. Referencing Table 2, CODMn does not have a level of grade discrimination, NH3-N is distributed in three water quality grades, and TP is distributed in two water quality grades, which is consistent with the analysis results in Section 3.1.
As shown in Table 4, the first-order moments of CODMn, NH3-N, and TP are 0.267, 0.810, and 0.896, respectively. The larger the first-order moment of the indicator is, the higher the degree of pollution of the indicator is and the greater the importance of the indicator is. Therefore, the degree of pollution and the importance of TP and NH3-N are greater than those of CODMn. Referencing Table 2, according to the TP and NH3-N of the five stations, the water quality has been polluted to varying degrees, while according to CODMn, the water quality is not polluted, which is consistent with the analysis results in Section 3.1.
The weights of CODMn, NH3-N, and TP are 0.191, 0.428, and 0.381, respectively, indicating that the rank of weight parameters generated by the I-EWM is NH3-N > TP > CODMn. The I-EWM assigns weights based on fuzzy entropy and the first-order moment of the indicator, where the larger the fuzzy entropy and the first-order moment are, the greater the weight is. Comparing CODMn, NH3-N, and TP, the first-order moments of NH3-N and TP are greater than that of CODMn, and the fuzzy entropies of NH3-N and TP are also greater than that of CODMn, so the weights of NH3-N and TP are greater than that of CODMn. Comparing NH3-N and TP, the first-order moments of NH3-N and TP are equivalent, and the fuzzy entropy of NH3-N is greater than that of TP, so the weight of NH3-N is greater than that of TP. Based on the above content, NH3-N has the highest weight, followed by TP, and CODMn has the lowest weight.
Comparing the weight results of the I-EWM and that of the EWM, the weight of CODMn is significantly reduced, while the weights of NH3-N and TP are significantly increased. This is because the numerical discrimination of CODMn is greater than that of NH3-N and TP, but the level of grade discrimination and the pollution degree of CODMn are smaller than those of NH3-N and TP. The EWM assigns weights based on the numerical discrimination of indicators; CODMn with high numerical discrimination is assigned larger weights, while NH3-N and TP, with low numerical discrimination, are assigned smaller weights. The I-EWM assigns weights based on the level of grade discrimination and pollution degree of indicators, CODMn, without grade discrimination and with a low pollution degree are assigned smaller weights, while NH3-N and TP with higher grade discrimination and higher pollution degree are assigned larger weights.

3.4. Comprehensive Evaluation Results of Water Quality

The comprehensive WQI results using the weights generated by the EWM are shown in Table 5, and the comprehensive WQI results using the weights generated by the I-EWM are shown in Table 6.
As shown in Table 5, according to the water quality assessment results of the EWM, the comprehensive WQI of stations #1, #2, #3, #4, and #5 are 75.79, 66.97, 65.14, 61.06, and 58.08, respectively. Correspondingly, the water quality grades of stations #1, #2, #3, and #4 are all “good”, while only that of station #5 is “medium”. According to the water quality assessment results of the EWM, the final water quality grades are mostly “good”, indicating that the overall water quality has not been polluted.
As shown in Table 6, according to the water quality assessment results of the I-EWM, the comprehensive WQI of stations #1, #2, #3, #4, and #5 are 47.59, 42.62, 38.28, 28.89, and 20.51, respectively. Correspondingly, the water quality grades of stations #1 and #2 are “medium”, while those of stations #3, #4, and #5 are “ poor”. According to the water quality assessment results of the I-EWM, the final water quality grades are “medium” and “poor”, indicating that the overall water quality pollution degree is high.
Comparing the water quality assessment results of the EWM and I-EWM, the water quality assessment results of the I-EWM are much worse than those of the EWM. Compared with the CWQIj values of the EWM, the CWQIj values of the I-EWM at stations #1, #2, #3, #4, and #5 are reduced by 28.20, 24.35, 26.86, 32.17, and 37.57, respectively. Compared with the water quality grades of the EWM, the water quality grades of the I-EWM at stations #1, #2, and #5 are reduced by one level, while the water quality grades of the I-EWM at stations #3 and #4 are reduced by two levels.
Why are the water quality assessment results of the EWM and I-EWM completely opposite? Referencing Table 5 and Table 6, this phenomenon is caused by the difference in weight assignment between the two methods. According to the EWM, the weight of CODMn is significantly higher than that of NH3-N and TP, and the fact that CODMn is in the “excellent” grade plays a dominant role in the water quality assessment results, leading to optimistic water quality assessment results. According to the I-EWM, the weights of NH3-N and TP were significantly higher than that of CODMn, with NH3-N concentrated in the “medium”, “poor”, and “bad” grades and TP concentrated in the “poor” and “bad” grades. NH3-N and TP played a dominant role in the water quality assessment results, resulting in poor water quality assessment results.
In summary, the water quality assignment results based on the I-EWM are more reasonable. The EWM assigns higher weights to CODMn with high numerical discrimination and low level of grade discrimination, while assigning smaller weights to NH3-N and TP with low numerical discrimination and high level of grade discrimination, resulting in overly optimistic water quality evaluation results. The I-EWM assigned higher weights to NH3-N and TP with high levels of grade discrimination and pollution degree, while smaller weights were assigned to CODMn with no grade discrimination and no pollution, resulting in more reasonable water quality assignment results.

3.5. Comparative Analysis of Weight Allocation

To more effectively validate the superiority of the Improved Entropy Weight Method (I-EWM), this study simultaneously introduced the Analytic Hierarchy Process-Entropy combination method (AHP-Entropy) and Principal Component Analysis (PCA) to compare their weighting results with those of I-EWM. As shown in Table 7, although the AHP-Entropy method incorporates subjective adjustments, its weighting results remain predominantly influenced by CODMn. PCA assigns weights to the permanganate index (CODMn), ammonia nitrogen, and total phosphorus that are relatively equal in magnitude. As established in the preceding analysis, the weighting results of both methods fail to accurately reflect the information content pertaining to grade discriminability among the indicators. Consequently, they cannot serve as substitutes for I-EWM in addressing the weighting distortion related to grade discrimination inherent in the traditional EWM.
The water quality assessment results of the four weighting methods presented in Table 8 demonstrate the following: The results of the AHP-Entropy method are similar to those of the conventional EWM, both exhibiting an overly optimistic bias. Although the results of the PCA method are relatively closer to those of I-EWM, its water quality rating for Site 3# remains one grade higher than that of I-EWM, being rated as “Medium”. This discrepancy arises because the PCA weighting process assigns a comparatively high weight to the permanganate index, which consequently exerts a dominant influence on the water quality evaluation result for Site 3#.

4. Conclusions

4.1. Main Findings and Contributions

The weight results of the EWM cannot accurately reflect the importance of pollutants in actual water environment management. The EWM assigns weights based on the numerical discrimination of indicators rather than the grade discrimination of indicators. The numerical discrimination of indicators cannot reflect the information content of indicators, and grade discrimination can reflect the information content of indicators. When there are significant differences in the threshold division of different indicators, the EWM will give too much weight to indicators with high numerical discrimination and low level of grade discrimination and give too little weight to indicators with low numerical discrimination and high level of grade discrimination, resulting in distorted water quality evaluation results.
The I-EWM introduces fuzzy variable set theory and assigns weights based on the fuzzy entropy and first-order moment of the average membership vector of indicators. The fuzzy entropy reflects the water quality grade discrimination, and the first-order moment reflects the pollution degree of the indicator. The basic principle of weights assigned to the I-EWM is as follows: When the indicators have the same degree of pollution, the higher the level of grade discrimination of the indicators is, the greater the importance of the indicators is, and the higher the weights assigned are, and vice versa. When there is the same level of grade discrimination of indicators, the higher the degree of pollution is, the greater the importance of the indicators is, and the higher the weights assigned are, and vice versa. Through a specific water quality evaluation example, it is proven that the I-EWM can effectively improve the distortion phenomenon mentioned above in the EWM, and the water quality evaluation results are more reasonable.

4.2. Limitations and Applicability Discussion

In practical applications, the I-EWM weighting model has the following two limitations. Firstly, the method assumes equal importance between grade discrimination (fuzzy entropy) and pollution degree (first-order moment) in weight allocation. However, in practice, the influence of these two factors may differ significantly, requiring adjustments to the weighting formula based on specific conditions. Secondly, the advantage of I-EWM is most prominent in scenarios where there are significant differences in threshold divisions among evaluation indicators and where grade discrimination holds clear management significance. For ordinary datasets with uniformly distributed thresholds or ambiguous grade distinctions, its improvement may be limited, and the computational cost is slightly higher compared to the traditional EWM.
Nevertheless, I-EWM remains valuable in addressing the critical issue of traditional entropy weight methods neglecting grade discrimination. It is particularly suitable for hierarchical evaluation systems with significant threshold differences, thus demonstrating clear practical significance and potential for broader application.

4.3. Future Research Directions

To address the limitations of I-EWM in practical applications, future research should focus on integrating machine learning techniques to enhance the model’s adaptability and intelligence:
When significant differences exist in how grade discrimination and pollution degree influence weights, reinforcement learning or multi-objective optimization algorithms can be introduced. This enables the model to automatically adjust the proportional relationship between fuzzy entropy and first-order moment in weight calculations based on historical evaluation outcomes or managerial preferences, thereby improving the model’s adaptability across different application scenarios.
For conventional datasets with uniformly distributed thresholds or unclear grade discrimination, a lightweight hybrid model can be developed to balance performance and efficiency. Specifically, simple classifiers (e.g., decision trees) or rule-based criteria can be employed to pre-assess whether a dataset exhibits the characteristic of “significant threshold differences”. If this feature is present, the complete I-EWM model is activated; if the dataset is conventional, the system automatically switches to the more computationally efficient traditional EWM or other suitable models.

Author Contributions

Conceptualization, F.Y.; Methodology, J.H.; Writing—Original Draft Preparation, Q.Y.; Data Curation, L.X.; Formal Analysis and Visualization, R.C.; Investigation, Writing—Review and Editing, F.Y.; Supervision, X.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Science and Technology Research Project of Jiangxi Provincial Department of Education (Youth Project), via grant number GJJ2402821; National Natural Science Foundation of China (Grant No. 52469001); and Jiangxi Provincial Natural Science Foundation (20242BAB25313).

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
EWMEntropy Weight Method
I-EWMImproved Entropy Weight Model
WQIWater Quality Index
CWQIComprehensive Water Quality Index

References

  1. Yan, T.; Shen, S.L.; Zhou, A. Indices and models of surface water quality assessment: Review and perspectives. Environ. Pollut. 2022, 308, 119611. [Google Scholar] [CrossRef]
  2. Jing, P.; Sheng, J.; Hu, T.; Mahmoud, A.; Guo, L.; Liu, Y.; Wu, Y. Spatiotemporal evolution of sustainable utilization of water resources in the Yangtze River Economic Belt based on an integrated water ecological footprint model. J. Clean. Prod. 2022, 358, 132035. [Google Scholar] [CrossRef]
  3. Liu, X.; Li, G.; Liu, Z.; Guo, W.; Gao, N. Water Pollution Characteristics and Assessment of Lower Reaches in Haihe River Basin. Procedia Environ. Sci. 2010, 2, 199–206. [Google Scholar] [CrossRef]
  4. Wang, W.-c.; Xu, D.-m.; Chau, K.-w.; Lei, G.-j. Assessment of River Water Quality Based on Theory of Variable Fuzzy Sets and Fuzzy Binary Comparison Method. Water Resour. Manag. 2014, 28, 4183–4200. [Google Scholar] [CrossRef]
  5. Xi, L.; Qin, Z.; Feng, Y. The improved entropy weighting model in water quality evaluation based on the compound function. Environ. Monit. Assess. 2022, 194, 662. [Google Scholar] [CrossRef]
  6. Tang, M.; Zeng, H.; Wang, K. Bayesian Water Quality Evaluation Model Based on Generalized Triangular Fuzzy Number and its Application. Environ. Process. 2022, 9, 6. [Google Scholar] [CrossRef]
  7. Abbas, F.; Cai, Z.; Shoaib, M.; Iqbal, J.; Ismail, M.; Arifullah; Alrefaei, A.F.; Albeshr, M.F. Machine Learning Models for Water Quality Prediction: A Comprehensive Analysis and Uncertainty Assessment in Mirpurkhas, Sindh, Pakistan. Water 2024, 16, 941. [Google Scholar] [CrossRef]
  8. Ren, J.; Han, G.; Gao, X.; Zhang, S. Water quality assessment in a megacity river: Water chemical analysis integrated with AHP-Entropy weighted fuzzy comprehensive evaluation model. J. Environ. Sci. 2025; in press. [Google Scholar] [CrossRef]
  9. Luo, L.; Zhang, Y.; Dong, W.; Zhang, J.; Zhang, L. Ensemble Empirical Mode Decomposition and a Long Short-Term Memory Neural Network for Surface Water Quality Prediction of the Xiaofu River, China. Water 2023, 15, 1625. [Google Scholar] [CrossRef]
  10. Wu, G.-z.; Li, C.-y.; Zhang, S.; Jia, K.-l.; Zhou, L.-w. Applying PAC-SOM Combined Model to Assess Lake Water Quality. Environ. Sci. Technol. 2009, 32, 173–177. [Google Scholar]
  11. Peng, Q.; Zhang, Y.; Zhang, Q.; Zeng, K.; Tang, J. Model of Water Quality Assessment Based on Improved TOPSIS Coupled with Hasse Diagram. Yellow River 2021, 43, 93–96+128. [Google Scholar]
  12. Jin, S.; Zhang, K.; Zhang, T.; Yang, Y.; Shuai, Y.; Hu, T. Establishment of an innovative machine learning-driven drinking water quality assessment model with health considerations. J. Environ. Sci. 2025, 161, 697–706. [Google Scholar] [CrossRef]
  13. Zeng, Q.; Luo, X.; Yan, F. The pollution scale weighting model in water quality evaluation based on the improved fuzzy variable theory. Ecol. Indic. 2022, 135, 108562. [Google Scholar] [CrossRef]
  14. Wang, Z.; Xu, X.; Feng, Y. An abnormal phenomenon in entropy weight method in the dynamic evaluation of water quality index. Ecol. Indic. 2021, 131, 108137. [Google Scholar] [CrossRef]
  15. Mahammad, S.; Islam, A. Surface water quality assessment for drinking and irrigation using DEMATEL, entropy-based models and irrigation hazard indices. Environ. Earth Sci. 2024, 83, 332. [Google Scholar] [CrossRef]
  16. Wang, D.; Zeng, D.; Singh, V.P.; Xu, P.; Liu, D.; Wang, Y.; Zeng, X.; Wu, J.; Wang, L. A multidimension cloud model-based approach for water quality assessment. Environ. Res. 2016, 149, 113–121. [Google Scholar] [CrossRef]
  17. Agossou, A.; Yang, J.S. Comparative study of groundwater vulnerability to contamination assessment methods applied to the southern coastal sedimentary basin of Benin. J. Hydrol. Reg. Stud. 2021, 35, 100803. [Google Scholar] [CrossRef]
  18. Mia, M.Y.; Haque, M.E.; Islam, A.R.M.T.; Jannat, J.N.; Jion, M.M.M.F.; Islam, M.S.; Siddique, M.A.B.; Idris, A.M.; Senapathi, V.; Talukdar, S.; et al. Analysis of self-organizing maps and explainable artificial intelligence to identify hydrochemical factors that drive drinking water quality in Haor region. Sci. Total Environ. 2023, 904, 166927. [Google Scholar] [CrossRef] [PubMed]
  19. Singha, S.S.; Singha, S. Comparative assessment of groundwater quality using subjective and objective weighting methods in a multi-criteria decision analysis framework. Groundw. Sustain. Dev. 2025, 31, 101553. [Google Scholar] [CrossRef]
  20. Yan, F.; Yi, F.; Li, C. Improved Entropy Weighting Model in Water Quality Evaluation. Water Resour. Manag. 2019, 33, 2049–2056. [Google Scholar] [CrossRef]
  21. Bao, Q.; Zhu, Y.; Wang, Y.; Yan, F. Can Entropy Weight Method Correctly Reflect the Distinction of Water Quality Indices? Water Resour. Manag. 2020, 34, 3667–3674. [Google Scholar] [CrossRef]
  22. Mo, Y.; Xu, J.; Liu, C.; Wu, J.; Chen, D. Assessment and prediction of Water Quality Index (WQI) by seasonal key water parameters in a coastal city: Application of machine learning models. Environ. Monit. Assess. 2024, 196, 1008. [Google Scholar] [CrossRef] [PubMed]
  23. Zotou, I.; Tsihrintzis, V.A.; Gikas, G.D. Water quality evaluation of a lacustrine water body in the Mediterranean based on different water quality index (WQI) methodologies. J. Environ. Sci. Health A Tox. Hazard. Subst. Environ. Eng. 2020, 55, 537–548. [Google Scholar] [CrossRef] [PubMed]
  24. Zhao, E.; Kuo, Y.-M.; Chen, N. Assessment of water quality under various environmental features using a site-specific weighting water quality index. Sci. Total Environ. 2021, 783, 146868. [Google Scholar] [CrossRef] [PubMed]
  25. GB 3838-2002; Environmental Quality Standards for Surface Water. Ministry of Ecology and Environment of the People’s Republic of China: Beijing, China, 2002.
  26. Yan, F.; Yan, F.; Liu, L.; Li, Y.; Zhang, Y.; Chen, M.; Xing, X. A dynamic water quality index model based on functional data analysis. Ecol. Indic. 2015, 57, 249–258. [Google Scholar] [CrossRef]
  27. Islam, M.M.; Rahman, M.; Nargis, S.; Ahamed, M.R.; Mollik, R.S.; Boby, N.J.K.; Raju, M.R.; Proshad, R.; Aziz, M.A. Elevating Health of the Turag River: A Synergistic Water Quality Assessment Approach. Earth Syst. Environ. 2023, 7, 761–780. [Google Scholar] [CrossRef]
  28. Duan, C.; Liu, C.; Cao, L.; Chen, X.; Qiu, L. Evaluation of water resources-carrying capacity by a variable fuzzy set method. J. Beijing Norm. Univ. (Nat. Sci.) 2009, 45, 582–584. [Google Scholar]
  29. Chen, F.; Huang, G.H.; Fan, Y.R. Inexact Multistage Fuzzy-Stochastic Programming Model for Water Resources Management. J. Water Resour. Plan. Manag. 2015, 141, 04015027. [Google Scholar] [CrossRef]
  30. Yan, F.; Bao, Q.; Liu, C.; Wei, B.; Zhang, Y. Introducing Biological Indicators into CCME WQI Using Variable Fuzzy Set Method. Water Resour. Manag. 2018, 32, 2901–2915. [Google Scholar] [CrossRef]
  31. Wang, Y.; Sheng, D.; Wang, D.; Ma, H.; Wu, J.; Xu, F. Variable Fuzzy Set Theory to Assess Water Quality of the Meiliang Bay in Taihu Lake Basin. Water Resour. Manag. 2014, 28, 867–880. [Google Scholar] [CrossRef]
  32. Sun, X.; Peng, A.; Hu, S.; Shi, Y.; Lu, L.; Bi, A. Dynamic Successive Assessment of Water Resource Carrying Capacity Based on System Dynamics Model and Variable Fuzzy Pattern Recognition Method. Water 2024, 16, 304. [Google Scholar] [CrossRef]
  33. Zuo, Q.; Guo, J.; Ma, J.; Cui, G.; Yang, R.; Yu, L. Assessment of regional-scale water resources carrying capacity based on fuzzy multiple attribute decision-making and scenario simulation. Ecol. Indic. 2021, 130, 108034. [Google Scholar] [CrossRef]
  34. Chen, S.-y.; Guo, Y. Variable Fuzzy Sets and its Application in Comprehensive Risk Evaluation for Flood-control Engineering System. Fuzzy Optim. Decis. Mak. 2006, 5, 153–162. [Google Scholar] [CrossRef]
  35. Fang, Y.; Zheng, X.; Peng, H.; Wang, H.; Xin, J. A new method of the relative membership degree calculation in variable fuzzy sets for water quality assessment. Ecol. Indic. 2019, 98, 515–522. [Google Scholar] [CrossRef]
  36. Yan, F.; Liu, L.; Xu, L.; Yao, X. Application of Membership Vector Analysis in River Health Assessment. Water Resour. Power 2012, 30, 30–32+214. [Google Scholar]
  37. Qiu, W. Entropy Theory in Management Decision and Its Applications, 3rd ed.; Science Press: Beijing, China, 2021; pp. 28–35. [Google Scholar]
Figure 1. Schematic diagram of the water quality grade thresholds for the i-th indicator (a “smaller-is-better” metric).
Figure 1. Schematic diagram of the water quality grade thresholds for the i-th indicator (a “smaller-is-better” metric).
Water 17 03508 g001
Figure 2. Schematic diagram of the water quality grade thresholds for the i-th indicator (a “larger-is-better” metric).
Figure 2. Schematic diagram of the water quality grade thresholds for the i-th indicator (a “larger-is-better” metric).
Water 17 03508 g002
Figure 3. Comparing the EWM and I-EWM.
Figure 3. Comparing the EWM and I-EWM.
Water 17 03508 g003
Table 1. Threshold values of pollutants (mg/L).
Table 1. Threshold values of pollutants (mg/L).
IndicatorsExcellentGoodMediumPoorBad
CODMn[0, 2](2, 4](4, 6](6, 10](10, 15]
NH3-N[0, 0.15](0.15, 0.5](0.5, 1](1, 1.5](1.5, 2]
TP[0, 0.02](0.02, 0.1](0.1, 0.2](0.2, 0.3](0.3, 0.4]
Table 2. Monitoring values of indicators.
Table 2. Monitoring values of indicators.
StationsConcentrations (mg/L)Grades
CODMnNH3-NTPCODMnNH3-NTP
1#0.70.880.26excellentmediumpoor
2#1.840.910.29excellentmediumpoor
3#1.821.210.28excellentpoorpoor
4#1.861.620.31excellentbadbad
5#1.841.800.38excellentbadbad
Note: # denotes the monitoring site location (or sampling site number).
Table 3. Weight results of the EWM.
Table 3. Weight results of the EWM.
IndicatorsCODMnNH3-NTP
StationsNormalized Values
1#0.9360.3560.316
2#0.0160.3450.237
3#0.0320.2290.263
4#0.0000.0700.184
5#0.0160.0000.000
Hj0.1900.7820.850
1 − Hj0.8100.2180.150
wi0.6870.1850.127
Note: # denotes the monitoring site location (or sampling site number).
Table 4. Weight results of I-EWM.
Table 4. Weight results of I-EWM.
IndicatorsCODMnNH3-NTP
Stationsuij
1#(1.00, 0.00, 0.00, 0.00, 0.00) (0.00, 0.00, 0.74, 0.26, 0.00)(0.00, 0.00, 0.00, 0.90, 0.10)
2#(0.58, 0.42, 0.00, 0.00, 0.00) (0.00, 0.00, 0.68, 0.32, 0.00)(0.00, 0.00, 0.00, 0.60, 0.40)
3#(0.59, 0.41, 0.00, 0.00, 0.00) (0.00, 0.00, 0.08, 0.92, 0.00)(0.00, 0.00, 0.00, 0.70, 0.30)
4#(0.57, 0.43, 0.00, 0.00, 0.00) (0.00, 0.00, 0.00, 0.26, 0.74)(0.00, 0.00, 0.00, 0.40, 0.60)
5#(0.58, 0.42, 0.00, 0.00, 0.00) (0.00, 0.00, 0.00, 0.00, 1.00)(0.00, 0.00, 0.00, 0.00, 1.00)
vi(0.664, 0.336, 0.000, 0.000, 0.000) (0.00, 0.00, 0.300, 0.352, 0.348)(0.00, 0.00, 0.00, 0.520, 0.480)
Qi0.3970.6810.430
Pi1.3364.0484.480
wi0.1910.4280.381
Note: # denotes the monitoring site location (or sampling site number).
Table 5. Comprehensive WQI results based on EWM weights.
Table 5. Comprehensive WQI results based on EWM weights.
IndicatorsCODMnNH3-NTPCWQIjGrades
wi0.6870.1850.127
1#93.044.828.075.79good
2#81.643.622.066.97good
3#81.831.624.065.14good
4#81.415.218.061.06good
5#81.68.04.058.08medium
Note: # denotes the monitoring site location (or sampling site number).
Table 6. Comprehensive WQI results using the weights generated by I-EWM.
Table 6. Comprehensive WQI results using the weights generated by I-EWM.
IndicatorsCODMnNH3-NTPCWQIjGrades
wi0.1910.4280.381
1#93.044.828.047.59medium
2#81.643.622.042.62medium
3#81.831.624.038.28poor
4#81.415.218.028.89poor
5#81.68.04.020.51poor
Note: # denotes the monitoring site location (or sampling site number).
Table 7. Comparison of Weights from Four Weighting Methods.
Table 7. Comparison of Weights from Four Weighting Methods.
MethodsCODMnNH3-NTP
EWM0.6870.1850.127
PCA0.3320.3360.332
AHP-Entropy0.6390.2150.146
I-EWM0.1910.4280.381
Table 8. Comparison of Water Quality Assessment Results Using Four Weighting Methods.
Table 8. Comparison of Water Quality Assessment Results Using Four Weighting Methods.
MethodsEWMPCAAHP-EntropyI-EWM
1#goodmediumgoodmedium
2#goodmediumgoodmedium
3#goodmediumgoodpoor
4#goodpoormediumpoor
5#mediumpoormediumpoor
Note: # denotes the monitoring site location (or sampling site number).
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Yan, Q.; Yan, F.; Xie, L.; Huang, J.; Chen, R.; Liu, X. An Improved Entropy Weight Method Mitigating Grade Distortion in Water Quality Assessment. Water 2025, 17, 3508. https://doi.org/10.3390/w17243508

AMA Style

Yan Q, Yan F, Xie L, Huang J, Chen R, Liu X. An Improved Entropy Weight Method Mitigating Grade Distortion in Water Quality Assessment. Water. 2025; 17(24):3508. https://doi.org/10.3390/w17243508

Chicago/Turabian Style

Yan, Qin, Feng Yan, Lingping Xie, Jiang Huang, Rui Chen, and Xinxin Liu. 2025. "An Improved Entropy Weight Method Mitigating Grade Distortion in Water Quality Assessment" Water 17, no. 24: 3508. https://doi.org/10.3390/w17243508

APA Style

Yan, Q., Yan, F., Xie, L., Huang, J., Chen, R., & Liu, X. (2025). An Improved Entropy Weight Method Mitigating Grade Distortion in Water Quality Assessment. Water, 17(24), 3508. https://doi.org/10.3390/w17243508

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop