The Development of a River Quality Prediction Model That Is Based on the Water Quality Index via Machine Learning: A Review
Abstract
:1. Introduction
1.1. Complexity and Cost of Manual Sampling
1.2. Limited Spatial and Temporal Coverage
1.3. Lack of Real-Time Monitoring
1.4. Data Quality and Consistency
1.5. Potential of Machine Learning and Advanced Technologies
1.6. Key Machine Algorithms
- ❖
- Conventional neural networks and gated recurrent units: These models achieved a validation accuracy of 97.86% when used to monitor the Vaigai River, which was more accurate than other state-of-the-art models. It incorporates real-time data collection and feedback and gives immediate alarms, which help in early intercession and decision-making in sound management [10].
- ❖
- Long short-term memory (LSTM) has been shown to be superior to other models, such as support vector regression and random forest, for estimating the British Columbia water quality index without some parameters, with a coefficient of determination of 0.91 [11].
- ❖
- Gradient boosting and polynomial regression: these algorithms are helpful in predicting the water quality index (WQI), with the gradient boosting model having been shown to have a mean absolute error of 1.8074 [12].
- ❖
- Random forest and extreme gradient boosting: these models were used for classification tasks and gradient boosting, with an accuracy level of 99.50% for water quality class prediction [13].
- ❖
- Performance comparison of key algorithms: Reviews of the performance comparison of key algorithms in previous trials indicate that machine learning models have considerable potential in the estimation of water quality index parameters, including pH, turbidity, and dissolved oxygen. The prognosis models used in previous studies include CNNs, LSTMs, and RF, which differ in their suitability for demonstrating predictive capabilities.
- ❖
- Convolutional neural networks (CNNs): convolutional neural networks (CNNs) can be applied to situations where spatial and image-based analyses of water quality can also be conducted and can learn complex patterns in data; hence, they may be effective if they are used with large training datasets [14].
- ❖
- Long short-term memory networks (LSTMs): These are ideal for use with time series data, which makes them perfect when trying to identify changes in water quality over time. They perform well in capturing long-term dependency information and can also be computationally expensive [14].
- ❖
- Random forest (RF): This is a strong ensemble method that works proficiently even when the relationships between the IVs and the DV are nonlinear. The anticipation of WQI parameters was proven to be accurate with published research suggesting R2 values equal to 0.98 (Khoi et al., 2022b [14]) and (Asadollah et al., 2021 [25]).
1.7. Advantages and Limitations
- ❖
- A comparison of different ML algorithms for real-time water quality predictions occurs within this study.
- ❖
- This study analyzes four significant challenges including the lack of data availability along with difficulty in interpreting models and high computational load and ethical issues.
- ❖
- This research examines machine learning applications within practical scenarios while showing their cost-saving and large data-handling abilities.
- ❖
- The authors propose future recommendations that combine ML with climate change data while improving data representation and handling privacy concerns.
2. Summary of Original Article
2.1. Machine Learning Models for Water Quality Prediction
2.2. Advanced Monitoring Systems
2.3. Feature Selection and Model Optimization
2.4. Key Themes in the Literature
2.5. Machine Learning Integration with WQI
2.6. Real-Time Monitoring
3. Challenges and Considerations
- ❖
- Model interpretability: The problem with many ML models, however, is that they may perform accurately but their results cannot be easily explained. This has been accomplished by applying methods such as Shapley additive explanation (SHAP), which explains the effects of different parameters on the WQI [29].
- ❖
- Parameter variability: The quality of water from different sources is not constant; thus, source-specific treatments are needed. For example, the levels of pH and total dissolved solids (TDSs) are extremely different in different water sources, which calls for specific strategies [30].
- ❖
- Ethical considerations and data privacy: The application of the IoT and big data in water quality management presents several ethical questions, whose responses are crucial in the advancement of this technology, in ownership and privacy, and in equity in the same technology. These issues must not be taken lightly to gain the confidence of communities regarding the proper implementation of the said measures. In the IoT case, ownership of data gathered through IoT devices is sometimes unclear, particularly when the data are personal or sensitive. Some questions arise regarding who owns these rights and how these data can be utilized [18]. In WBE, for example, a question is who owns the genetic data obtained from wastewater, which is still a concern for ethical implications and suggests that guidelines and policies should be more developed [18]. The problem of privacy violation arises from the massive flow of information from the IoT, which requires effective management. There is a need to adopt the right ethical practices in the development of machine learning to protect such information and make the utilization of the data more transparent.
- ❖
- Data representation and bias: Machine learning that involves the use of nonsample datasets can lead to the generation of incorrect predictions, as discovered by studies on drinking water quality in California. Here, modeling decisions play a highly important role in the demographic features of false negatives [33]. A lack of diverse data hampers performance in certain areas or with certain groups, thus perpetuating current disparities in environmental surveillance and resource distribution [34].
- ❖
- Impact of fata drift: A critical issue with AI/ML models is that they are not robust to data shifts, where performance can degrade as a result of changes in the environment. When the training data are not current, the models carry forward this bias, making the disparities in environmental monitoring even more entrenched [33].
- ❖
- ❖
- Frameworks: Frameworks can be used to recognize environmental AI problems and how AI can reduce dataset bias [35].
- ❖
- Robustness to outlier data: Most of the papers researched do not pay proper attention to the effect of outliers on models, which distorts the results [19]. Sophisticated methods such as isolation forest and the kernel density estimation algorithm have received positive reception in the identification of outliers, but they have not received extensive use [19]. Model performance resilience to noisy inputs is important since they are present in real applications.
- ❖
- Limitations od applicability of models to developing areas: The reviewed models tend to use large datasets, which are difficult to obtain in the developing part of the world [37]. New strategies, including the use of sparse dataset modeling, must be introduced to improve the predictiveness of models in these domains. This appears to be the case for regions whose data are severely constrained since closing this gap could enhance water quality management.
4. Challenges in Traditional River Water Monitoring
4.1. Limitation of Traditional Monitoring Methods
- ❖
- Inability to Predict Trends: these methods do not have prognostic value because they are based on human assessment rather than on sophisticated models [38].
- ❖
- Sample Alteration During Transport: this is why the properties of water samples may change during the transportation process, which affects results [38].
- ❖
- Inadequate real-time monitoring: conventional approaches are not capable of generating accurate data in real time, which is paramount to decision-making [39].
4.2. Addressing the Limitations
- ❖
- Machine learning models: models can be implemented to improve predictive capabilities through the use of on-site parameters to generate predictions with improved water quality index (WQI) predictions [40].
- ❖
- Geospatial frameworks: making use of geospatial frameworks can enhance the transparency and effectiveness of monitoring through providing near real-time data and pollutant control.
- ❖
- Advance sensing technologies: traditional methods can be complex; however, by incorporating more modern sensing methods, such as the IoT, virtual sensing, and cyber–physical systems, real-time detection can be achieved [39].
- ❖
- Biomarker analyses: a combination of classical chemical biological monitoring with cutting-edge techniques such as biomarker analysis can aid in more sensitive assessments of environmental contaminants [41].
- ❖
- Wireless sensor networks: autonomous microsensors could be developed to create networks that can continuously monitor at appropriate time scales, reducing costs [42].
4.3. Role of Machine Learning in Environmental Monitoring
- ❖
- Deep learning ensembles: Two deep learning models, TNX and STNX, also use temporal and spatial–temporal attention mechanisms. The improvement in prediction accuracy for short-step and long-step predictions over baseline models is 2.1% to 6.1% and 4.3% to 22.0%, respectively. To further enhance performance, the STNX model results in 0.5–2.4% and 2.3–5.7% improvements over TNX and effectively mitigates prediction shifts in long-step forecasts [20].
- ❖
- Convolutional neural networks and gated recurrent units: CNGRU-WQM is an improvement over CNGRUs, which modeled water quality along the Vaigai River via a convolutional neural network and gated recurrent units. The validation accuracy of this model was 97.86%, which was better than that of other state-of-the-art approaches and provided warnings of real-time water quality breaches [10].
- ❖
- Hybrid machine learning models: To improve prediction accuracy, the most representative data subset is selected, and hybrid models, including the adaptive neurofuzzy inference system (ANFIS), artificial neural networks (ANNs), and support vector machines (SVMs), are used. It has been shown that these models still perform with similar or better accuracy than traditional models, especially when trained on highly variable data [21].
- ❖
- Tree-Based and ensemble learning approaches: Water quality index prediction algorithms such as random forest, gradient boosting, and XGBoost are used. Furthermore, results show that these models, especially gradient boosting, have high accuracy, with R2 values of 0.88 and 0.85 for training and testing, respectively [22,43].
- ❖
- Recurrent neural networks and LSTM models: The Self-Attentive LSTM (SA-LSTM) model was combined with the Load Estimator (LOADEST) for water quality prediction in regions with sparse data. The method reduced the RMSE by 24.6% for COD Mn and 21.3% for NH3 N and preserved accuracy at longer data collection intervals [37].
4.4. Integration of WQI and ML for River Quality Prediction
- (1)
- Enhanced prediction accuracy:
- ❖
- Statistical methods, including the adaptive neurofuzzy inference system (ANFIS), artificial neural networks (ANNs) and support vector machines (SVMs), increase the accuracy of water quality prediction when trained on representative and variable data to decrease overfitting and increase model generalizability [20].
- ❖
- In general, the gradient boosting (GB) and random forest (RF) models have been able to yield high precision in estimating the WQI, with GB accuracies with an R2 of 0.88 during model training and 0.85 during model testing [22].
- ❖
- Ensemble models such as models that include more than one machine learning algorithm could be used to predict the WQI with high accuracy, as illustrated when data derived from the Johor River Basin are used [23].
- (2)
- Data-Driven insights:
- ❖
- Feature importance in machine learning is used to find and rank features that are important for model building, which, on the one hand, enables the addressing of key sources of water pollution and, on the other hand, enhances model interpretability (Ejaz et al., 2024 [22]) and (Aldrees, A. et al., 2024 [29]).
- ❖
- The combination of the modern enhancement of AI-based prediction and feature selection, such as SHAP, improves the ability to interpret models and provides insights into the individual characteristics of water that contribute to poor water quality (Ejaz et al., 2024 [22]) and (Aldrees, A. et al., 2024 [29]).
- (3)
- Real-Time monitoring and risk mittigation:
- ❖
- Real-time water quality monitoring is performed through machine learning models to prevent pollution by developing preventive measures as early as possible [23].
- ❖
- This implies that, with the help of predictive river health models, the management of resources and the reduction in the unfavorable effects of climate change and anthropogenic activities can be enhanced [40].
- ❖
- Sophisticated measures such as isolation forest and kernel density estimation are used to rectify the data discrepancy and to increase the strength of WQI forecasting [19].
4.5. Real-World Applications and Impacts
- ❖
- Climate change: Climate change has a large impact on river water quality because the hydrological regime becomes more sensitive to flow changes. It has been identified to account for more than 70% of the changes in water quality at some points, such as the Kanpur stretch of the Ganga River [45].
- ❖
- Land use and land cover (LULC): LULC conversion to urban land and increased agricultural activities are other ways through which nutrient loads and nonpoint source pollution are increased. Such changes modify the hydrological connectivity and delivery of pollutants to rivers [46].
- ❖
- Urbanization and industrialization: These activities add surfaces that do not allow evaporation and load water with pollutants, thus causing water pollution. The prevalent socioenvironmental situation in the Ganga River study further suggests that integrated and better sewage management needs to be adopted to control the different sources of pollution from urban and industrial areas [45].
- ❖
- Population growth: density results in the increased accumulation of waste and increases the availability of water for human use, thereby increasing pollution levels [44].
- ❖
- Machine learning models: ANNs, XGBoost, and random forest have been used for river water quality prediction. The optimization and simplification of these models involve considering a number of environmental and socioeconomic parameters that could increase precision (Camara et al., 2019 [46]) and (Satish et al., 2024 [47]).
- ❖
- Hybrid models: The SSA-VMD and BiGRU models also divide water quality into several parts to enhance the forecast results. These models have displayed high accuracy compared with other developed models, with prediction accuracies of 97.8% for dissolved oxygen and 96.1% for pH [48].
- ❖
4.6. Model Performance and Predictive Accuracy
4.7. Practical Implication of Findings
- ❖
- Enhancing policy analysis: When methods from ML are applied, it is possible to estimate various and different treatment effects for different groups. For example, when the decentralized social care system in the Netherlands was examined, ML revealed large differences in the effects of the policy between urban and rural areas [52]. The effectiveness of policies can be evaluated by using large datasets in policy-making since the latter is a key feature of ML [48].
- ❖
- Addressing fairness and bias: When dealing with causal ML models that are used to overcome unfair biases, designers should be careful with sensitive characteristics such as race or gender. Some conventional AI fairness approaches may not be adequate, primarily because most of them implicitly assume that the models make decisions rather than providing suggestions to people to make decisions, which is not always the case with ITS systems [53]. Policy-makers should understand the predisposing biases in outputs generated by ML models and avoid prejudicious policy consequences [54].
- ❖
- Technical difficulties and methodological developments: The task of aligning ML models with often highly complex policy objectives is not easy. Challenges such as inclusivity in training datasets and model configurations are strategic whenever they are integrated [53]. Some new approaches, such as causal ML and multiple objective optimizations, can be useful to fill the gap between the current capacity and the policy aims.
4.8. Stakeholder Roles
- ❖
- Governments: provide legal bodies and frameworks concerning the quality of water monitoring, funding, and technological advancement in the field.
- ❖
- NGOS: promote and support stakeholders’ involvement in monitoring water quality and encourage members of the public to participate in the same way.
- ❖
4.9. Data Management and Sharing
4.10. Continuous Improvement
4.11. Comparison with Existing Approaches
4.12. Limitation of Findings
4.13. Exploring Future Research Opportunities
- ❖
- Temperature rise and water quality: Warmer water has a low DO holding capacity; therefore, with increasing temperature, the DO concentration decreases in rivers, which harms fish and other aquatic organisms [31]. Higher temperatures can also influence the development of preferential blooms of toxic algae, further worsening nutrient pollution [58].
- ❖
- Altered precipitations patterns: Increased precipitation intensity results in increased flooding and variations in the process of transporting nutrients and setting sediments [16]. For example, precipitation modifications in the Qu’Appelle River also led to high fluctuations in DO and TDS values [59].
- ❖
- Challenges for prediction models: Climate change as a source of variability makes it challenging to adjust for the accuracy of predictive frameworks since flexibility is seen in the increased concentration of pollutants in the Ganga River, as envisaged in certain scenarios in the future. The fact that the constituents of water, such as nutrients and pathogens, result in mixed reactions makes predictions even more challenging since some involve counterbalancing mechanisms [31]. The effects of climate change on river water quality are apparent; however, some degree of protection from some of the negative effects of climate change is associated with increased stream flow and highlights the importance of sophisticated management initiatives. Recent advancements in applying ML techniques to address climate development and the projection of variable and tight environments have been undertaken, with a stronger emphasis on modeling accuracy and data fusion. These factors, of course, are necessary for predicting the effects of climate change on severe weather conditions.
- ❖
- Machine learning for extreme event attribution: Counteractions of extreme weather events have been developed in recent studies with the help of ML techniques, especially convolutional neural networks. For example, Trok and others have shown how ML can quickly identify that extreme heat is linked to climate change and have recently estimated that temperature increases caused by global warming [60]. It facilitates the identification of historical trends that should be attributed to respective classes and future trends that should be expected.
- ❖
- Data integration and accessibility: Preprocessing repositories of information or what the authors refer to as Deep Extreme Cubes are crucial to the ML process. This database compiles several types of Earth observation data to quantify the effects of climate extremes on ecosystems to enhance biosphere dynamics forecasting [61].
- ❖
- Bias correction in climate models: This method is also used in the improvement of the biases of climate models, which significantly improves the forecasting of intense climate systems. Trok, J.T. et al. proved that ML dramatically decreases systematic biases in large-scale environment simulations, contributing to enhancements in extreme weather perception [62]. However, there are still obstacles present in terms of data quality and model interpretability, which may slow the further development of ML applications in climatology. Solving these problems will be crucial for future work. Artificial intelligence (AI) models have been used routinely in climate-sensitive regions to solve diverse problems associated with climatic change. These models make use of complicated data relationships to improve patient prognosis and guide change plans. The following is a list of some of the remarkable uses of ML in various climate-sensitive sectors.
- ❖
- Precipitation downscaling in the colorado river basin: To downscale precipitation in the Colorado River Basin, a new generative diffusion model was used, which showed comparatively lower error with traditional schematics. This model was based on reanalysis precipitation data to identify the precipitation fields at a fairly high resolution, which benefited climate modeling work in this area [63].
- ❖
- Urban Climate Change Adaptation: AI and ML have been studied for their ability to adapt to climate change in urban areas, with a focus on their effectiveness on different continents. The research focuses on such success stories where AI-ML technologies are applied with the aim of increasing the climate trauma resilience of cities, as the findings indicate, and the application of the approaches differs depending on the characteristics of the urban environment [50].
- ❖
- Multiple risk assessments in the Veneto region for multiple risks from extreme climate events in the Veneto region of Italy were performed, and an AI framework was employed to analyze the scenarios. This approach applied supervised ML algorithms to identify risk vulnerability and exposure to various harms to inform risk management [64].
- ❖
- Agricultural impact assessment in Ethiopia: Climatology and climate change models were combined with ML techniques to study the effects of climate change on the Gilgel Gibe Watershed in Ethiopia. The study employed ML models to forecast future climate trends, thus depicting negative trends in rainfall favorable for agriculture and positive trends in temperature that are important in agricultural practices [60]. However, these applications indicate the ability of ML to support climate change vulnerability in climate-sensitive areas, and data integration issues still persist, as does model accuracy. Ongoing improvements in ML methods and their cooperation are necessary to enhance their potential to solve the challenges posed by climate change implications.
5. Conclusions
5.1. Future Research Directions
- (1)
- Integration of AI and the IoT: AI and the IoT, when integrated into water utility modernization, can significantly enhance closed monitoring and optimization procedures. Decision support systems that utilize artificial intelligence will advance the allocation of resources, overall organizational practices, and sustainability and efficiency in water resource management systems [65,66].
- (2)
- Interdisciplinary Approaches: Subsequent studies should attempt to look at the interrelated nature of the sector to address water insecurity issues, which have interdisciplinary features. This includes bringing perspectives outside hydrological systems, such as energy, food, and climate security, into water management strategies [66].
- (3)
- Smart Water Metering (SWM): Although the uptake of SWM technologies is slow, it is important in the proper management of water. Thus, positive water policies are helpful for increasing the efficiency of water assets and supporting sustainable development in the SWM market [67].
5.2. Interdisciplinary Approaches
- (1)
- Integrated water security solutions: Water security also involves competing and/or mutually reinforcing drivers, such as energy, food, and climate security. The structure and process that break down siloes of specialty can create proof-based policy remedies that are sensitive to local conditions, as shown in studies from the Middle East and Africa [66].
- (2)
- Comparative analysis and knowledge sharing: Benchmarking the water practices of the USA against the water practices of Africa can help in the general improvement of global water policy. The implications of this research thus indicate the necessity of collaboration at the international level and knowledge sharing as preconditions for context-adapted strategies as well as for the creation of sustainable water futures [68,69].
5.3. Policy and Governance
- (1)
- Comprehensive water policies: Water management should consider science, environmental, economic, and cultural policy frameworks for its optimum use. Water scarcity is triggered by population pressure, economic development, and climate change and therefore requires coordinated, intersectoral effort [70].
- (2)
- Considerations of broader impact: It is recommended that the broader impacts of water management research are studied systematically to help in policy formulation. This encompasses taking into account social, ethical, and cultural effects and ensuring that all decisions made are open [71]. Although the application of this study to various aspects of life is encouraged, several difficulties persist regarding the diffusion of knowledge into concrete practice. The proposed versus reported broader impacts in research projects reveal the absence of frameworks that capture and address these impacts, particularly for marginalized populations [72]. Moreover, the slow adoption of technologies such as SWM also implies that adequate policy support and cooperation from all stakeholders are still required for the technology to be adopted fully [67]. Thus, future research and policy can learn from these challenges and optimize technological and interdisciplinary adaptations toward sustainable water management.
5.4. Educational and Awareness Initiatives
5.5. Public Engagement as Significant
- (1)
- Democratic governance: public awareness of the topic of ML creates awareness and makes those developing the technology fully responsible to the public to produce systems that are acceptable to society [73].
- (2)
- Diverse perspectives: communalities enable the consideration of these various views, which, when followed, could result in proper and responsible algorithmic findings [74].
5.6. Educational Campaigns
- (1)
- Awareness and understanding: campaigns can fill the knowledge gaps existing in the minds of various stakeholders by providing explanatory information about the specific ML technologies needed for targeted work, as practice shows in theoretical and empirical studies of science engagement [75].
- (2)
- Skill development: educational interventions may provide people with proper knowledge that would enable them to manage and transition to the use of ML tools in innovation, such as digital marketing [76].
5.7. Case Studies and Examples
6. Implications for Future Technologies
6.1. Integration of AI and IoT
- ❖
- Water quality parameters include temperature, pH, and pollutant levels, with the IoT capable of continuously collecting data and informing appropriate actions when needed [77].
- ❖
- The IoT increases the geographical spread of the space where data are collected, and as a result, the datasets that are available for analysis are broader. This may enhance the realism of the models and, in turn, improve the allocation of resources and prognosis [78].
6.2. Deep Learning Architectures
- ❖
- ❖
- This approach can significantly minimize the need to gather large amounts of data by working from devices already trained, leading to cost-effectiveness and quick river quality prediction [48].
- ❖
- Owing to its ability to employ large databases, deep learning can be used in the prediction of the concentration of contaminants, including fecal indicator bacteria, through a set of sensor-collected physicochemical data [48].
7. Proposed Innovations
- ❖
- IoT and blockchain for real-time monitoring: IoT devices maintain constant observations and measurements, including of pH, turbidity, and the temperature of water, to obtain real-time data on water quality. Such data are protected, and through the use of blockchain, these data are made to be high-quality and permanent, created to increase the level of trust between the different stakeholders [81].
- ❖
- Machine learning for predictive analysis: Other ML techniques based on random forest have been used in previous works to improve water quality predictions with good accuracy [82]. When trained with ML and accompanied by blockchain, the prediction process can then be safeguarded, and only those authorized for data change can do so [82].
- ❖
- Enforcement and compliance: This shows that with blockchain, compliance can be managed by recording instant violations and hence promptly penalizing industries with water source pollution. Thus, this integration not only increases accountability but also helps in observing environmental legislation. Closely related to the main advantages of ML together with blockchain, the existing issues include data privacy and the difficulty of implementing such a system. To enable such integrated systems to take root in the management of water, these challenges must be addressed [83].
- ❖
- AI-Driven scenario modeling: Generative AI includes tools such as the python toolbox introduced for water distribution networks (WDNs), which helps generate the hydraulic and water quality scenarios required for researchers to simulate complexities such as contamination and leakage. Clinical hyporheic-zone biogeochemical activity has been successfully simulated via conditional deep convolutional generative adversarial networks (cDC-GANs) without the need for significant parameterization [84].
- ❖
- Climate change impact assessment: Machine learning and AI models such as random forest have been used to forecast climate change effects on water quality and quantity more than physically based models do. These models interpret weather patterns to forecast the actions of river flow and sediment load on the basis of climate, with demonstrations of the versatility of AI for such analyses [85].
- ❖
- Risk evaluation and decision support: Advanced AI-based solutions, including the s-WQI, form the basis for complex methodologies to assess water quality with leaching periods, taking into account the seasonal changes and estimating the existing risks via the Monte Carlo method. AI helps to improve decision-making related to water quality assessments, offering the optimization of water management and early signs of a negative shift.
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Essien-Ibok, M.A.; George, U.U.; Titus, D.I. Assessing ecosystem health: A comparative study using water quality index analysis across ten lotic system in the cross river system. J. Geogr. Environ. Earth Sci. Int. 2024, 28, 96–113. [Google Scholar] [CrossRef]
- Li, H.; Xue, C.; Song, L.; Yu, Z.; Zhang, J.; Xiao, K. Application of comprehensive water quality labeling index method in water quality evaluation of Xiangjiang river. In Proceedings of the 2023 4th International Conference on Computer, Big Data and Artificial Intelligence (ICCBD+AI), Guiyang, China, 15–17 December 2023; IEEE: New York, NY, USA, 2023; pp. 147–151. [Google Scholar]
- Zaman, M.U.; Pandit, B.A.; Singh, R.; Zulfiya, M. Evaluating water health: A review of varied water quality index computation techniques. Int. J. Res. Appl. Sci. Eng. Technol. 2023, 11, 716–720. [Google Scholar] [CrossRef]
- Jeong, D.Y.; Choi, Y.H.; Kim, K.H. Evaluation of the water quality of Nakong river using the real-time water quality index. J. Korean Soc. Hazard Mitig. 2024, 24, 199–207. [Google Scholar] [CrossRef]
- Najafzadeh, M.; Basirian, S. Evaluation of river water quality index using remote sensing and artificial intelligence models. Remote Sens. 2023, 15, 2359. [Google Scholar] [CrossRef]
- Harmel, R.D.; Preisendanz, H.E.; King, K.W.; Busch, D.; Birgand, F.; Sahoo, D. A review of data quality and cost considerations for water quality monitoring at the field scale and in small watersheds. Water 2023, 15, 3110. [Google Scholar] [CrossRef]
- Simonoska, E.; Bogatinoska, D.C.; Dimitrievski, I.; Malekian, R. Sensor system for real-time water quality monitoring. In Proceedings of the 2023 46th MIPRO ICT and Electronics Convention (MIPRO), Opatija, Croatia, 22–26 May 2023; IEEE: New York, NY, USA, 2023; pp. 114–119. [Google Scholar]
- Khan, G.; Ali, W.; Qureshi, M.H.; Kumar Gola, K.; Chauhan, A. Water quality monitoring system using wireless sensor networks. In Proceedings of the 2023 6th International Conference on Contemporary Computing and Informatics (IC3I), Gautam Buddha Nagar, India, 14–16 September 2023; IEEE: New York, NY, USA, 2023; pp. 239–245. [Google Scholar]
- Thakur, A.; Devi, P. A comprehensive review on water quality monitoring devices: Materials advances, current status, and future perspective. Crit. Rev. Anal. Chem. 2024, 54, 193–218. [Google Scholar] [CrossRef]
- Geetha, T.S.; Gandhimathi, G.; Chellaswamy, C.; Thiruvalar Selvan, P. comprehensive river water quality monitoring using convolutional neural networks and gated recurrent units: A case study along the Vaigai river. J. Env. Manag. 2024, 365, 121567. [Google Scholar] [CrossRef]
- Il Kim, H.; Kim, D.; Mahdian, M.; Salamattalab, M.M.; Bateni, S.M.; Noori, R. Incorporation of water quality index models with machine learning-based techniques for real-time assessment of aquatic ecosystems. Environ. Pollut. 2024, 355, 124242. [Google Scholar] [CrossRef]
- Khaskheli, S.A.; Ahmed Rahu, M.; Siraj, S.; Jamshed, H.; Iqbal, S. Optimized water quality forecasting using machine learning. Int. J. Inf. Syst. Comput. Technol. 2024, 3, 46–60. [Google Scholar] [CrossRef]
- Nitya, N.J. Computational machine learning analytics for prediction of water quality. Commun. Appl. Nonlinear Anal. 2024, 31, 448–465. [Google Scholar] [CrossRef]
- Khoi, D.N.; Quan, N.T.; Linh, D.Q.; Nhi, P.T.T.; Thuy, N.T.D. Using machine learning models for predicting the water quality index in the La Buong River, Vietnam. Water 2022, 14, 1552. [Google Scholar] [CrossRef]
- Mokarram, M.; Pourghasemi, H.R.; Pham, T.M. Enhancing water quality monitoring through the integration of deep learning neural networks and fuzzy method. Mar. Pollut. Bull. 2024, 206, 116698. [Google Scholar] [CrossRef] [PubMed]
- Tefera, G.W.T.; Ray, R.L.; Singh, V.P. Simulating the effect of climate change scenarios on surface water quality in the bosque watershed, central texas, united states. Res. Sq. 2023. [Google Scholar] [CrossRef]
- Koleva, R.; Zaev, E.; Babunski, D.; Stefanovski, D.; Rath, G. Computational methods for water quality index calculation using real-time measurement system. In Proceedings of the 2024 13th Mediterranean Conference on Embedded Computing (MECO), Budva, Montenegro, 11–14 June 2024; IEEE: New York, NY, USA, 2024; pp. 1–4. [Google Scholar]
- Jacobs, D.; McDaniel, T.; Varsani, A.; Halden, R.U.; Forrest, S.; Lee, H. Wastewater monitoring raises privacy and ethical considerations. IEEE Trans. Technol. Soc. 2021, 2, 116–121. [Google Scholar] [CrossRef]
- Uddin, M.G.; Rahman, A.; Rosa Taghikhah, F.; Olbert, A.I. Data-driven evolution of water quality models: An in-depth investigation of innovative outlier detection approaches-a case study of irish water quality index (IEWQI) model. Water Res. 2024, 255, 121499. [Google Scholar] [CrossRef]
- Zheng, Y.; Wei, J.; Zhang, W.; Zhang, Y.; Zhang, T.; Zhou, Y. An ensemble model for accurate prediction of key water quality parameters in river based on deep learning methods. J. Env. Manag. 2024, 366, 121932. [Google Scholar] [CrossRef]
- del Castillo, A.F.; Garibay, M.V.; Díaz-Vázquez, D.; Yebra-Montes, C.; Brown, L.E.; Johnson, A.; Garcia-Gonzalez, A.; Gradilla-Hernández, M.S. Improving river water quality prediction with hybrid machine learning and temporal analysis. Ecol. Inf. 2024, 82, 102655. [Google Scholar] [CrossRef]
- Ejaz, U.; Khan, S.M.; Jehangir, S.; Ahmad, Z.; Abdullah, A.; Iqbal, M.; Khalid, N.; Nazir, A.; Svenning, J.C. Monitoring the industrial waste polluted stream—integrated analytics and machine learning for water quality index assessment. J. Clean. Prod. 2024, 450, 141877. [Google Scholar] [CrossRef]
- Sidek, L.M.; Mohiyaden, H.A.; Marufuzzaman, M.; Noh, N.S.M.; Heddam, S.; Ehteram, M.; Kisi, O.; Sammen, S.S. Developing an ensembled machine learning model for predicting water quality index in johor river basin. Env. Sci. Eur. 2024, 36, 67. [Google Scholar] [CrossRef]
- Lin, Z.; Lim, J.Y.; Oh, J.M. Innovative interpretable AI-guided water quality evaluation with risk adversarial analysis in river streams considering spatial-temporal effects. Environ. Pollut. 2024, 350, 124015. [Google Scholar] [CrossRef]
- Asadollah, S.B.H.S.; Sharafati, A.; Motta, D.; Yaseen, Z.M. River water quality index prediction and uncertainty analysis: A comparative study of machine learning models. J. Env. Chem. Eng. 2021, 9, 104599. [Google Scholar] [CrossRef]
- Tabassum, S.; Kotnala, C.B.; Masih, R.K.; Shuaib, M.; Alam, S.; Alar, T.M. Performance analysis of machine learning techniques for predicting water quality index using physiochemical parameters. In Proceedings of the 2023 International Conference on Sustainable Computing and Smart Systems (ICSCSS), Coimbatore, India, 14–16 June 2023; IEEE: New York, NY, USA, 2023; pp. 372–377. [Google Scholar]
- Nguyen, D.T.; Luong, T.L.; Nguyen, N.M.; Nguyen, T.Q.; Le, N.T.N.; Truong, C.D.; Nguyen, T.L.; Ho, V.L.; Dinh, Q.K.; Huynh, D.C.; et al. Monitoring water quality parameters in aquaculture using edge computing. Vietnam J. Catal. Adsorpt. 2024, 12, 135–140. [Google Scholar] [CrossRef]
- Ismail, R.; Rawashdeh, A.; Al-Mattarneh, H.; Hatamleh, R.; Telfah, D.B.; Jaradat, A. Artificial intelligence for application in water engineering: The use of ann to determine water quality index in rivers. Civ. Eng. J. 2024, 10, 2261–2274. [Google Scholar] [CrossRef]
- Aldrees, A.; Khan, M.; Taha, A.T.B.; Ali, M. Evaluation of water quality indexes with novel machine learning and SHapley additive ExPlanation (SHAP) approaches. J. Water Process Eng. 2024, 58, 104789. [Google Scholar] [CrossRef]
- Tetali, R.R.; Salomi, K.; Gope, E.R. Analysis of water quality parameters across diverse sources. J. Pharma Insights Res. 2024, 2, 210–216. [Google Scholar] [CrossRef]
- Khalil, M.; Al-Haija, Q.A. Ethical machine learning for internet of things network. In Ethical Artificial Intelligence in Power Electronics; CRC Press: New York, NY, USA, 2024; pp. 12–20. [Google Scholar]
- Amador-Castro, F.; González-López, M.E.; Lopez-Gonzalez, G.; Garcia-Gonzalez, A.; Díaz-Torres, O.; Carbajal-Espinosa, O.; Gradilla-Hernández, M.S. Internet of things and citizen science as alternative water quality monitoring approaches and the importance of effective water quality communication. J. Env. Manag. 2024, 352, 119959. [Google Scholar] [CrossRef]
- Kaswan, P.; Marzban, M.F.; Nam, W.; Akkarakaran, S.; Luo, T. Statistical AI/ML model monitoring for 5G/6G: Interference prediction case study. In Proceedings of the 2024 IEEE International Conference on Communications Workshops (ICC Workshops), Denver, CO, USA, 9–13 June 2024; IEEE: New York, NY, USA, 2024; pp. 638–643. [Google Scholar]
- Karasaki, S.; Morello-Frosch, R.; Callaway, D. Machine learning for environmental justice: Dissecting an algorithmic approach to predict drinking water quality in California. Sci. Total Environ. 2024, 951, 175730. [Google Scholar] [CrossRef]
- Chakraborty, S. Towards a comprehensive assessment of AI’s environmental impact. arXiv 2024. [Google Scholar] [CrossRef]
- Torres-Martínez, J.A.; Mahlknecht, J.; Kumar, M.; Loge, F.J.; Kaown, D. Advancing groundwater quality predictions: Machine learning challenges and solutions. Sci. Total Environ. 2024, 949, 174973. [Google Scholar] [CrossRef]
- Huang, S.; Xia, J.; Wang, Y.; Lei, J.; Wang, G. Water quality prediction based on sparse dataset using enhanced machine learning. Environ. Sci. Ecotechnol. 2024, 20, 100402. [Google Scholar] [CrossRef]
- Sani, S.A.; Ibrahim, A.; Musa, A.A.; Dahiru, M.; Baballe, M.A.; Sadiku, N.; Sani, A. Drawbacks of traditional environmental monitoring systems corresponding author. TMP Univers. J. Res. Rev. Arch. 2023, 2, 2583–7214. [Google Scholar]
- Zainurin, S.N.; Wan Ismail, W.Z.; Mahamud, S.N.I.; Ismail, I.; Jamaludin, J.; Ariffin, K.N.Z.; Wan Ahmad Kamil, W.M. Advancements in monitoring water quality based on various sensing methods: A systematic review. Int. J. Env. Res. Public Health 2022, 19, 14080. [Google Scholar] [CrossRef] [PubMed]
- Azha, S.F.; Sidek, L.M.; Ahmad, Z.; Zhang, J.; Basri, H.; Zawawi, M.H.; Noh, N.M.; Ahmed, A.N. Enhancing river health monitoring: Developing a reliable predictive model and mitigation plan. Ecol. Indic. 2023, 156, 111190. [Google Scholar] [CrossRef]
- Popescu, S.M.; Mansoor, S.; Wani, O.A.; Kumar, S.S.; Sharma, V.; Sharma, A.; Arya, V.M.; Kirkham, M.B.; Hou, D.; Bolan, N.; et al. Artificial intelligence and IoT driven technologies for environmental pollution monitoring and management. Front. Env. Sci 2024, 12, 1336088. [Google Scholar] [CrossRef]
- Namour, P.; Clement, Y.; Bernard, C.; Lyon, U.; Breil, P.; Lanteri, P. Environmental monitoring and water safety are claiming innovative tools. In Proceedings of the 8th International Conference Novatech 2013 (Event), Lyon, France, 23–27 June 2013; p. 8. [Google Scholar]
- Md. Islam, J.; Salekin, S.U.; Abdullah, M.S.; Zaman, N.; Khan, A.A.A. Evaluation of water quality assessment through machine learning: A water quality index-based approach. Res. Sq. 2024. [Google Scholar] [CrossRef]
- Shaheed, H.; Zawawi, M.H.; Hayder, G. Water quality index classification of southeast, south and west asia rivers using machine learning algorithms. J. Ecohumanism 2024, 3, 2752–6801. [Google Scholar] [CrossRef]
- Benzerra, A.; Cherrared, M.; Chocat, B.; Cherqui, F.; Zekiouk, T. Decision support for sustainable urban drainage system management: A case study of Jijel, Algeria. J. Env. Manag. 2012, 101, 46–53. [Google Scholar] [CrossRef]
- Camara, M.; Jamil, N.R.; Abdullah, A.F. Bin impact of land uses on water quality in malaysia: A review. Ecol. Process. 2019, 8, 10. [Google Scholar] [CrossRef]
- Satish, N.; Anmala, J.; Rajitha, K.; Varma, M.R.R. A stacking ANN ensemble model of ml models for stream water quality prediction of Godavari River basin, India. Ecol. Inf. 2024, 80, 102500. [Google Scholar] [CrossRef]
- Lv, M.; Niu, X.; Zhang, D.; Ding, H.; Lin, Z.; Zhou, S.; Zhu, Y. A data-driven framework for spatiotemporal analysis and prediction of river water quality: A case study in Pearl River, China. Water 2023, 15, 257. [Google Scholar] [CrossRef]
- Jiao, J.; Ma, Q.; Huang, S.; Liu, F.; Wan, Z. A hybrid water quality prediction model based on variational mode decomposition and bidirectional gated recursive unit. Water Sci. Technol. 2024, 89, 2273–2289. [Google Scholar] [CrossRef]
- Srivastava, A.; Maity, R. Assessing the potential of AI–ML in urban climate change adaptation and sustainable development. Sustainability 2023, 15, 16461. [Google Scholar] [CrossRef]
- Santy, S.; Mujumdar, P.; Bala, G. Projections of water quality along an industrialized stretch of Ganga River for the Mid-21st Century. In Proceedings of the AGU Fall Meeting 2021, New Orleans, LA, USA, 13–17 December 2021. [Google Scholar]
- Verhagen, M.D. Using Machine Learning to Study Effect Heterogeneity in Large-Scale Policy Interventions: The Dutch Decentralisation of the Social Domain. SocArXiv 2024. [Google Scholar] [CrossRef]
- Fischer-Abaigar, U.; Kern, C.; Barda, N.; Kreuter, F. Bridging the gap: Towards an expanded toolkit for AI-driven decision-making in the public sector. Gov. Inf. Q. 2024, 41, 101976. [Google Scholar] [CrossRef]
- Rehill, P.; Biddle, N. Fairness implications of heterogeneous treatment effect estimation with machine learning methods in policy-making. arXiv 2023. [Google Scholar] [CrossRef]
- Iancu, G.; Ciolofan, S.N.; Drăgoicea, M. Real-time IoT architecture for water management in smart cities. Discov. Appl. Sci. 2024, 6, 191. [Google Scholar] [CrossRef]
- Mohanasundaram, R.; Sagar, R.K.; Khandelwal, A.; Upadhyay, D.; Poddar, H.; Rajagopal, S. Water quality monitoring and control in urban areas in real-time via IoT and mobile applications. In Proceedings of the 2024 3rd International Conference on Artificial Intelligence for Internet of Things (AIIoT), Vellore, India, 3–4 May 2024; IEEE: New York, NY, USA, 2024; pp. 1–5. [Google Scholar]
- Nishan, R.K.; Akter, S.; Sony, R.I.; Hoque, M.M.; Anee, M.J.; Hossain, A. Development of an IoT-based multi-level system for real-time water quality monitoring in industrial wastewater. Discov. Water 2024, 4, 43. [Google Scholar] [CrossRef]
- Dorado-Guerra, D.Y.; Paredes-Arquiola, J.; Pérez-Martín, M.Á.; Corzo-Pérez, G.; Ríos-Rojas, L. Effect of climate change on the water quality of mediterranean rivers and alternatives to improve its status. J. Env. Manag. 2023, 348, 119069. [Google Scholar] [CrossRef]
- Hassanjabbar, A.; Nezaratian, H.; Wu, P. Climate change impacts on the flow regime and water quality indicators using an artificial neural network (ANN): A case study in Saskatchewan, Canada. J. Water Clim. Change 2022, 13, 3046–3060. [Google Scholar] [CrossRef]
- Bojer, A.K.; Woldetsadik, M.; Biru, B.H. Machine learning and CORDEX-Africa regional model for assessing the impact of climate change on the Gilgel Gibe Watershed, Ethiopia. J. Env. Manag. 2024, 363, 121394. [Google Scholar] [CrossRef]
- Ji, C.; Fincke, T.; Benson, V.; Camps-Valls, G.; Fernandez-Torres, M.A.; Gans, F.; Kraemer, G.; Martinuzzi, F.; Montero, D.; Mora, K.; et al. DeepExtremeCubes: Integrating Earth System Spatio-Temporal Data for Impact Assessment of Climate Extremes. arXiv 2024. [Google Scholar] [CrossRef]
- Trok, J.T.; Barnes, E.A.; Davenport, F.V.; Diffenbaugh, N.S. Machine learning–based extreme event attribution. Sci. Adv. 2024, 10, eadl3242. [Google Scholar] [CrossRef] [PubMed]
- Saoulis, A.; Moraga, S.; Lord, N.; Uhe, P.; Addor, N. Application of Novel Generative Diffusion Models to Precipitation Downscaling. In Proceedings of the European Geosciences Union General Assembly 2024 (EGU24), Vienna, Austria, 14–19 April 2024; p. 19266. [Google Scholar]
- Ferrario, D.M.; Sano, M.; Tiggeloven, T.; Claasen, J.; Petrovska, E.; Maraschini, M.; de Ruiter, M.; Torresan, S.; Critto, A. Artificial intelligence for climate change multi-risk assessment: A myriad-EU case study in the Veneto Region. In Proceedings of the European Geosciences Union General Assembly 2024 (EGU24), Vienna, Austria, 14–19 April 2024; p. EGU24-16585. [Google Scholar]
- Elshaikh, A.; Mabrouki, J.; Mohamed, A.A.O. The future of water management. In Advancements in Climate and Smart Environment Technology; IGI Global: Hershey, PA, USA, 2024; pp. 213–225. [Google Scholar]
- Ismail, S.; Dawoud, D.W.; Ismail, N.; Marsh, R.; Alshami, A.S. IoT-based water management systems: Survey and future research direction. IEEE Access 2022, 10, 35942–35952. [Google Scholar] [CrossRef]
- Msamadya, S.; Joo, J.C.; Lee, J.M.; Choi, J.S.; Lee, S.; Lee, D.J.; Go, H.W.; Jang, S.Y.; Lee, D.H. Role of water policies in the adoption of smart water metering and the future market. Water 2022, 14, 826. [Google Scholar] [CrossRef]
- Nwokediegwu, Z.Q.S.; Adefemi, A.; Ayorinde, O.B.; Ilojianya, V.I.; Etukudoh, E.A. Review of water policy and management: Comparing the USA and Africa. Eng. Sci. Technol. J. 2024, 5, 402–411. [Google Scholar] [CrossRef]
- Tóth, T.; Matias Silva, B. Development directions of water management by comparing Rio Grande do Norte to Hungary. Belügyi. Szle. 2021, 69, 53–67. [Google Scholar] [CrossRef]
- Bekele, D.A.; Bona, S.K.; Haji, S.H. Water policy for sustainable management: A review. J. Resour. Dev. Manag. 2021, 72, 2422–8397. [Google Scholar] [CrossRef]
- Dassler, T.; Myhr, A.I.; Lalyer, C.R.; Frieß, J.L.; Spök, A.; Liebert, W.; Hagen, K.; Engelhard, M.; Giese, B. Structured analysis of broader GMO impacts inspired by technology assessment to inform policy decisions. Agric. Hum. Values 2024, 41, 449–458. [Google Scholar] [CrossRef]
- Woodson, T.; Boutilier, S. From intent to impact—The decline of broader impacts throughout an NSF project life cycle. Res. Eval. 2023, 32, 348–355. [Google Scholar] [CrossRef]
- Gilman, M.E. Democratizing AI: Principles for meaningful public participation. SSRN Electron. J. 2023. [Google Scholar] [CrossRef]
- Pallett, H.; Price, C.; Chilvers, J.; Burall, S. Just public algorithms: Mapping public engagement with the use of algorithms in UK public services. Big Data Soc. 2024, 11, 20539517241235867. [Google Scholar] [CrossRef]
- Crettaz von Roten, F. Broadening the audience for science engagement with machine-learning techniques. Front. Commun. 2024, 9, 1382952. [Google Scholar] [CrossRef]
- Miklosik, A.; Kuchta, M.; Evans, N.; Zak, S. Towards the adoption of machine learning-based analytical tools in digital marketing. IEEE Access 2019, 7, 85705–85718. [Google Scholar] [CrossRef]
- Hanoon, S.K.; Abdullah, A.F.; Shafri, H.Z.M.; Wayayok, A. A novel approach based on machine learning and public engagement to predict water-scarcity risk in urban areas. ISPRS Int. J. Geoinf. 2022, 11, 606. [Google Scholar] [CrossRef]
- Balachandran, T.; Abreu, T.; Naloufi, M.; Souihi, S.; Lucas, F.; Janne, A. IoT and transfer learning based urban river quality prediction. In Proceedings of the 2022 IEEE Global Communications Conference (GLOBECOM 2022), Rio de Janeiro, Brazil, 4–8 December 2022; IEEE: New York, NY, USA, 2022; pp. 257–262. [Google Scholar]
- Al-Rawas, G.; Nikoo, M.R.; Al-Wardy, M.; Etri, T. A Critical review of emerging technologies for flash flood prediction: Examining artificial intelligence, machine learning, internet of things, cloud computing, and robotics techniques. Water 2024, 16, 2069. [Google Scholar] [CrossRef]
- Yang, Y. Current trends in deep learning. Adv. Eng. Technol. Res. 2023, 5, 422. [Google Scholar] [CrossRef]
- Samanta, S.; Sarkar, A. IoT and blockchain for smart water quality management in future cities: A Hyperledger fabric framework for smart water quality management and distribution. Res. Sq. 2023. [Google Scholar] [CrossRef]
- Sharma, A.; Sharma, R.; Rana, R.; Kalia, A. Water quality prediction using machine learning models. E3S Web Conf. 2024, 596, 01025. [Google Scholar] [CrossRef]
- Alharbi, N.; Althagafi, A.; Alshomrani, O.; Almotiry, A.; Alhazmi, S. A blockchain based secure IoT solution for water quality management. In Proceedings of the 2021 International Congress of Advanced Technology and Engineering (ICOTEN), Taiz, Yemen, 4–5 July 2021; IEEE: New York, NY, USA, 2021; pp. 1–8. [Google Scholar]
- Artelt, A.; Kyriakou, M.S.; Vrachimis, S.G.; Eliades, D.G.; Hammer, B.; Polycarpou, M.M. A toolbox for supporting research on ai in water distribution networks. arXiv 2024. [Google Scholar] [CrossRef]
- Moezzi, S.M.M.; Abousaeidi, A.; Khorashadi Zadeh, F.; Sheikholeslami, R.; Nkwasa, A.; van Griensven, A. Projecting Impacts of Climate Change on Water Quality and Quantity Using an AI-Based Model as an Innovative Alternative to Physically-Based Models. In Proceedings of the EGU General Assembly 2024, Vienna, Austria, 14–19 April 2024; p. EGU24-21915. [Google Scholar]
Reference | Method/Model Used | Results | |
---|---|---|---|
[6] | (Harmel et al., 2023) | Traditional water quality assessment. | Perceived weaknesses in non-auto sampling that included high costs, labor and time intensity, and inadequate coverage. |
[10] | (Geetha, T.S. et al., 2024) | Convolutional neural networks (CNNs) and gated recurrent units (GRUs). | Transaction and real-time monitoring accuracy on the Vaigai River was validated, achieving a level of 97.86%. |
[11] | (Kim et al., 2024) | In the presented work, monthly WQI values are predicted using the long short-term memory (LSTM) model. | Achieved a superior accuracy with an R2 of 0.91 which suggested more accuracy as compared to support vector regression (SVR) and random forest models. |
[12] | (Khaskheli et al., 2024) | The optimized machine learning models for water quality forecasting. | Applied gradient boosting to achieve a mean absolute error (MAE) = 1.8074 and proved that feature selection is highly relevant. |
[13] | (Nitya Nand Jha, 2024) | Generally, random forest for classification. | Observed a 99.50% accuracy in the classification of actual and potential water quality classes using physicochemical parameters. |
[14] | (Khoi et al., 2022a) | The machine learning algorithms that can be implemented for these data comprise a gradient boosting model and polynomial regression. | Showed good accuracy in estimating WQI values in river systems. |
[15] | (Mokarram et al., 2024) | Fuzzy neural networks. | Combined deep learning with fuzzy techniques to improve SWQRs for water quality in polluted areas. |
[16] | (Tefera et al., 2023) | In this paper, a deep learning model with feature selection is proposed for disease diagnosis Ghana as a case study. | The integration of random forest with AdaBoost improved interpretability and the accuracy of the chosen model in a real-time monitoring environment. |
[17] | (Koleva et al., 2024) | The Internet of Things (Internet of Things systems). | Facilitated the instant computation of the WQI in real time with sensors for turbidity, pH, and DO in water bodies. |
[18] | (Jacobs et al., 2021) | The fourth topic is ethical consideration in AI/ML models. | Sparred on data privacy and equality issues regarding the use of the IoT and machine learning with applications in water management. |
[19] | (Uddin et al., 2024) | This paper presents an approach for outlier detection with isolation forest. | Emphasized the need to address noisy data for improved estimates of water quality prediction models in particular. |
[20] | (Zheng et al., 2024) | Ensemble deep learning models. | Implemented a spatial–temporal attention mechanism for better prediction accuracy up to 22%. |
[21] | (del Castillo et al., 2024) | The ANFIS model as a combination of an ANFIS, an ANN, and an SVM. | Improved accuracy in estimating dissolved oxygen and pH in various conditions in water environments. |
[22] | (Ejaz et al., 2024) | As the names suggest, random forest and gradient boosting are also another two important algorithms for machine learning. | Quitted with high proficiency rates by achieving a training R2 of 0.88 and a testing R2 of 0.85. |
[23] | (Sidek et al., 2024) | The analysis of ensemble models for water quality prediction. | They also established the efficiency of predicting the WQI for the Johor River Basin through hybrid techniques. |
[24] | (Lin et al., 2024) | This paper delves into the exploration of Explainable AI using SHAP analysis. | Highlighted WQI prediction model features and contributed to understanding key parameters which affect the model’s results. |
Criterion | Traditional Methods | AI- and Machine Learning-Based Methods | Improvement Percentage Using AI | References |
---|---|---|---|---|
Annual Operating Cost (per monitoring point) | More expansive than machine learning-based methods | Fewer costs | 60–75% reduction in cost | (Zainurin, et al., 2022) [39] |
Geographical Coverage (monitoring stations per 100 km2) | 2–5 stations | 15–50 IoT sensors | 500–900% improvement in coverage | (Lv et al., 2023) [48] |
Response Time for Pollution Detection | 12–48 h (lab-based analysis) | 1–5 min (real-time AI analysis) | Over 99% improvement in detection speed | (Trok et al., 2024) [62] |
Quality Measurement Accuracy (pollutant measurement error rate) | ±15–25% | ±2–5% | 80–90% improvement in accuracy | (Amador-Castro, et al., 2024) [32] |
Monitoring Continuity | Intermittent (once or twice per month) | Continuous (24/7) | 100% improvement in data continuity | (Khoi et al., 2022) [14] |
Early Pollution Detection Capability | 30–50% of cases detected after damage occurs | 85–98% of cases detected in real time | 70–90% improvement in early detection | (Khoi Huang et al., 2024) [37] |
Equipment Cost per Monitoring Station | More expansive than machine learning-based methods | Fewer costs | 60–80% reduction in equipment cost | (Trok et al., 2024) [62] |
Logistical and Environmental Constraints | Difficult access to some areas | Deployable self-operating sensors and drones | Significant improvement in operational flow | (Nguyen et al., 2024) [27] |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Shaheed, H.; Zawawi, M.H.; Hayder, G. The Development of a River Quality Prediction Model That Is Based on the Water Quality Index via Machine Learning: A Review. Processes 2025, 13, 810. https://doi.org/10.3390/pr13030810
Shaheed H, Zawawi MH, Hayder G. The Development of a River Quality Prediction Model That Is Based on the Water Quality Index via Machine Learning: A Review. Processes. 2025; 13(3):810. https://doi.org/10.3390/pr13030810
Chicago/Turabian StyleShaheed, Hassan, Mohd Hafiz Zawawi, and Gasim Hayder. 2025. "The Development of a River Quality Prediction Model That Is Based on the Water Quality Index via Machine Learning: A Review" Processes 13, no. 3: 810. https://doi.org/10.3390/pr13030810
APA StyleShaheed, H., Zawawi, M. H., & Hayder, G. (2025). The Development of a River Quality Prediction Model That Is Based on the Water Quality Index via Machine Learning: A Review. Processes, 13(3), 810. https://doi.org/10.3390/pr13030810