Prediction, Uncertainty Quantification, and ANN-Assisted Operation of Anaerobic Digestion Guided by Entropy Using Machine Learning
Abstract
1. Introduction
- Industrial-scale dataset and reproducible evaluation pipeline. A six-month, 9823-sample dataset is constructed from a full-scale AD plant. A unified pipeline—including data cleaning, anomaly removal, normalization, temporal K–S splitting, five-fold cross-validation, and rolling window evaluation—ensures data reliability and model generalizability.
- Entropy-aware machine learning and interpretable validation. The ANN outperforms SVM and RF in predicting biogas yield, temperature, and VFA. Beyond accuracy metrics, error entropy is introduced to characterize predictive uncertainty. Feed solids, organic matter, and feed rate are consistently identified as the dominant variables through feature importance and entropy increase analysis.
- ANN-assisted operation deployment and process entropy reduction. The optimized ANN is embedded into a real-time feedback loop (“sensor → prediction → programmable logic controller (PLC) → feedback”), reducing gas-yield fluctuation from ±18% to ±5% and improving process stability by approximately 23%. This improvement is accompanied by a measurable reduction in process entropy, demonstrating enhanced system order, energy efficiency, and low-carbon potential.
2. Materials and Methods
2.1. Data Sources and Preprocessing
2.1.1. Data Acquisition and Anaerobic Digestion Process
2.1.2. Data Preprocessing Methods
2.2. Machine Learning Model Design and Evaluation Metrics
2.2.1. Support Vector Machine (SVM)
2.2.2. Random Forest (RF)
2.2.3. Artificial Neural Networks (ANNs)
2.3. Model Evaluation Metrics
- Classification metrics include Precision (Equation (11)), Recall (Equation (11)), and F1-score (Equation (11)) to balance prediction reliability under class imbalance; AUROC complements these by evaluating threshold-independent robustness.
- Regression performance was quantified by Root Mean Square Error (RMSE) (Equation (12)) for biogas yield (m3/t), temperature (°C), and VFA (g/L); lower RMSE indicates stronger predictive capability and generalization.
2.4. Entropy-Based Uncertainty Quantification Method
2.4.1. Error Entropy for Prediction Uncertainty
2.4.2. Entropy Increase for Feature Contribution
2.4.3. Process Entropy for Operational Stability
3. Results and Analysis
3.1. Model Performance Comparison
3.2. Feature Importance and Entropy-Based Uncertainty Analysis
3.3. Application of ANN-Based Intelligent Operation and Monitoring
3.4. Limitations and Outlook
4. Conclusions
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Jiang, Y.; Zhang, Y.; Li, H. Research progress and analysis on comprehensive utilization of livestock and poultry biogas slurry as agricultural resources. Agriculture 2023, 13, 2216. [Google Scholar] [CrossRef]
- Kumar, D.J.P.; Mishra, R.K.; Chinnam, S.; Binnal, P.; Dwivedi, N. A comprehensive study on anaerobic digestion of organic solid waste: A review on configurations, operating parameters, techno-economic analysis and current trends. Biotechnol. Notes 2024, 5, 33–49. [Google Scholar] [CrossRef]
- Zhao, J.; Ren, S.; Li, C.; Jiao, M.; Wu, G.; Chen, H. Research progress and perspectives of biogas production from municipal organic solid waste. Int. J. Chem. React. Eng. 2024, 22, 219–230. [Google Scholar] [CrossRef]
- Yuan, Q.; Lou, Y.; Wu, J.; Sun, Y. Long-term semi-continuous acidogenic fermentation for food wastes treatment: Effect of high organic loading rates at low hydraulic retention times and uncontrolled pH conditions. Bioresour. Technol. 2022, 357, 127356. [Google Scholar] [CrossRef]
- Wang, S.; Li, D.; Zhang, K.; Ma, Y.; Liu, F.; Li, Z.; Gao, X.; Gao, W.; Du, L. Effects of initial volatile fatty acid concentrations on process characteristics, microbial communities, and metabolic pathways on solid-state anaerobic digestion. Bioresour. Technol. 2023, 369, 128461. [Google Scholar] [CrossRef] [PubMed]
- Rutland, H.; You, J.; Liu, H.; Bull, L.; Reynolds, D. A systematic review of machine-learning solutions in anaerobic digestion. Bioengineering 2023, 10, 1410. [Google Scholar] [CrossRef] [PubMed]
- Zhai, S.; Chen, K.; Yang, L.; Li, Z.; Yu, T.; Chen, L.; Zhu, H. Applying machine learning to anaerobic fermentation of waste sludge using two targeted modeling strategies. Sci. Total Environ. 2024, 916, 170232. [Google Scholar] [CrossRef] [PubMed]
- Gan, E.Y.T.; Chan, Y.J.; Wan, Y.K.; Tiong, T.J.; Chong, W.C.; Lim, J.W. Examining the synergistic effects through machine learning prediction and optimisation in the anaerobic co-digestion of palm oil mill effluent and decanter cake with economic analysis. J. Clean. Prod. 2024, 437, 140666. [Google Scholar] [CrossRef]
- Delory, F.S.; Neubauer, P.; Weinrich, S. Uncertainty analysis of a simplified anaerobic digestion model applied to dynamic agricultural experimental data. Water Sci. Technol. 2025, 92, 610–634. [Google Scholar] [CrossRef]
- Yildirim, O.; Ozkaya, B. Prediction of biogas production of industrial-scale anaerobic digestion plant by machine learning algorithms. Chemosphere 2023, 335, 138976. [Google Scholar] [CrossRef]
- Ganeshan, P.; Bose, A.; Lee, J.; Barathi, S.; Rajendran, K. Machine learning for high solid anaerobic digestion: Performance prediction and optimization. Bioresour. Technol. 2024, 400, 130665. [Google Scholar] [CrossRef]
- Wen, C.; Li, R.; Zhao, C.; Chen, L.; Wang, M.; Yin, Y.; Meng, Z. An improved LSTM-based model for identifying high working-intensity load segments of the tractor load spectrum. Comput. Electron. Agric. 2023, 210, 107879. [Google Scholar] [CrossRef]
- Haas, C.; Budin, C.; d’Arcy, A. How to select oil price prediction models—The effect of statistical and financial performance metrics and sentiment scores. Energy Econ. 2024, 133, 107466. [Google Scholar] [CrossRef]
- Farzin, F.; Moghaddam, S.S.; Ehteshami, M. Auto-tuning data-driven model for biogas yield prediction from anaerobic digestion of sewage sludge at the South-Tehran wastewater treatment plant: Feature selection and hyperparameter population-based optimization. Renew. Energy 2024, 227, 120554. [Google Scholar] [CrossRef]
- Breiman, L. Random forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
- Heaton, J. An empirical analysis of feature engineering for predictive modeling. In Proceedings of the IEEE SoutheastCon 2016, Norfolk, VA, USA, 30 March–3 April 2016; pp. 1–6. [Google Scholar]
- Hassoun, M.H. Fundamentals of Artificial Neural Networks; MIT Press: Cambridge, MA, USA, 1995. [Google Scholar]
- Pietrasik, M.; Wilbik, A.; Damoiseaux, Y.; Derks, T.; Karambiri, E.; de Koster, S.; van der Velde, D.; Ragaert, K.; Teng, S.Y. Capturing variability in material property predictions for plastics recycling via machine learning. Digit. Chem. Eng. 2025, 15, 100239. [Google Scholar] [CrossRef]
- Liu, Z.; Kan, H.; Zhang, T.; Li, Y. DUKMSVM: A framework of deep uniform kernel mapping support vector machine for short text classification. Appl. Sci. 2020, 10, 2348. [Google Scholar] [CrossRef]
- Bagnulo, E.; Strocchi, G.; Bicchi, C.; Liberto, E. Industrial food quality and consumer choice: Artificial-intelligence-based tools in the chemistry of sensory notes in comfort foods (coffee, cocoa and tea). Trends Food Sci. Technol. 2024, 147, 104415. [Google Scholar] [CrossRef]
- Srilakshmi, U.; David, D.B. Enhancing the quality of experience of online video service using support vector machine (SVM) in comparison with artificial neural networks (ANN). AIP Conf. Proc. 2025, 3267, 020282. [Google Scholar]
- Tocchi, G.; Misra, S.; Padgett, J.E.; Polese, M.; Di Ludovico, M. The use of machine-learning methods for post-earthquake building usability assessment: A predictive model for seismic-risk impact analyses. Int. J. Disaster Risk Reduct. 2023, 97, 104035. [Google Scholar] [CrossRef]
- Chen, L.; He, P.; Zou, J.; Zhang, H.; Peng, W.; Lü, F. Scalable and interpretable automated machine learning framework for biogas prediction, optimization, and stability monitoring in industrial-scale dry anaerobic digestion. Chem. Eng. J. 2025, 519, 165482. [Google Scholar] [CrossRef]
- Shen, R.; Sun, P.; Liu, J.; Luo, J.; Yao, Z.; Zhang, R.; Yu, J.; Zhao, L. Robust prediction for characteristics of digestion products in an industrial-scale biogas project via typical non-time-series and time-series machine learning algorithms. Chem. Eng. J. 2024, 498, 155582. [Google Scholar] [CrossRef]
- Tuğrul, T.; Hinis, M.A. Improvement of drought forecasting by means of various machine learning algorithms and wavelet transformation. Acta Geophys. 2025, 73, 855–874. [Google Scholar] [CrossRef]
- Gupta, R.; Zhang, L.; Hou, J.; Zhang, Z.; Liu, H.; You, S.; Ok, Y.S.; Li, W. Review of explainable machine learning for anaerobic digestion. Bioresour. Technol. 2023, 369, 128468. [Google Scholar] [CrossRef]
- Liu, L.; Tian, Y.; Zhao, J.; Xia, Z.; Wang, N.; Wang, D. Machine learning for predicting methane production and optimizing parameter in anaerobic digestion process. Fuel 2025, 396, 135206. [Google Scholar] [CrossRef]
- Sinha, S.; Vaidy, U. On Data-Driven Computation of Information Transfer for Causal Inference in Discrete-Time Dynamical Systems. J. Nonlinear Sci. 2020, 30, 1651–1676. [Google Scholar] [CrossRef]
- Deng, X.; Xu, Y.; Chen, L.; Zhong, W.; Jolfaei, A.; Zheng, X. Dynamic clustering method for imbalanced learning based on AdaBoost. J. Supercomput. 2020, 76, 9716–9738. [Google Scholar] [CrossRef]
- Zhong, Y.; Roman, M.B.; Zhong, Y.; Archer, S.; Chen, R.; Deitz, L.; Hochhalter, D.; Balaze, K.; Sperry, M.; Werner, E. Using anaerobic digestion of organic wastes to biochemically store solar thermal energy. Energy 2015, 83, 638–646. [Google Scholar] [CrossRef]
- Sarkar, O.; Rova, U.; Christakopoulos, P.; Matsakas, L. Influence of initial uncontrolled pH on acidogenic fermentation of brewery spent grains to biohydrogen and volatile fatty acids production: Optimization and scale-up. Bioresour. Technol. 2021, 319, 124233. [Google Scholar] [CrossRef]
- Schroer, H.W.; Just, C.L. Feature engineering and supervised machine learning to forecast biogas production during municipal anaerobic co-digestion. ACS EST Eng. 2024, 4, 660–672. [Google Scholar] [CrossRef]
- Abd, A.A.; Othman, M.R.; Helwani, Z.; Kim, J. An overview of biogas upgrading via pressure swing adsorption: Navigating through bibliometric insights towards a conceptual framework and future research pathways. Energy Convers. Manag. 2024, 306, 118268. [Google Scholar] [CrossRef]







| Variable | Unit | Operating Range (Mean ± SD) | Engineering Threshold | Data Source | n | Coverage |
|---|---|---|---|---|---|---|
| Feed solids | % | 26.5 ± 4.2 | 20–35 | Online sensor | 9820 | Steady, fluctuation, seasonal |
| Organic matter | % | 28.7 ± 5.1 | 15–40 | Experimental test | 9500 | Steady, feedstock change, seasonal |
| Feed rate | t/h | 1.2 ± 0.3 | 0.5–2.0 | Flow meter | 10,000 | Steady, fluctuation |
| pH | – | 7.2 ± 0.4 | 6.5–8.0 | Online sensor | 9700 | Steady, fluctuation |
| Dissolved O2 (mg L−1) | mg/L | 0.25 ± 0.10 (0.10–0.50) | ≤0.5 mg/L (<1% sat) | Online sensor | 9600 | Steady, fluctuation |
| Total solids | % | 32.4 ± 3.5 | 25–45 | Experimental test | 9300 | Steady, feedstock change |
| Biogas yield | m3/t | 79.5 ± 6.1 | ≥70 | Gas meter | 10,000 | Steady, fluctuation, seasonal |
| Temperature | °C | 34.2 ± 2.5 | 30–35 | Temperature sensor | 9950 | Steady, fluctuation, seasonal |
| VFA | g/L | 6.3 ± 1.5 | ≤8 | Experimental test | 9100 | Steady, feedstock change, fluctuation |
| Timestamp | Feed Solids (%) | Organic (%) | pH | Biogas (m3/t) | VFA (g/L) | Condition |
|---|---|---|---|---|---|---|
| 8-Mar | 27.8 | 29.4 | 7.2 | 81.3 | 6.1 | Steady state |
| 21-Mar | 33.2 | 34.7 | 7 | 77.6 | 7.4 | Load fluctuation |
| 12-May | 30.5 | 31.2 | 6.8 | 79.8 | 5.9 | Seasonal variation |
| Model | Hyperparameter | Search Range | Optimal Value | Optimization Method |
|---|---|---|---|---|
| SVM | C | 0.01–100 | 10 | Grid search |
| γ (RBF kernel width) | 1 × 10−4–1 × 100 | 0.05 | Grid search | |
| RF | Number of trees | 50–500 | 100 | Cross-validation |
| Maximum depth | 5–30 | 18 | Random search | |
| Number of features (mtry) | 1–6 | 2 | Random search | |
| ANN | Hidden layers | [32–256] × 1–3 | [128, 64] | Grid search |
| Activation function | ReLU/Tanh | ReLU | Empirical selection | |
| Learning rate | 1 × 10−4–1 × 10−2 | 0.001 | Adam optimizer | |
| Regularization λ | 0–0.01 | 0.001 | Grid search | |
| Batch size | 16–128 | 64 | Grid search |
| Model | Accuracy | Recall | F1 | AUROC | Biogas RMSE (m3/t) | Temp RMSE (°C) | VFA RMSE (g /L) | R2 (avg.) |
|---|---|---|---|---|---|---|---|---|
| SVM | 0.89 | 0.87 | 0.88 | 0.91 | 2.1 | 1.2 | 0.8 | 0.82 |
| RF | 0.91 | 0.9 | 0.9 | 0.94 | 1.8 | 0.9 | 0.6 | 0.88 |
| ANN | 0.96 | 0.95 | 0.95 | 0.98 | 1.2 | 0.5 | 0.3 | 0.94 |
| Scenario | Grid Emission Factor (kgCO2/kW/h) | Energy Savings Rate | Carbon Emission Reduction Rate |
|---|---|---|---|
| China | 0.65 | 8% | 5% |
| EU | 0.35 | 9% | 6% |
| United States | 0.45 | 10% | 7% |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Zhuang, Z.; Liu, X.; Jin, J.; Li, Z.; Liu, Y.; Tavares, A.; Li, D. Prediction, Uncertainty Quantification, and ANN-Assisted Operation of Anaerobic Digestion Guided by Entropy Using Machine Learning. Entropy 2025, 27, 1233. https://doi.org/10.3390/e27121233
Zhuang Z, Liu X, Jin J, Li Z, Liu Y, Tavares A, Li D. Prediction, Uncertainty Quantification, and ANN-Assisted Operation of Anaerobic Digestion Guided by Entropy Using Machine Learning. Entropy. 2025; 27(12):1233. https://doi.org/10.3390/e27121233
Chicago/Turabian StyleZhuang, Zhipeng, Xiaoshan Liu, Jing Jin, Ziwen Li, Yanheng Liu, Adriano Tavares, and Dalin Li. 2025. "Prediction, Uncertainty Quantification, and ANN-Assisted Operation of Anaerobic Digestion Guided by Entropy Using Machine Learning" Entropy 27, no. 12: 1233. https://doi.org/10.3390/e27121233
APA StyleZhuang, Z., Liu, X., Jin, J., Li, Z., Liu, Y., Tavares, A., & Li, D. (2025). Prediction, Uncertainty Quantification, and ANN-Assisted Operation of Anaerobic Digestion Guided by Entropy Using Machine Learning. Entropy, 27(12), 1233. https://doi.org/10.3390/e27121233

