Artificial Intelligence in Anaerobic Digestion: A Review of Sensors, Modeling Approaches, and Optimization Strategies

Milena Marycz; Izabela Turowska; Szymon Glazik; Piotr Jasiński

doi:10.3390/s25226961

,

and

Faculty of Electronics, Telecommunications and Informatics, and Advanced Materials Centre, Gdańsk University of Technology, 80-233 Gdańsk, Poland

^*

Author to whom correspondence should be addressed.

Sensors2025, 25(22), 6961;https://doi.org/10.3390/s25226961

This article belongs to the Section Biosensors

Version Notes

Order Reprints

Highlights

What are the main findings?

Artificial intelligence (AI) and machine learning (ML) have the potential to significantly enhance the forecasting and optimization of anaerobic digestion (AD) processes. They enable efficient handling of nonlinear and multidimensional data, enhancing process control and predictive accuracy.
Soft sensors integrating electrochemical, microbial, optical, and hybrid systems provide adaptive, real-time process control and stability monitoring in anaerobic fermentation.

What is the implication of the main finding?

Further development toward autonomous and intelligent AD systems requires the creation of more reliable sensors, standardization of open-access datasets, and improvement of AI model interpretability.
Advancing these areas will enable more predictive, transparent, and efficient biogas production, supporting the circular economy and energy transition goals.

Abstract

Anaerobic digestion (AD) is increasingly recognized as a key technology for renewable energy generation and sustainable waste management within the circular economy. However, its performance is highly sensitive to feedstock variability and environmental fluctuations, making stable operation and high methane yields difficult to sustain. Conventional monitoring and control systems, based on limited sensors and mechanistic models, often fail to anticipate disturbances or optimize process performance. This review discusses recent progress in electrochemical, optical, spectroscopic, microbial, and hybrid sensors, highlighting their advantages and limitations in artificial intelligence (AI)-assisted monitoring. The role of soft sensors, data preprocessing, feature engineering, and explainable AI is emphasized to enable predictive and adaptive process control. Various machine learning (ML) techniques, including neural networks, support vector machines, ensemble methods, and hybrid gray-box models, are evaluated for yield forecasting, anomaly detection, and operational optimization. Persistent challenges include sensor fouling, calibration drift, and the lack of standardized open datasets. Emerging strategies such as digital twins, data augmentation, and automated optimization frameworks are proposed to address these issues. Future progress will rely on more robust sensors, shared datasets, and interpretable AI tools to achieve predictive, transparent, and efficient biogas production supporting the energy transition.

Keywords:

anaerobic digestion; artificial intelligence; machine learning; soft sensors; data preprocessing; explainable AI; digital twin; process optimization; feature engineering

1. Introduction

Energy policy in Europe increasingly focuses on climate change mitigation and energy security. In 2022, the European Commission launched the REPowerEU plan, which explicitly aims to reduce the European Union’s reliance on imported fossil fuels, particularly natural gas from Russia. A central component of this plan is the large-scale deployment of biomethane produced from renewable and waste-based sources, such as agricultural residues, industrial by-products, waste gases, and municipal wastewater [1,2,3]. Anaerobic digestion (AD) links waste management with the production of biogas, which can be used as a renewable energy carrier. AD is a multi-stage microbial process in which organic substrates are decomposed in the absence of oxygen, yielding primarily methane and carbon dioxide (CO₂), along with small fractions of other gases such as hydrogen sulfide and ammonia. The resulting biogas can be upgraded to biomethane and injected into natural gas grids, while the residual digestate constitutes a nutrient-rich soil amendment [4]. In addition to its environmental benefits, AD contributes to the circular economy by valorizing diverse organic residues such as food waste, wastewater sludge, and agricultural by-products [5,6].

In practice, AD operation often faces instability. The microbial consortia responsible for each metabolic stage are highly sensitive to environmental fluctuations, and process stability is easily disrupted by variations in feedstock composition, accumulation of inhibitors, or inadequate control of operational parameters such as temperature, pH, organic loading rate (OLR), and hydraulic retention time (HRT) [7,8]. Instability can result in reduced methane yields, process inhibition, or even complete reactor failure. Traditional control strategies, relying on a limited set of sensors and linear control loops, often lack the predictive power necessary to anticipate such disturbances [9]. Consequently, the optimization of AD requires new approaches that combine high-resolution monitoring with advanced analytical and decision-support tools.

Artificial intelligence (AI) and machine learning (ML) have been applied to AD in recent studies. These approaches use historical and real-time data to model nonlinear dynamics that conventional mechanistic models cannot capture. Neural networks, recurrent networks, adaptive neuro-fuzzy systems, and ensemble methods have all been applied to predict biogas yields, detect anomalies, and optimize parameters [5,10,11,12,13,14]. Hybrid gray-box frameworks combine the structure of models such as ADM1 with data-driven flexibility, offering more practical predictive capacity [15,16,17,18].

The modeling of AD processes has traditionally relied on three complementary approaches: mechanistically inspired, kinetic, and phenomenological models [19]. Mechanistic models, such as ADM1 and BioModel, provide a detailed biochemical and physicochemical description of system dynamics but require extensive parameterization and substrate characterization [20]. Kinetic models simplify these interactions using empirical rate equations, enabling faster computation but with limited generalizability [21]. Phenomenological models, in turn, focus on reproducing observed input–output relationships without explicit mechanistic detail, offering flexibility at the cost of interpretability [19]. Each class of models thus presents a trade-off between accuracy, complexity, and data requirements. Optimization of AD processes has similarly evolved from deterministic approaches, which rely on gradient-based algorithms and provide reproducible, single-point solutions, to stochastic or metaheuristic methods, such as genetic algorithms, particle swarm optimization, and simulated annealing [22,23,24]. The latter are particularly useful for nonconvex, multimodal problems typical of AD but may require high computational effort and careful tuning of algorithmic parameters. In this context, AI and ML serve as a bridge between traditional modeling paradigms. They combine the adaptability of data-driven learning with the explanatory power of mechanistic frameworks, as demonstrated by recent developments in hybrid modeling and digital twin systems [19].

Another important limitation of current mechanistic and kinetic models is their inability to adequately describe foaming phenomena in anaerobic digesters [19,25]. Foam formation and stability depend on complex interactions among surface-active compounds, gas–liquid mass transfer, hydrodynamics, and microbial community composition, which remain poorly understood at the mechanistic level [26]. As a result, existing process models often fail to anticipate foaming events that can cause operational disruptions and biomass losses [27]. AI-driven frameworks may help address this gap by extracting data-driven signatures of foaming behavior from high-frequency sensor data (e.g., pressure, gas flow, or image-based measurements) [28]. Such models could support early detection and mitigation of foaming, complementing mechanistic understanding with predictive capabilities derived from real-world process data.

The aim of this review is to examine current AI-based approaches for AD, with focus on monitoring, sensor integration, optimization algorithms, and decision support. The review considers developments in biotechnology, process engineering, and computational methods, with reference to their application in biogas and biomethane production under the REPowerEU plan.

2. Anaerobic Digestion and the Need for Intelligent Monitoring

2.1. Basics of Anaerobic Fermentation Processes

In AD, organic matter is broken down stepwise by microbial consortia, with different groups performing distinct metabolic conversions. The first stage, hydrolysis, involves extracellular enzymes that depolymerize carbohydrates, proteins, and lipids into soluble monomers. These smaller molecules are then fermented during acidogenesis, producing volatile fatty acids (VFAs), alcohols, hydrogen, and CO₂. In the subsequent step, acetogenesis, these intermediates are further oxidized to acetate, hydrogen, and CO₂, which finally serve as substrates for methanogens in the terminal stage of methanogenesis. The efficiency of each stage depends on the activity of microbial consortia, many of which are still not fully characterized [29,30].

The composition of biogas is governed by the carbon oxidation-reduction state of the organic matter present in the feedstock, and by the type of AD process. For example, biogas from landfills is a complex mixture of methane (35–65%), CO₂ (15–50%) and volatile organic pollutants (VOCs; <4500 mg/m³) with low hydrogen sulfide (H₂S) concentrations (<200 ppm_v). Biogas from enclosed digesters exhibits higher methane (50–70%) and H₂S (500–20,000 ppm_v) contents, but lower concentrations of VOCs [31,32]. The digestate contributes to nutrient recycling, which supports circular economy applications [33,34].

AD has been studied extensively with mathematical and computational models. The Anaerobic Digestion Model No. 1 (ADM1), developed by the International Water Association, represents the most widely adopted mechanistic framework. It describes the dynamics of 32 state variables through more than 100 parameters, encompassing hydrolysis kinetics, microbial interactions, and gas–liquid transfer processes [16]. Although ADM1 is widely used in research, its application at full scale is limited. It requires extensive substrate characterization, which is often impractical for heterogeneous feedstocks such as municipal solid waste or manure. Parameter calibration is highly demanding and site-specific, and the model does not easily accommodate the stochastic fluctuations inherent in real-world conditions [9,15]. Recently, more advanced mechanistic frameworks such as the BioModel have been developed, extending beyond ADM1 by incorporating biochemical reactions together with three-phase (gas–liquid–solid) physicochemical processes, as well as the influence of biological and inorganic additives and trace metal activities. These extensions allow a more comprehensive representation of substrate degradation and gas production dynamics, supported by experimental validation and parameter optimization on full-scale biogas plant data [35,36]. Such developments provide a valuable context for integrating AI, which can complement these detailed mechanistic frameworks by improving data-driven calibration, predictive accuracy, and adaptive process control.

Data-driven approaches such as ML are increasingly applied in AD. ML techniques range from artificial neural networks to ensemble methods. These techniques offer the ability to infer complex, nonlinear relationships between input parameters and process performance without requiring explicit biochemical equations [37,38]. These models have demonstrated strong predictive performance in estimating biogas yields, forecasting system instabilities, and even serving as soft sensors (i.e., inferential models emulating sensor functions; a formal definition and classification are provided in Section 3.3) for variables such as VFA concentration or ammonia, which are otherwise costly and laborious to measure directly [10,39,40]. Hybrid or so-called gray-box approaches that integrate mechanistic insights with data-driven adaptability further expand this potential, allowing simulations to benefit simultaneously from biochemical interpretability and statistical flexibility [17,18]. For instance, Ge et al. [41] proposed a modified ADM1 (M-ADM1) that integrates a support vector machine model to dynamically predict key kinetic parameters, thereby improving simulation accuracy for various biomass feedstocks.

In summary, AD technology is promising due to the ability to recover energy from spent biomass and recycle nutrients, but has strict operational constraints related to maintaining microbial communities and variability in properties of feedstock. Addressing these issues requires monitoring and control strategies that extend beyond conventional methods.

2.2. Challenges in Monitoring and Control

The stable operation of AD relies on monitoring parameters that are closely interrelated. Temperature plays a critical role, as microbial activity and methane production differ significantly under mesophilic and thermophilic conditions, typically around 35–40 °C for mesophilic and 55–60 °C for thermophilic operation, although the exact ranges may vary between systems and operators [42,43,44]. pH and alkalinity reflect microbial balance: methanogens grow within a narrow pH range, while the accumulation of VFAs can quickly disrupt digester stability [45,46,47]. The relationship between OLR and HRT determines whether microorganisms have enough time to process the substrate. Excessive loading or insufficient retention can lead to biomass washout or process inhibition [33,48]. VFA and ammonia concentrations serve as sensitive early indicators of imbalance, yet continuous real-time monitoring remains challenging [7,49,50]. In laboratory and pilot-scale studies, these parameters are still often measured manually. Samples are withdrawn from the reactor and analyzed in the laboratory using standard test kits or chromatography-based methods to determine VFA, ammonia, or chemical oxygen demand (COD). Although reliable, these laboratory methods are time-consuming and susceptible to operator mistakes [9]. They also miss short-term changes in digester conditions, which can lead to problems before corrective measures are applied.

Efforts to automate monitoring have increasingly turned to online sensors that can deliver continuous data streams. Their deployment in AD is, however, technically demanding. A persistent problem is biofouling: microbial films and organic deposits accumulate on sensor surfaces, distorting measurements and gradually lowering accuracy. This is particularly evident in electrochemical probes used for pH, redox potential, or ion-selective measurements, where signal drift often requires frequent recalibration [51,52,53]. Common countermeasures include self-cleaning housings, protective membranes, and automated calibration routines. These measures slow down fouling but do not remove it entirely. More recent trials with anti-adhesive or antimicrobial coatings show potential for longer stability, though their durability in digesters still needs to be confirmed [54,55,56].

Calibration of sensors poses another practical difficulty. In many studies, models are trained on relatively small datasets derived from manual sampling. Such limited input narrows the capacity of algorithms to generalize, so even minor shifts in feedstock composition may require retraining [13,57]. Sensor drift and environmental variability add further noise, reducing reproducibility [58]. A promising approach to mitigate these limitations involves the systematic creation and evaluation of large, heterogeneous training datasets, encompassing diverse sample types. Specifically, in the context of AD, samples including agricultural residues, manure, and industrial effluents are employed to capture the full range of substrate characteristics [13,59]. By providing a comprehensive representation of sample variability, such datasets enable ML models to learn the nonlinearities inherent in sensor responses and to adapt to new conditions without necessitating a complete redesign of the system [60]. These efforts are still in early stages, and their effectiveness for full-scale operation is being evaluated [13,60].

Control strategies face comparable limitations. Conventional feedback mechanisms, particularly proportional–integral–derivative (PID) controllers, are poorly suited to AD because of its strong nonlinearity, long response times, and interdependencies among parameters [61,62]. In practice, operators often rely on trial-and-error adjustments. This approach can keep reactors running but also increases the risk of inefficiency or breakdown [63].

Research has therefore moved toward data-driven control. Soft sensors estimate variables that are expensive or difficult to monitor directly, for example, VFA concentrations, by drawing on more readily available measurements [51,64]. Anomaly detection systems can recognize deviations in system behavior before they escalate into critical issues. Hybrid approaches that combine ML with optimization methods, such as genetic algorithms or desirability analysis, can then refine loading rates and retention times to maximize methane yield [18,65]. More recently, hybrid dynamic model predictive controllers incorporating Bayesian estimation have been proposed to handle inlet variability and improve methane generation efficiency in anaerobic digestion systems [66]. Moreover, the emergence of digital twins, which are virtual replicas of digesters that combine predictive modeling with real-time process data obtained from sensors, provides a powerful tool for scenario testing, early warning, and autonomous decision-making [67,68]. These tools remain under development, but they show how process control is moving beyond empirical adjustment. The reactor’s economic efficiency can be improved by implementing the SMART (Sensor-based Monitoring and Remote-control Technology) control system, whose fuzzy logic-based decision engine ensures increased biogas production and improved volume recovery rates. Using this technology allowed for a 53% increase in methane production after four months compared to the open-loop method, while methane production increased 1.6 times slower using the conventional method than the SMART system [69].

Although many monitoring and control concepts have been proposed, their validation under industrial conditions is scarce. Future work should focus on transferring laboratory-scale monitoring strategies to full-scale biogas plants and assessing their economic feasibility and robustness to environmental variability.

3. Sensors in Anaerobic Reactors: State of the Art

3.1. Electrochemical Sensors

The operation of AD depends on continuous changes in pH, redox potential, and conductivity, which are tracked with electrochemical sensors [70]. Online monitoring provides real-time data that lead to faster detection of process disturbances and advance planning of calibration or cleaning [71]. However, sensor reliability depends on multiple factors, including equipment configuration, sampling strategies, measurement frequency, and maintenance requirements [72].

Amperometric electrochemical sensors have been developed to monitor VFAs, critical intermediates that reflect AD stability. These sensors detect current signals generated by electron transfer from electroactive bacteria such as Geobacter anodireducens, which colonize electrode surfaces with anodic biofilms [73]. However, sensor signals are sensitive to interference. For example, fumarate can act as an electron donor, increasing current density and distorting VFA quantification, while ions such as Cu (II) or elevated total ammonia nitrogen (TAN > 200 mg/L) reduce measurement reliability [74]. Nevertheless, electrodes composed of graphite anodes and stainless-steel cathodes have achieved accuracy comparable to gas chromatography (GC), demonstrating the promise of microbial electrochemical sensors (MESe) for online VFA monitoring [75]. For example, octane measurement using microbial biosensors yields comparable results to gas chromatography, with the market value within 24.2% in terms of gas chromatographic analysis [76]. Coupling electrochemical probes with physicochemical sensors such as temperature or conductivity electrodes can improve data quality [75]. In practice, long-term use remains constrained by microbial community shifts that affect reproducibility and by fouling of anion-exchange membranes during immersion in digestate [75,77].

Microbial potentiometric sensors (MPSs) use biofilm-mediated open-circuit potentials to track redox fluctuations in digesters [78]. MPS devices are more sensitive to changes in carbon removal than dissolved oxygen (DO) or oxidation–reduction potential (ORP) sensors. Their main advantages include the catalytic activity of biofilm-associated enzymes and the capacity for biofilm regeneration of electrode surfaces and can delay the need for electrode maintenance [79]. However, MPS systems are strongly influenced by environmental conditions such as nutrient levels, temperature, and the presence of biocides. The results of the bioelectrochemical cell solution analysis (MEC/MFC) lead to a linear correlation (R² = 0.99) between current fluxes (from 0.03 ± 0.01 to 2.43 ± 0.12 A/m²) and VFA effects (5–100 mM), further supporting the sensitivity of MPS-based systems to biochemical variations in anaerobic environments [80]. Increased internal resistance over time also degrades signal quality and reduces reliability [81]. Biofilms may harbor pathogenic microorganisms, but they can simultaneously support beneficial processes such as surface regeneration [82].

The field of bioelectrochemical sensing has grown considerably in recent years, particularly using microbial fuel cells (MFCs) and electrolytic systems that exploit electrons released during anaerobic metabolism [52]. In most cases, the sensing element consists of microbial biofilms, which directly couple biological activity to an electrical response. A notable example is their application in monitoring gas composition, including stable isotopes such as C¹³, which can shed light on the activity of methanogenic pathways [83]. Electroactive microorganisms can spontaneously colonize electrode surfaces, forming direct contact with conductive materials needed for a stable signal [84]. However, these sensors face limitations in selectivity and repeatability. Over time, microbial consortia consume multiple carbon sources, which decreases measurement precision. To counteract this effect, genetic modulation of auxotrophic strains has been proposed to enhance signal strength, improve current density, and maintain long-term sensitivity [85]. Despite such advances, reproducibility remains a major challenge, as it is difficult to consistently replicate biofilm thickness or microbial distribution under identical conditions. Environmental stressors such as antibiotics or heavy metals further compromise sensor stability, particularly in waste streams such as dairy effluents [52,86].

Recent innovations explore hybrid sensor systems, such as those combining MFCs with gas flow meters and pH probes, which expand monitoring capabilities for intermediates like acetate, an important indicator of methanogenesis stability. These hybrid designs have achieved acetate measurement deviations within 24.2% of GC reference values, highlighting their practical potential [76,87]. Additional advances, such as pH-activated dissolvable polymeric coatings, can mitigate biofouling on electrochemical electrodes, improving long-term stability [88].

Electrochemical and potentiometric sensors remain constrained by issues of longevity, reproducibility, and biofouling. Large-scale application is further limited by trade-offs between nutrient supply, microbial stability, and the effort required for maintenance [72,89].

3.2. Optical and Spectroscopic Sensors

In recent years, optical and spectroscopic approaches have become increasingly important tools for monitoring AD, offering an alternative to conventional electrochemical methods. These sensor systems rely on light–matter interactions such as absorption, fluorescence, or scattering to track relevant analytes, ranging from gases and solvents to microbial metabolites. Commonly applied techniques include isotope spectrophotometry, infrared spectroscopy (IR), and spectrofluorometry [90,91,92].

Optical sensors are used either at-line, where samples are taken to an external module, or on-line, where they are placed directly in the process stream. Continuous on-line monitoring is used for hydrogen concentrations, since elevated hydrogen levels not only affect CO₂ reduction but also accelerate VFA accumulation, destabilizing digestion. Correct sensor positioning within the reactor is therefore essential for reliable performance [93].

An illustrative example of early optical sensor innovations is the fiber-optic immersion sensor employing triphenylmethane dyes to detect organic solvents in wastewater. Its performance is strongly influenced by temperature and signal noise, as elevated temperatures accelerate solvent evaporation and alter analyte diffusion into the sensor matrix. This makes thermal control a critical factor for reliable operation. The sensor also exhibits limitations for poorly water-soluble solvents, which may produce delayed or inconsistent responses, while small donor molecules such as ammonia often show negligible absorption despite their strong reactivity. In contrast, highly volatile solvents with properties distinct from water (e.g., tetrahydrofuran, ethyl acetate) generate increased partial vapor pressures, which can artificially inflate the measured signal. Despite these constraints, the fiber-optic sensor has practical advantages: recalibration is simple and rapid, since the device only requires a water rinse between measurements [94].

Biosensors have also been coupled with optical transducers. In such systems, immobilized biocatalysts placed on single waveguides or fiber bundles catalyze analyte-specific reactions, while the resulting products are recorded via fluorescence or chemiluminescence detection [92,95]. The biocatalyst thus acts as a molecular bridge between the analyte and the transducer, enabling conversion of biochemical interactions into measurable optical signals. In this approach, chlorophyll fluorescence can be used to detect algal growth. Such measurements, often performed with laboratory spectrofluorimetry under laser excitation, provide valuable information on microbial community dynamics and help prevent excessive algal proliferation that could destabilize fermentation [92,96].

Total internal reflection fluorescence (TIRF) uses lipid membranes on planar optical waveguides to control the penetration depth of the evanescent field. The angle of light coupling to the waveguide depends on its refractive index, allowing miniaturized sensors suitable for quantifying gas concentrations, refractive indices of liquids, or humidity [92,97]. A complementary technology is surface plasmon resonance (SPR) sensing, in which incident light excites surface plasmons at the metal–dielectric interface. This enables trace-level detection of heavy metals, with reported sensitivities of 10 ppm Cu and 13.76 ppm Pb [98].

Optical methods are widely used for biogas quality assessment. For example, non-dispersive infrared (NDIR) modules equipped with photoacoustic detectors provide high-resolution, real-time quantification of methane and CO₂. In this configuration, mid-infrared light is converted into acoustic waves, with amplitudes directly proportional to analyte concentrations [90]. Additionally, NIR spectroscopy used in the AD reactor for TAN analysis showed strong model correlation (R² = 0.91) and low prediction error (RMSEP = 0.32 g N/L) for the range of 1.5–5.5 g N/L. These results confirm the high efficiency of the probe during measurements in filtration mode and in assessing the quality of substrates [99]. In parallel, isotope fractionation techniques have been applied to monitor microbial responses to environmental perturbations such as salt stress [91].

In wastewater treatment applications, where membrane fouling remains a persistent challenge, UV and IR spectroscopic scattering indicators are widely applied for real-time monitoring of fouling formation [100]. Despite their advantages, including versatility, rapid response, and relatively low operational costs, optical sensors continue to face practical challenges. These include probe biofouling, calibration drift, and optical artifacts caused by air bubbles during mixing or aeration, which distort absorption signals [72,101]. Moreover, their applicability is limited for certain compounds such as carbohydrates and saturated hydrocarbons, which lack significant UV absorption and therefore require complementary analytical techniques [92,102].

Optical and spectroscopic sensors offer high-resolution, non-invasive monitoring of AD. Their wider use depends on solving problems of fouling, calibration drift, and analyte-specific limitations.

3.3. Sensor Integration and Data Acquisition

Monitoring in AD involves both physical sensors and data acquisition frameworks that support modeling. Online measurements of flow rates, gas composition, and auxiliary variables provide rapid, continuous insights into process dynamics, in contrast to offline sampling, which introduces delays and limits temporal resolution [13]. Online systems capture short-term fluctuations in reactors that are missed by offline sampling.

A key element in this integration is the use of soft sensors, which link direct physical measurements with mathematical modeling to estimate process variables that are otherwise difficult or expensive to quantify. Indirect indicators, such as nutrient consumption or the release of metabolic intermediates, can provide useful proxies for biomass growth and substrate turnover. These indicators supply critical data for model-based predictions. Despite the rise of advanced sensing technologies, simple titration-based assays remain common in both research and industry, largely due to their low cost and adaptability [103,104,105].

Soft sensors can be classified according to the modeling strategies used to transform raw measurements and auxiliary variables into reliable process information. These strategies encompass mechanistic, regression-based, state-estimation, AI–based, and deep learning-based frameworks, as illustrated schematically in Figure 1.

Figure 1. Schematic representation of soft sensor modeling strategies applied to AD processes [103,106,107,108,109,110,111,112].

Mechanistic models construct mathematical relationships between target and auxiliary variables based on balance equations and detailed process analysis [106]. While this approach provides interpretability and clear links to underlying bioprocesses, it struggles to capture the inherent complexity, variability, and nonlinear behavior of AD [107]. To address these limitations, hybrid models have emerged that combine mechanistic foundations with data-driven flexibility. Such models integrate physical and mathematical correlations to provide both qualitative and quantitative insights into degradation-phase dynamics and associated gas production, while also accounting for microbial growth patterns [103,108].

Regression-based approaches use statistical correlations between measured inputs and predicted outputs. Multiple linear regression (MLR) and partial least squares regression (PLSR) are widely applied, as they effectively handle collinearity among auxiliary variables and establish linear mappings between measured inputs and predicted outputs [103,109]. However, such models often generalize results excessively, which can reduce accuracy, obscure meaningful signal components, or introduce residual noise.

State-estimation models infer target variables from auxiliary measurements by approximating system behavior [107]. Such models are particularly useful for real-time monitoring, yet their accuracy can decline when system complexity is reduced to ease computational load. Moreover, both the limited precision and the expense of sophisticated instrumentation continue to hinder their broader adoption in large-scale industrial AD processes [103,109].

A conceptual distinction exists between regression-based and AI-based frameworks. Regression models, such as MLR and PLSR, rely on predefined linear or polynomial relationships between inputs and outputs. In contrast, AI-based soft sensors include ML and deep learning algorithms, such as neural networks and support vector machines, which can also perform regression but learn nonlinear and adaptive relationships directly from data. This distinction clarifies the methodological scope of each group and ensures consistency with the discussion of regression and classification tasks in Section 4.1.

Recent research increasingly explores AI-based soft sensors. Neural networks, for example, can establish nonlinear mappings between auxiliary and target variables by learning directly from training datasets [110]. Similarly, kernel-based approaches transform variables into higher-dimensional feature spaces, where linear algorithms can then be applied to uncover nonlinear relationships [111]. Nevertheless, both approaches face challenges, as network topology, kernel selection, and the quality of training samples strongly influence performance. Neural networks are prone to overfitting or underfitting, which may compromise generalizability [103,112]. These issues are discussed in more detail in Section 4.2.

The most advanced developments involve deep learning–based frameworks, which extend neural network architectures with multiple hidden layers to automatically extract informative features from raw data and capture temporal dynamics inherent to AD processes. These methods have demonstrated improved robustness and predictive accuracy under fluctuating environmental and operational conditions [103].

Despite ongoing challenges such as delays in analysis, a tendency toward over-simplification, and high implementation costs, the integration of physical sensors with AI-based data acquisition remains limited. For clarity and comparison, Table 1 outlines the principal sensor types used in AD monitoring, along with their measurable parameters, advantages, limitations, potential for AI integration, and key literature references. While numerous sensor types have been tested for AD, few studies directly compare their performance across feedstocks or operational regimes. Establishing standardized testing protocols and open databases for sensor benchmarking would facilitate more objective assessment and model integration.

Table 1. Comparison of sensor types for AD monitoring.

4. AI for Anaerobic Fermentation

4.1. From Data to Insight: ML Paradigm

Modern sensor systems generate large datasets during AD operation. Analyzing these data requires methods able to identify patterns and predict system behavior. ML is increasingly applied in this context, as it can model nonlinear dependencies and dynamic fluctuations [65,113]. Unlike conventional control strategies, which often respond only after disturbances occur, ML-based tools support early interventions and contribute to more stable reactor performance. Nevertheless, their effectiveness depends critically on the amount, quality, and diversity of available data. This is a persistent challenge in AD, where measurements may be incomplete, affected by sensor drift, or constrained by limited sampling frequency [13,114].

Most ML work in AD uses either classification or regression tasks. Classification models are typically used to distinguish between stable and unstable operating states, detect inhibition events, or group substrates based on microbial community composition [115]. These tasks may take the form of binary (e.g., failure vs. stable operation), multiclass (different feedstock types), or multilabel classification. Regression models, by contrast, predict continuous outcomes such as methane yield, VFA concentration, or TAN. Their performance is generally evaluated using metrics such as root mean square error (RMSE), which provide a quantitative benchmark for predictive accuracy [114].

A broad range of machine learning algorithms has been applied to AD, including tree-based ensembles such as Random Forest (RF), evolutionary approaches such as Genetic Programming (GP), instance-based methods such as k-Nearest Neighbors (KNN), kernel-based methods such as Support Vector Machines (SVMs), neural architectures such as Artificial Neural Networks (ANNs) and Extreme Learning Machines (ELMs), as well as ensemble boosting frameworks such as Extreme Gradient Boosting (XGBoost). These approaches, summarized schematically in Figure 2, differ in data requirements, interpretability, and computational complexity, providing diverse capabilities for process prediction, optimization, and fault detection. RF is often applied because it can process many input variables without prior feature elimination. It performs well for methane yield prediction but less consistently for VFAs. In such cases, GP offers greater transparency and adaptability, as it generates explicit mathematical relationships that can be integrated into process control systems [116]. Comparative studies show that RF often outperforms KNN in regression tasks for methane prediction, although results may vary depending on the dataset and process parameters. KNN performance is highly dependent on the choice of the k parameter and tends to suffer from reduced precision [114,117].

Figure 2. Schematic representation of ML algorithms applied to AD, illustrating major methodological categories and their interrelations [110,114,116,117,118,119,120,121,122,123].

SVMs map nonlinear input data into higher-dimensional feature spaces, which enables linear separation. Beyond conventional algorithms, probabilistic approaches such as multi-task Gaussian processes (MTGPs) have recently demonstrated strong predictive performance for simultaneous estimation of biogas, soluble COD, and VFA dynamics, outperforming mechanistic AM2 models while retaining uncertainty quantification capabilities [124]. This makes them suitable for both classification and regression tasks. In AD studies, SVMs have shown high accuracy in predicting TAN values and biogas quality, often outperforming analytical methods and neural networks [118,119].

ANNs have also been explored, benefiting from their ability to model complex nonlinear relationships between process variables. They have been applied to predict methane yields, volatile solids concentrations, and inhibition events [110,120]. Their performance, however, depends strongly on network architecture and training strategy, and they are prone to overfitting and sensitivity to hyperparameter choices [118]. Variants such as ELMs set hidden layer weights randomly and calculate output weights analytically, which shortens training time. These models often lack robustness and have limited fault detection capacity, so their use in full-scale systems remains restricted despite positive laboratory results [121].

More recently, ensemble and boosting approaches have gained prominence. XGBoost has demonstrated industrial-scale applicability, offering improved prediction of methane production across heterogeneous feedstocks while reducing overfitting through regularization [122,123]. Reported normalized RMSE values of around 21% have been achieved, though performance still depends on feedstock heterogeneity and system conditions [123].

Algorithm choice depends on the task and data quality. Hybrid approaches that combine mechanistic models such as ADM1 with ML are being tested to balance interpretability with predictive flexibility [17,18].

Integration of AI models with sensor data is crucial for translating raw measurements into actionable insights. Different AI and ML architectures exhibit specific advantages depending on the nature of the data produced by various sensors [13,103]. For example, time-series models such as Long Short-Term Memory (LSTM) networks and Recurrent Neural Networks (RNNs) are particularly effective for continuous electrochemical or potentiometric sensor data, as they capture temporal dependencies and detect gradual signal drifts caused by fouling or process fluctuations [10,11]. In contrast, Convolutional Neural Networks (CNNs) excel in analyzing spectroscopic and optical sensor outputs, where spatial or spectral feature extraction is required to identify subtle patterns in absorbance, fluorescence or scattering data [14]. Ensemble and tree-based algorithms (e.g., Random Forest, XGBoost) are often preferred for hybrid sensor arrays combining multiple data modalities, offering robustness to noise and strong feature interpretability [125]. A concise summary of the relationship between sensor types and suitable AI/ML approaches is provided in Table 2. A more detailed comparative summary of ML algorithms applied to AD, including their specific inputs, outputs, and performance metrics, can be found in the recent comprehensive review by Murali et al. [126].

Table 2. Representative AI and ML models suited for different sensor data types in AD monitoring.

Despite the good results of hybrid and ensemble models, there are very few databases for training models in AD. Developing such databases for regions or countries, for example, would greatly facilitate model testing, training, and validation.

4.2. Training–Validation–Deployment: Practical Workflow

Applying ML in AD involves several stages: data collection, preprocessing, model training, validation, and deployment. Each stage affects the reliability of predictions. Moreover, they present specific challenges in the AD context and determines the reliability of final model predictions [13,58]. These sequential steps, illustrated schematically in Figure 3, form the practical workflow for developing, validating, and implementing ML models in AD.

Figure 3. Schematic workflow of ML model development and deployment for AD processes.

The workflow begins with data preparation. Raw measurements must be cleaned, normalized, and formatted to match ML model requirements. Missing values, common in both laboratory and industrial datasets due to sensor drift, transmission errors, or biofouling, must be carefully addressed. In practice, sensor biofouling and signal drift frequently result in data that are not missing at random (NMAR), complicating imputation and increasing the risk of biased model training if not properly handled. Their treatment depends on the mechanism of missingness: missing completely at random (MCAR), missing at random (MAR), or NMAR [130,131]. Two main strategies are applied: (i) imputation, which estimates missing values using statistical or ML methods (e.g., regression, hot-deck, or multiple imputation), and (ii) deletion, which removes affected entries either entirely (listwise) or selectively (pairwise) [130,131,132]. More specifically, imputation refers to estimating missing values based on observed data. Simple approaches include mean or median substitution [133]. More advanced strategies, such as regression imputation, involve first building a regression model on available data and then predicting missing values [58]. Regression-based approaches preserve dataset size but may require large samples for reliability. Another commonly used method is hot-deck imputation, in which donor values from similar cases are substituted for missing entries. Variants include random, nearest-neighbor, or sequential hot-deck procedures [134]. While this maintains correlations within the dataset, random donor selection may lead to inconsistent substitutions [135]. The alternative is deletion, i.e., explicit removal of incomplete records. Listwise deletion eliminates entire observations containing any missing values and is considered reliable only under MCAR conditions, as it risks discarding valuable information when missingness is systematic or datasets are small [136]. Pairwise deletion, in which only the missing values are excluded from specific analyses, is more accurate for MCAR or small amounts of MAR data, but it is time-consuming and may still reduce regression stability [58,137].

Additionally, some ML algorithms particularly decision tree models such as Random Forest can natively handle missing data without requiring explicit imputation or deletion. These models utilize internal mechanisms such as “surrogate splits”, which are alternative decision paths based on correlated features that are used in the case of a missing value. This preserves model consistency and reduces errors resulting from data deletion. Although this functionality is rarely used in AD modeling, it provides an interesting alternative to pairwise deletion, which can lead to inconsistent or biased statistical results [138,139].

Beyond handling missing data, preprocessing also involves data normalization and standardization, particularly important when combining measurements from different sources [114,140]. Common approaches include Z-score normalization, which rescales values based on deviation from the mean:

z = \frac{(X - μ)}{σ},

(1)

where X is the observed measurement, μ the dataset means, and σ the standard deviation. Z-score normalization is fast, reduces the influence of outliers, and facilitates comparison across variables, though it requires datasets to be first cleaned of missing values [134,141,142]. Another widely used method is min–max normalization, which linearly transforms data into the range 0–1:

v = \frac{v^{'} - m i n}{m a x - m i n} (1 - 0) + 0

(2)

where v′ is the raw value, and min and max are the minimum and maximum values of the dataset. This method is intuitive and transparent, and while it reduces the effect of outliers, it is sensitive to extreme values [130,142].

Once preprocessed, datasets are typically divided into training, validation, and test subsets. The training set is used to optimize model parameters, the validation set provides feedback during optimization to avoid overfitting, and the test set offers an unbiased assessment of generalization performance. Metrics such as RMSE, coefficient of determination (R²), or classification accuracy are commonly used to benchmark predictive performance [143].

Feature selection is equally important, as it identifies the most relevant subset of variables, thereby improving prediction accuracy, reducing noise, and simplifying computations. In AD, commonly selected features include pH, temperature, COD, oxidation–reduction potential (ORP), dissolved oxygen (DO), total solids (TS), ash content, volatile solids (VS), and TAN [131,132]. Moreover, process indicators in AD such as pH, COD, TAN, VFAs, or temperature are often highlighted as strong predictors of methane yield and system stability [114,136,140]. Feature selection reduces noise and the so-called “black box” problem, limits overfitting, and highlights variables that most affect model predictions [143,144]

The “black box” issue arises when models such as neural networks or random forests produce accurate predictions but lack transparent internal mechanisms, limiting reproducibility and user trust. To address this, feature importance evaluation methods (e.g., SHAP, LIME, and partial dependence plots (PDPs)) can be applied, enabling insights into how individual variables contribute to model output [122,144].

During model development, two algorithmic pitfalls must be addressed: overfitting, where models adapt too closely to the training data, and underfitting, where they fail to capture relevant patterns even in training. Strategies such as cross-validation, regularization, and ensemble learning are employed to mitigate these risks [142,145]. Neural networks and deep learning architectures, for example, can model highly nonlinear interactions, but they require careful tuning of network depth, neuron numbers, and stopping criteria to avoid overfitting [134,146].

In the context of AD, overfitting is a particularly significant issue when models are trained on data collected under narrowly defined conditions—for instance, using a limited number of substrates, reactor types, or data from a single facility operating under stable conditions. In such cases, the model learns only a restricted range of patterns and loses accuracy when applied to data outside the training domain. This is especially important because feedstock composition, temperature, and microbial community structure can vary significantly between biogas plants, leading to extrapolation errors. To mitigate these issues, robust validation strategies and advanced approaches such as reinforcement learning can be employed to improve model generalization and adaptability [13].

In deployment, trained models are integrated with operational platforms such as soft sensors, digital twins, or supervisory control systems. In this phase, continuous monitoring and recalibration are critical, since the statistical properties of incoming data often differ from those of training datasets due to feedstock variability, seasonal changes, or sensor degradation [135]. Deployment strategies must also balance accuracy with practicality: while deep learning models can achieve high precision, simpler methods such as RF or SVMs may be more suitable for real-time or on-site use because of lower computational requirements and higher interpretability [13,58,122]. Dynamic feed scheduling frameworks have also been developed, integrating predictive control and optimization to adjust feeding patterns in response to changing market and process conditions, leading to improved operational flexibility and economic returns [147].

Current ML workflows in AD still lack standardized pipelines for data preprocessing, feature selection, and validation across different systems. Developing harmonized procedures would improve reproducibility and enable fair comparison between algorithms used for AD monitoring and optimization.

5. Practical Considerations and Research Gaps

5.1. Sensor Limitations and Maintenance Issues

Monitoring AD processes depends on sensor accuracy, but the devices currently in use have multiple operational weaknesses. Electrochemical sensors, such as pH electrodes, are widely used but require frequent calibration and surface cleaning to maintain accuracy. Their long-term stability depends on implementing dedicated cleaning strategies, including hydraulic, mechanical, chemical, or ultrasonic methods, that extend the maintenance-free lifespan of electrodes. Integrating fully automated cleaning and recalibration interfaces can further reduce operational effort while improving data reliability [50,148,149]. Optical sensors degrade more slowly under fouling or surface contamination and usually require less maintenance. However, electrochemical configurations can be enhanced with advanced functionalities, such as automated impedance checks of membranes and glass electrodes, enabling earlier fault detection and minimizing downtime [50,150].

Normalization procedures are also critical in sensor calibration. For example, IR and colorimetric sensor arrays often employ normalization to predict component concentrations in previously unmeasured samples. A multidimensional VFA detection platform using a 23-dye matrix has been tested; it shortens sample interaction time and lowers operational costs [151].

Optical sensors are increasingly used in AI-based monitoring, as their signals can be integrated with automated data processing. Their uptake reflects limitations of electrochemical devices and the need for more stable input to soft-sensor models.

5.2. Data Scarcity and the Need for Public Datasets

Data availability is a major limitation for applying AI in AD, since obtaining sufficient high-quality samples is difficult. Collecting sufficient high-quality samples is particularly difficult, as invasive sampling can disturb fermentation processes. Online sensors partially mitigate this issue, but training robust soft-sensor models still requires large, diverse datasets. Generative adversarial networks (GANs) have been tested to artificially expand training datasets and compensate for limited sampling [103].

ML methods depend on large, reliable datasets to describe nonlinear relations between process variables and system behavior. However, the high costs and complexity of experimental work often result in sparse and noisy datasets. This limitation reduces model generalizability and increases retraining demands [152]. Open experimental and computational datasets can improve reproducibility and make it possible to compare results across studies [153].

Open datasets also introduce certain challenges. Studies show that neural network performance can be negatively affected by data preprocessing steps such as inconsistent formatting, scaling, or normalization procedures applied across different sources. Methods such as dropout regularization, sample normalization, transfer learning, and pretraining are used to improve model performance when data are limited [152]. Selecting the most effective strategy remains difficult due to the lack of systematic comparisons between available methods [154]. A recent study combined explainable ML with detailed microbial and chemical fingerprinting, identifying taxa such as Oscillibacter and Clostridium sensu stricto as potential biomarkers of reactor stability [155].

AI also holds potential beyond monitoring by supporting experiment planning and design. In highly dynamic and nonlinear AD processes, predictive models can inform both maintenance scheduling and process optimization. Achieving this, however, requires significantly larger datasets coupled with improved sensor technology, capable of capturing correlations between key variables. Larger datasets combined with improved sensors are needed to capture correlations between key variables and to develop more reliable predictive models [7].

Process stability depends on better data acquisition and improved sensors. Combined with statistical and AI-based methods, AD operations can shift toward predictive, resilient, and resource-efficient process management.

6. Conclusions and Future Perspectives

AI is gaining importance as a tool for advancing AD. Recent developments in ML show that data-driven models can forecast methane production, detect early signs of process disturbances, and support operational decisions in ways that exceed the capabilities of conventional approaches [7,65,122]. When combined with emerging sensor technologies and soft-sensor concepts, AI has the potential to make AD more predictive, resilient, and resource-efficient, in line with the principles of the circular economy and the ongoing energy transition.

Despite this promise, industrial application of AI in AD faces several hurdles. The main obstacles to large-scale implementation are the scarcity of high-quality, standardized datasets, the sensitivity of ML algorithms to hyperparameter selection, and the limited interpretability of most models. Harmonized protocols for data acquisition, formatting, and sharing would improve reproducibility and allow benchmarking across laboratories and scales [13,58]. Addressing hyperparameter sensitivity will require automated and adaptive optimization strategies that reduce reliance on expert knowledge [58,112]. Equally important, explainable AI approaches such as feature importance analyses and interpretable ensemble models could help overcome the “black-box” barrier to industrial and regulatory acceptance [58,118].

To move in this direction, several areas of research require attention. One concerns the hardware layer: more robust sensors and the integration of heterogeneous data streams are essential to establish a reliable foundation for AI-based monitoring. Another priority is algorithmic: hybrid gray-box models that merge mechanistic understanding with the flexibility of ML could help reconcile accuracy with interpretability [17,18]. At the system level, digital twins and autonomous control architectures offer a route toward self-optimizing, full-scale biogas plants. However, the practical implementation of Industrial-scale Digital Twins still faces challenges related to real-time data synchronization, computational scalability, and model validation under dynamic operating conditions. Establishing standardized interfaces between sensor networks, control systems, and simulation platforms will be essential for their large-scale deployment [156,157,158]. Another emerging trend is the use of Reinforcement Learning (RL) for developing adaptive and autonomous control strategies in AD. RL algorithms can continuously learn from plant feedback to optimize operational parameters such as feedstock loading or temperature, enabling truly self-optimizing systems [159,160]. Preliminary studies have shown that RL can enhance process stability while reducing the need for manual supervision [161]. In parallel, Federated Learning (FL) has emerged as a promising paradigm to address data scarcity and privacy concerns. By enabling collaborative model training across multiple facilities without sharing raw data, FL can help build more generalized and transferable AI models while preserving data confidentiality [162,163]. Advancing these developments will depend on close collaboration between biotechnology, computational science, and process engineering, as well as stronger partnerships between academia and industry.

AI should not be seen as a universal solution, yet it can play a decisive role in optimizing AD. Advancement in three areas, data standardization, model transparency, and sensor integration, will determine how quickly AI applications move from proof-of-concept studies to routine industrial practice. Meeting these requirements would make it possible to achieve more stable, efficient, and scalable biomethane production, directly supporting the goals of REPowerEU and the broader energy transition.

Author Contributions

Writing—original draft, M.M., I.T. and S.G.; Visualization, M.M., I.T. and S.G.; Supervision, M.M. and P.J.; Writing—review and editing, M.M., I.T., S.G. and P.J. All authors have read and agreed to the published version of the manuscript.

Funding

This work is supported by National Science Centre Poland Miniatura 9 project number 2025/09/X/ST8/00630: “Investigation of the microbiological efficiency of biogas-to-biomethane conversion”. The APC was funded by Statutory Funds of the Faculty of Electronics, Telecommunications and Informatics Gdańsk University of Technology.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

AD	Anaerobic Digestion
AI	Artificial Intelligence
ML	Machine Learning
ANN	Artificial Neural Network
SVM	Support Vector Machine
RF	Random Forest
KNN	k-Nearest Neighbors
GP	Genetic Programming
OLR	Organic Loading Rate
HRT	Hydraulic Retention Time
VFAs	Volatile Fatty Acids
TAN	Total Ammonia Nitrogen
COD	Chemical Oxygen Demand
TS	Total Solids
VS	Volatile Solids
ORP	Oxidation–Reduction Potential
DO	Dissolved Oxygen
ADM	Anaerobic Digestion Model
PID	Proportional–Integral–Derivative Controller
MESe	Microbial Electrochemical Sensor
MPS	Microbial Potentiometric Sensor
MFC	Microbial Fuel Cell
NDIR	Non-Dispersive Infrared
IR	Infrared Spectroscopy
UV	Ultraviolet Spectroscopy
SPR	Surface Plasmon Resonance
TIRF	Total Internal Reflection Fluorescence
PLSR	Partial Least Squares Regression
MLR	Multiple Linear Regression
GAN	Generative Adversarial Network
XGBoost	Extreme Gradient Boosting
RMSE	Root Mean Square Error
R²	Coefficient of Determination
VOCs	Volatile Organic Compounds

References

European Commission REPowerEU Plan. Available online: https://eur-lex.europa.eu/legal-content/EN/TXT/?uri=COM%3A2022%3A230%3AFIN&qid=1653033742483 (accessed on 28 August 2025).
European Commission. Directorate-General for Energy. Biomethane. Available online: https://energy.ec.europa.eu/topics/renewable-energy/bioenergy/biomethane_en (accessed on 28 August 2025).
Gas for Climate. European Biogas Association & Guidehouse. In Manual for National Biomethane Strategies; Gas for Climate: Utrecht, The Netherlands, 2022. [Google Scholar]
Nurherdiana, S.D.; Nugraha, R.E.; Iqbal, R.M.; Widyanto, A.R.; Nomura, M.; Jalil, M.J.; Fansuri, H. Biogas Upgrading to Biomethane. In Green Energy and Technology, Hydrogen and Low-Carbon Fuels in Circular Bio-Economy; Springer: Cham, Switzerland, 2025; pp. 163–186. ISBN 978-3-031-92894-9. [Google Scholar]
Andrade Cruz, I.; Chuenchart, W.; Long, F.; Surendra, K.C.; Renata Santos Andrade, L.; Bilal, M.; Liu, H.; Tavares Figueiredo, R.; Khanal, S.K.; Fernando Romanholo Ferreira, L. Application of Machine Learning in Anaerobic Digestion: Perspectives and Challenges. Bioresour. Technol. 2022, 345, 126433. [Google Scholar] [CrossRef]
Subbarao, P.M.V.; D’ Silva, T.C.; Adlak, K.; Kumar, S.; Chandra, R.; Vijay, V.K. Anaerobic Digestion as a Sustainable Technology for Efficiently Utilizing Biomass in the Context of Carbon Neutrality and Circular Economy. Environ. Res. 2023, 234, 116286. [Google Scholar] [CrossRef]
Cinar, S.; Cinar, S.O.; Wieczorek, N.; Sohoo, I.; Kuchta, K. Integration of Artificial Intelligence into Biogas Plant Operation. Processes 2021, 9, 85. [Google Scholar] [CrossRef]
Harirchi, S.; Wainaina, S.; Sar, T.; Nojoumi, S.A.; Parchami, M.; Parchami, M.; Varjani, S.; Khanal, S.K.; Wong, J.; Awasthi, M.K.; et al. Microbiological Insights into Anaerobic Digestion for Biogas, Hydrogen or Volatile Fatty Acids (VFAs): A Review. Bioengineered 2022, 13, 6521. [Google Scholar] [CrossRef] [PubMed]
Donoso-Bravo, A.; Mailier, J.; Martin, C.; Rodríguez, J.; Aceves-Lara, C.A.; Wouwer, A. Vande Model Selection, Identification and Validation in Anaerobic Digestion: A Review. Water Res. 2011, 45, 5347–5364. [Google Scholar] [CrossRef] [PubMed]
Seo, K.W.; Seo, J.; Kim, K.; Ji Lim, S.; Chung, J. Prediction of Biogas Production Rate from Dry Anaerobic Digestion of Food Waste: Process-Based Approach vs. Recurrent Neural Network Black-Box Model. Bioresour. Technol. 2021, 341, 125829. [Google Scholar] [CrossRef]
Jeong, K.; Abbas, A.; Shin, J.; Son, M.; Kim, Y.M.; Cho, K.H. Prediction of Biogas Production in Anaerobic Co-Digestion of Organic Wastes Using Deep Learning Models. Water Res. 2021, 205, 117697. [Google Scholar] [CrossRef]
Zaghloul, M.S.; Hamza, R.A.; Iorhemen, O.T.; Tay, J.H. Comparison of Adaptive Neuro-Fuzzy Inference Systems (ANFIS) and Support Vector Regression (SVR) for Data-Driven Modelling of Aerobic Granular Sludge Reactors. J. Environ. Chem. Eng. 2020, 8, 103742. [Google Scholar] [CrossRef]
Rutland, H.; You, J.; Liu, H.; Bull, L.; Reynolds, D. A Systematic Review of Machine-Learning Solutions in Anaerobic Digestion. Bioengineering 2023, 10, 1410. [Google Scholar] [CrossRef]
Ling, J.Y.X.; Chan, Y.J.; Chen, J.W.; Chong, D.J.S.; Tan, A.L.L.; Arumugasamy, S.K.; Lau, P.L. Machine Learning Methods for the Modelling and Optimisation of Biogas Production from Anaerobic Digestion: A Review. Environ. Sci. Pollut. Res. 2024, 31, 19085–19104. [Google Scholar] [CrossRef]
Batstone, D.J.; Hülsen, T.; Mehta, C.M.; Keller, J. Platforms for Energy and Nutrient Recovery from Domestic Wastewater: A Review. Chemosphere 2015, 140, 2–11. [Google Scholar] [CrossRef]
Batstone, D.J.; Keller, J.; Angelidaki, I.; Kalyuzhnyi, S.V.; Pavlostathis, S.G.; Rozzi, A.; Sanders, W.T.; Siegrist, H.; Vavilin, V.A. The IWA Anaerobic Digestion Model No 1 (ADM1). Water Sci. Technol. 2002, 45, 65–73. [Google Scholar] [CrossRef] [PubMed]
Mahmoodi-Eshkaftaki, M.; Ebrahimi, R. Integrated Deep Learning Neural Network and Desirability Analysis in Biogas Plants: A Powerful Tool to Optimize Biogas Purification. Energy 2021, 231, 121073. [Google Scholar] [CrossRef]
Barik, D.; Murugan, S. An Artificial Neural Network and Genetic Algorithm Optimized Model for Biogas Production from Co-Digestion of Seed Cake of Karanja and Cattle Dung. Waste Biomass Valorization 2015, 6, 1015–1027. [Google Scholar] [CrossRef]
Kegl, T.; Torres Jiménez, E.; Kegl, B.; Kovač Kralj, A.; Kegl, M. Modeling and Optimization of Anaerobic Digestion Technology: Current Status and Future Outlook. Prog. Energy Combust. Sci. 2025, 106, 101199. [Google Scholar] [CrossRef]
Lauwers, J.; Appels, L.; Thompson, I.P.; Degrève, J.; Van Impe, J.F.; Dewil, R. Mathematical Modelling of Anaerobic Digestion of Biomass and Waste: Power and Limitations. Prog. Energy Combust. Sci. 2013, 39, 383–402. [Google Scholar] [CrossRef]
Emebu, S.; Pecha, J.; Janáčová, D. Review on Anaerobic Digestion Models: Model Classification & Elaboration of Process Phenomena. Renew. Sustain. Energy Rev. 2022, 160, 112288. [Google Scholar] [CrossRef]
Ramachandran, A.; Rustum, R.; Adeloye, A.J. Review of Anaerobic Digestion Modeling and Optimization Using Nature-Inspired Techniques. Processes 2019, 7, 953. [Google Scholar] [CrossRef]
Siddique, N.; Adeli, H. Nature Inspired Computing: An Overview and Some Future Directions. Cogn. Comput. 2015, 7, 706–714. [Google Scholar] [CrossRef]
Kunatsa, T.; Xia, X. A Review on Anaerobic Digestion with Focus on the Role of Biomass Co-Digestion, Modelling and Optimisation on Biogas Production and Enhancement. Bioresour. Technol. 2022, 344, 126311. [Google Scholar] [CrossRef]
Jiang, C.; Qi, R.; Hao, L.; McIlroy, S.J.; Nielsen, P.H. Monitoring Foaming Potential in Anaerobic Digesters. Waste Manag. 2018, 75, 280–288. [Google Scholar] [CrossRef]
Tiso, T.; Demling, P.; Karmainski, T.; Oraby, A.; Eiken, J.; Liu, L.; Bongartz, P.; Wessling, M.; Desmond, P.; Schmitz, S.; et al. Foam Control in Biotechnological Processes—Challenges and Opportunities. Discov. Chem. Eng. 2024, 4, 2. [Google Scholar] [CrossRef]
Kegl, T.; Kravanja, G.; Knez, Ž.; Knez Hrnčič, M. Effect of Addition of Supercritical CO₂ on Transfer and Thermodynamic Properties of Biodegradable Polymers PEG 600 and Brij52. J. Supercrit. Fluids 2017, 122, 10–17. [Google Scholar] [CrossRef]
Daly, S.E.; Ni, J.Q. Machine Learning Prediction of Foaming in Anaerobic Co-Digestion from Six Key Process Parameters. Fermentation 2024, 10, 639. [Google Scholar] [CrossRef]
Stams, A.J.M.; Plugge, C.M. Electron Transfer in Syntrophic Communities of Anaerobic Bacteria and Archaea. Nat. Rev. Microbiol. 2009, 7, 568–577. [Google Scholar] [CrossRef]
Venkiteshwaran, K.; Milferstedt, K.; Hamelin, J.; Zitomer, D.H. Anaerobic Digester Bioaugmentation Influences Quasi Steady State Performance and Microbial Community. Water Res. 2016, 104, 128–136. [Google Scholar] [CrossRef]
Bailón, L. Report: Biogas and Bio-Syngas Upgrading; Danish Technological Institute: Taastrup, Denmark, 2012. [Google Scholar]
Xu, F.; Li, Y.; Ge, X.; Yang, L.; Li, Y. Anaerobic Digestion of Food Waste—Challenges and Opportunities. Bioresour. Technol. 2018, 247, 1047–1058. [Google Scholar] [CrossRef]
Appels, L.; Baeyens, J.; Degrève, J.; Dewil, R. Principles and Potential of the Anaerobic Digestion of Waste-Activated Sludge. Prog. Energy Combust. Sci. 2008, 34, 755–781. [Google Scholar] [CrossRef]
Cucina, M. Integrating Anaerobic Digestion and Composting to Boost Energy and Material Recovery from Organic Wastes in the Circular Economy Framework in Europe: A Review. Bioresour. Technol. Rep. 2023, 24, 101642. [Google Scholar] [CrossRef]
Kegl, T. Consideration of Biological and Inorganic Additives in Upgraded Anaerobic Digestion BioModel. Bioresour. Technol. 2022, 355, 127252. [Google Scholar] [CrossRef]
Kegl, T.; Kovač Kralj, A. An Enhanced Anaerobic Digestion BioModel Calibrated by Parameters Optimization Based on Measured Biogas Plant Data. Fuel 2022, 312, 122984. [Google Scholar] [CrossRef]
Khashaba, N.H.; Ettouney, R.S.; Abdelaal, M.M.; Ashour, F.H.; El-Rifai, M.A. Artificial Neural Network Modeling of Biochar Enhanced Anaerobic Sewage Sludge Digestion. J. Environ. Chem. Eng. 2022, 10, 107988. [Google Scholar] [CrossRef]
Zou, J.; Lü, F.; Chen, L.; Zhang, H.; He, P. Machine Learning for Enhancing Prediction of Biogas Production and Building a VFA/ALK Soft Sensor in Full-Scale Dry Anaerobic Digestion of Kitchen Food Waste. J. Environ. Manag. 2024, 371, 123190. [Google Scholar] [CrossRef]
Fajobi, M.O.; Lasode, O.A.; Adeleke, A.A.; Ikubanni, P.P.; Balogun, A.O.; Paramasivam, P. Prediction of Biogas Yield from Codigestion of Lignocellulosic Biomass Using Adaptive Neuro-Fuzzy Inference System (ANFIS) Model. J. Eng. 2023, 2023, 9335814. [Google Scholar] [CrossRef]
Yang, Y.; Zheng, S.; Ai, Z.; Jafari, M.M.M. On the Prediction of Biogas Production from Vegetables, Fruits, and Food Wastes by ANFIS- and LSSVM-Based Models. BioMed Res. Int. 2021, 2021, 9202127. [Google Scholar] [CrossRef] [PubMed]
Ge, Y.; Tao, J.; Wang, Z.; Chen, C.; Mu, L.; Ruan, H.; Rodríguez Yon, Y.; Su, H.; Yan, B.; Chen, G. Modification of Anaerobic Digestion Model No.1 with Machine Learning Models towards Applicable and Accurate Simulation of Biomass Anaerobic Digestion. Chem. Eng. J. 2023, 454, 140369. [Google Scholar] [CrossRef]
Westerholm, M.; Moestedt, J.; Schnürer, A. Biogas Production through Syntrophic Acetate Oxidation and Deliberate Operating Strategies for Improved Digester Performance. Appl. Energy 2016, 179, 124–135. [Google Scholar] [CrossRef]
Ziganshin, A.M.; Schmidt, T.; Scholwin, F.; Il’Inskaya, O.N.; Harms, H.; Kleinsteuber, S. Bacteria and Archaea Involved in Anaerobic Digestion of Distillers Grains with Solubles. Appl. Microbiol. Biotechnol. 2011, 89, 2039–2052. [Google Scholar] [CrossRef]
Labatut, R.A.; Angenent, L.T.; Scott, N.R. Conventional Mesophilic vs. Thermophilic Anaerobic Digestion: A Trade-off between Performance and Stability? Water Res. 2014, 53, 249–258. [Google Scholar] [CrossRef]
Chen, Y.; Cheng, J.J.; Creamer, K.S. Inhibition of Anaerobic Digestion Process: A Review. Bioresour. Technol. 2008, 99, 4044–4064. [Google Scholar] [CrossRef]
Björnsson, L.; Murto, M.; Jantsch, T.G.; Mattiasson, B. Evaluation of New Methods for the Monitoring of Alkalinity, Dissolved Hydrogen and the Microbial Community in Anaerobic Digestion. Water Res. 2001, 35, 2833–2840. [Google Scholar] [CrossRef]
Rabii, A.; Aldin, S.; Dahman, Y.; Elbeshbishy, E. A Review on Anaerobic Co-Digestion with a Focus on the Microbial Populations and the Effect of Multi-Stage Digester Configuration. Energies 2019, 12, 1106. [Google Scholar] [CrossRef]
Angelidaki, I.; Ellegaard, L. Codigestion of Manure and Organic Wastes in Centralized Biogas Plants: Status and Future Trends. Appl. Biochem. Biotechnol. Part A Enzym. Eng. Biotechnol. 2003, 109, 95–105. [Google Scholar] [CrossRef]
Wu, D.; Li, L.; Zhao, X.; Peng, Y.; Yang, P.; Peng, X. Anaerobic Digestion: A Review on Process Monitoring. Renew. Sustain. Energy Rev. 2019, 103, 1–12. [Google Scholar] [CrossRef]
Jimenez, J.; Latrille, E.; Harmand, J.; Robles, A.; Ferrer, J.; Gaida, D.; Wolf, C.; Mairet, F.; Bernard, O.; Alcaraz-Gonzalez, V.; et al. Instrumentation and Control of Anaerobic Digestion Processes: A Review and Some Research Challenges. Rev. Environ. Sci. Bio/Technol. 2015, 14, 615–648. [Google Scholar] [CrossRef]
Holubar, P.; Zani, L.; Hager, M.; Fröschl, W.; Radak, Z.; Braun, R. Advanced Controlling of Anaerobic Digestion by Means of Hierarchical Neural Networks. Water Res. 2002, 36, 2582–2588. [Google Scholar] [CrossRef] [PubMed]
Singh, A.; Kumar, V. Recent Developments in Monitoring Technology for Anaerobic Digesters: A Focus on Bio-Electrochemical Systems. Bioresour. Technol. 2021, 329, 124937. [Google Scholar] [CrossRef]
Jin, X.; Angelidaki, I.; Zhang, Y. Microbial Electrochemical Monitoring of Volatile Fatty Acids during Anaerobic Digestion. Environ. Sci. Technol. 2016, 50, 4422–4429. [Google Scholar] [CrossRef]
Hu, Y.; Cheng, H.; Ji, J.; Li, Y.Y. A Review of Anaerobic Membrane Bioreactors for Municipal Wastewater Treatment with a Focus on Multicomponent Biogas and Membrane Fouling Control. Environ. Sci. 2020, 6, 2641–2663. [Google Scholar] [CrossRef]
Sethunga, G.S.M.D.P.; Karahan, H.E.; Wang, R.; Bae, T.H. Wetting- and Fouling-Resistant Hollow Fiber Membranes for Dissolved Methane Recovery from Anaerobic Wastewater Treatment Effluents. J. Membr. Sci. 2021, 617, 118621. [Google Scholar] [CrossRef]
Wu, H.; Fang, K.; Shi, C.; Wang, K. Anti-Fouling Performance and Methane Potential in Coagulation-Adsorption Assisted Biogas-Spared Anaerobic Membrane Preconcentration Process. J. Clean. Prod. 2023, 414, 137606. [Google Scholar] [CrossRef]
Witkowska, E.; Buczkowska, A.; Zamojska, A.; Szewczyk, K.W.; Ciosek, P. Monitoring of Periodic Anaerobic Digestion with Flow-through Array of Miniaturized Ion-Selective Electrodes. Bioelectrochemistry 2010, 80, 87–93. [Google Scholar] [CrossRef]
Gupta, R.; Zhang, L.; Hou, J.; Zhang, Z.; Liu, H.; You, S.; Sik Ok, Y.; Li, W. Review of Explainable Machine Learning for Anaerobic Digestion. Bioresour. Technol. 2023, 369, 128468. [Google Scholar] [CrossRef]
Radočaj, D.; Jurišić, M. Comparative Evaluation of Ensemble Machine Learning Models for Methane Production from Anaerobic Digestion. Fermentation 2025, 11, 130. [Google Scholar] [CrossRef]
Kovačić, Đ.; Radočaj, D.; Jurišić, M. Ensemble Machine Learning Prediction of Anaerobic Co-Digestion of Manure and Thermally Pretreated Harvest Residues. Bioresour. Technol. 2024, 402, 130793. [Google Scholar] [CrossRef] [PubMed]
Borase, R.P.; Maghade, D.K.; Sondkar, S.Y.; Pawar, S.N. A Review of PID Control, Tuning Methods and Applications. Int. J. Dyn. Control 2021, 9, 818–827. [Google Scholar] [CrossRef]
George, T.; Ganesan, V. Optimal Tuning of PID Controller in Time Delay System: A Review on Various Optimization Techniques. Chem. Product. Process Model. 2022, 17, 1–28. [Google Scholar] [CrossRef]
Astals, S.; Nolla-Ardèvol, V.; Mata-Alvarez, J. Anaerobic Co-Digestion of Pig Manure and Crude Glycerol at Mesophilic Conditions: Biogas and Digestate. Bioresour. Technol. 2012, 110, 63–70. [Google Scholar] [CrossRef]
Bernard, O.; Hadj-Sadok, Z.; Dochain, D.; Genovesi, A.; Steyer, J.P. Dynamical Model Development and Parameter Identification for an Anaerobic Wastewater Treatment Process. Biotechnol. Bioeng. 2001, 75, 424–438. [Google Scholar] [CrossRef]
Long, F.; Wang, L.; Cai, W.; Lesnik, K.; Liu, H. Predicting the Performance of Anaerobic Digestion Using Machine Learning Algorithms and Genomic Data. Water Res. 2021, 199, 117182. [Google Scholar] [CrossRef]
Zhang, X.; Li, Y.; Guo, M. Hybrid Control Framework for the Anaerobic Digestion Process. Control Eng. Pract. 2025, 164, 106523. [Google Scholar] [CrossRef]
Attaran, M.; Celik, B.G. Digital Twin: Benefits, Use Cases, Challenges, and Opportunities. Decis. Anal. J. 2023, 6, 100165. [Google Scholar] [CrossRef]
Soori, M.; Arezoo, B.; Dastres, R. Digital Twin for Smart Manufacturing, A Review. Sustain. Manuf. Serv. Econ. 2023, 2, 100017. [Google Scholar] [CrossRef]
Abdallah, M.; Petriu, E.; Kennedy, K.; Narbaitz, R.; Warith, M. Intelligent Control of Bioreactor Landfills. In Proceedings of the IEEE International Conference on Computational Intelligence for Measurement Systems and Applications, Ottawa, ON, Canada, 19–21 September 2011; pp. 17–22. [Google Scholar] [CrossRef]
Scampini, A.C.; Belcher, A. Upflow Anaerobic Sludge Blanket Reactors for Treatment of Wastewater from the Brewery Industry. Ph.D. Thesis, Massachusetts Institute of Technology, Cambridge, MA, USA, 2010. [Google Scholar]
Thomas, O.; Théraulaz, F.; Cerdà, V.; Constant, D.; Quevauviller, P. Wastewater Quality Monitoring. Trends Anal. Chem. 2009, 16, 419–424. [Google Scholar] [CrossRef]
Bourgeois, W.; Burgess, J.E.; Stuetz, R.M. On-Line Monitoring of Wastewater Quality: A Review. J. Chem. Technol. Biotechnol. 2001, 76, 337–348. [Google Scholar] [CrossRef]
Sun, D.; Wang, A.; Cheng, S.; Yates, M.; Logan, B.E. Geobacter anodireducens sp. Nov., an Exoelectrogenic Microbe in Bioelectrochemical Systems. Int. J. Syst. Evol. Microbiol. 2014, 64, 3485–3491. [Google Scholar] [CrossRef] [PubMed]
Kretzschmar, J.; Böhme, P.; Liebetrau, J.; Mertig, M.; Harnisch, F. Microbial Electrochemical Sensors for Anaerobic Digestion Process Control—Performance of Electroactive Biofilms under Real Conditions. Chem. Eng. Technol. 2018, 41, 687–695. [Google Scholar] [CrossRef]
Jiang, Y.; Chu, N.; Zeng, R.J. Submersible Probe Type Microbial Electrochemical Sensor for Volatile Fatty Acids Monitoring in the Anaerobic Digestion Process. J. Clean. Prod. 2019, 232, 1371–1378. [Google Scholar] [CrossRef]
Sun, H.; Angelidaki, I.; Wu, S.; Dong, R.; Zhang, Y. The Potential of Bioelectrochemical Sensor for Monitoring of Acetate during Anaerobic Digestion: Focusing on Novel Reactor Design. Front. Microbiol. 2019, 10, 420500. [Google Scholar] [CrossRef]
Sepehri, A.; Sarrafzadeh, M.H. Effect of Nitrifiers Community on Fouling Mitigation and Nitrification Efficiency in a Membrane Bioreactor. Chem. Eng. Process. Process Intensif. 2018, 128, 10–18. [Google Scholar] [CrossRef]
Burge, S.R.; Hristovski, K.D.; Burge, R.G.; Hoffman, D.A.; Saboe, D.; Chao, P.F.; Taylor, E.; Koenigsberg, S.S. Microbial Potentiometric Sensor: A New Approach to Longstanding Challenges. Sci. Total Environ. 2020, 742, 140528. [Google Scholar] [CrossRef] [PubMed]
Rice, D.; Westerhoff, P.; Perreault, F.; Garcia-Segura, S. Electrochemical Self-Cleaning Anodic Surfaces for Biofouling Control during Water Treatment. Electrochem. Commun. 2018, 96, 83–87. [Google Scholar] [CrossRef]
Jin, X.; Li, X.; Zhao, N.; Angelidaki, I.; Zhang, Y. Bio-Electrolytic Sensor for Rapid Monitoring of Volatile Fatty Acids in Anaerobic Digestion Process. Water Res. 2017, 111, 74–80. [Google Scholar] [CrossRef] [PubMed]
Hyde, K.; Peak, D.; Schebel, A.; Siciliano, S.D.; Burge, S.; Bradshaw, K. Using Passive Anode Cathode Technology to Assess Microbial Happiness and Boost Benzene Biodegradation Rates; University of Saskatchewan: Saskatchewan, SK, Canada, 2019. [Google Scholar]
Mansouri, J.; Harrisson, S.; Chen, V. Strategies for Controlling Biofouling in Membrane Filtration Systems: Challenges and Opportunities. J. Mater. Chem. 2010, 20, 4567–4586. [Google Scholar] [CrossRef]
Adam, G.; Lemaigre, S.; Goux, X.; Delfosse, P.; Romain, A.C. Upscaling of an Electronic Nose for Completely Stirred Tank Reactor Stability Monitoring from Pilot-Scale to Real-Scale Agricultural Co-Digestion Biogas Plant. Bioresour. Technol. 2015, 178, 285–296. [Google Scholar] [CrossRef]
Arends, J.B. The next Step towards Usable Microbial Bioelectrochemical Sensors? Microb. Biotechnol. 2017, 11, 20. [Google Scholar] [CrossRef]
Sweeney, J.B.; Murphy, C.D.; McDonnell, K. Development of a Bacterial Propionate-Biosensor for Anaerobic Digestion Monitoring. Enzym. Microb. Technol. 2018, 109, 51–57. [Google Scholar] [CrossRef]
Xiao, L.; Wang, Y.; Lichtfouse, E.; Li, Z.; Kumar, P.S.; Liu, J.; Feng, D.; Yang, Q.; Liu, F. Effect of Antibiotics on the Microbial Efficiency of Anaerobic Digestion of Wastewater: A Review. Front. Microbiol. 2021, 11, 611613. [Google Scholar] [CrossRef]
Liu, Z.; Liu, J.; Li, B.; Zhang, Y.; Xing, X.H. Focusing on the Process Diagnosis of Anaerobic Fermentation by a Novel Sensor System Combining Microbial Fuel Cell, Gas Flow Meter and PH Meter. Int. J. Hydrogen Energy 2014, 39, 13658–13664. [Google Scholar] [CrossRef]
Uçar, A.; González-Fernández, E.; Staderini, M.; Murray, A.F.; Mount, A.R.; Bradley, M. PH-Activated Dissolvable Polymeric Coatings to Reduce Biofouling on Electrochemical Sensors. J. Funct. Biomater. 2023, 14, 329. [Google Scholar] [CrossRef]
Osbild, D.; Vasseur, P. Microbiological Sensors for the Monitoring of Water Quality. In Monitoring of Water Quality; Elsevier Science BV: Amsterdam, The Netherlands, 1998; pp. 37–48. [Google Scholar]
Bierer, B.; Kress, P.; Nägele, H.J.; Lemmer, A.; Palzer, S. Investigating Flexible Feeding Effects on the Biogas Quality in Full-scale Anaerobic Digestion by High Resolution, Photoacoustic-based NDIR Sensing. Eng. Life Sci. 2019, 19, 700. [Google Scholar] [CrossRef]
De Vrieze, J.; De Waele, M.; Boeckx, P.; Boon, N. Isotope Fractionation in Biogas Allows Direct Microbial Community Stability Monitoring in Anaerobic Digestion. Environ. Sci. Technol. 2018, 52, 6704–6713. [Google Scholar] [CrossRef]
Scully, P. Optical Techniques for Water Monitoring. In Monitoring of Water Quality; Elsevier Science BV: Amsterdam, The Netherlands, 1998; pp. 15–35. [Google Scholar] [CrossRef]
Lamb, J.J.; Bernard, O.; Sarker, S.; Lien, K.M.; Hjelme, D.R. Perspectives of Surface Plasmon Resonance Sensors for Optimized Biogas Methanation. Eng. Life Sci. 2019, 19, 759. [Google Scholar] [CrossRef] [PubMed]
Dickert, F.L.; Schreiner, S.K.; Mages, G.R.; Kimmel, H. A Fiber-Optic Dipping Sensor for Organic Solvents in Wastewater. Anal. Chem. 1989, 61, 2306–2309. [Google Scholar] [CrossRef]
Robinson, G. The Commercial Development of Planar Optical Biosensors. Sens. Actuators B Chem. 1995, 29, 31–36. [Google Scholar] [CrossRef]
Chen, J.L.; Ortiz, R.; Xiao, Y.; Steele, T.W.J.; Stuckey, D.C. Rapid Fluorescence-Based Measurement of Toxicity in Anaerobic Digestion. Water Res. 2015, 75, 123–130. [Google Scholar] [CrossRef]
Brandenburg, A.; Gombert, A. Grating Couplers as Chemical Sensors: A New Optical Configuration. Sens. Actuators B Chem. 1993, 17, 35–40. [Google Scholar] [CrossRef]
Chen, Y.; Ming, H. Review of Surface Plasmon Resonance and Localized Surface Plasmon Resonance Sensor. Photonic Sens. 2012, 2, 37–49. [Google Scholar] [CrossRef]
Raju, C.S.; Løkke, M.M.; Sutaryo, S.; Ward, A.J.; Møller, H.B. NIR Monitoring of Ammonia in Anaerobic Digesters Using a Diffuse Reflectance Probe. Sensors 2012, 12, 2340–2350. [Google Scholar] [CrossRef]
Yu, J.; Xiao, K.; Xu, H.; Qi, T.; Li, Y.; Tan, J.; Wen, X.; Huang, X. Spectroscopic Sensing of Membrane Fouling Potential in a Long-Term Running Anaerobic Membrane Bioreactor. Chem. Eng. J. 2021, 426, 130799. [Google Scholar] [CrossRef]
Brookman, S.K.E. Estimation of Biochemical Oxygen Demand in Slurry and Effluents Using Ultra-Violet Spectrophotometry. Water Res. 1997, 31, 372–374. [Google Scholar] [CrossRef]
Bourgeois, W.; Hogben, P.; Pike, A.; Stuetz, R.M. Development of a Sensor Array Based Measurement System for Continuous Monitoring of Water and Wastewater. Sens. Actuators B Chem. 2003, 88, 312–319. [Google Scholar] [CrossRef]
Yan, P.; Gai, M.; Wang, Y.; Gao, X. Review of Soft Sensors in Anaerobic Digestion Process. Processes 2021, 9, 1434. [Google Scholar] [CrossRef]
Feitkenhauer, H.; Meyer, U. Software Sensors Based on Titrimetric Techniques for the Monitoring and Control of Aerobic and Anaerobic Bioreactors. Biochem. Eng. J. 2004, 17, 147–151. [Google Scholar] [CrossRef]
Steyer, J.P.; Bouvier, J.C.; Conte, T.; Gras, P.; Harmand, J.; Delgenes, J.P. On-Line Measurements of COD, TOC, VFA, Total and Partial Alkalinity in Anaerobic Digestion Processes Using Infra-Red Spectrometry. Water Sci. Technol. 2002, 45, 133–138. [Google Scholar] [CrossRef]
Zhu, X.; Rehman, K.U.; Wang, B.; Shahzad, M. Modern Soft-Sensing Modeling Methods for Fermentation Processes. Sensors 2020, 20, 1771. [Google Scholar] [CrossRef]
Moral, H.; Aksoy, A.; Gokcay, C.F. Modeling of the Activated Sludge Process by Using Artificial Neural Networks with Automated Architecture Screening. Comput. Chem. Eng. 2008, 32, 2471–2478. [Google Scholar] [CrossRef]
Deipser, A.; Körner, I. Simulation Software for Biological Processes in Waste Management (SimuCF); Universitätsbibliothek der Technischen Universität Hamburg: Hamburg, Germany, 2020. [Google Scholar]
Eberly, L.E. Multiple Linear Regression. In Topics in Biostatistics. Methods in Molecular Biology; Humana Press: Totowa, NJ, USA, 2007; Volume 404. [Google Scholar] [CrossRef]
Güçlü, D.; Yilmaz, N.; Ozkan-Yucel, U.G. Application of Neural Network Prediction Model to Full-Scale Anaerobic Sludge Digestion. J. Chem. Technol. Biotechnol. 2011, 86, 691–698. [Google Scholar] [CrossRef]
Abu Qdais, H.; Bani Hani, K.; Shatnawi, N. Modeling and Optimization of Biogas Production from a Waste Digester Using Artificial Neural Network and Genetic Algorithm. Resour. Conserv. Recycl. 2010, 54, 359–363. [Google Scholar] [CrossRef]
Huang, M.; Han, W.; Wan, J.; Ma, Y.; Chen, X. Multi-Objective Optimisation for Design and Operation of Anaerobic Digestion Using GA-ANN and NSGA-II. J. Chem. Technol. Biotechnol. 2016, 91, 226–233. [Google Scholar] [CrossRef]
Wang, L.; Long, F.; Liao, W.; Liu, H. Prediction of Anaerobic Digestion Performance and Identification of Critical Operational Parameters Using Machine Learning Algorithms. Bioresour. Technol. 2020, 298, 122495. [Google Scholar] [CrossRef] [PubMed]
Cinar, S.Ö.; Cinar, S.; Kuchta, K. Machine Learning Algorithms for Temperature Management in the Anaerobic Digestion Process. Fermentation 2022, 8, 65. [Google Scholar] [CrossRef]
Cai, W.; Lesnik, K.L.; Wade, M.J.; Heidrich, E.S.; Wang, Y.; Liu, H. Incorporating Microbial Community Data with Machine Learning Techniques to Predict Feed Substrates in Microbial Fuel Cells. Biosens. Bioelectron. 2019, 133, 64–71. [Google Scholar] [CrossRef] [PubMed]
Kazemi, P.; Steyer, J.P.; Bengoa, C.; Font, J.; Giralt, J. Robust Data-Driven Soft Sensors for Online Monitoring of Volatile Fatty Acids in Anaerobic Digestion Processes. Processes 2020, 8, 67. [Google Scholar] [CrossRef]
Wager, S.; Athey, S. Estimation and Inference of Heterogeneous Treatment Effects Using Random Forests. J. Am. Stat. Assoc. 2018, 113, 1228–1242. [Google Scholar] [CrossRef]
Kazemi, P.; Bengoa, C.; Steyer, J.P.; Giralt, J. Data-Driven Techniques for Fault Detection in Anaerobic Digestion Process. Process Saf. Environ. Prot. 2021, 146, 905–915. [Google Scholar] [CrossRef]
Alejo, L.; Atkinson, J.; Guzmán-Fierro, V.; Roeckel, M. Effluent Composition Prediction of a Two-Stage Anaerobic Digestion Process: Machine Learning and Stoichiometry Techniques. Environ. Sci. Pollut. Res. 2018, 25, 21149–21163. [Google Scholar] [CrossRef]
Flores-Asis, R.; Méndez-Contreras, J.M.; Juárez-Martínez, U.; Alvarado-Lassman, A.; Villanueva-Vásquez, D.; Aguilar-Lasserre, A.A.; Aguilar, A.A.; M Endez-Contreras, J.M.; Ju Arez-Mart Inez, U.; Villanueva-V Asquez, D. Use of Artificial Neuronal Networks for Prediction of the Control Parameters in the Process of Anaerobic Digestion with Thermal Pretreatment. J. Environ. Sci. Health Part A 2018, 53, 883–890. [Google Scholar] [CrossRef]
Abdullah, S.S.; Malek, M.A.; Abdullah, N.S.; Kisi, O.; Yap, K.S. Extreme Learning Machines: A New Approach for Prediction of Reference Evapotranspiration. J. Hydrol. 2015, 527, 184–195. [Google Scholar] [CrossRef]
De Clercq, D.; Wen, Z.; Fei, F.; Caicedo, L.; Yuan, K.; Shang, R. Interpretable Machine Learning for Predicting Biomethane Production in Industrial-Scale Anaerobic Co-Digestion. Sci. Total Environ. 2020, 712, 134574. [Google Scholar] [CrossRef]
Xu, W.; Long, F.; Zhao, H.; Zhang, Y.; Liang, D.; Wang, L.; Lesnik, K.L.; Cao, H.; Zhang, Y.; Liu, H. Performance Prediction of ZVI-Based Anaerobic Digestion Reactor Using Machine Learning Algorithms. Waste Manag. 2021, 121, 59–66. [Google Scholar] [CrossRef] [PubMed]
Dekhici, B.; Short, M. Data-Driven Modelling of Biogas Production Using Multi-Task Gaussian Processes. Syst. Control Trans. 2025, 4, 26–32. [Google Scholar] [CrossRef]
Ahmad, A.; Yadav, A.K.; Singh, A.; Singh, D.K. A Comprehensive Machine Learning-Coupled Response Surface Methodology Approach for Predictive Modeling and Optimization of Biogas Potential in Anaerobic Co-Digestion of Organic Waste. Biomass Bioenergy 2024, 180, 106995. [Google Scholar] [CrossRef]
Murali, R.; Bywater, A.; Dolat, M.; Dekhici, B.; Zarei, M.; Hilton, L.; Sadhukhan, J.; Zhang, D.; Short, M. Anaerobic Digestion Site-Wide Optimisation and Decision-Making: An Industrial Perspective and Review. Renew. Sustain. Energy Rev. 2026, 226, 116402. [Google Scholar] [CrossRef]
Ahmad, A.; Yadav, A.K.; Singh, A.; Singh, D.K. Optimisation of Biogas Yield from Anaerobic Co-Digestion of Dual Waste for Environmental Sustainability: ANN, RSM and GA Approach. Int. J. Oil Gas Coal Technol. 2023, 33, 75–101. [Google Scholar] [CrossRef]
Onu, C.E.; Nweke, C.N.; Nwabanne, J.T. Modeling of Thermo-Chemical Pretreatment of Yam Peel Substrate for Biogas Energy Production: RSM, ANN, and ANFIS Comparative Approach. Appl. Surf. Sci. Adv. 2022, 11, 100299. [Google Scholar] [CrossRef]
Chong, D.J.S.; Chan, Y.J.; Arumugasamy, S.K.; Yazdi, S.K.; Lim, J.W. Optimisation and Performance Evaluation of Response Surface Methodology (RSM), Artificial Neural Network (ANN) and Adaptive Neuro-Fuzzy Inference System (ANFIS) in the Prediction of Biogas Production from Palm Oil Mill Effluent (POME). Energy 2023, 266, 126449. [Google Scholar] [CrossRef]
Emmanuel, T.; Maupong, T.; Mpoeleng, D.; Semong, T.; Mphago, B.; Tabona, O. A Survey on Missing Data in Machine Learning. J. Big Data 2021, 8, 140. [Google Scholar] [CrossRef]
Ren, L.; Wang, T.; Sekhari Seklouli, A.; Zhang, H.; Bouras, A. A Review on Missing Values for Main Challenges and Methods. Inf. Syst. 2023, 119, 102268. [Google Scholar] [CrossRef]
Jadhav, A.; Pramod, D.; Ramanathan, K. Comparison of Performance of Data Imputation Methods for Numeric Dataset. Appl. Artif. Intell. 2019, 33, 913–933. [Google Scholar] [CrossRef]
Schonlau, M.; Zou, R.Y. The Random Forest Algorithm for Statistical Learning. Stata J. 2020, 20, 3–29. [Google Scholar] [CrossRef]
Chen, J.W.; Chan, Y.J.; Arumugasamy, S.K.; Yazdi, S.K. Process Modelling and Optimisation of Methane Yield from Palm Oil Mill Effluent Using Response Surface Methodology and Artificial Neural Network. J. Water Process Eng. 2023, 52, 103493. [Google Scholar] [CrossRef]
Niu, C.; Zhang, Z.; Cai, T.; Pan, Y.; Lu, X.; Zhen, G. Sludge Bound-EPS Solubilization Enhance CH4 Bioconversion and Membrane Fouling Mitigation in Electrochemical Anaerobic Membrane Bioreactor: Insights from Continuous Operation and Interpretable Machine Learning Algorithms. Water Res. 2024, 264, 122243. [Google Scholar] [CrossRef] [PubMed]
Antwi, P.; Li, J.; Boadi, P.O.; Meng, J.; Shi, E.; Deng, K.; Bondinuba, F.K. Estimation of Biogas and Methane Yields in an UASB Treating Potato Starch Processing Wastewater with Backpropagation Artificial Neural Network. Bioresour. Technol. 2017, 228, 106–115. [Google Scholar] [CrossRef] [PubMed]
Min, S.; Lee, B.; Yoon, S. Deep Learning in Bioinformatics. Brief. Bioinform. 2017, 18, 851–869. [Google Scholar] [CrossRef]
Breiman, L.; Friedman, J.H.; Olshen, R.A.; Stone, C.J. Classification and Regression Trees; Chapman and Hall/CRC: New York, NY, USA, 2017; pp. 1–358. [Google Scholar] [CrossRef]
Tang, F.; Ishwaran, H. Random Forest Missing Data Algorithms. Stat. Anal. Data Min. 2017, 10, 363. [Google Scholar] [CrossRef]
Pattanayak, A.S.; Pattnaik, B.S.; Udgata, S.K.; Panda, A.K. Development of Chemical Oxygen on Demand (COD) Soft Sensor Using Edge Intelligence. IEEE Sens. J. 2020, 20, 14892–14902. [Google Scholar] [CrossRef]
Zhai, S.; Chen, K.; Yang, L.; Li, Z.; Yu, T.; Chen, L.; Zhu, H. Applying Machine Learning to Anaerobic Fermentation of Waste Sludge Using Two Targeted Modeling Strategies. Sci. Total Environ. 2024, 916, 170232. [Google Scholar] [CrossRef]
Domingos, P. A Few Useful Things to Know about Machine Learning. Commun. ACM 2012, 55, 78–87. [Google Scholar] [CrossRef]
Cheon, A.; Sung, J.; Jun, H.; Jang, H.; Kim, M.; Park, J. Application of Various Machine Learning Models for Process Stability of Bio-Electrochemical Anaerobic Digestion. Processes 2022, 10, 158. [Google Scholar] [CrossRef]
Ma, H.; Liu, Y.; Zhao, J.; Fei, F.; Gao, M.; Wang, Q. Explainable Machine Learning-Driven Predictive Performance and Process Parameter Optimization for Caproic Acid Production. Bioresour. Technol. 2024, 410, 131311. [Google Scholar] [CrossRef]
James, G.; Witten, D.; Hastie, T.; Tibshirani, R. An Introduction to Statistical Learning: With Applications in R; Springer: New York, NY, USA, 2013; Volume 7, ISBN 978-1-4614-7137-0. [Google Scholar]
Suzuki, K. Artificial Neural Networks: Methodological Advances and Biomedical Applications; BoD–Books on Demand; InTech: Rijeka, Croatia, 2011. [Google Scholar]
Dolat, M.; Murali, R.; Zarei, M.; Zhang, R.; Pincam, T.; Liu, Y.Q.; Sadhukhan, J.; Bywater, A.; Short, M. Dynamic Feed Scheduling for Optimised Anaerobic Digestion: An Optimisation Approach for Better Decision-Making to Enhance Revenue and Environmental Benefits. Digit. Chem. Eng. 2024, 13, 100191. [Google Scholar] [CrossRef]
Barhoum, A.; Hamimed, S.; Slimi, H.; Othmani, A.; Abdel-Haleem, F.M.; Bechelany, M. Modern Designs of Electrochemical Sensor Platforms for Environmental Analyses: Principles, Nanofabrication Opportunities, and Challenges. Trends Environ. Anal. Chem. 2023, 38, e00199. [Google Scholar] [CrossRef]
Li, S.; Zhou, B.; Shang, J.; Chen, X.; Yu, J. Review of Recent Applications and Future Perspectives on Process Monitoring Approaches in Industrial Processes. J. Manuf. Syst. 2025, 82, 509–530. [Google Scholar] [CrossRef]
Goumas, G.; Vlachothanasi, E.N.; Fradelos, E.C.; Mouliou, D.S. Biosensors, Artificial Intelligence Biosensors, False Results and Novel Future Perspectives. Diagnostics 2025, 15, 1037. [Google Scholar] [CrossRef]
Lamb, J.J.; Bernard, O.; Sarker, S.; Lien, K.M.; Hjelme, D.R. Perspectives of Optical Colourimetric Sensors for Anaerobic Digestion. Renew. Sustain. Energy Rev. 2019, 111, 87–96. [Google Scholar] [CrossRef]
Andriyanov, N.A.; Andriyanov, D.A. The Using of Data Augmentation in Machine Learning in Image Processing Tasks in the Face of Data Scarcity. J. Phys. Conf. Ser. 2020, 1661, 012018. [Google Scholar] [CrossRef]
Nandy, A.; Duan, C.; Kulik, H.J. Audacity of Huge: Overcoming Challenges of Data Scarcity and Data Quality for Machine Learning in Computational Materials Discovery. Curr. Opin. Chem. Eng. 2022, 36, 100778. [Google Scholar] [CrossRef]
Moradi, R.; Berangi, R.; Minaei, B. A Survey of Regularization Strategies for Deep Models. Artif. Intell. Rev. 2020, 53, 3947–3986. [Google Scholar] [CrossRef]
Piercy, E.; Sun, X.; Ellis, P.R.; Taylor, M.; Guo, M. Temporal Dynamics of Microbial Communities in Anaerobic Digestion: Influence of Temperature and Feedstock Composition on Reactor Performance and Stability. Water Res. 2025, 284, 123974. [Google Scholar] [CrossRef] [PubMed]
Vedurmudi, A.P.; Miličević, K.; Kok, G.; Yong, B.X.; Xu, L.; Zheng, G.; Brintrup, A.; Gruber, M.; Tabandeh, S.; Zaidan, M.A.; et al. Automation in Sensor Network Metrology: An Overview of Methods and Their Implementations. Meas. Sens. 2025, 38, 101799. [Google Scholar] [CrossRef]
Cho, Y.; Noh, S. Do Design and Implementation of Digital Twin Factory Synchronized in Real-Time Using MQTT. Machines 2024, 12, 759. [Google Scholar] [CrossRef]
Attaran, S.; Attaran, M.; Celik, B.G. Digital Twins and Industrial Internet of Things: Uncovering Operational Intelligence in Industry 4.0. Decis. Anal. J. 2024, 10, 100398. [Google Scholar] [CrossRef]
Aribisala, A.A.; Ghori, U.A.S.; Cavalcante, C.A.V. The Application of Reinforcement Learning to Pumps—A Systematic Literature Review. Machines 2025, 13, 480. [Google Scholar] [CrossRef]
Devarakonda, V.S.; Sun, W.; Tang, X.; Tian, Y. Recent Advances in Reinforcement Learning for Chemical Process Control. Processes 2025, 13, 1791. [Google Scholar] [CrossRef]
Yu, P.; Zhang, H.; Song, Y.; Wang, Z.; Dong, H.; Ji, L. Safe Reinforcement Learning for Power System Control: A Review. Renew. Sustain. Energy Rev. 2025, 223, 116022. [Google Scholar] [CrossRef]
Mohammadi, S.; Balador, A.; Sinaei, S.; Flammini, F. Balancing Privacy and Performance in Federated Learning: A Systematic Literature Review on Methods and Metrics. J. Parallel Distrib. Comput. 2024, 192, 104918. [Google Scholar] [CrossRef]
Zhan, S.; Huang, L.; Luo, G.; Zheng, S.; Gao, Z.; Chao, H.C. A Review on Federated Learning Architectures for Privacy-Preserving AI: Lightweight and Secure Cloud–Edge–End Collaboration. Electronics 2025, 14, 2512. [Google Scholar] [CrossRef]

Figure 1. Schematic representation of soft sensor modeling strategies applied to AD processes [103,106,107,108,109,110,111,112].

Figure 2. Schematic representation of ML algorithms applied to AD, illustrating major methodological categories and their interrelations [110,114,116,117,118,119,120,121,122,123].

Figure 3. Schematic workflow of ML model development and deployment for AD processes.

Table 1. Comparison of sensor types for AD monitoring.

Sensor Type	Main Parameters Measured	Advantages	Limitations	Potential for AI Integration	References
Electrochemical (pH, ORP, EC)	pH, redox potential, conductivity, VFAs	Low cost, continuous monitoring, established technology	Biofouling, calibration drift, interference from inhibitors	Moderate—requires preprocessing	[52,70]
Microbial Electrochemical (MESe)	VFAs, acetate, redox changes	High sensitivity, direct microbial interaction	Sensitive to ammonium, heavy metals, biofilm variability	High—nonlinear signals suitable for ML	[74,75]
Microbial Potentiometric (MPS)	Redox potential, organic load indicators	Real-time monitoring, low maintenance due to biofilm regeneration	Signal instability, internal resistance, variability in biofilms	High—dynamic signals useful for AI	[78,81]
Optical (IR, NDIR, fluorescence)	Gas composition (CH₄, CO₂, H₂), isotope fractionation, microbial activity	Non-invasive, rapid response, minimal sample preparation	Calibration instability, interference from bubbles/fouling	Very high—direct integration with ML models	[90,91,100]
Spectroscopic (SPR, TIRF)	Heavy metals, refractive index changes	High precision, compact sensors, wide application potential	Complex setup, sensitive to fouling and environmental noise	High—suitable for feature extraction	[97,98]
Hybrid and Integrated Systems	Multiparameter (pH, gas flow, acetate, VFAs)	Complementary measurements, improved accuracy	Longevity, complex maintenance, storage/nutrient conditions	Very high—multiparameter datasets support AI	[76,87,88]

Table 2. Representative AI and ML models suited for different sensor data types in AD monitoring.

Sensor Type	Typical Data Characteristics	Most Suitable AI/ML Models	Advantages of Integration	References
Electrochemical/Potentiometric	Continuous time-series signals (pH, ORP, conductivity, VFAs)	LSTM, RNN, GRU ¹	Capture temporal dependencies; effective for drift detection and anomaly prediction	[13]
Optical/Spectroscopic	High-dimensional spectral or image-like data (IR, NDIR, fluorescence)	CNN, Autoencoders	Extract spatial/spectral features; robust to signal noise and wavelength variability	[103]
Hybrid/Multi-sensor systems	Heterogeneous, multimodal datasets (electrochemical + optical + microbial)	Random Forest, XGBoost, Hybrid ANN–GA ²	Handle mixed data types; enable feature selection and model interpretability	[125,127]
Microbial/Bioelectrochemical	Nonlinear, low-frequency signals with stochastic variability	ANFIS ³, SVM, Hybrid ANN	Manage nonlinearities; suitable for pattern recognition in noisy environments	[128,129]

¹ GRU—Gated Recurrent Unit. ² Hybrid ANN–GA—Hybrid Artificial Neural Network combined with Genetic Algorithm. ³ ANFIS—Adaptive Neuro-Fuzzy Inference System.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Artificial Intelligence in Anaerobic Digestion: A Review of Sensors, Modeling Approaches, and Optimization Strategies

Highlights

Abstract

1. Introduction

2. Anaerobic Digestion and the Need for Intelligent Monitoring

2.1. Basics of Anaerobic Fermentation Processes

2.2. Challenges in Monitoring and Control

3. Sensors in Anaerobic Reactors: State of the Art

3.1. Electrochemical Sensors

3.2. Optical and Spectroscopic Sensors

3.3. Sensor Integration and Data Acquisition

4. AI for Anaerobic Fermentation

4.1. From Data to Insight: ML Paradigm

4.2. Training–Validation–Deployment: Practical Workflow

5. Practical Considerations and Research Gaps

5.1. Sensor Limitations and Maintenance Issues

5.2. Data Scarcity and the Need for Public Datasets

6. Conclusions and Future Perspectives

Author Contributions

Funding

Conflicts of Interest

Abbreviations

References

Article Metrics

Citations

Article Access Statistics