Next Article in Journal
Experiment and Simulation on the Influence of Fire Radiation on the Evaporation of Liquefied Natural Gas
Previous Article in Journal
Explanatory Model of the Material Removal Mechanisms and Grinding Wheel Wear During Grinding of PCD with Water-Based Cooling Lubricants
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Dynamic Restoration of Collapsed Anammox Biofilm Systems: Integrating Process Optimization, Microbial Community Succession, and Machine Learning-Based Prediction

School of Environment and Energy, South China University of Technology, Guangzhou 510006, China
*
Author to whom correspondence should be addressed.
Processes 2025, 13(6), 1672; https://doi.org/10.3390/pr13061672
Submission received: 5 May 2025 / Revised: 17 May 2025 / Accepted: 20 May 2025 / Published: 26 May 2025
(This article belongs to the Special Issue Applications of Microorganisms in Wastewater Treatment Processes)

Abstract

:
The majority of extant studies concentrate on the reactivation of dormant Anammox biomass or the recovery of activity under specific storage conditions. Research on rehabilitation strategies for anaerobic ammonium oxidation (Anammox) systems is limited, with the exception of research on inhibitory factors. The recovery characteristics of biofilm systems after collapse induced by varying degrees of ammonia-nitrogen and small-molecular organic compound composite shocks have not been thoroughly elucidated. This study addresses the collapse of Anammox biofilm systems caused by sodium acetate inhibition through multi-phase rehabilitation strategies, stoichiometric analysis, and microbial community succession dynamics. Two regression algorithms—Support Vector Regression (SVR) and eXtreme Gradient Boosting (XGBoost)—were employed to construct predictive models for Total Nitrogen Removal Efficiency (TNRE) and Total Nitrogen Removal Rate (TNRR) in the CANON system, with model performance evaluated via coefficient of determination (R2) and root mean square error (RMSE). Results demonstrated that after terminating moderate-to-high sodium acetate dosing (300 mg/L and 500 mg/L), reactors R300 and R500 achieved TNRE recovery to 57.98% and 58.86%, respectively, and TNRR of 0.281 and 0.275 kgN/m3·d within 60–100 days, indicating the reversibility of high-concentration sodium acetate inhibition but a positive correlation between recovery duration and inhibition intensity. Microbial community analysis revealed that Planctomycetota (including Candidatus_Kuenenia) rebounded to 46–49% relative abundance in R100, synchronized with TNRE improvement. In contrast, R300 and R500 exhibited ecological niche replacement of denitrifiers (Denitratisoma) and partial TNRE restoration despite enhanced performance. Model comparisons showed SVR outperformed XGBoost in TNRE prediction, whereas XGBoost demonstrated superior TNRR prediction accuracy with R2 approaching 1 and RMSE nearing 0, significantly surpassing SVR. This work provides critical insights into recovery mechanisms under organic inhibition stress and establishes a robust predictive framework for optimizing nitrogen removal performance in CANON systems.

1. Introduction

In recent years, the anaerobic ammonium oxidation (Anammox) technology based on autotrophic nitrogen removal principles has garnered widespread attention due to its revolutionary energy-saving and consumption-reducing characteristics. This technology enables direct conversion of ammonium and nitrite within a single-stage reactor through the synergistic action of ammonia-oxidizing bacteria (AOB) and anaerobic ammonia-oxidizing bacteria (AnAOB). Compared to traditional processes, it reduces aeration demand by 60% and eliminates the need for external carbon source addition [1], positioning it as a key solution for addressing high energy consumption and carbon emissions in high-ammonia wastewater treatment. However, the ubiquitous presence of organic matter in real wastewater poses a critical challenge to system stability, hindering its widespread application. Studies indicate that organic compounds trigger substrate competition between heterotrophic denitrifiers and autotrophic microbial consortia, leading to suppressed AnAOB activity or even functional collapse. In high-ammonia wastewater treatment scenarios, continuous input of small-molecule organics such as sodium acetate may disrupt microbial community equilibrium, causing fluctuations in nitrogen removal efficiency or even system failure [2].
The completely autotrophic nitrogen removal over nitrite (CANON) process, as a core representative of third-generation biological nitrogen removal technologies, traces its development to the early 21st century [3]. In engineering applications of CANON, the destabilizing effects of excessive small-molecule organics remain a central challenge. Current research predominantly focuses on microbial dormancy or activity recovery under specific storage conditions, while studies on post-collapse recovery strategies following inhibitory shocks remain limited [4]. The functional recovery mechanisms of collapsed systems, serving as emergency safeguards in engineering practices, still exhibit significant theoretical and operational knowledge gaps [5]. Existing investigations have addressed recovery from salinity, strong alkali, and metal ion inhibition in Anammox systems, yet strategies for system recovery after small-molecule organic-induced collapse are scarcely explored [6]. Peng et al. demonstrated that acetic acid (critical concentration: 270 mg/L) induces more severe Anammox system collapse compared to propionic acid. Through phased regulation of influent carbon-to-nitrogen ratio and hydraulic retention time (HRT) under anaerobic operational conditions (280 mg/L NH4+-N and 370 mg/L NO2-N), nitrogen removal efficiency (TNRE > 80%) could be restored, with Chloroflexi becoming the dominant phylum (relative abundance: 57%) while Planctomycetes failed to fully recover [5]. Nevertheless, the recovery characteristics of CANON biofilm systems following composite shocks from varying degrees of high ammonia and small-molecule organics remain poorly elucidated.
Analysis of existing recovery strategy studies reveals that dynamic regulation of process parameters demonstrates significant advantages in cost-effectiveness and operational efficiency. Although exogenous microbial inoculation or chemical additives (e.g., hydrazine, hydroxylamine, ferrous ions) can accelerate recovery processes, their economic costs and secondary pollution risks limit large-scale applications [7]. In contrast, optimizing core parameters such as substrate ratio, nitrogen loading rate, and HRT enables activity recovery while maintaining system homeostasis. Notably, precise identification of inhibition sources and multi-strategy integration are critical for enhancing recovery efficiency. For instance, systems subjected to small-molecule organic stress require combined in situ recovery (terminating organic input) and progressive load regulation to re-establish functional microbial dominance in phased stages.
Machine learning, as one of the fastest-growing interdisciplinary fields, serves as a low-cost computational method for processing extensive nonlinear data and establishing mathematical models. It has been widely applied across domains including manufacturing, education, healthcare, physical sciences, and wastewater treatment data modeling [8]. Wang et al. compared multiple machine learning algorithms to predict anaerobic digestion performance (e.g., methane production, biogas yield) and identify critical operational parameters, testing prediction accuracy across regression and classification models [9]. Yang et al. developed an Extreme Gradient Boosting (XGBoost) model to predict Total Nitrogen Removal Efficiency (TNRE) in Anammox systems treating iron oxide red wastewater, achieving 98% prediction accuracy in practical applications [10]. When addressing complex nonlinear relationships among variables and microbial population dynamics in biological nitrogen removal systems, machine learning can analyze and predict key variables through mathematical modeling, providing actionable insights for system commissioning and stable operation.
In this study, four parallel CANON biofilm reactors (R0, R100, R300, R500) were established under high-ammonia influent conditions (1000 ± 50 mg N/L). Diverging from traditional gradual COD dosing strategies, this configuration was specifically designed to simulate industrial transient organic shock scenarios and systematically reveal the recovery characteristics of CANON systems following acute acetate inhibition. Sodium acetate solutions with concentrations of 0, 100, 300, and 500 mg/L were continuously dosed into respective reactors. Using R0 and R100 as controls, this work investigates the recovery of Anammox biofilm systems after collapse induced by sustained sodium acetate stress at medium-high concentrations (300 and 500 mg/L). A recovery protocol was designed to gradually increase the Total Nitrogen Removal Rate (TNRR) of collapsed systems. High-throughput sequencing was employed to elucidate microbial community succession patterns during recovery, revealing restoration mechanisms of biofilm systems post inhibition at different sodium acetate concentrations. These findings provide theoretical foundations and strategic references for rapid recovery of analogous processes. Furthermore, two machine learning models (Support Vector Regression (SVR) and XGBoost) were developed to compare their predictive performance differences for nitrogen removal metrics (TNRE, TNRR) during the recovery phase of Anammox biofilm systems.

2. Materials and Methods

2.1. Experimental Setup and Inoculated Sludge

Four reactors (70 mm internal diameter × 700 mm height) with individual effective volumes of 2.5 L were employed. Each reactor was externally wrapped with heating jackets connected to temperature controllers to maintain the internal liquid phase at 30 ± 1 °C. Feedwater was delivered via peristaltic pumps (BT100L, Leici, Shanghai, China) through bottom inlets, while effluent exited from top outlets. A reflux port was installed below the effluent outlet of each reactor, enabling partial liquid recirculation from the opposite bottom side using mid-level peristaltic pumps (positioned at 1/2 reactor height). Sodium acetate solutions at varying concentrations were injected through bottom ports after mixing with recirculated streams. All reactors contained bottom-mounted aeration diffusers connected to individual flowmeters for precise airflow regulation (0.5–3.0 L/min). The biofilm sludge used in this experiment was inoculated from a large-scale partial nitritation-anammox (PN/A) plant treating high-ammonia wastewater (500 kgN/day) in Guangdong Province, China. This facility has been stably operated for nearly 4 years [11].
After inoculation, reactor start-up was initiated using synthetic wastewater configured for CANON operation. The synthetic wastewater consisted of NH4Cl, NaHCO3, KH2PO4 0.05 g/L, CaCl2 0.3 g/L, MgSO4·7H2O 0.3 g/L, FeSO4·7H2O 0.00625 g/L, and 1.25 mL/L of trace elements solution. Phosphorus served as a critical nutrient to support microbial metabolism. The trace element solution, formulated based on prior experimental studies and production applications, is comprehensively documented in Table 1, including its detailed composition and concentration parameters [10,11]. Sodium bicarbonate was used to adjust the pH value of influent water to 7.9 ± 0.2.

2.2. Experimental Design and Operational Protocol

The four parallel reactors were operated through five consecutive phases. Phase I (Days 1–55) represented the start-up period. Phase II (Days 56–92) involved stable reactor operation. During Phase III (Days 93–108), sodium acetate solutions with concentrations of 0, 100, 300, and 500 mg/L were continuously dosed into reactors R0, R100, R300, and R500, respectively. Sodium acetate addition was terminated on Day 109. Considering the economic constraints of practical applications [11], Phase IV and V (the exact time depends on different reactors) adopted in situ recovery strategies through operational parameter optimization. In Phase IV, all reactors were switched to an anaerobic low-load operation mode, with influent NH4+-N and NO2-N concentrations set at 100 mg/L to evaluate intrinsic Anammox activity. Reactors R0 and R100 underwent nitrogen loading rate (NLR) escalation based on performance, while the R300 and R500 are prioritized to replace the residual high concentrations of NH4+-N and NO2-N and reduce the NLR of the system for recovery. In Phase V, according to the actual performance recovery of each reactor, the influent water of R0, R100, R300, and R500 was successively switched to a lower influent NH4+-N concentration (160–200 mg N/L), and the reactor was operated again according to the CANON mode, and its subsequent recovery was carried out by adjusting the influent NH4+-N concentration, regulating the influent flow rate and aeration rate (50–100 mL/min) for its subsequent recovery.

2.3. Analytical Method

Daily water samples were collected from influent tanks and reactor effluent ports. After filtration through 0.45 μm membranes, samples were either immediately analyzed or preserved at 4 °C. Pollutant concentrations were systematically measured following the Standard Methods for the Examination of Water and Wastewater (2024 edition) published by the American Public Health Association (APHA) [12]. NH4+-N was quantified via Nessler’s reagent spectrophotometry: samples reacted with Nessler’s reagent to form reddish-brown complexes, with absorbance measured at 420 nm. The concentration of NO2-N was determined using the N-(1-naphthyl) ethylenediamine spectrophotometric method. Under acidic conditions, NO2-N reacts to form a purple-red azo dye, which was quantitatively analyzed by measuring its absorbance at 540 nm. NO3-N analysis employed ultraviolet spectrophotometry, utilizing characteristic absorption peaks at 220 nm with 275 nm absorbance correction to eliminate organic interference (UV-5200PC, YuanXi, Shanghai, China). The pH was measured by a pH meter (PHS-3C, Leici, Shanghai, China).

2.4. Calculation Method

The ammonium removal efficiency (ARE), TN removal efficiency (TNRE), NLR (Nitrogen loading rate), and TNRR (TN removal rate) concentration were calculated according to Equations (1)–(4):
A R E ( % ) = N H 4 + N i n f N H 4 + N e f f N H 4 + N i n f × 100 %
T N R E ( % ) = T N i n f T N e f f T N i n f × 100 %
N L R ( k g N / m 3 / d a y ) = T N i n f H R T × 1000 × 24
T N R R ( k g N / m 3 / d a y ) = T N i n f T N e f f H R T × 1000 × 24
where the subscript inf and eff represent the influent and effluent in the reactors in this study.

2.5. Analysis of Microbial Community Structure

Biofilm samples were collected from identical locations across of all reactors using flame-sterilized scalpels and forceps to ensure aseptic conditions then immediately cryopreserved at −20 °C of collection to prevent post-sampling microbial activity shifts. During the recovery phases (Phase IV and V), the renamed designations for R0, R100, R300, and R500 were H0_45, H100_60, H300 (H300_1, H300_2), and H500 (H500_1, H500_2), respectively. All sequencing services were contracted to Shanghai Personal Biotechnology Co., Ltd. (Shanghai, China), with detailed procedures as follows:
Total DNA was extracted from sludge samples using a DNA extraction kit, and the DNA solution was stored at −20 °C. DNA concentration was measured by ultraviolet spectrophotometry at wavelengths of 260 nm and 280 nm. The V4 region of bacterial 16S rRNA genes was amplified via PCR using forward primer 515F (5′-GTGYCAGCMGCCGCGGTAA-3′) and reverse primer 806R (5′-GGACTACNVGGTWTCTAAT-3′). Post-amplification, amplicons were pooled in equimolar concentrations, purified, and subjected to paired-end sequencing (250 bp) on an Illumina NovaSeq platform. Raw sequencing data were stored in FASTQ format. Data processing was performed using QIIME2 (version 2022.11). The DADA2 plugin was employed for quality control, noise filtering, sequence merging, and chimera removal. Feature sequences were integrated into an amplicon sequence variant (ASV) table, with singletons (ASVs with ≤1 read) filtered out. Taxonomic classification of ASVs was conducted using the SILVA database (v138). All sequencing datasets generated in this study are publicly available via the NCBI Sequence Read Archive under BioProject accession number PRJNA1258194.

2.6. Modeling and Development of Machine Learning Models

This study employed two mainstream regression algorithms (SVR and XGBoost) within a machine learning theoretical framework to construct data-driven predictive models. Comparative analysis of model performance was conducted for different nitrogen removal metrics (TNRE, TNRR). Input features were standardized using the StandardScaler module from scikit-learn. The normalized dataset was split into two subsets: 70% for machine learning model training and 30% for final model testing. Model predictive accuracy and quantitative performance were evaluated via coefficient of determination (R2) and root mean square error (RMSE), calculated using Equations (5) and (6), respectively. R2 reflects the goodness-of-fit, indicating the proportion of dependent variable variance explainable by the model, with values ranging from 0 to 1. A value closer to 1 denotes superior fitting performance [13]. RMSE quantifies prediction errors, where lower values indicate smaller model prediction deviations and enhanced performance.
R 2 = 1 i = 1 n ( y t i y p i ) 2 i = 1 n ( y t i y m ) 2
R M S E = i = 1 n ( y t i y p i ) 2 n
where the n is total number of data points in the training or test set; y t i represents the actual value of TNRE or TNRR in the system; y p i represents the predicted value of TNRE or TNRR in the system; y m represents the average of all the actual values of TNRE or TNRR.

3. Results and Discussion

3.1. Recovery Analysis of Nitrogen Removal Performance of System

Under high-ammonia conditions, four parallel Anammox biofilm systems were successfully initiated by gradually increasing influent flow velocity with corresponding aeration rate adjustments, achieving stable operation with influent ammonia of 1000 ± 50 mg N/L. The average Total Nitrogen Removal Rate (TNRR) and Efficiency (TNRE) reached 0.494 kg N/(m3·d) and 82.05%, respectively. Sodium acetate solutions at concentrations of 0, 100, 300, and 500 mg COD/L (prepared in 0.5 L amber bottles with deionized water) were continuously dosed into R0, R100, R300, and R500 via dedicated pumps through bottom influent ports. The corresponding organic loading rates (OLRs) were 0, 0.0144, 0.0432, and 0.0720 kg/(m3·d), while all reactors maintained a nitrogen loading rate (NLR) of 0.6725 kg N/(m3·d).
By Day 15 of sodium acetate dosing, R300 and R500 exhibited severe performance deterioration, with TNRR/TNRE declining to 0.1020 kg N/(m3·d)/15.42% and 0.0719 kg N/(m3·d)/10.87%, respectively. In contrast, R100 showed improved performance: TNRR increased from 0.5103 kg N/(m3·d) (Phase II: Days 72–92) to 0.5995 kg N/(m3·d), and TNRE rose from 81.35% to 90.61%. Due to catastrophic failure in R300 and R500, recovery protocols were uniformly implemented.
During Phase IV and V, NLR, TNRR, NH4+-Niff and NH4+-Neff concentrations are shown in Figure 1, while effluent NO2-N, NO3-N, ammonia removal efficiency (ARE), and TNRE are presented in Figure 2. In Phase IV, R0 and R100 under low NLR (0.231 kg N/(m3·d)) demonstrated robust activity, maintaining NH4+-Neff and NO2-Neff below 20 mg/L. During the initial two days, R0 achieved ARE/TNRE of 96.49%/80.74%, while R100 reached 97.92%/69.62%. Subsequent NLR escalation to 0.495 kg N/(m3·d) via flow velocity adjustment enabled rapid recovery. Both reactors resumed CANON-mode operation after only 7 (R0) and 11 days (R100), validating their recovery capacity (Phase V). Conversely, R300 and R500 exhibited sustained high effluent NO2-N (74.99 and 92.74 mg/L) with minimal NO3-N production (4.07 and 2.43 mg/L), indicating AnAOB functional impairment. After 16 days of anaerobic operation without significant recovery, both reactors were switched back to CANON mode at reduced NLR (0.231 kg N/(m3·d)) to align with practical operational constraints.
Phase V results demonstrated that R0 and R100 achieved average TNRR values of 0.414 and 0.329 kg N/(m3·d) after 40 and 60 days of CANON-mode operation, respectively. Compared to R0, R100 exhibited significant fluctuations in effluent NO3-N concentration, which twice exceeded 100 mg N/L due to transient over-aeration. These deviations were mitigated by enhancing free ammonia (FA) inhibition through alkalinity adjustments. Similar phenomena occurred in R300 and R500, with identical recovery strategies proving effective.
For R300 and R500, TNRR gradually increased to >0.281 and 0.275 kg N/(m3·d) by Day 100 of recovery (Day 220 of total operation; see Figure 1 and Figure 2), accompanied by TNRE recovery to 57.98% and 58.86%, respectively. These findings indicate that collapsed Anammox biofilm systems (R300, R500) can self-recover after sodium acetate cessation. Although high sodium acetate concentrations induced severe AnAOB inhibition, such inhibition was reversible. Post inhibitor removal, AnAOB activity recovered within 60–100 days, with recovery duration positively correlated with initial acetate concentration. This aligns with Yang et al.’s observations under anaerobic conditions [6] and corroborates Tian et al.’s report of 120-day recovery after methanol-induced inhibition [14].

3.2. Reaction Stoichiometric Ratio Analysis in System Recovery Phase

In the macroscopic analysis of anammox system operations, stoichiometric reaction analysis is widely adopted to evaluate the nitrogen removal characteristics of microorganisms within the reactor. This methodology was initially proposed by Strous et al. [15] and subsequently refined across different process configurations [16]. In this study, under the assumption of neglecting microbial assimilation and attributing the generated nitrogen gas (N2) entirely to total nitrogen removal [17], the two characteristic stoichiometric ratios of the anammox process in the CANON system—ΔTN/ΔNH4+-N and ΔNO3-N/ΔNH4+-N—were calculated as 0.88 and 0.11, respectively. The close alignment of these ratios with theoretical values, derived from effluent data analysis, further confirms the dominant contribution of the anammox process to system-wide nitrogen removal [18]. The reaction stoichiometric analysis of the system recovery phase are shown in Figure 3. In R0, the ΔTN/ΔNH4+-N and ΔNO3-N/ΔNH4+-N ratios remained stable throughout recovery, gradually approaching theoretical values in later phases, though ΔNO3-N/ΔNH4+-N ratios persistently fell below theoretical levels.
In R100, R300, and R500, both ratios exhibited marked fluctuations, consistent with observed effluent NO3-N variability. This indicates that sodium acetate exposure predisposed systems to complete nitrification, necessitating stringent aeration control during recovery. For R100 during Days 100–160 of CANON-mode operation, average ΔTN/ΔNH4+-N and ΔNO3-N/ΔNH4+-N ratios were 0.904 and 0.086, respectively, confirming Anammox as the dominant nitrogen removal pathway.
Conversely, R300 and R500 failed to approach theoretical stoichiometric ratios even after 120 days of CANON-mode recovery, suggesting coexistence of complete nitrification-denitrification pathways. Higher sodium acetate exposure (R500 > R300) prolonged recovery duration, emphasizing the need for immediate intervention during high-concentration acetate shocks to minimize time costs.

3.3. Microbial Community Structure Analysis in System Recovery Phase

3.3.1. Microbial Richness and Diversity Analysis

A total of six samples (initial inoculum and time-series samples from four reactors) were analyzed, yielding 676,566 high-quality sequences and 3644 operational taxonomic units (OTUs). Alpha diversity indices (Chao1, Observed_species, Shannon, and Simpson) are summarized in Table 2. Chao1 and Observed_species indices reflect community richness, while Shannon and Simpson indices represent community diversity [19,20].
Throughout the recovery period, all Alpha diversity indices (Chao1, Observed_species, Shannon, Simpson) for H300_1, H300_2, H500_1, and H500_2 gradually increased with recovery time, indicating partial restoration of microbial richness and diversity after inhibition removal. However, these indices remained consistently lower than those of H0_45 and H100_60, demonstrating that high-concentration sodium acetate-induced system collapse caused persistent damage to biofilm community richness and diversity, with incomplete recovery within the experimental timeframe.
Non-metric multidimensional scaling (NMDS) analysis was performed to assess beta-diversity differences among communities during recovery (Figure 4, Stress = 0.0001). H0_45 and H100_60 clustered closely, suggesting minimal community divergence. In contrast, H300_1/H300_2 and H500_1/H500_2 exhibited significant spatial separation, indicating distinct microbial community structures in R300 and R500 at different recovery phases (Day 60 vs. Day 120) [21]. The divergent trajectories of R300/R500 communities (H300_2 and H500_2) further corroborated the delayed TNRE recovery observed between Days 100–120 (57.98–58.86%).

3.3.2. Microbial Community Composition Analysis

A total of 40 phyla were detected across six samples, with 9 phyla exhibiting relative abundances > 1% (Figure 5). In R0 and R100, the dominant phyla were Planctomycetota, Proteobacteria, Bacteroidota, and Chloroflexi. Planctomycetota encompasses all known AnAOB, including Candidatus_Brocadia and Candidatus_Kuenenia [22], serving as core functional taxa in Anammox systems by directly mediating the anaerobic oxidation of ammonium and nitrite. Proteobacteria includes diverse nitrogen-cycling genera such as Nitrosomonas (key ammonia oxidizers) [23] and heterotrophic denitrifiers (Thauera, Limnobacter, Pseudomonas) [24,25].
Planctomycetota relative abundances in H0_45 and H100_60 were comparable (58.34% vs. 56.72%). In contrast, H300_1/H300_2 and H500_1/H500_2 exhibited gradual increases, reaching 46.47% and 46.34% by Day 120, respectively. This recovery trajectory aligned with restored nitrogen removal performance. Conversely, Proteobacteria abundances displayed an inverse trend. Notably, NB1-j proliferated significantly in R300 and R500 during recovery. Previous studies indicate strong co-occurrence relationships between NB1-j and microalgae (e.g., Bacillariophyta, Pyrenomonadales) [26]. All four reactors were uniformly equipped with identical and rigorous light-blocking measures, including wrapping with heating tape and affixing opaque aluminum foil. However, due to differences in their physical placement, reactors R300 and R500 experienced unintended exposure to direct sunlight. We hypothesize that inadequate light-shielding measures during recovery may have triggered algal proliferation, indirectly promoting NB1-j enrichment.
Genus-level analysis of the top 10 taxa (Figure 6) revealed Candidatus_Kuenenia as the sole Anammox functional genus across all samples. Combined with phylum-level data, 16S rDNA sequencing confirmed Candidatus_Kuenenia and Nitrosomonas as the primary drivers of nitrogen removal in CANON biofilm systems. In H300_2 and H500_2, Nitrosomonas abundances recovered to 3.30% and 3.07%, respectively, consistent with late-phase TNRR and TNRE recovery trends. Other denitrifiers (Limnobacter, Denitratisoma) also showed elevated abundances, with Denitratisoma reaching 4.64% (H300_2) and 5.69% (H500_2). These findings demonstrate that collapsed systems can be resuscitated post inhibition through operational parameter adjustments (NH4+-Ninf concentration, flow velocity, aeration rate). However, experimental results confirm incomplete functional recovery, with recovery duration positively correlated with initial sodium acetate exposure intensity.

3.4. Machine Learning Model Prediction for System Performance in Recovery Phase

To analyze and predict nitrogen removal performance dynamics during Anammox biofilm system recovery, this study employed Support Vector Regression (SVR) and XGBoost models using 12 operational features. Prediction performance for TNRE and TNRR is illustrated in Figure 7 and Figure 8, evaluated via coefficient of determination (R2) and root mean square error (RMSE). Comparative plots of predicted versus measured TNRE and TNRR are provided in Figure A1 and Figure A2. For machine learning algorithms, R2 values approaching 1.000 indicate superior model fitting, while lower RMSE values denote enhanced predictive accuracy. For TNRE prediction, SVR achieved R2 values closer to 1.000 and lower Test RMSE than XGBoost, demonstrating superior precision. For TNRR prediction, XGBoost exhibited R2 values approaching 1.000 and lower Test RMSE than SVR, indicating higher accuracy in biofilm system TNRR forecasting. Detailed prediction results for both models are summarized in Table 3.
Analysis of XGBoost feature importance revealed distinct predictors for TNRE and TNRR recovery patterns. For TNRE prediction, the ΔTN/ΔNH4+-N ratio dominated feature contributions across reactors (R100: 88.99%, R300: 93%, R500: 92%), demonstrating exceptional consistency (>88%) in nitrogen substrate stoichiometric ratio as the key recovery driver. Conversely, HRT emerged as the primary predictor for TNRR, particularly in R300 (72% contribution) and R500 (55%), which strongly underscores the pivotal role of operational strategy adjustments in restoring CANON biofilm system performance.
This study demonstrates that model-specific selection (SVR for TNRE vs. XGBoost for TNRR) can replace traditional time-consuming and costly experimental approaches (e.g., long-term continuous-flow reactor trials), providing an efficient predictive framework to assess the impacts of sustained medium-high sodium acetate dosing on Anammox biofilm processes.

4. Conclusions

This study elucidates the recovery characteristics of Anammox biofilm systems under medium-high sodium acetate stress through dynamic regulation strategies and establishes machine learning predictive models. The main conclusions are as follows:
(1) Validation of Recovery Strategy Effectiveness: gradient load regulation enables functional recovery across varying sodium acetate inhibition intensities. For low-concentration groups (R0, R100), TNRE recovered to >80% within 40–60 days via NLR optimization (0.216 → 0.495 kg N/(m3·d)) and aeration rate adjustments (50–200 mL/min). High-concentration groups (R300, R500) achieved TNRE recovery to 57.98–58.86% over 100–120 days through residual NH4+-N and NO2-N replacement and HRT control, with recovery duration positively correlated with acetate concentration.
(2) Nitrogen Removal Pathways and Microbial Synergy: stoichiometric analysis revealed reconstructed Anammox dominance in R100 (ΔTN/ΔNH4+-N = 0.882), approaching CANON theoretical values. Significant ΔNO3-N/ΔNH4+-N ratio fluctuations in R300 and R500 necessitated precise aeration control to suppress nitrification competition. Microbial succession showed Candidatus_Kuenenia abundance recovery to 16.06% (R300) and 15.85% (R500), synchronized with TNRE improvement. Denitratisoma predominance (4.64–5.69%) exceeded R0 and R100 levels.
(3) Predictive Model Development: based on 12 operational parameters (NH4+-N, NO2-N, NO3-N, COD concentrations, HRT, etc.), the XGBoost model demonstrated superior TNRR prediction accuracy, while SVR excelled in TNRE forecasting. This provides a real-time regulation tool for engineering recovery. The findings offer a theory-technology integrated solution for early warning and recovery strategies in high-ammonia-organic compound wastewater treatment processes.

Author Contributions

Conceptualization, L.W.; Methodology, L.W. and J.Y.; Formal analysis, L.W., J.L., Y.Z. and J.Y.; Investigation, L.W., Y.C., J.L., Y.Z. and J.Y.; Writing—Original Draft, L.W.; Writing—Review and Editing, L.W., Y.C., J.L., Y.Z. and X.W.; Supervision, Y.C. and X.W.; Resources, X.W.; Project Administration, X.W.; Funding Acquisition, X.W. All authors have read and agreed to the published version of the manuscript.

Funding

This research was mainly funded by Key R&D Projects of Guangzhou Science and Technology Plan (2024B03J1278).

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
CANONCompletely Autotrophic Nitrogen Removal Over Nitrite
AnAOBAnaerobic ammonia-oxidizing Bacteria
AOBAmmonia-oxidizing Bacteria
NOBNitrite Oxidizing Bacteria
DNBDenitrifying Bacteria
TNRETotal Nitrogen Removal Efficiency
TNRRTotal Nitrogen Removal Rate
NLRNitrogen Loading Rate
OLROrganic Loading Rate
CODChemical Oxygen Demand
XGBoosteXtreme Gradient Boosting
SVMSupport Vector Machine
RMSERoot Mean Square Error
FAFree Ammonia
HRTHydraulic Retention Time
PN/Apartial nitritation-anammox

Appendix A

Comparative plots of predicted versus measured TNRE and TNRR are provided in Figure A1 and Figure A2.
Figure A1. Comparison of date-specific prediction results for Total Nitrogen Removal Efficiency (TNRE) in the CANON system (R0, R100, R300, R500) using Support Vector Regression (ad) and Extreme Gradient Boosting Models (eh).
Figure A1. Comparison of date-specific prediction results for Total Nitrogen Removal Efficiency (TNRE) in the CANON system (R0, R100, R300, R500) using Support Vector Regression (ad) and Extreme Gradient Boosting Models (eh).
Processes 13 01672 g0a1
Figure A2. Comparison of date-specific prediction results for Total Nitrogen Removal Rate (TNRR) in the CANON system (R0, R100, R300, R500) using Support Vector Regression (ad) and Extreme Gradient Boosting Models (eh).
Figure A2. Comparison of date-specific prediction results for Total Nitrogen Removal Rate (TNRR) in the CANON system (R0, R100, R300, R500) using Support Vector Regression (ad) and Extreme Gradient Boosting Models (eh).
Processes 13 01672 g0a2

References

  1. Zheng, P.; Xu, X.Y.; Hu, B.L. New Biological Nitrogen Removal Theory and Technology: Theory and Technology of Biological Nitrogen Removal; Science Press: Beijing, China, 2004. [Google Scholar]
  2. Zhang, X.; Zhang, H.; Ye, C.; Wei, M.; Du, J. Effect of COD/N Ratio on Nitrogen Removal and Microbial Communities of CANON Process in Membrane Bioreactors12. Bioresour. Technol. 2015, 189, 302–308. [Google Scholar] [CrossRef] [PubMed]
  3. Sliekers, A.O.; Third, K.A.; Abma, W.; Kuenen, J.G.; Jetten, M.S.M. CANON and Anammox in a Gas-Lift Reactor. FEMS Microbiol. Lett. 2003, 218, 339–344. [Google Scholar] [CrossRef] [PubMed]
  4. Hou, X.B.; Cao, J.S.; Zhou, K.W. Inhibition of ANAMMOX System and Subsequent Restoration Strategy. China Water Wastewater 2017, 33, 6. [Google Scholar] [CrossRef]
  5. Peng, B. Impact of Acetate and Propionate Acids on the Nitrogen Removal Performance of ANAMMOX System and Microbiological Mechanisms. Ph.D. Thesis, Northeast Forestry University, Harbin, China, 2023. [Google Scholar]
  6. Yang, R.L. Study on Start-Up, Suppression and Recovery Characteristics of Anammox and CANON Processes. Ph.D. Thesis, University of Chinese Academy of Sciences, Beijing, China, 2020. [Google Scholar]
  7. Cai, Q.; Ding, J.J. The Mechanism and Dynamic Characteristic of N2H4 Enhancing Anaerobic Ammonium Oxidation (Anammox) Process. Technol. Water Treat. 2015, 41, 5. [Google Scholar] [CrossRef]
  8. Jordan, M.I.; Mitchell, T.M. Machine Learning: Trends, Perspectives, and Prospects. Science 2015, 349, 255–260. [Google Scholar] [CrossRef] [PubMed]
  9. Wang, L.; Long, F.; Liao, W.; Liu, H. Prediction of Anaerobic Digestion Performance and Identification of Critical Operational Parameters Using Machine Learning Algorithms. Bioresour. Technol. 2020, 298, 122495. [Google Scholar] [CrossRef] [PubMed]
  10. Yang, J.F.; Chen, Z.G.; Zheng, X.W.; Zhang, Y.; Li, J.Y.; Zhou, S.W.; Wang, X.J. Research on the engineering of anammox process for treating iron oxide red wastewater. J. Environ. Sci. 2025, 45, 158–166. [Google Scholar] [CrossRef]
  11. Chen, Z.; Wang, X.; Zhou, S.; Fan, J.; Chen, Y. Large-Scale (500 Kg N/Day) Two-Stage Partial Nitritation/Anammox (PN/A) Process for Liquid-Ammonia Mercerization Wastewater Treatment: Rapid Start-up and Long-Term Operational Performance. J. Environ. Manag. 2023, 326, 116404. [Google Scholar] [CrossRef] [PubMed]
  12. American Public Health Association; American Water Works Association; Water Environment Federation; Lipps, W.C.; Braun-Howland, E.B.; Baxter, T.E. Standard Methods for the Examination of Water and Wastewater, 24th ed.; APHA Press: Washington, DC, USA, 2023. [Google Scholar]
  13. Serefoglu Cabuk, K.; Cengiz, S.K.; Guler, M.G.; Topcu, H.; Cetin Efe, A.; Ulas, M.G.; Poslu Karademir, F. Chasing the Objective Upper Eyelid Symmetry Formula; R2, RMSE, POC, MAE, and MSE. Int. Ophthalmol. 2024, 44, 303. [Google Scholar] [CrossRef] [PubMed]
  14. Tian, Z.; Zhang, J.; Song, Y. Several Key Factors Influencing Nitrogen Removal Performance of Anammox Process in a Bio-Filter at Ambient Temperature. Environ. Earth Sci. 2015, 73, 5019–5026. [Google Scholar] [CrossRef]
  15. Strous, M.; Heijnen, J.J.; Kuenen, J.G.; Jetten, M.S.M. The Sequencing Batch Reactor as a Powerful Tool for the Study of Slowly Growing Anaerobic Ammonium-Oxidizing Microorganisms. Appl. Microbiol. Biotechnol. 1998, 50, 589–596. [Google Scholar] [CrossRef]
  16. Jin, R.C.; Yang, G.F.; Yu, J.J.; Zheng, P. The Inhibition of the Anammox Process: A Review. Chem. Eng. J. 2012, 197, 67–79. [Google Scholar] [CrossRef]
  17. Sun, Z.J. Startup and Denitrification Mechanism of a Novel Microaerobic Process Treating Manure-Free Piggery Wastewater. Ph.D. Thesis, Harbin Institute of Technology, Harbin, China, 2018. [Google Scholar]
  18. Tian, L.; Huang, R.; Du, D.Y.; Jie, C.J. Nitrogen removal mechanism analysis of CANON process at low C/N. Environ. Sci. Technol. 2023, 46, 104–111. [Google Scholar] [CrossRef]
  19. Chao, A. Nonparametric estimation of the number of classes in a population. Scand. J. Statist. 1984, 11, 265–270. [Google Scholar]
  20. Shannon, P. Cytoscape: A Software Environment for Integrated Models of Biomolecular Interaction Networks. Genome Res. 2003, 13, 2498–2504. [Google Scholar] [CrossRef] [PubMed]
  21. Legendre, L.; Legendre, P. Numerical Ecology, 2nd English ed.; Elsevier: Amsterdam, The Netherlands, 1998; Volume 17–26. [Google Scholar]
  22. Liu, J.; Hu, M.; Hu, M.; Wang, J.; Zhang, T.; Wang, Y.; Wang, X. Responses of suspended sludge and biofilm in a SNAD system under C/N elevation: Microbial activity, nitrogen conversion flux and molecular ecological network. Sci. Total Environ. 2024, 954, 176236. [Google Scholar] [CrossRef] [PubMed]
  23. Lei, Y.F.; Yang, C.W.; Peng, Z.H.; Wan, X.; Pan, X.J.; Yang, B.Q. Effect of organic carbon source on the performance of partial nitrification system and its comprehensive regulation. Chin. J. Environ. Eng. 2021, 15, 3088–3100. [Google Scholar] [CrossRef]
  24. Scholten, E.; Lukow, T.; Auling, G.; Kroppenstedt, R.M.; Rainey, F.A.; Diekmann, H. Thauera mechernichensis sp. nov., an aerobic denitrifier from a leachate treatment plant. Int. J. Syst. Bacteriol. 1999, 49, 1045–1051. [Google Scholar] [CrossRef] [PubMed]
  25. Liu, B.; Mao, Y.; Bergaust, L.; Bakken, L.R.; Frostegrd, S. Strains in the genus Thauera exhibit remarkably different denitrification regulatory phenotypes. Environ. Microbiol. 2013, 15, 2816–2828. [Google Scholar] [CrossRef] [PubMed]
  26. Stefan, S.; Carmen, S.; Markus, G.; Hans-Peter, K. A Taxonomic Framework for Emerging Groups of Ecologically Important Marine Gammaproteobacteria Based on the Reconstruction of Evolutionary Relationships Using Genome-Scale Data. Front. Microbiol. 2015, 6, 281. [Google Scholar]
Figure 1. Nitrogen Loading Rate, Total Nitrogen Removal Rate, and changes in NH4+-Ninf and NH4+-Neff concentrations for the recovery phase in R0 (a), R100 (b), R300 (c), and R500 (d).
Figure 1. Nitrogen Loading Rate, Total Nitrogen Removal Rate, and changes in NH4+-Ninf and NH4+-Neff concentrations for the recovery phase in R0 (a), R100 (b), R300 (c), and R500 (d).
Processes 13 01672 g001
Figure 2. NO2Neff, NO3-Neff, ammonium removal efficiency and TN removal efficiency of R0 (a), R100 (b), R300 (c), and R500 (d) during the recovery phase.
Figure 2. NO2Neff, NO3-Neff, ammonium removal efficiency and TN removal efficiency of R0 (a), R100 (b), R300 (c), and R500 (d) during the recovery phase.
Processes 13 01672 g002
Figure 3. Stoichiometry ratio of R0 (a), R100 (b), R300 (c), and R500 (d) during the recovery phase of the system.
Figure 3. Stoichiometry ratio of R0 (a), R100 (b), R300 (c), and R500 (d) during the recovery phase of the system.
Processes 13 01672 g003
Figure 4. Non-metric multidimensional scaling analysis based on the Bray–Curtis distances on ASV level in R0, R100, R300, R500 during the recovery period.
Figure 4. Non-metric multidimensional scaling analysis based on the Bray–Curtis distances on ASV level in R0, R100, R300, R500 during the recovery period.
Processes 13 01672 g004
Figure 5. Relative abundance of species at the phylum level in R0, R100, R300 and R500 reactors in the system recovery phase (the top 20 phylum).
Figure 5. Relative abundance of species at the phylum level in R0, R100, R300 and R500 reactors in the system recovery phase (the top 20 phylum).
Processes 13 01672 g005
Figure 6. The relative abundance of functional bacterial community at the genus level during the sodium acetate dosing stages.
Figure 6. The relative abundance of functional bacterial community at the genus level during the sodium acetate dosing stages.
Processes 13 01672 g006
Figure 7. Comparison of prediction results for Total Nitrogen Removal Efficiency (TNRE) in the CANON system (R0, R100, R300, R500) using Support Vector Regression (ad) and Extreme Gradient Boosting Models (eh).
Figure 7. Comparison of prediction results for Total Nitrogen Removal Efficiency (TNRE) in the CANON system (R0, R100, R300, R500) using Support Vector Regression (ad) and Extreme Gradient Boosting Models (eh).
Processes 13 01672 g007
Figure 8. Comparison of prediction results for Total Nitrogen Removal Rate (TNRR) in the CANON system (R0, R100, R300, R500) using Support Vector Regression (ad) and Extreme Gradient Boosting Models (eh).
Figure 8. Comparison of prediction results for Total Nitrogen Removal Rate (TNRR) in the CANON system (R0, R100, R300, R500) using Support Vector Regression (ad) and Extreme Gradient Boosting Models (eh).
Processes 13 01672 g008
Table 1. The dosage of trace elements of synthetic wastewater in reactors.
Table 1. The dosage of trace elements of synthetic wastewater in reactors.
CompoundConcentration (g/L)CompoundConcentration (g/L)
EDTA-2Na15NaMoO4·2H2O0.22
H3BO40.014NaSeO4·10H2O0.21
MnCl2·4H2O0.99CoCl2·6H2O0.24
CuSO4·5H2O0.25NiCl2·6H2O0.19
ZnSO4·7H2O0.43
Table 2. Alpha-diversity of bacterial communities in samples obtained from R0, R100, R300, and R500 reactors during the recovery period.
Table 2. Alpha-diversity of bacterial communities in samples obtained from R0, R100, R300, and R500 reactors during the recovery period.
Sample IDChao1Observed_SpeciesShannonSimpson
H0_45653.4386536.028910.95026
H100_60652.46496.029690.950595
H300_1594.8725905.346550.933856
H300_2614.56095.883920.956053
H500_1563.2795625.345570.933993
H500_2581.755815.544920.939002
Table 3. Summary of prediction accuracy of SVR and XGBoost models.
Table 3. Summary of prediction accuracy of SVR and XGBoost models.
ModelSVRXGBoost
Reactor Train R2Test R2Train RMSETest RMSETrain R2Test R2Train RMSETest RMSE
R0TNRE0.99980.99540.13720.69291.0000.95160.05112.2349
TNRR0.89070.89450.06730.07631.0000.85480.00050.0895
R100TNRE0.99970.99570.33261.16491.0000.96040.03823.5491
TNRR0.93870.93610.06770.07071.0000.94300.00060.0667
R300TNRE0.99920.99570.69761.44421.0000.98120.05083.0272
TNRR0.81550.59140.06910.07341.0000.90090.00060.0362
R500TNRE0.99890.99330.81101.94911.0000.97300.04003.9060
TNRR0.80930.53100.06740.07011.0000.85770.00060.0386
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Wang, L.; Chen, Y.; Yang, J.; Li, J.; Zhang, Y.; Wang, X. Dynamic Restoration of Collapsed Anammox Biofilm Systems: Integrating Process Optimization, Microbial Community Succession, and Machine Learning-Based Prediction. Processes 2025, 13, 1672. https://doi.org/10.3390/pr13061672

AMA Style

Wang L, Chen Y, Yang J, Li J, Zhang Y, Wang X. Dynamic Restoration of Collapsed Anammox Biofilm Systems: Integrating Process Optimization, Microbial Community Succession, and Machine Learning-Based Prediction. Processes. 2025; 13(6):1672. https://doi.org/10.3390/pr13061672

Chicago/Turabian Style

Wang, Li, Yongxing Chen, Junfeng Yang, Jiayi Li, Yu Zhang, and Xiaojun Wang. 2025. "Dynamic Restoration of Collapsed Anammox Biofilm Systems: Integrating Process Optimization, Microbial Community Succession, and Machine Learning-Based Prediction" Processes 13, no. 6: 1672. https://doi.org/10.3390/pr13061672

APA Style

Wang, L., Chen, Y., Yang, J., Li, J., Zhang, Y., & Wang, X. (2025). Dynamic Restoration of Collapsed Anammox Biofilm Systems: Integrating Process Optimization, Microbial Community Succession, and Machine Learning-Based Prediction. Processes, 13(6), 1672. https://doi.org/10.3390/pr13061672

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop