Machine Learning in the Design and Performance Prediction of Organic Framework Membranes: Methodologies, Applications, and Industrial Prospects

Wu, Tong; Zhang, Jiawei; Yan, Qinghao; Wang, Jingxiang; Yang, Hao

doi:10.3390/membranes15060178

Open AccessReview

Machine Learning in the Design and Performance Prediction of Organic Framework Membranes: Methodologies, Applications, and Industrial Prospects

by

Tong Wu

^1,2

,

Jiawei Zhang

^1,2,

Qinghao Yan

^1,2,

Jingxiang Wang

^1,2 and

Hao Yang

^1,2,*

¹

State Key Laboratory of Pollution Control and Resource Reuse, School of the Environment, Nanjing University, Nanjing 210023, China

²

Institute for the Environment and Health, Nanjing University Suzhou Campus, Suzhou 215163, China

^*

Author to whom correspondence should be addressed.

Membranes 2025, 15(6), 178; https://doi.org/10.3390/membranes15060178

Submission received: 19 May 2025 / Revised: 8 June 2025 / Accepted: 8 June 2025 / Published: 11 June 2025

(This article belongs to the Special Issue Preparation, Characterization, and Application of Advanced Separation Membrane Materials)

Download

Browse Figures

Versions Notes

Abstract

Organic framework membranes (OFMs) have emerged as transformative materials for separation technologies due to their tunable porosity, structural diversity, and stability, yet their design and optimization face challenges in navigating vast chemical spaces and complex performance trade-offs. This review highlights the pivotal role of machine learning (ML) in overcoming these limitations by integrating multi-source data, constructing quantitative structure–property relationships, and enabling the cross-scale optimization of OFMs. Methodologically, ML workflows—spanning data construction, feature engineering, and model optimization—accelerate candidate screening, inverse design, and mechanistic interpretation, as demonstrated in gas separations and nascent liquid-phase applications. Key findings reveal that ML identifies critical structural descriptors and environmental parameters, guiding the development of high-performance membranes that surpass traditional selectivity–permeability limits. Challenges persist in liquid separations due to dynamic operational complexities and data scarcity, while emerging frameworks offer untapped potential. The integration of interpretable ML, in situ characterization, and industrial scalability strategies is essential to transition OFMs from laboratory innovations to sustainable, adaptive separation systems. This review underscores ML’s transformative capacity to bridge computational insights with experimental validation, fostering next-generation membranes for carbon neutrality, water security, and energy-efficient industrial processes.

Keywords:

machine learning; organic framework membranes; gas separation; liquid separation; metal–organic framework membranes; covalent organic framework membranes

1. Introduction

In recent years, organic framework membranes (OFMs) have garnered significant attention in the field of membrane separation due to their unique structural designability and performance advantages. These materials, constructed through covalent bonds, metal–organic coordination interactions, or intermolecular forces to form porous frameworks, exhibit high porosity, precisely tunable pore sizes, and tailorable microenvironments within their channels [1,2]. Such characteristics enable multiscale design flexibility to address diverse separation requirements, including molecular sieving, ion-selective rejection, and organic compound separation [3,4]. Representative OFMs include metal–organic framework (MOF) membranes [2,5,6], covalent organic framework (COF) membranes [2,7,8], conjugated microporous polymer (CMP) membranes [9], porous organic cage (POC) membranes [10], and porous aromatic framework (PAF) membranes [11,12]. Their core advantages stem from the chemical and topological diversity of building blocks: the selection and assembly of organic linkers allow for the precise regulation of pore size distributions at sub-nanometer to nanometer scales, while pore-wall functionalization optimizes interactions between substances and membrane interfaces. This synergy enhances both separation selectivity and permeation flux, overcoming the traditional “selectivity-permeability trade-off” inherent in conventional membrane materials [13]. Furthermore, the exceptional chemical and thermal stability of OFMs ensures robustness under extreme conditions such as high temperatures, pressures, and aggressive chemical environments (e.g., strong acids/bases), guaranteeing long-term operational reliability [14]. These attributes underscore their broad potential in gas separation, water purification, and energy storage applications.

However, the design and optimization of OFMs face substantial challenges. Despite their theoretically near-infinite chemical space (e.g., over 100,000 experimentally synthesized MOFs [15]), traditional trial-and-error approaches and high-throughput computational screening (HTCS) suffer from high resource consumption and prolonged cycles when evaluating vast candidate libraries. While molecular simulations can predict performance metrics like permeability and selectivity, their computational complexity escalates exponentially with system size, hindering large-scale material exploration. This bottleneck highlights the critical need for data-driven approaches. With the emergence of the “fourth paradigm” of science—Data-Intensive Scientific Discovery [16]—machine learning (ML) has become a pivotal tool for accelerating OFM development [17,18]. ML excels at extracting hidden patterns from existing datasets to establish quantitative structure–property relationships (QSPRs), linking structural descriptors (e.g., pore geometry, functional group distribution) to separation performance [18,19]. This capability enables efficient candidate screening, inverse design guidance, and the seamless integration of simulation with experimental validation. For instance, ML models integrating experimental data, molecular simulations, and theoretical databases can construct multidimensional feature systems, unveil intricate structure–performance correlations, and achieve cross-domain optimization across scenarios such as gas separation [20] and water treatment.

Despite remarkable progress in ML-guided OFM design, current research remains fragmented. On the one hand, the adaptability of ML methodologies to different separation systems (e.g., gas vs. liquid) has yet to be systematically summarized. On the other hand, universal strategies for cross-scale design (from molecular building blocks to macroscopic performance) and industrial translation are still lacking. Additionally, liquid-phase separations, which involve complex mechanisms such as multiphase flow dynamics and interfacial interactions, pose higher challenges in data acquisition and model construction, leaving ML applications in this domain largely nascent. To address these gaps, a comprehensive review is urgently needed to consolidate recent advances in ML-driven OFM design, unify methodologies for diverse scenarios, dissect critical challenges (e.g., data standardization, model interpretability), and outline future directions. This review aims to bridge this gap by providing the first holistic overview of ML innovations in OFM development. By focusing on core applications—gas separation, liquid processing, and molecular diffusion prediction—we elucidate the synergistic mechanisms of data construction, feature engineering, and algorithmic optimization, demonstrating how ML transcends traditional trial-and-error limitations to expedite the rational design and industrial deployment of high-performance OFMs. This work not only serves as a methodological guide for researchers but also injects new momentum into advancing green separation technologies.

2. Methodologies and Workflows for Machine Learning

The core value of ML in OFM design lies in its ability to systematically integrate multi-source data, resolve complex structural performance relationships, and achieve rational design optimization across scenarios. The process is data-driven and consists of three phases: data construction, feature engineering, and model training with validation (Figure 1).

Data construction is the cornerstone of the approach, and its essence lies in the integration of multidimensional information from experiments, simulations, and theoretical calculations. Experimental data provide actual separation performance parameters and synthesis conditions; molecular simulations generate indicators of microscopic mass transfer behavior; and theoretical databases provide the basis for crystal structure and pore topology analysis. Researchers can search for the required crystal structure files in the database, summarize the macroscopic performance indexes from the literature, and perform calculations using molecular simulations. Performance labels need to be dynamically adapted to different separation needs: gas separation focuses on the balance between selectivity and permeability, water treatment focuses on the synergy between retention and flux, while energy applications require the quantification of ion mobility efficiency and chemical stability. Subsequently, data cleaning is carried out, the core task of which is to repair and normalize the quality of the original data to ensure the consistency and reliability of the data. This includes dealing with missing values, outlier detection and correction, and standardizing data formats and units. The final integration of multi-source data and the verification of the distribution of the cleaned data is reasonable and domain-compliant in order to obtain high-quality, structured OFM data sets to ensure the reliability of the subsequent analysis, and to lay the foundation for the subsequent feature engineering and modeling.

Feature engineering aims to translate the physicochemical properties of OFMs into a machine-readable descriptor system. Geometrical descriptors (pore size distribution, porosity) dominate size sieving effects, chemical descriptors (functional group type, surface charge) regulate substance–membrane interfacial interactions, and topological descriptors (framework connectivity, interlayer stacking) influence mass transfer pathways and mechanical strength [21]. The key parameters are extracted and size differences are eliminated by dimensionality reduction algorithms and feature selection methods, resulting in a high information density input feature set [22]. On this basis, the introduction of environmental parameters (temperature, pressure, and pH) further enhances the dynamic adaptability of the model to actual working conditions.

The model training and validation phase focuses on algorithm selection, performance optimization, and cross-domain migration. The modular ML framework strategically selects algorithms by weighing their strengths and limitations (Table 1): eXtreme Gradient Boosting (XGBoost) achieves top prediction accuracy for large-scale screening but requires careful tuning [23]; Random Forest (RF) offers robust feature interpretability with moderate computational cost [24]; Artificial Neural Networks (ANNs) excel in modeling complex nonlinear relationships, yet demand massive data [25]; Support Vector Machines (SVMs) provide strong generalization for small datasets but scale poorly [26]; Decision Trees (DTs) deliver full transparency yet suffer from instability [24]; while the Tree-based Pipeline Optimization Tool (TPOT) automates pipeline optimization at high computational expense [27]. Beyond conventional data-driven models, Physics-Informed Machine Learning (PIML) embeds domain knowledge, such as governing equations of mass transport or thermodynamic constraints, directly into loss functions [28,29]. This paradigm enhances extrapolation reliability with limited data and ensures physical consistency, showing promise for simulating dynamic membrane processes like fouling evolution or multi-component diffusion. The original input data are first randomly divided into a training set, a validation set, and a test set. The fitted model is trained using the training and validation sets, and finally, the performance of the final model is evaluated on the test set. Hyperparameters are pre-set configuration options for machine learning models (e.g., the decision tree depth, the SVM kernel width, the neural network learning rate) which are not learned directly from data, yet crucially govern model complexity and learning behavior. Tuning them is vital because improper settings cause underfitting or overfitting. Common tuning methods include the following: Grid Search (exhaustively tests predefined combinations), Random Search (samples combinations randomly), and Bayesian Optimization (intelligently selects new parameters based on past evaluations) [30]. For instance, in a study on the accurate prediction of CO₂ separation performance of metal–organic framework mixed-matrix membranes based on machine learning, researchers conducted hyperparameter tuning of the BP neural network model for robust optimization and validation. By adjusting hyperparameters such as the hidden layer structure, activation function, epochs, and loss function, the predictive performance of the BP neural network model was enhanced [31].

The robust evaluation of model performance relies on rigorous quantitative metrics and validation strategies. For regression tasks predicting membrane properties (e.g., permeability, selectivity, and flux), researchers commonly use the Mean Squared Error (MSE) to quantify the average deviation between predicted and true values (lower is better), and the Coefficient of Determination (R²) to indicate how well the model captures data variation (closer to 1 is better) [32]. In classification tasks, precision (the proportion of correctly identified positives, avoiding false alarms) and recall (the proportion of all positives found, avoiding misses) are key evaluation criteria. Model validation relies on cross-validation and interpretable tools (e.g., SHAP analysis [33]) to ensure generalizability and reveal mechanisms for key structural parameters [34]. Crucially, cross-validation involves dividing data into several subsets and iteratively using one for validation, which is essential for assessing model generalization and preventing overfitting to training data.

Table 1. Comparison of major machine learning (ML) algorithms.

Algorithm	Advantages	Limitations	Applications	Refs.
eXtreme Gradient Boosting (XGBoost)	Highest accuracy for structured data with efficient computation and built-in feature importance analysis.	Moderate interpretability and sensitivity to noisy data, requiring careful hyperparameter tuning.	High-throughput CO₂ separation membrane screening; performance optimization.	[23,35,36]
Random Forest (RF)	Strong robustness against overfitting with excellent feature importance interpretability for high-dimensional data.	Computationally expensive for large forests and limited extrapolation capability beyond training ranges.	MOF/COF screening; structure–performance correlation analysis.	[24,37,38]
Artificial Neural Networks (ANNs)	Unmatched flexibility in modeling complex nonlinear relationships and high-dimensional patterns.	Black-box nature reduces interpretability; requires massive data and computational resources to avoid overfitting.	Water permeability/salt rejection modeling; multi-scale separation prediction.	[25,31]
Support Vector Machines (SVMs)	Effective high-dimensional handling with strong generalization via margin maximization and kernel tricks.	Poor scalability to big data and sensitive hyperparameter tuning for nonlinear kernels.	Limited-scale separation performance prediction.	[26,39]
Decision Trees (DTs)	Complete interpretability through intuitive decision rules and low computational cost for small datasets.	Severe overfitting susceptibility and instability with data variations limiting predictive power.	Baseline modeling or constituent learners within RF/XGBoost frameworks.	[24,40]
Tree-based Pipeline Optimization Tool (TPOT)	Automates optimal pipeline selection using genetic algorithms for tree-based methods.	Extremely resource-intensive with limited interpretability of final pipelines.	Efficient identification of top-performing materials from vast chemical spaces.	[27,41,42]

The advantages of this methodological framework are threefold: first, the data-driven alternative to the traditional trial-and-error approach significantly shortens the material development cycle; second, the constructive relationships revealed by the interpretable tools provide theoretical guidance for function-oriented design; and third, the generalized feature system and algorithmic architecture support seamless switching between multiple scenarios. In the future, it is necessary to further integrate in-situ characterization and dynamic working condition data, develop adaptive models, and promote OFMs from laboratory exploration to large-scale applications, so as to inject new impetus into the innovation of green separation technology.

3. Machine Learning-Guided Rational Design of OFMs

Against the dual background of global industrialization and energy transition, innovation in separation technology has become a key breakthrough in combating climate change and achieving efficient resource utilization. The multi-component complexity of industrial emission gases, the environmental sensitivity of liquid-phase separations, and the high energy demand for purification of rare gases have imposed demanding performance requirements on membrane materials. Against this background, ML opens up a new dimension for the rational design of OFMs. The optimal structure is selected from tens of thousands of membrane candidates, and the long-term stability under extreme operating conditions is simulated in a virtual environment. This design strategy provides an innovative paradigm for the development of the next generation of high-performance separation membranes.

3.1. OFMs for Gas Separations

3.1.1. Carbon Capture

Excessive CO₂ emissions from global fossil fuel consumption have intensified the greenhouse effect, leading to critical ecological challenges such as climate warming and ocean acidification [43]. Carbon capture and storage (CCS), which enables efficient CO₂ separation from industrial flue gases or direct air capture, is recognized as a critical pathway toward achieving carbon neutrality [44]. OFMs, with their tunable sub-nanometer channels and chemisorption sites, offer unique advantages for the precise sieving of CO₂/N₂ and CO₂/CH₄ gas mixtures. Nevertheless, optimizing OFMs for complex operational conditions, such as fluctuating flue gas humidity and multicomponent competitive adsorption, remains challenging. Traditional trial-and-error approaches struggle to navigate the vast chemical design space, while the computational costs of molecular simulations limit high-throughput screening scalability. ML addresses these challenges by integrating material structural encoding, separation performance databases, and cross-scale modeling, thereby establishing a data-driven paradigm for designing next-generation CO₂ separation membranes with enhanced stability and energy efficiency. Recent advancements in pure MOF membranes highlight the potential of ML-guided design. Situ et al. [35] analyzed 6013 experimental MOF membranes through grand canonical Monte Carlo and molecular dynamics (MD) simulations, revealing that gas permeability correlates strongly with structural descriptors such as the available permeation area. By employing XGBoost, they demonstrated the impact of the available permeation area once again and screened seven high-performance MOFs (Figure 2a). Similarly, Zhang et al. [45] developed a filler database of 8167 IL@MOF composites and utilized RF models to demonstrate that the accessible volume and gravimetric surface area dominate CO₂/N₂ separation performance. Their experimental validation of [NH₂-Pmim][Tf₂N]@ZIF-67 membranes showcases the synergy between computational predictions and empirical verification. Despite the promise of pure MOF membranes, their practical application is often hindered by mechanical fragility and scalability challenges. This has spurred growing interest in mixed-matrix membranes (MMMs), which integrate MOFs into polymer matrices to balance selectivity with processability [46]. Recent studies exemplify the role of ML in accelerating MMM development: Cheng et al. [25] combined molecular simulations with ANNs to optimize CO₂/CH₄ separation in IRMOF-1-based membranes, achieving high prediction accuracy (R² = 0.982) (Figure 2b); Guan et al. [37] employed RF to identify MOFs with pore sizes >1 nm and surface areas ~800 m²/g, leading to Cu-CAT-1 and Cu-THQ MMMs that exceeded the 2008 Robeson upper bound; Alizamir et al. [32] developed a hybrid extreme learning machine model based on the extreme learning machines algorithm optimized with the BAT optimization algorithm, revealing the MOF cage size and polymer type as critical factors for CO₂ permeability; Yao et al. [31] utilized genetic algorithm-optimized ANNs to predict the CO₂ permeability and CO₂/N₂ selectivity of MOF MMMs. The study derived from the SHAP algorithm that the MOF type and polymer type are the most important factors for membrane permeability and selectivity, respectively; and Wan et al. [47] screened 54,117 polymer–MOF combinations via ensemble models, emphasizing the pore limit diameter and fractional free volume as key parameters. Collectively, these efforts demonstrate ML’s capacity to decode structure–performance relationships, accelerate material discovery, and bridge computational insights with experimental validation, ultimately advancing industrially viable CO₂ separation technologies.

3.1.2. Hydrogen Separation

As a clean energy carrier, hydrogen plays a pivotal role in decarbonizing industries and enabling sustainable energy transitions. Efficient hydrogen recovery from gas mixtures (e.g., syngas, natural gas, or industrial byproducts) is critical for maximizing its utilization, yet conventional separation methods often suffer from high energy costs and limited selectivity [48]. OFMs, particularly MOFs, have emerged as promising candidates due to their tunable pore architectures and surface chemistries. Recent advances in ML further empower researchers to navigate the vast design space of MOF membranes, accelerating the discovery of high-performance materials tailored for hydrogen purification and recovery. Recent studies demonstrate ML’s efficacy in optimizing MOF membranes for hydrogen separation from multicomponent gas streams. For instance, Zhou et al. [39] integrated physics-based modeling with ML to screen 12,723 synthesizable MOFs for D₂/H₂ separation, identifying the pore limit diameter (PLD) and largest cavity diameter (LCD) as decisive structural features (Figure 3a). Similarly, Bai et al. [49] developed a novel variable, trade-off multiple selectivity and permeability (TMSP), to evaluate H₂/X (X = CH₄, N₂, CO₂) separation in computational-ready MOF membranes. Their Gaussian process regression and RF models achieved high predictive accuracy, with the top candidates surpassing Robeson’s upper bounds for polymer membranes. Extending this approach, Li et al. [40] investigated the performance prediction of MOFs as adsorbents or membranes for capturing hydrogen from air through large-scale computational screening and machine learning approaches. By comparing three ML algorithms (RF, DT, and TPOT), the RF model was identified as the most effective in predicting the performance of H₂/O₂ + N₂ systems. Furthermore, this algorithm revealed that the LCD exhibited the highest relative importance for hydrogen adsorbent performance in the H₂/O₂ + N₂ system. Additionally, five optimal MOF membranes were screened. Analysis of the top-performing MOFs further validated that an LCD close to the kinetic diameter of hydrogen is a critical prerequisite for achieving efficient separation from air (Figure 3b). Beyond multicomponent separation, ML also facilitates specialized hydrogen purification tasks, such as removing trace impurities like helium. Zhang et al. [38] combined molecular simulations with RF models to uncover the PLD and framework porosity as dominant factors in He/H₂ separation, where electronegative pore surfaces enhance selectivity via quantum sieving effects. He et al. [50] further advanced this field by screening 2873 fluorine-rich ionic liquid@APMOF composites, revealing the ionic liquid content (IL%) as the key driver of separation efficiency (Figure 3c). Their CatBoost-guided optimization yielded tunable IL@APMOF membranes with exceptional He/H₂ performance (Figure 3d), exemplifying ML’s potential in addressing niche yet high-value separation challenges. Together, these studies underscore ML’s transformative role in bridging computational insights with experimental validation, enabling the rapid identification of MOF membranes that balance selectivity, permeability, and scalability. By decoding structure–performance relationships and automating design iterations, ML-driven strategies are poised to unlock next-generation hydrogen recovery technologies with enhanced efficiency and industrial viability.

3.1.3. Natural Gas Purification

In the field of hydrocarbon separation and natural gas purification, researchers have revealed the deep correlation between the structure and performance of COFs through machine learning. Gulbalkan et al. [51] highlighted recent advances in combining high-throughput molecular simulations and machine learning to accurately identify the most promising MOF and COF membranes among thousands of candidates for the separation of methane from other gases (acetylene, carbon dioxide, helium, hydrogen, and nitrogen). Similarly, Qiu et al. [52] developed a machine learning framework based on density-functional theory to target 500 highly selective materials from 70,000 COFs, with a membrane selectivity as high as 248, and pointed out that the PLD and LCD are the core parameters for CH₄/H₂ screening (Figure 4a). In addition, Cao et al. [53] quantified the differential effects of porosity and the PLD on selectivity and permeability for the separation of isobutylene (i-C₄H₈) and 1,3-butadiene (C₄H₆) using an RF model. They screened out adsorption-dominated efficient separation membranes from 601 COFs, providing a theoretical basis for hydrocarbon purification. Beyond hydrocarbon separation, COF-based membranes also exhibit unique advantages in the efficient removal of acid gases from natural gas. Xin et al. [41] developed a new method for the rapid screening of high-performance COF-based membranes for the separation of acid gases (H₂S and CO₂) from natural gas through combining interpretable machine learning and molecular simulation (Figure 4b). Through molecular simulation and machine learning modeling, porosity was found to be a key factor in determining membrane performance. Using machine learning modeling, the rapid screening of potential high-performance materials from nearly 70,000 hypothetical COFs accelerated the development of COF-based membranes for sour gas separation applications.

3.1.4. Rare Gas Processing

Rare gases are widely used in medicine, lighting, electrical engineering, and aerospace. Xenon (Xe), one of the high-demand rare gases, is currently purified mainly through costly and energy-intensive distillation [36]. To reduce Xe purification costs, researchers have applied various organic framework membranes to Xe separation. Huang et al. [54] utilized HTCS and five ML algorithms (RF, DT, SVM, k-nearest neighbors, and XGBoost) to analyze the structure–performance relationships of 6013 MOF membranes for Kr/Xe separation. They identified key descriptors like PLD and proposed three design strategies to enhance separation performance (Figure 5a). The study combined HTCS, ML, and molecular fingerprints to offer new insights for developing high-performance membranes for Kr/Xe separation. Beyond noble gas separation (e.g., Kr/Xe), helium recovery from natural gas or industrial byproducts has emerged as a critical challenge due to its high economic value and scarcity. Lang et al. [42] developed a universal ML model to predict helium separation performance in COF membranes (Figure 5b). By integrating grand canonical Monte Carlo (GCMC) simulations with machine learning, the model extracted structural (e.g., pore volume, porosity) and chemical descriptors (e.g., isosteric heat of adsorption Qst) of COFs to predict helium adsorption capacity and selectivity. MD simulations further validated the diffusion selectivity of helium in candidate materials, with 3D-Sp-COF-1 exhibiting a threefold enhancement compared to conventional benchmarks. This work highlights the synergy between multiscale simulations and ML in decoding rare gas separation mechanisms, offering a template for accelerating the discovery of energy-efficient purification technologies.

3.2. OFMs for Liquid Separations

Compared with the gas separation field, the application of machine learning in liquid separation membranes is still relatively limited, and this gap stems from the complexity of the liquid separation process and the high cost of data acquisition. Liquid separation not only involves complex mechanisms such as multiphase flow dynamics, interfacial interactions, and pollutant adsorption, but also needs to cope with environmental variables such as solution ionic strength and pressure fluctuations, which puts higher requirements on the pore size distribution, chemical stability, and surface functionalization of membrane materials. In addition, the dispersion and lack of standardization of experimental data further limit the generalization ability of the model. Nevertheless, with the combination of machine learning algorithms and multi-scale simulation techniques, this field is gradually showing breakthrough potential.

Usman et al. [55] designed a polydopamine-modified UiO-66-NH₂ (PDA-s-UiO-66-NH₂) membrane to address the challenge of water-in-oil emulsion separation and used a Gaussian process regression (GPR) model to predict the permeate flux and oil retention (Figure 6a). It was found that GPR significantly outperformed the SVM and decision tree models in terms of prediction performance by virtue of its nonlinear fitting ability and uncertainty estimation advantages. This work not only validates the applicability of machine learning in complex liquid separations but also reveals the importance of data-driven design for membrane surface engineering. In another study on water treatment membranes, Zhang et al. [56] significantly improved the performance prediction accuracy of MOF-based composite membranes by optimizing a back-propagation (BP) neural network via a genetic algorithm (GA). The study determined the network structure through cross-validation and hyper-parameter tuning, and used a GA to optimize the initial weights and thresholds, which enabled the model to outperform the traditional BP model and algorithms such as RF and SVM in the prediction of water permeability (R² = 0.98) and salt retention (R² = 0.99). Grey relation analysis further indicated that the MOF size and the thickness of the polyamide layer were the dominant factors that affected performance (Figure 6b). This finding provides a quantitative basis for balancing the compatibility of MOF fillers with polymer matrices and promotes the rational design of reverse osmosis membranes. The above studies highlight the unique value of machine learning in resolving the multifactorial coupling of liquid separation. The advantages of the GPR model in nonlinear problems and the efficiency of the GA-BP model in parameter optimization reflect the diversity of algorithmic adaptation scenarios.

Membrane fouling, a pervasive and detrimental obstacle in liquid separations, refers to the deposition, adsorption, or growth of contaminants on the membrane surface or within its pores, leading to flux decline, increased energy consumption, and shortened membrane lifespan. Its primary forms include organic fouling, biofouling, and scaling. Machine learning is emerging as a powerful tool for predicting, monitoring, and mitigating fouling [57]. For instance, Garakani et al. explore how to use physical information neural network (PINN) models to enhance our understanding and predictive capabilities regarding membrane fouling phenomena [28]. Researchers combined physical laws with neural networks to develop a PINN model capable of dynamically allocating weights to different fouling mechanisms. This model not only quantifies the relative importance of each fouling mechanism in flux decay but also accurately predicts flux decay even with limited data, demonstrating higher predictive accuracy and adaptability than traditional machine learning models. This research provides new tools and methods for optimizing membrane filtration systems and improving the efficiency of membrane technology in practical applications. This integration of fouling prediction with cleaning strategy optimization through ML opens new avenues for developing smarter and more fouling-resistant liquid separation systems based on OFMs.

3.3. Industrial Translation Pathways

Machine learning is becoming the core engine that is accelerating OFM commercialization by bridging lab-to-factory gaps. Studies have shown that by optimizing CO₂/CH₄ separation in membranes with an ANN model, and through case simulations of separating CO₂/CH₄ mixtures in raw natural gas, landfill gas, and shale gas, IRMOF-1 membranes achieve higher recovery rates than commercial polymer PTMSP membranes with smaller membrane areas, reaching target purity [25]. This confirms the feasibility of the proposed integrated framework, offering guidance for applying MOF-based materials in industrial gas separation and insights for material design and process optimization. In water treatment, the integration of ML with fluorescence spectroscopy and mechanistic models has provided an innovative approach for predicting membrane flux decline [29]. This enables the effective handling of complex fouling phenomena, improves the efficiency of membrane filtration processes, reduces maintenance costs, and offers new development directions for water treatment technologies. Real-time monitoring and the dynamic adjustment of filtration parameters can optimize membrane filtration, prolong the membrane lifespan, and cut overall operational costs. In seawater desalination, the GA-BP neural network model offers crucial guidance for designing high-performance OFM-based reverse osmosis membranes and serves as a reference for developing other membrane materials [56]. This provides a more economical and feasible solution for freshwater supply in coastal water-scarce regions and helps mitigate global water scarcity.

In summary, ML has played a pivotal role in the industrialization of OFM membranes, accelerating the transition from lab research to industrial application. Through data-driven custom production, ML enables the optimized design and manufacturing of OFM membranes to meet specific industrial demands across different fields. This customization not only enhances the market competitiveness of OFM membranes but also delivers more precise and efficient solutions for related industries. Additionally, ML speeds up the R&D and innovation cycle, allowing OFM membranes to quickly adapt to market changes and continuously introduce high-performance products to maintain technological leadership.

4. Machine Learning-Enabled Multiscale Optimization of Membrane Systems

Optimizing membrane separation efficiency requires not only refining the structural properties of membranes but also decoding microscopic mechanisms such as molecular transport dynamics and material–environment interactions. Machine learning indirectly elevates membrane performance by uncovering molecular diffusion patterns, quantifying adsorption–desorption energy barriers, or predicting material responses under extreme operational conditions. For instance, dynamic pore fluctuations in flexible frameworks [54], interfacial modification effects of ionic liquids [50], and molecular diffusion pathways within channels [58]—complex phenomena once intractable—can now be systematically resolved through the integration of ML and multiscale simulations [59]. Notably, the application of machine learning in the field of solid-state electrolytes (SSEs) has also provided useful insights into the study of organic framework membranes. In SSE research, machine learning has successfully established a quantitative relationship between structural features and ionic conductivity properties by analyzing the crystal structure and ion transport paths of the materials, and this data-driven screening method significantly shortens the material development cycle and reduces the experimental cost [60]. Similarly, in the study of organic framework membranes, we can learn from this efficient screening strategy and use machine learning to quickly identify organic framework membrane materials with potentially high performance and optimize their separation performance. This cross-disciplinary borrowing and application fully demonstrates the potential and significant value of machine learning for a wide range of applications in materials science, marking a paradigm shift from phenomenal description to mechanism-driven optimization, and bridging the gap between basic science and industrial implementation.

The transformative potential of ML in advancing membrane systems lies not in the direct manipulation of membrane structures but in its unparalleled capacity to optimize the multiscale phenomena governing separation processes. By decoding the intricate relationships between molecular building blocks, dynamic transport behaviors, and macroscopic performance, ML acts as an invisible architect, reshaping membrane science from serendipitous discovery to mechanism-driven design. This paradigm shift transcends traditional boundaries, enabling membranes to evolve from static sieves into adaptive, self-optimizing systems tailored for industrial complexity. At the heart of this revolution is ML’s ability to accelerate material discovery—a critical first step toward high-performance membranes. For instance, a RF model trained on a database containing more than 12,000 COFs and MOFs by Li et al. [61] identified the top adsorbents for adsorption-driven heat pumps (AHPs) with 92% accuracy, which reduced the computational cost by two orders of magnitude. These ML-prioritized materials, such as MOFs with pore-limiting diameters of >1 nm, are subsequently integrated into MMMs, indirectly enhancing CO₂/CH₄ separation efficiency by optimizing filler–polymer interfaces (Figure 7a). This material-centric strategy not only bypasses trial-and-error synthesis but also establishes a foundation for designing membranes with built-in molecular recognition capabilities.

Beyond material screening, ML unravels the dynamic interplay between molecules and frameworks—a realm where traditional methods falter. Pan et al. [58] analyzed the effect of framework flexibility on the diffusion coefficients of C₃H₈ and C₃H₆ using MD simulations. The results show that it is difficult to intuitively judge the change of diffusion coefficients when the trends of the PLD and LCD are inconsistent, so the SISSO algorithm, which has strong interpretability, was used to build a model to accurately predict the effect of framework flexibility on diffusion. A prediction model for C₃H₈ and C₃H₆ diffusion coefficients was also developed. The results emphasize that the prediction of molecular diffusion coefficients in COFs requires a combination of factors such as pore structure, material density, and cell volume (Figure 7b).

From accelerating adsorbent discovery to engineering adaptive pores and resilient frameworks, ML operates as the silent catalyst of membrane innovation. It redefines membranes as intelligent ecosystems where selectivity, permeability, and longevity coexist through data-driven harmony. This convergence of computational intelligence and materials science not only addresses current challenges in carbon capture and water purification but also paves the way for next-generation membranes capable of evolving with the ever-shifting demands of sustainable industries.

5. Outlook

The integration of ML with OFMs heralds a transformative era in membrane science, yet challenges and opportunities coexist. While ML has revolutionized gas separation by decoding structure–performance relationships and accelerating material discovery, its application in liquid-phase separations remains nascent due to the inherent complexity of multiphase interactions, dynamic environmental variables, and scarce standardized datasets. Beyond the extensively studied MOFs and COFs, emerging frameworks such as CMPs, POCs, and PAFs offer untapped potential. These materials, characterized by their unique topological flexibility, modular assembly, and chemical diversity, present novel avenues for tailored separation processes. However, the ML-driven exploration of CMPs, POCs, and PAFs remains limited, hindered by insufficient experimental datasets and unclear structure–property correlations. Future advancements hinge on expanding ML applications to these systems and leveraging their distinct advantages, such as CMPs’ tunable π-conjugated networks for photoresponsive separations or POCs’ molecular recognition capabilities for selective ion transport. Enhanced collaboration between computational and experimental communities will be pivotal for establishing unified databases encompassing synthesis conditions, dynamic operational parameters, and multiscale performance metrics. Such efforts will not only improve model generalizability but will also enable the development of adaptive ML frameworks capable of addressing real-world complexities, such as fluctuating pH, pressure gradients, and competitive adsorption in industrial settings.

Interpretability and trust in ML models must be prioritized to translate computational insights into actionable design principles. Tools like SHAP analysis and physics-informed neural networks could unravel the mechanistic origins of membrane behavior, guiding the rational functionalization of pore walls or the stabilization of flexible frameworks under harsh conditions. For instance, ML could optimize the hierarchical porosity of PAFs for high-flux gas separation or decode the host–guest interactions in POCs to enhance chiral separation efficiency. Concurrently, the integration of in situ characterization techniques with ML workflows will unlock dynamic, real-time optimization of membrane systems, transitioning them from static sieves to responsive, self-adapting platforms.

Industrial scalability remains a critical frontier. While ML-driven high-throughput screening has identified promising OFM candidates, their translation into cost-effective, durable membranes demands interdisciplinary collaboration. Innovations in automated synthesis, defect engineering, and hybrid membrane architectures (e.g., mixed-matrix membranes) must align with ML predictions to balance selectivity, permeability, and mechanical robustness. Notably, CMPs and PAFs, with their exceptional thermal stability and processability, could serve as robust fillers or standalone membranes for harsh industrial environments. Furthermore, the convergence of ML with emerging technologies, such as digital twins for process simulation or AI-guided robotic labs for autonomous experimentation, could redefine the pace and precision of OFM development across diverse frameworks.

In addition, the emerging field of PIML holds great potential for bridging the gap between pure data fitting and fundamental physical interpretation in OFM design and simulation. First, it can enhance generalization and robustness. By adhering to underlying physical laws, models reduce the risk of overfitting sparse or noisy experimental data and can make more reliable extrapolations beyond the training data distribution. Second, it can reduce data dependency. Physical laws provide inherent regularization, potentially enabling models to make accurate predictions on smaller datasets—a critical advantage for complex scenarios such as liquid separation or dynamic operating conditions. Third, it can improve interpretability. The model’s solutions naturally follow physical principles, making prediction results more trustworthy and mechanistic insights easier to obtain. PIML represents a key frontier direction. Its development is crucial for achieving truly predictive, multi-scale OFM system digital twins, which will accelerate the rational design of next-generation separation membranes in real dynamic environments.

Ultimately, the fusion of ML and OFMs promises to transcend traditional trade-offs, enabling membranes tailored for sustainability-driven applications like carbon neutrality, water security, and clean energy. By expanding ML’s scope beyond MOFs and COFs to embrace CMPs, POCs, and PAFs, researchers can unlock a broader chemical space for next-generation separation technologies. Through transforming serendipity into strategy, this synergy will not only address global separation challenges but will also catalyze a paradigm shift toward intelligent, eco-efficient industrial systems.

Author Contributions

Conceptualization, T.W. and H.Y.; investigation, T.W., J.Z. and Q.Y.; writing—original draft preparation, T.W.; writing—review and editing, J.W. and H.Y.; supervision, H.Y.; funding acquisition, H.Y. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Nanjing University Talent Start-up Funds (16002244).

Conflicts of Interest

The authors declare no conflicts of interest.

References

Dou, H.; Xu, M.; Wang, B.; Zhang, Z.; Wen, G.; Zheng, Y.; Luo, D.; Zhao, L.; Yu, A.; Zhang, L.; et al. Microporous framework membranes for precise molecule/ion separations. Chem. Soc. Rev. 2021, 50, 986–1029. [Google Scholar] [CrossRef] [PubMed]
Hosseini Monjezi, B.; Kutonova, K.; Tsotsalas, M.; Henke, S.; Knebel, A. Current Trends in Metal–Organic and Covalent Organic Framework Membrane Materials. Angew. Chem. Int. Ed. 2021, 60, 15153–15164. [Google Scholar] [CrossRef] [PubMed]
Liu, X.; Liu, P.; Wang, H.; Khashab, N.M. Advanced Microporous Framework Membranes for Sustainable Separation. Adv. Mater. 2020, 2500310. [Google Scholar] [CrossRef] [PubMed]
Yang, L.; Wang, Z.; Wang, Z.L.; Wei, D. Porous Organic Framework Membranes with Nanostructures for Osmotic Power Conversion, Water Desalination, and Selective Separation. ChemNanoMat 2025, 2400629. [Google Scholar] [CrossRef]
Furukawa, H.; Cordova, K.E.; O’Keeffe, M.; Yaghi, O.M. The Chemistry and Applications of Metal-Organic Frameworks. Science 2013, 341, 1230444. [Google Scholar] [CrossRef]
Qiu, S.; Xue, M.; Zhu, G. Metal–organic framework membranes: From synthesis to separation application. Chem. Soc. Rev. 2014, 43, 6116–6140. [Google Scholar] [CrossRef]
Côté, A.P.; Benin, A.I.; Ockwig, N.W.; O’Keeffe, M.; Matzger, A.J.; Yaghi, O.M. Porous, Crystalline, Covalent Organic Frameworks. Science 2005, 310, 1166–1170. [Google Scholar] [CrossRef]
Fan, H.; Mundstock, A.; Feldhoff, A.; Knebel, A.; Gu, J.; Meng, H.; Caro, J. Covalent Organic Framework–Covalent Organic Framework Bilayer Membranes for Highly Selective Gas Separation. J. Am. Chem. Soc. 2018, 140, 10094–10098. [Google Scholar] [CrossRef]
Lindemann, P.; Tsotsalas, M.; Shishatskiy, S.; Abetz, V.; Krolla-Sidenstein, P.; Azucena, C.; Monnereau, L.; Beyer, A.; Gölzhäuser, A.; Mugnaini, V.; et al. Preparation of Freestanding Conjugated Microporous Polymer Nanomembranes for Gas Separation. Chem. Mater. 2014, 26, 7189–7193. [Google Scholar] [CrossRef]
Song, Q.; Jiang, S.; Hasell, T.; Liu, M.; Sun, S.; Cheetham, A.K.; Sivaniah, E.; Cooper, A.I. Porous Organic Cage Thin Films and Molecular-Sieving Membranes. Adv. Mater. 2016, 28, 2629–2637. [Google Scholar] [CrossRef]
Wang, L.; Jia, J.; Faheem, M.; Tian, Y.; Zhu, G. Fabrication of triazine-based Porous Aromatic Framework (PAF) membrane with structural flexibility for gas mixtures separation. J. Ind. Eng. Chem. 2018, 67, 373–379. [Google Scholar] [CrossRef]
Ben, T.; Ren, H.; Ma, S.; Cao, D.; Lan, J.; Jing, X.; Wang, W.; Xu, J.; Deng, F.; Simmons, J.M.; et al. Targeted Synthesis of a Porous Aromatic Framework with High Stability and Exceptionally High Surface Area. Angew. Chem. Int. Ed. 2009, 48, 9457–9460. [Google Scholar] [CrossRef] [PubMed]
Koros, W.J.; Zhang, C. Materials for next-generation molecularly selective synthetic membranes. Nat. Mater. 2017, 16, 289–297. [Google Scholar] [CrossRef]
Jian, M.; Qiu, R.; Xia, Y.; Lu, J.; Chen, Y.; Gu, Q.; Liu, R.; Hu, C.; Qu, J.; Wang, H.; et al. Ultrathin water-stable metal-organic framework membranes for ion separation. Sci. Adv. 2020, 6, eaay3998. [Google Scholar] [CrossRef]
Bobbitt, N.S.; Shi, K.; Bucior, B.J.; Chen, H.; Tracy-Amoroso, N.; Li, Z.; Sun, Y.; Merlin, J.H.; Siepmann, J.I.; Siderius, D.W.; et al. MOFX-DB: An Online Database of Computational Adsorption Data for Nanoporous Materials. J. Chem. Eng. Data 2023, 68, 483–498. [Google Scholar] [CrossRef]
Tolle, K.M.; Tansley, D.S.W.; Hey, A.J.G. The Fourth Paradigm: Data-Intensive Scientific Discovery [Point of View]. Proc. IEEE 2011, 99, 1334–1337. [Google Scholar] [CrossRef]
Sanchez-Lengeling, B.; Aspuru-Guzik, A. Inverse molecular design using machine learning: Generative models for matter engineering. Science 2018, 361, 360–365. [Google Scholar] [CrossRef]
Butler, K.T.; Davies, D.W.; Cartwright, H.; Isayev, O.; Walsh, A. Machine learning for molecular and materials science. Nature 2018, 559, 547–555. [Google Scholar] [CrossRef]
Chong, S.; Lee, S.; Kim, B.; Kim, J. Applications of machine learning in metal-organic frameworks. Coord. Chem. Rev. 2020, 423, 213487. [Google Scholar] [CrossRef]
Zhang, Y.; Yin, B.H.; Huang, L.; Ding, L.; Lei, S.; Telfer, S.G.; Caro, J.; Wang, H. MOF membranes for gas separations. Prog. Mater. Sci. 2025, 151, 101432. [Google Scholar] [CrossRef]
Raccuglia, P.; Elbert, K.C.; Adler, P.D.F.; Falk, C.; Wenny, M.B.; Mollo, A.; Zeller, M.; Friedler, S.A.; Schrier, J.; Norquist, A.J. Machine-learning-assisted materials discovery using failed experiments. Nature 2016, 533, 73–76. [Google Scholar] [CrossRef] [PubMed]
Lin, J.; Liu, Z.; Guo, Y.; Wang, S.; Tao, Z.; Xue, X.; Li, R.; Feng, S.; Wang, L.; Liu, J.; et al. Machine learning accelerates the investigation of targeted MOFs: Performance prediction, rational design and intelligent synthesis. Nano Today 2023, 49, 101802. [Google Scholar] [CrossRef]
Bentéjac, C.; Csörgő, A.; Martínez-Muñoz, G. A comparative analysis of gradient boosting algorithms. Artif. Intell. Rev. 2021, 54, 1937–1967. [Google Scholar] [CrossRef]
Lundberg, S.M.; Erion, G.; Chen, H.; DeGrave, A.; Prutkin, J.M.; Nair, B.; Katz, R.; Himmelfarb, J.; Bansal, N.; Lee, S.-I. From local explanations to global understanding with explainable AI for trees. Nat. Mach. Intell. 2020, 2, 56–67. [Google Scholar] [CrossRef]
Cheng, X.; Liao, Y.; Lei, Z.; Li, J.; Fan, X.; Xiao, X. Multi-scale design of MOF-based membrane separation for CO₂/CH₄ mixture via integration of molecular simulation, machine learning and process modeling and simulation. J. Membr. Sci. 2023, 672, 121430. [Google Scholar] [CrossRef]
Cervantes, J.; Garcia-Lamont, F.; Rodríguez-Mazahua, L.; Lopez, A. A comprehensive survey on support vector machine classification: Applications, challenges and trends. Neurocomputing 2020, 408, 189–215. [Google Scholar] [CrossRef]
Olson, R.S.; Moore, J.H. TPOT: A Tree-Based Pipeline Optimization Tool for Automating Machine Learning. In Automated Machine Learning: Methods, Systems, Challenges; Hutter, F., Kotthoff, L., Vanschoren, J., Eds.; Springer International Publishing: Cham, Switzerland, 2019; pp. 151–160. [Google Scholar]
Garakani, S.S.; Chew, J.W. Development of physics-informed machine-learning models to enhance understanding and prediction of membrane fouling. J. Membr. Sci. 2025, 728, 124133. [Google Scholar] [CrossRef]
Tagliavini, M.; Snyder, S.A. Flux Decline Prediction in Dead-End Ultrafiltration Combining Fluorescence Spectroscopy and Mechanism-Informed Machine Learning. ACS ES&T Water 2024, 4, 4828–4835. [Google Scholar] [CrossRef]
Yang, L.; Shami, A. On hyperparameter optimization of machine learning algorithms: Theory and practice. Neurocomputing 2020, 415, 295–316. [Google Scholar] [CrossRef]
Yao, L.; Zhang, Z.; Li, Y.; Zhuo, J.; Chen, Z.; Lin, Z.; Liu, H.; Yao, Z. Precise prediction of CO₂ separation performance of metal-organic framework mixed matrix membranes based on feature selection and machine learning. Sep. Purif. Technol. 2024, 349, 127894. [Google Scholar] [CrossRef]
Alizamir, M.; Keshavarz, A.; Abdollahi, F.; Khosravi, A.; Karagoz, S. Accurately predicting the performance of MOF-based mixed matrix membranes for CO₂ removal using a novel optimized extreme learning machine by BAT algorithm. Sep. Purif. Technol. 2023, 325, 124689. [Google Scholar] [CrossRef]
Lundberg, S.M.; Lee, S.-I. A unified approach to interpreting model predictions. In Proceedings of the 31st International Conference on Neural Information Processing Systems, Long Beach, CA, USA, 4–9 December 2017; pp. 4768–4777. [Google Scholar]
Wang, Z.; Zhou, T.; Sundmacher, K. Interpretable machine learning for accelerating the discovery of metal-organic frameworks for ethane/ethylene separation. Chem. Eng. J. 2022, 444, 136651. [Google Scholar] [CrossRef]
Situ, Y.; Yuan, X.; Bai, X.; Li, S.; Liang, H.; Zhu, X.; Wang, B.; Qiao, Z. Large-Scale Screening and Machine Learning for Metal-Organic Framework Membranes to Capture CO₂ from Flue Gas. Membranes 2022, 12, 700. [Google Scholar] [CrossRef]
Yang, Y.; Tu, C.; Guo, L.; Wang, L.; Cheng, F.; Luo, F. Metal-organic frameworks for xenon and krypton separation. Cell Rep. Phys. Sci. 2023, 4, 101694. [Google Scholar] [CrossRef]
Guan, J.; Huang, T.; Liu, W.; Feng, F.; Japip, S.; Li, J.; Wang, X.; Zhang, S. Design and prediction of metal organic framework-based mixed matrix membranes for CO₂ capture via machine learning. Cell Rep. Phys. Sci. 2022, 3, 100864. [Google Scholar] [CrossRef]
Zhang, S.; He, Y.; Zhang, Z.; Zhong, C. Machine learning aided investigation on the structure-performance correlation of MOF for membrane-based He/H₂ separation. Green Chem. Eng. 2024, 5, 526–532. [Google Scholar] [CrossRef]
Zhou, M.; Vassallo, A.; Wu, J. Toward the inverse design of MOF membranes for efficient D₂/H₂ separation by combination of physics-based and data-driven modeling. J. Membr. Sci. 2020, 598, 117675. [Google Scholar] [CrossRef]
Li, H.; Wang, C.; Zeng, Y.; Li, D.; Yan, Y.; Zhu, X.; Qiao, Z. Combining Computational Screening and Machine Learning to Predict Metal-Organic Framework Adsorbents and Membranes for Removing CH₄ or H₂ from Air. Membranes 2022, 12, 830. [Google Scholar] [CrossRef]
Xin, B.; Feng, M.; Cheng, M.; Dai, Z.; Ye, S.; Zhou, L.; Dai, Y.; Ji, X. Combining Interpretable Machine Learning and Molecular Simulation to Advance the Discovery of COF-Based Membranes for Acid Gas Separation. Ind. Eng. Chem. Res. 2024, 63, 8369–8382. [Google Scholar] [CrossRef]
Lang, Y.; Sun, L.; Pan, Y.; Xu, M.; Zhai, D.; Liu, J.; Deng, W.; Yang, L. A universal model for predicting helium separation in covalent organic frameworks and Metal-organic frameworks. Chem. Eng. Sci. 2025, 305, 121175. [Google Scholar] [CrossRef]
Friedlingstein, P.; O’Sullivan, M.; Jones, M.W.; Andrew, R.M.; Hauck, J.; Landschützer, P.; Le Quéré, C.; Li, H.; Luijkx, I.T.; Olsen, A.; et al. Global Carbon Budget 2024. Earth Syst. Sci. Data 2025, 17, 965–1039. [Google Scholar] [CrossRef]
Bui, M.; Adjiman, C.S.; Bardow, A.; Anthony, E.J.; Boston, A.; Brown, S.; Fennell, P.S.; Fuss, S.; Galindo, A.; Hackett, L.A.; et al. Carbon capture and storage (CCS): The way forward. Energy Environ. Sci. 2018, 11, 1062–1176. [Google Scholar] [CrossRef]
Zhang, Z.; Cao, X.; Geng, C.; Sun, Y.; He, Y.; Qiao, Z.; Zhong, C. Machine learning aided high-throughput prediction of ionic liquid@MOF composites for membrane-based CO₂ capture. J. Membr. Sci. 2022, 650, 120399. [Google Scholar] [CrossRef]
Dechnik, J.; Gascon, J.; Doonan, C.J.; Janiak, C.; Sumby, C.J. Mixed-Matrix Membranes. Angew. Chem. Int. Ed. 2017, 56, 9292–9310. [Google Scholar] [CrossRef]
Wan, H.; Fang, Y.; Hu, M.; Guo, S.; Sui, Z.; Huang, X.; Liu, Z.; Zhao, Y.; Liang, H.; Wu, Y.; et al. Interpretable Machine-Learning and Big Data Mining to Predict the CO₂ Separation in Polymer-MOF Mixed Matrix Membranes. Adv. Sci. 2025, 12, e2405905. [Google Scholar] [CrossRef]
Bernardo, G.; Araújo, T.; da Silva Lopes, T.; Sousa, J.; Mendes, A. Recent advances in membrane technologies for hydrogen purification. Int. J. Hydrogen Energy 2020, 45, 7313–7338. [Google Scholar] [CrossRef]
Bai, X.; Shi, Z.; Xia, H.; Li, S.; Liu, Z.; Liang, H.; Liu, Z.; Wang, B.; Qiao, Z. Machine-Learning-Assisted High-Throughput computational screening of Metal-Organic framework membranes for hydrogen separation. Chem. Eng. J. 2022, 446, 136783. [Google Scholar] [CrossRef]
He, Y.; Zhang, S.; Zhong, C. Unlocking the potential of ionic liquids in Anion-Pillared MOFs for enhanced He/H₂ separation Performance: A combined computational screening and Machine learning study. Sep. Purif. Technol. 2025, 363, 132253. [Google Scholar] [CrossRef]
Gulbalkan, H.C.; Uzun, A.; Keskin, S. Combining computational screening and machine learning to explore MOFs and COFs for methane purification. Appl. Phys. Lett. 2024, 124, 200501. [Google Scholar] [CrossRef]
Qiu, Y.; Chen, L.; Zhang, X.; Ping, D.; Tian, Y.; Zhou, Z. A universal machine learning framework to automatically identify high-performance covalent organic framework membranes for CH₄/H₂ separation. AIChE J. 2024, 70, e18575. [Google Scholar] [CrossRef]
Cao, X.; He, Y.; Zhang, Z.; Sun, Y.; Han, Q.; Guo, Y.; Zhong, C. Predicting of Covalent Organic Frameworks for Membrane-based Isobutene/1,3-Butadiene Separation: Combining Molecular Simulation and Machine Learning. Chem. Res. Chin. Univ. 2022, 38, 421–427. [Google Scholar] [CrossRef]
Huang, Q.; Yuan, X.; Li, L.; Yan, Y.; Yang, X.; Wang, W.; Chen, Y.; Liang, H.; Gao, H.; Wu, Y.; et al. Machine learning and molecular fingerprint screening of high-performance 2D/3D MOF membranes for Kr/Xe separation. Chem. Eng. Sci. 2023, 280, 119031. [Google Scholar] [CrossRef]
Usman, J.; Abba, S.I.; Baig, N.; Abu-Zahra, N.; Hasan, S.W.; Aljundi, I.H. Design and Machine Learning Prediction of In Situ Grown PDA-Stabilized MOF (UiO-66-NH₂) Membrane for Low-Pressure Separation of Emulsified Oily Wastewater. ACS Appl. Mater. Interfaces 2024, 16, 16271–16289. [Google Scholar] [CrossRef] [PubMed]
Zhang, Z.; Li, Y.; Chen, Z.; Yao, L. Improving performance prediction of metal-organic framework membranes for reverse osmosis via genetic algorithm optimized artificial neural networks. Mater. Today Sustain. 2024, 26, 100734. [Google Scholar] [CrossRef]
Obotey Ezugbe, E.; Rathilal, S. Membrane Technologies in Wastewater Treatment: A Review. Membranes 2020, 10, 89. [Google Scholar] [CrossRef]
Pan, R.; Tu, X.; Ma, X.; Liu, L.; Yan, T.; Tong, M. Interpretable machine learning on C₃H₆ and C₃H₈ diffusion in covalent organic frameworks: Incorporating the effects of framework flexibility. Chem. Eng. Sci. 2025, 310, 121520. [Google Scholar] [CrossRef]
Noé, F.; Tkatchenko, A.; Müller, K.-R.; Clementi, C. Machine Learning for Molecular Simulation. Annu. Rev. Phys. Chem. 2020, 71, 361–390. [Google Scholar] [CrossRef]
Hu, Q.; Chen, K.; Li, J.; Zhao, T.; Liang, F.; Xue, D. Speeding up the development of solid state electrolyte by machine learning. Next Energy 2024, 5, 100159. [Google Scholar] [CrossRef]
Li, W.; Xia, X.; Li, S. Screening of Covalent–Organic Frameworks for Adsorption Heat Pumps. ACS Appl. Mater. Interfaces 2020, 12, 3265–3273. [Google Scholar] [CrossRef]

Figure 1. The workflows for machine learning (ML) in organic framework membrane (OFM) design.

Figure 2. (a) The atomistic structures of top-performing metal–organic framework (MOF) membranes: CARGEI, YUJWAD, RIPWEU, VEHNED, WOCJII, YUJWOR, and YUJWUX (the order is from top left, to bottom left, to top right, and finally to bottom right). Reproduced from [35] with Open Access; (b) Ea_CH₄/E_aCO₂ values exhibit periodic fluctuations rather than smooth variations (e.g., 3.2 at 90% CO₂ and 4 atm), likely due to hindered CH₄ diffusion under high CO₂ concentrations (90%), which amplifies chaotic effects in molecular dynamics (MD) simulations and causes fluctuations in the diffusion coefficient. Reproduced from [25] with Open Access.

Figure 3. (a) A comparative study of D₂/H₂ separation selectivity between ROQFUA07 and other MOF materials. Reprinted with copyright permission from Ref. [39] Copyright 2019 Elsevier B.V.; (b) relative importance values of the seven descriptors predicted by the RF algorithm for CH₄/O₂ + N₂ (left) and H₂/O₂ + N₂ (right). Reproduced from [40] with Open Access; (c) the descriptor importance for lnMPS, comparing the top 1% IL@APMOF set (left bar chart) with the entire IL@APMOF set (right bar chart). The SHAP values are listed on the horizontal axis. The circular diagrams show the distribution and relative importance of structural descriptors (green), chemical descriptors (red), and the ionic liquid content (IL%) (yellow) across various datasets, progressing inward from the outer circle (representing the top 1% IL@APMOFs) to the top 2%, top 5%, and the entire IL@APMOF dataset. Reprinted with copyright permission from Ref. [50] Copyright 2025 Elsevier B.V.; (d) the relationship between the IL mass percent and the performance metrics of the IL@APMOF composite membrane, including He permeability and membrane selectivity. Reprinted with copyright permission from Ref. [50] Copyright 2025 Elsevier B.V.

Figure 4. (a) Unsupervised learning implemented for classification of adsorption selectivity. Classification results from principal component analysis (PCA) algorithm (top), with color mapping representing the adsorption selectivity percentage and the horizontal and vertical axes representing two principal components, and the T-SNE algorithm (bottom). Weighting map of the descriptors corresponding to the two principal components in the PCA, with color mapping from darker to lighter indicating stronger correlation. Reprinted with copyright permission from Ref. [52] Copyright 2024 American Institute of Chemical Engineers; (b) radar plots of feature importance for the gas adsorption, diffusion, and permeability of covalent organic frameworks (COFs). The SHAP values for each type of explainer have been normalized, and the scale range shows a Log distribution from 0.01 to 1. Reprinted with copyright permission from Ref. [41] Copyright 2024 American Chemical Society.

Figure 5. (a) Design strategies of 3D and 2D MOF membranes for boosting Kr/Xe separation. Reprinted with copyright permission from Ref. [54] Copyright 2019 Elsevier Ltd.; (b) a workflow for developing universal machine learning models to screen high-performance COFs/MOFs membrane materials for helium separation. Reprinted with copyright permission from Ref. [42] Copyright 2024 Elsevier Ltd.

Figure 6. (a) An in situ fabrication of a hierarchical PDA-stabilized UiO-66-NH₂ membrane with machine learning-optimized oil/water separation performance. Reprinted with copyright permission from Ref. [55] Copyright 2024 American Chemical Society; (b) the relative importance of input variables for membrane performance: water permeability and salt rejection. Reprinted with copyright permission from Ref. [56] Copyright 2024 Elsevier Ltd.

Figure 7. (a) The computational and machine learning approach for evaluating COFs in ethanol-based adsorption heat pumps. Reprinted with copyright permission from Ref. [61] Copyright 2020 American Chemical Society; (b) regression models for predicting the diffusion coefficients C₃H₈ (top) and C₃H₆ (bottom). Reprinted with copyright permission from Ref. [58] Copyright 2025 Elsevier Ltd.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Wu, T.; Zhang, J.; Yan, Q.; Wang, J.; Yang, H. Machine Learning in the Design and Performance Prediction of Organic Framework Membranes: Methodologies, Applications, and Industrial Prospects. Membranes 2025, 15, 178. https://doi.org/10.3390/membranes15060178

AMA Style

Wu T, Zhang J, Yan Q, Wang J, Yang H. Machine Learning in the Design and Performance Prediction of Organic Framework Membranes: Methodologies, Applications, and Industrial Prospects. Membranes. 2025; 15(6):178. https://doi.org/10.3390/membranes15060178

Chicago/Turabian Style

Wu, Tong, Jiawei Zhang, Qinghao Yan, Jingxiang Wang, and Hao Yang. 2025. "Machine Learning in the Design and Performance Prediction of Organic Framework Membranes: Methodologies, Applications, and Industrial Prospects" Membranes 15, no. 6: 178. https://doi.org/10.3390/membranes15060178

APA Style

Wu, T., Zhang, J., Yan, Q., Wang, J., & Yang, H. (2025). Machine Learning in the Design and Performance Prediction of Organic Framework Membranes: Methodologies, Applications, and Industrial Prospects. Membranes, 15(6), 178. https://doi.org/10.3390/membranes15060178

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Machine Learning in the Design and Performance Prediction of Organic Framework Membranes: Methodologies, Applications, and Industrial Prospects

Abstract

1. Introduction

2. Methodologies and Workflows for Machine Learning

3. Machine Learning-Guided Rational Design of OFMs

3.1. OFMs for Gas Separations

3.1.1. Carbon Capture

3.1.2. Hydrogen Separation

3.1.3. Natural Gas Purification

3.1.4. Rare Gas Processing

3.2. OFMs for Liquid Separations

3.3. Industrial Translation Pathways

4. Machine Learning-Enabled Multiscale Optimization of Membrane Systems

5. Outlook

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI