Integrating NSGA-II and TOPSIS for Stacking Model Optimization in Pursuit of Halide Double Perovskite Screening
Abstract
1. Introduction
- (1)
- Construction of a new feature set easily obtained from the periodic table as input for ML models and utilization of Shapley Additive exPlanations (SHAP) in feature selection engineering for predicting Eg and ΔHf of halide double perovskites were implemented.
- (2)
- A method integrated NSGA-II and TOPSIS for stacking regression model optimization, simultaneously considering four regression metrics (MSE, MAE, RMSE and R2) and the number of base models constructed in the stacking models in the Pareto front was proposed.
- (3)
- The optimal stacking regression model with high predicting accuracy was validated by a new dataset, providing guidance for discovering potential compounds for solar cells from a large quantity of materials.
2. Methodology
2.1. Feature Selection with Shapley Additive exPlanations (SHAP)
2.2. Model Evaluation
2.3. NSGA-II for Stacking Model Optimization with
2.4. TOPSIS for Optimal Stacking Model Selection
3. Experimental Data and Model Construction
3.1. Dataset
3.2. Input Features and Feature Selection
3.2.1. Original Feature Set and New Constructed Feature Set
3.2.2. Feature Selection by SHAP
3.3. Stacking Ensemble Regression Model
4. Results and Discussion
4.1. Performance Comparisons Between Single Models and Stacking Models
4.2. Stacking Model with Different Combinations of Base Models
4.3. Stacking Model Optimization with NSGA-II +TOPSIS
4.4. Comparison of the Optimal Stacking Model Selected by TOPSIS Versus the Optimal Model Identified by Random Search
4.5. Model Validation on Completely New Test Data
4.6. Generalizability to Other Perovskite Families
5. Conclusions
Supplementary Materials
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Shi, Y.; Wen, J.; Wen, C.; Jiang, L.; Wu, B.; Qiu, Y.; Sa, B. Interpretable machine learning insights of power conversion efficiency for hybrid perovskites solar cells. Sol. Energy 2025, 290, 113373. [Google Scholar] [CrossRef]
- Cai, Y.; Bai, Z.; Chen, C.; Sun, M.; Wang, Z.; Wang, S.; Zhang, Z.; Xie, J.; Li, D.; Guan, X.; et al. Machine learning prediction of small molecule passivators and their impacts on the passivation and photocatalytic performance of organic-inorganic hybrid perovskite interfaces. Energy Mater. 2025, 5, 500043. [Google Scholar] [CrossRef]
- Feng, H.-J.; Wu, K.; Deng, Z.-Y. Predicting Inorganic Photovoltaic Materials with Efficiencies >26% via Structure-Relevant Machine Learning and Density Functional Calculations. Cell Rep. Phys. Sci. 2020, 1, 100179. [Google Scholar] [CrossRef]
- Yan, F.; Song, K.; Liu, Y.; Chen, S.; Chen, J. Predictions and mechanism analyses of the fatigue strength of steel based on machine learning. J. Mater. Sci. 2020, 55, 15334–15349. [Google Scholar] [CrossRef]
- Maddah, H.A.; Berry, V.; Behura, S.K. Cuboctahedral stability in Titanium halide perovskites via machine learning. Comput. Mater. Sci. 2020, 173, 109415. [Google Scholar] [CrossRef]
- Bayon, A.; de la Calle, A.; Ghose, K.K.; Page, A.; McNaughton, R. Experimental, computational and thermodynamic studies in perovskites metal oxides for thermochemical fuel production: A review. Int. J. Hydrogen Energy 2020, 45, 12653–12679. [Google Scholar] [CrossRef]
- Li, C.; Chen, Z.; Zhou, C.; Hong, S.; Yang, S. Mixed Antisolvent Strategy Enables Synergistic Crystallization Control and Defect Passivation in Perovskite Solar Cells. ACS Appl. Mater. Interfaces 2025, 17, 29639–29648. [Google Scholar] [CrossRef]
- Farias-Basulto, G.; Mehlhop, T.; Otto, N.J.; Bertram, T.; Jäger, K.; Gall, S.; Weinberger, N.; Schlatmann, R.; Lauermann, I.; Klenk, R.; et al. Improving Perovskite/CIGS Tandem Solar Cells for Higher Power Conversion Efficiency through Light Management and Bandgap Engineering. ACS Appl. Mater. Interfaces 2025, 17, 56250–56255. [Google Scholar] [CrossRef]
- Bărar, A.; Boscornea, C.; Bălăşoiu, M.; Mănănăilă-Maximean, D. Two-diode modelling of perovskite solar cells and parameter extraction using the lambert w function. UPB Sci. Bull. Ser. A Appl. Math. Phys. 2021, 83, 217–225. [Google Scholar]
- Im, J.; Lee, S.; Ko, T.W.; Kim, H.W.; Hyon, Y.K.; Chang, H. Identifying Pb-free perovskites for solar cells by machine learning. npj Comput. Mater. 2019, 5, 37. [Google Scholar] [CrossRef]
- Himanen, L.; Geurts, A.; Foster, A.S.; Rinke, P. Data-Driven Materials Science: Status, Challenges, and Perspectives. Adv. Sci. 2019, 6, 1900808. [Google Scholar] [CrossRef]
- Li, Z.; Xu, Q.; Sun, Q.; Hou, Z.; Yin, W.J. Thermodynamic Stability Landscape of Halide Double Perovskites via High-Throughput Computing and Machine Learning. Adv. Funct. Mater. 2019, 29, 1807280. [Google Scholar] [CrossRef]
- Brown, K.A.; Brittman, S.; Maccaferri, N.; Jariwala, D.; Celano, U. Machine Learning in Nanoscience: Big Data at Small Scales. Nano Lett. 2020, 20, 2–10. [Google Scholar] [CrossRef] [PubMed]
- Ma, X.-Y.; Lyu, H.-Y.; Hao, K.-R.; Zhao, Y.-M.; Qian, X.; Yan, Q.-B.; Su, G. Large family of two-dimensional ferroelectric metals discovered via machine learning. Sci. Bull. 2020, 66, 233–242. [Google Scholar] [CrossRef]
- Wang, T.; Zhang, C.; Snoussi, H.; Zhang, G. Machine Learning Approaches for Thermoelectric Materials Research. Adv. Funct. Mater. 2020, 30, 1906041. [Google Scholar] [CrossRef]
- Agiorgousis, M.L.; Sun, Y.; Choe, D.; West, D.; Zhang, S. Machine learning augmented discovery of chalcogenide double perovskites for photovoltaics. Adv. Theory Simul. 2019, 2, 1800173. [Google Scholar] [CrossRef]
- Wang, J.; Xu, P.; Ji, X.; Li, M.; Lu, W. Feature Selection in Machine Learning for Perovskite Materials Design and Discovery. Materials 2023, 16, 3134. [Google Scholar] [CrossRef] [PubMed]
- Kusuma, F.J.; Widianto, E.; Wahyono; Santoso, I.; Sholihun; Absor, M.A.U.; Sakti, S.P.; Triyana, K. Multi-Properties prediction of perovskite materials using Machine learning and Meta-Heuristic feature selection. Sol. Energy 2025, 286, 113189. [Google Scholar] [CrossRef]
- Park, H.; Mall, R.; Ali, A.; Sanvito, S.; Bensmail, H.; El-Mellouhi, F. Importance of structural deformation features in the prediction of hybrid perovskite bandgaps. Comput. Mater. Sci. 2020, 184, 109858. [Google Scholar] [CrossRef]
- Guo, Z.; Lin, B. Machine learning stability and band gap of lead-free halide double perovskite materials for perovskite solar cells. Sol. Energy 2021, 228, 689–699. [Google Scholar] [CrossRef]
- Yang, X.; Li, L.; Tao, Q.; Lu, W.; Li, M. Rapid discovery of narrow bandgap oxide double perovskites using machine learning. Comput. Mater. Sci. 2021, 196, 110528. [Google Scholar] [CrossRef]
- Yang, Z.; Liu, Y.; Zhang, Y.; Wang, L.; Lin, C.; Lv, Y.; Ma, Y.; Shao, C. Machine Learning Accelerates the Discovery of Light-Absorbing Materials for Double Perovskite Solar Cells. J. Phys. Chem. C 2021, 125, 22483–22492. [Google Scholar] [CrossRef]
- Kim, J.; Kim, E.; Min, K. Synthesizable Double Perovskite Oxide Search via Machine Learning and High-Throughput Computational Screening. Adv. Theory Simul. 2021, 4, 2100263. [Google Scholar] [CrossRef]
- Gao, Z.; Zhang, H.; Mao, G.; Ren, J.; Chen, Z.; Wu, C.; Gates, I.D.; Yang, W.; Ding, X.; Yao, J. Screening for lead-free inorganic double perovskites with suitable band gaps and high stability using combined machine learning and DFT calculation. Appl. Surf. Sci. 2021, 568, 150916. [Google Scholar] [CrossRef]
- Tian, G.; Zhao, P.; Wang, Y.; Zhang, H.; Xing, L.; Cheng, X. Research on Performance Prediction Method of Refractory High-Entropy Alloy Based on Ensemble Learning. Metals 2025, 15, 371. [Google Scholar] [CrossRef]
- Chen, H.; Shang, Z.; Lu, W.; Li, M.; Tan, F. A Property-Driven Stepwise Design Strategy for Multiple Low-Melting Alloys via Machine Learning. Adv. Eng. Mater. 2021, 23, 2100612. [Google Scholar] [CrossRef]
- Lu, T.; Li, H.; Li, M.; Wang, S.; Lu, W. Inverse Design of Hybrid Organic—Inorganic Perovskites with Suitable Bandgaps via Proactive Searching Progress. ACS Omega 2022, 7, 21583–21594. [Google Scholar] [CrossRef] [PubMed]
- Oriola, O. A Stacked Generalization Ensemble Approach for Improved Intrusion Detection. Int. J. Comput. Sci. Inf. Secur. 2020, 18, 62–67. [Google Scholar]
- R., S.; M., S.; Hasan, M.K.; Saeed, R.A.; Alsuhibany, S.A.; Abdel-Khalek, S. An Empirical Model to Predict the Diabetic Positive Using Stacked Ensemble Approach. Front. Public Health 2021, 9, 792124. [Google Scholar] [CrossRef]
- Liu, R.; Liu, Y.; Duan, J.; Hou, F.; Wang, L.; Zhang, X.; Li, G. Ensemble learning directed classification and regression of hydrocarbon fuels. Fuel 2022, 324, 124520. [Google Scholar] [CrossRef]
- Meharie, M.G.; Mengesha, W.J.; Gariy, Z.A.; Mutuku, R.N.N. Application of stacking ensemble machine learning algorithm in predicting the cost of highway construction projects. Eng. Constr. Archit. Manag. 2022, 29, 2836–2853. [Google Scholar] [CrossRef]
- Asante-Okyere, S.; Marfo, S.A.; Ziggah, Y.Y. Estimating total organic carbon (TOC) of shale rocks from their mineral composition using stacking generalization approach of machine learning. Upstream Oil Gas Technol. 2023, 11, 100089. [Google Scholar] [CrossRef]
- Wang, T.; Zhang, K.; Thé, J.; Yu, H. Accurate prediction of band gap of materials using stacking machine learning model. Comput. Mater. Sci. 2022, 201, 110899. [Google Scholar] [CrossRef]
- Chen, R.; Zhou, S.; Liu, X.; Zhang, W.; Cheng, Q.; Jiang, N.; Zhang, Y.; Liu, D.; Yan, D. Application of Stacking machine learning model with SHapley Additive explanation (SHAP) in estimating the changes of mechanical properties in coals under CO2. Fuel 2026, 405, 136461. [Google Scholar] [CrossRef]
- Deng, X.; Zhu, S.P.; Zhang, S.; Zhang, X.; Xiong, R.; Dong, Y.; Yan, D. Physics-informed machine learning framework for creep-fatigue life prediction of a Ni-based superalloy using ensemble learning. Mater. Today Commun. 2024, 41, 110260. [Google Scholar] [CrossRef]
- Deb, K.; Pratap, A.; Agarwal, S.; Meyarivan, T. A fast and elitist multiobjective genetic algorithm: NSGA-II. IEEE Trans. Evol. Comput. 2002, 6, 182–197. [Google Scholar] [CrossRef]
- Do Lee, B.; Lee, J. Discovery of Pb-free hybrid organic–inorganic 2D perovskites using a stepwise optimization strategy. npj Comput. Mater. 2022, 8, 83. [Google Scholar] [CrossRef]
- Etghani, M.M.; Shojaeefard, M.H.; Khalkhali, A.; Akbari, M. A hybrid method of modified NSGA-II and TOPSIS to optimize performance and emissions of a diesel engine using biodiesel. Appl. Therm. Eng. 2013, 59, 309–315. [Google Scholar] [CrossRef]
- Zhou, Z.; Lin, Z.; Ma, Y.; Niu, J.R.; Liu, J.; Wang, X. Optimal design of colour formulation prediction for cotton fabrics based on NSGA-II and TOPSIS. Color. Technol. 2025, 141, 63–79. [Google Scholar] [CrossRef]
- Kocaturk, A.; Orkcu, H.H.; Altunkaynak, B. Parameter optimization in biclustering algorithms for large datasets using a combined approach of NSGA-II and TOPSIS. Int. J. Data Sci. Anal. 2025, 20, 5499–5516. [Google Scholar] [CrossRef]
- Sadni, F.E.; Salhi, I.; Belhora, F.; Hajjaji, A. Multi-objective optimization of energy and exergy efficiencies in ORC configurations using NSGA-II and TOPSIS. Therm. Sci. Eng. Prog. 2025, 63, 103606. [Google Scholar] [CrossRef]
- Welchowski, T.; Schmid, M. A framework for parameter estimation and model selection in kernel deep stacking networks. Artif. Intell. Med. 2016, 70, 31–40. [Google Scholar] [CrossRef]
- Tandon, S.; Kumar, V.; Singh, V.B. Empirical Analysis of Machine Learning Classifiers for Architectural Smells Using Critic-Topsis Approach. In Proceedings of the 2024 3rd Edition of IEEE Delhi Section Flagship Conference (DELCON), New Delhi, India, 21–23 November 2024. [Google Scholar] [CrossRef]
- Lundberg, S.M.; Erion, G.G.; Lee, S.-I. Consistent individualized feature attribution for tree ensembles. arXiv 2018, arXiv:1802.03888. [Google Scholar]
- Lyngdoh, G.A.; Li, H.; Zaki, M.; Krishnan, N.M.A.; Das, S. Elucidating the constitutive relationship of calcium–silicate–hydrate gel using high throughput reactive molecular simulations and machine learning. Sci. Rep. 2020, 10, 21336. [Google Scholar] [CrossRef]
- Kusdhany, M.I.M.; Lyth, S.M. New insights into hydrogen uptake on porous carbon materials via explainable machine learning. Carbon 2021, 179, 190–201. [Google Scholar] [CrossRef]
- Wang, G.; Fearn, T.; Wang, T.; Choy, K.L. Machine-Learning Approach for Predicting the Discharging Capacities of Doped Lithium Nickel-Cobalt-Manganese Cathode Materials in Li-Ion Batteries. ACS Cent. Sci. 2021, 7, 1551–1560. [Google Scholar] [CrossRef]
- Lu, T.; Li, H.; Li, M.; Wang, S.; Lu, W. Predicting Experimental Formability of Hybrid Organic-Inorganic Perovskites via Imbalanced Learning. J. Phys. Chem. Lett. 2022, 13, 3032–3038. [Google Scholar] [CrossRef]
- Liang, G.Q.; Zhang, J. A machine learning model for screening thermodynamic stable lead-free halide double perovskites. Comput. Mater. Sci. 2022, 204, 111172. [Google Scholar] [CrossRef]
- Grojean, C.; Paul, A.; Qian, Z.N.; Strumke, I. Lessons on interpretable machine learning from particle physics. Nat. Rev. Phys. 2022, 4, 284–286. [Google Scholar] [CrossRef]
- Bartel, C.J.; Sutton, C.; Goldsmith, B.R.; Ouyang, R.; Musgrave, C.B.; Ghiringhelli, L.M.; Scheffler, M. New tolerance factor to predict the stability of perovskite oxides and halides. Sci. Adv. 2019, 5, eaav0693. [Google Scholar] [CrossRef]
- Breiman, L.; Friedman, J.H.; Olshen, R.A.; Stone, C.J. Classification and Regression Trees; Routledge: Abingdon, UK, 2017. [Google Scholar]
- Huynh-Thu, V.A.; Irrthum, A.; Wehenkel, L.; Geurts, P. Inferring regulatory networks from expression data using tree-based methods. PLoS ONE 2010, 5, e12776. [Google Scholar] [CrossRef] [PubMed]
- Lundberg, S.M.; Erion, G.; Chen, H.; DeGrave, A.; Prutkin, J.M.; Nair, B.; Katz, R.; Himmelfarb, J.; Bansal, N.; Lee, S.-I. From local explanations to global understanding with explainable AI for trees. Nat. Mach. Intell. 2020, 2, 56–67. [Google Scholar] [CrossRef]
- Li, Y.; Tie, W.; Hu, X.; Duan, B.; Shao, Y.; Meng, Y.; Fang, J. Optimization of Beamline Optical Systems Using NSGA-II and TOPSIS Algorithms. In Proceedings of the 2025 4th International Conference on Advanced Manufacturing Technology and Manufacturing System (ICAMTMS), Wuhu, China, 8–10 August 2025. [Google Scholar] [CrossRef]
- Liu, H.; Cheng, J.; Dong, H.; Feng, J.; Pang, B.; Tian, Z.; Ma, S.; Xia, F.; Zhang, C.; Dong, L. Screening stable and metastable ABO3 perovskites using machine learning and the materials project. Comput. Mater. Sci. 2020, 177, 109614. [Google Scholar] [CrossRef]
- Zhai, X.; Chen, M. Accelerated Design for Perovskite-Oxide-Based Photocatalysts Using Machine Learning Techniques. Materials 2024, 17, 3026. [Google Scholar] [CrossRef]
- Song, K.; Yan, F.; Ding, T.; Gao, L.; Lu, S. A steel property optimization model based on the XGBoost algorithm and improved PSO. Comput. Mater. Sci. 2020, 174, 109472. [Google Scholar] [CrossRef]
- Hancock, J.T.; Khoshgoftaar, T.M. CatBoost for big data: An interdisciplinary review. J. Big Data 2020, 7, 94. [Google Scholar] [CrossRef] [PubMed]
- Degtyarev, V.V.; Naser, M.Z. Boosting machines for predicting shear strength of CFS channels with staggered web perforations. Structures 2021, 34, 3391–3403. [Google Scholar] [CrossRef]
- Chen, T.; Guestrin, C. Xgboost: A scalable tree boosting system. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, 13–17 August 2016; pp. 785–794. [Google Scholar]
- Liaw, A.; Wiener, M. Classification and regression by randomForest. R News 2002, 2, 18–22. [Google Scholar]
- Song, Z.; Chen, X.; Meng, F.; Cheng, G.; Wang, C.; Sun, Z.; Yin, W.-J. Machine learning in materials design: Algorithm and application. Chin. Phys. B 2020, 29, 116103. [Google Scholar] [CrossRef]
- Xu, Y.; Zhao, X.; Chen, Y.; Yang, Z. Research on a mixed gas classification algorithm based on extreme random tree. Appl. Sci. 2019, 9, 1728. [Google Scholar] [CrossRef]
- Ke, G.; Meng, Q.; Finley, T.; Wang, T.; Chen, W.; Ma, W.; Ye, Q.; Liu, T.Y. LightGBM: A highly efficient gradient boosting decision tree. In Proceedings of the 31st International Conference on Neural Information Processing Systems, Long Beach, CA, USA, 4–9 December 2017. [Google Scholar]
- Quinlan, J.R. Induction of decision trees. Mach. Learn. 1986, 1, 81–106. [Google Scholar] [CrossRef]
- Bartlett, P.; Freund, Y.; Lee, W.S.; Schapire, R.E. Boosting the margin: A new explanation for the effectiveness of voting methods. Ann. Stat. 1998, 26, 1651–1686. [Google Scholar] [CrossRef]
- Blum, L.C.; Reymond, J.L. 970 Million druglike small molecules for virtual screening in the chemical universe database GDB-13. J. Am. Chem. Soc. 2009, 131, 8732–8733. [Google Scholar] [CrossRef] [PubMed]
- Bengio, Y.; Courville, A.; Vincent, P. Representation learning: A review and new perspectives. IEEE Trans. Pattern Anal. Mach. Intell. 2013, 35, 1798–1828. [Google Scholar] [CrossRef]
- Zhai, X.; Chen, M.; Lu, W. Accelerated search for perovskite materials with higher Curie temperature based on the machine learning methods. Comput. Mater. Sci. 2018, 151, 41–48. [Google Scholar] [CrossRef]
- Li, C.; Hao, H.; Xu, B.; Shen, Z.; Zhou, E.; Jiang, D.; Liu, H. Improved physics-based structural descriptors of perovskite materials enable higher accuracy of machine learning. Comput. Mater. Sci. 2021, 198, 110714. [Google Scholar] [CrossRef]
- Rajan, A.C.; Mishra, A.; Satsangi, S.; Vaish, R.; Mizuseki, H.; Lee, K.R.; Singh, A.K. Machine-learning-assisted accurate band gap predictions of functionalized mxene. Chem. Mater. 2018, 30, 4031–4038. [Google Scholar] [CrossRef]
- Chou, J.S.; Tsai, C.F.; Pham, A.D.; Lu, Y.H. Machine learning in concrete strength simulations: Multi-nation data analytics. Constr. Build. Mater. 2014, 73, 771–780. [Google Scholar] [CrossRef]
- Pedregosa, F.; Varoquaux, G.; Gramfort, A.; Michel, V.; Thirion, B.; Grisel, O.; Blondel, M.; Prettenhofer, P.; Weiss, R.; Dubourg, V. Scikit-learn: Machine learning in Python. J. Mach. Learn. Res. 2011, 12, 2825–2830. [Google Scholar]
- Zhao, X.G.; Yang, J.H.; Fu, Y.; Yang, D.; Xu, Q.; Yu, L.; Wei, S.H.; Zhang, L. Design of Lead-Free Inorganic Halide Perovskites for Solar Cells via Cation-Transmutation. J. Am. Chem. Soc. 2017, 139, 2630–2638. [Google Scholar] [CrossRef]
- Li, W.; Jacobs, R.; Morgan, D. Predicting the thermodynamic stability of perovskite oxides using machine learning models. Comput. Mater. Sci. 2018, 150, 454–463. [Google Scholar] [CrossRef]









| Model ID | Model Name | Note |
|---|---|---|
| 1 | CatBR | Gradient Boosting with categorical feature support |
| 2 | XGBR | Extreme Gradient Boosting Regressor |
| 3 | RFR | Random Forest Regressor |
| 4 | LGBR | Light Gradient Boosting Machine Regression |
| 5 | Bag | Bagging Regressor |
| 6 | GBR | Gradient Boosting Regressor |
| 7 | ETR | Extra-Tree Regressor |
| 8 | DTR | Decision Tree Regressor |
| 9 | Ada | AdaBoost Regressor |
| 10 | LR | Linear regressor |
| 11 | BR | Bayesian Ridge |
| 12 | SVR | SVR with RBF kernel |
| Stacking Models | Single Models | ||||||
|---|---|---|---|---|---|---|---|
| Meta-Learner | RMSE (eV) | MAE (eV) | R2 | Model Name | RMSE (eV) | MAE (eV) | R2 |
| CatBR | 0.1944 | 0.1131 | 0.9381 | CatBR | 0.2091 | 0.1230 | 0.9282 |
| XGBR | 0.2047 | 0.1173 | 0.9308 | XGBR | 0.2375 | 0.1450 | 0.9073 |
| RFR | 0.1903 | 0.1070 | 0.9402 | RFR | 0.2681 | 0.1801 | 0.8833 |
| LGBR | 0.2145 | 0.1307 | 0.9256 | LGBR | 0.2730 | 0.1909 | 0.8788 |
| Bag | 0.1965 | 0.1124 | 0.9364 | Bag | 0.2899 | 0.1939 | 0.8632 |
| GBR | 0.1932 | 0.1129 | 0.9384 | GBR | 0.2927 | 0.2126 | 0.8619 |
| ETR | 0.2614 | 0.1551 | 0.8871 | ETR | 0.3557 | 0.1949 | 0.7895 |
| DTR | 0.2469 | 0.1442 | 0.8997 | DTR | 0.3607 | 0.2012 | 0.7848 |
| Ada | 0.2334 | 0.1659 | 0.9110 | Ada | 0.4419 | 0.3684 | 0.6887 |
| LR | 0.1862 | 0.1102 | 0.9427 | LR | 0.4865 | 0.3920 | 0.6231 |
| BR | 0.1868 | 0.1108 | 0.9423 | BR | 0.4891 | 0.3933 | 0.6192 |
| SVR | 0.1803 | 0.1050 | 0.9459 | SVR | 0.7182 | 0.5759 | 0.5841 |
| Stacking Models | Single Models | ||||||
|---|---|---|---|---|---|---|---|
| Meta-Learner | RMSE (eV/atom) | MAE (eV/atom) | R2 | Model Name | RMSE (eV/atom) | MAE (eV/atom) | R2 |
| CatBR | 0.01460 | 0.00985 | 0.99598 | CatBR | 0.01422 | 0.00748 | 0.99585 |
| GBR | 0.01344 | 0.00830 | 0.99642 | GBR | 0.01475 | 0.01010 | 0.99572 |
| XGBR | 0.01543 | 0.00973 | 0.99536 | XGBR | 0.01608 | 0.01011 | 0.99493 |
| LGBR | 0.01827 | 0.01090 | 0.99370 | LGBR | 0.01662 | 0.01148 | 0.99464 |
| RFR | 0.01269 | 0.00761 | 0.99675 | RFR | 0.02460 | 0.01856 | 0.98857 |
| Bag | 0.01325 | 0.00807 | 0.99649 | Bag | 0.02672 | 0.02046 | 0.98654 |
| LR | 0.01090 | 0.00600 | 0.99750 | LR | 0.03314 | 0.02542 | 0.97962 |
| BR | 0.01092 | 0.00601 | 0.99749 | BR | 0.03324 | 0.02558 | 0.97950 |
| DTR | 0.01694 | 0.01018 | 0.99425 | DTR | 0.03336 | 0.02529 | 0.97907 |
| ETR | 0.01817 | 0.01150 | 0.99348 | ETR | 0.03639 | 0.02816 | 0.97506 |
| Ada | 0.02223 | 0.01569 | 0.99074 | Ada | 0.04637 | 0.03654 | 0.95982 |
| SVR | 0.05297 | 0.04787 | 0.94750 | SVR | 0.06494 | 0.05328 | 0.92149 |
| Parameter | Value | Description |
|---|---|---|
| Population Size | 20 | Number of candidate stacking model configurations in each generation. |
| Generations | 20 | Total iterations of the evolutionary process. |
| Crossover Probability | 0.8 | Probability of performing crossover between two parent solutions. |
| Mutation Probability | 0.2 | The probability that an individual will undergo mutation. |
| Number of fitness evaluations | 40 runs × 5-fold CV | Each stacking model evaluated by 40 repeated runs of five-fold cross-validation. |
| Fitness metrics | MSE, RMSE, MAE, R2, Nbase | Five regression metrics used as multi-objective optimization criteria. |
| (a) | |||||||||
| Pareto Solutions | Base_Model_Size | Base_Model | Meta-Learner | MSE (eV2) | RMSE (eV) | MAE (eV) | R2 | C_I | Optimal Model |
| Solution_A | 4 | CatBR, GBR, ETR, Ada | SVR | 0.0324 | 0.1759 | 0.1010 | 0.9483 | 0.3559 | N |
| Solution_B | 4 | CatBR, RFR, GBR, ETR | SVR | 0.0323 | 0.1755 | 0.1011 | 0.9485 | 0.6441 | Y |
| (b) | |||||||||
| Pareto Solutions | Base_Model_Size | Base_Model | Meta-Learner | MSE (eV2/atom2) | RMSE (eV/atom) | MAE (eV/atom) | R2 | C_I | Optimal Model |
| Solution_C | 8 | CatBR, RFR, XGBR, Bag, GBR, ETR, Ada, LGBR | BR | 0.000136 | 0.010915 | 0.006039 | 0.997495 | 0.0447 | N |
| Solution_D | 6 | CatBR, Bag, XGBR, GBR, Ada, LGBR | BR | 0.000136 | 0.010901 | 0.006059 | 0.997499 | 0.9553 | Y |
| Metric | Best Stacking Model (Unoptimized) | Optimal Stacking Model (Optimized) | Improvement |
|---|---|---|---|
| MSE (eV2) | 0.0340 | 0.0323 | ↓ 5.02% |
| RMSE (eV)) | 0.1803 | 0.1755 | ↓ 2.70% |
| MAE (eV) | 0.1050 | 0.1011 | ↓ 3.72% |
| R2 | 0.9459 | 0.9485 | ↑ 0.28% |
| Base Model Size | 12 base models | 4 base models | ↓ 66.7% |
| Meta-learner | SVR | SVR | — |
| Metric | Best Stacking Model (Unoptimized) | Optimal Stacking Model (Optimized) | Improvement |
|---|---|---|---|
| MSE (eV2/atom2) | 0.000136 | 0.000136 | 0% |
| RMSE (eV/atom) | 0.010907 | 0.010901 | ↓ 0.06% |
| MAE (eV/atom) | 0.006011 | 0.006059 | ↓ 0.8% |
| R2 | 0.997494 | 0.997499 | ↑ 0.0005% |
| Base Model Size | 12 base models | 6 base models | ↓ 50% |
| Meta-learner | LR | BR | — |
| (a) | |||||||
| Model | Base_Model_Size | Base_Model | Meta-Learner | MSE (eV2) | RMSE (eV) | MAE (eV) | R2 |
| TOPSIS-selected | 4 | CatBR, RFR, GBR, ETR | SVR | 0.0323 (±0.0134) | 0.1755 (±0.0375) | 0.1011 (±0.0139) | 0.9485 (±0.0223) |
| Optimal from 400 RS | 6 | CatBR, XGBR, LR, GBR, ETR, Ada, | SVR | 0.0331 (±0.0135) | 0.17834 (±0.0382) | 0.1041 (±0.0142) | 0.9472 (±0.0225) |
| Improvement (%) | +33.33% | — | — | +2.47% | +1.59% | +2.91% | +0.14% |
| (b) | |||||||
| Model | Base_Model_Size | Base_Model | Meta-Learner | MSE (eV2/atom2) | RMSE (eV/atom) | MAE (eV/atom) | R2 |
| TOPSIS-selected | 6 | CatBR, Bag, XGBR, GBR, Ada, LGBR | BR | 0.000136 (±0.000111) | 0.010901 (±0.004134) | 0.006059 (±0.000932) | 0.997499 (±0.002038) |
| Optimal from 400 RS | 7 | CatBR, Bag, GBR, ETR, Ada, LGBM, LR | BR | 0.000141 (±0.000113) | 0.011119 (±0.004181) | 0.006430 (±0.000928) | 0.997406 (±0.002081) |
| Improvement (%) | 14.29% | — | — | +3.49% | +1.96% | +5.76% | +0.01% |
| Nos. | Compounds | Predicted Eg (eV) | Classified with Predicted Ega | Classified with Calculated Egb | Predicted ΔHf (eV/atom) | Hfa | Hb |
|---|---|---|---|---|---|---|---|
| 1 | Cs2NaSbBr6 | 1.2198 | N | N | −0.7630 | Y | Y |
| 2 | Cs2KSbBr6 | 0.9398 | N | N | −0.8058 | Y | Y |
| 3 | Cs2RbSbBr6 | 1.2994 | N | N | −0.8484 | Y | Y |
| 4 | Cs2NaBiBr6 | 0.7174 | Y | N | −0.8432 | Y | Y |
| 5 | Cs2KBiBr6 | 0.9142 | N | N | −0.8860 | Y | Y |
| 6 | Cs2RbBiBr6 | 1.0474 | N | N | −0.9317 | Y | Y |
| 7 | Cs2NaSbCl6 | 1.3333 | N | N | −1.0036 | Y | Y |
| 8 | Cs2KSbCl6 | 1.3671 | N | N | −1.0817 | Y | Y |
| 9 | Cs2RbSbCl6 | 1.7458 | N | N | −1.1066 | Y | Y |
| 10 | Cs2NaBiCl6 | 1.1677 | N | N | −1.0937 | Y | Y |
| 11 | Cs2KBiCl6 | 1.3577 | N | N | −1.1813 | Y | Y |
| 12 | Cs2RbBiCl6 | 1.9255 | N | N | −1.2019 | Y | Y |
| 13 | Cs2NaSbI6 | 1.2048 | N | N | −0.7630 | Y | Y |
| 14 | Cs2KSbI6 | 0.9420 | N | N | −0.8058 | Y | N |
| 15 | Cs2RbSbI6 | 1.2464 | N | N | −0.8484 | Y | N |
| 16 | Cs2NaBiI6 | 0.7065 | Y | N | −0.8432 | Y | Y |
| 17 | Cs2KBiI6 | 0.8835 | N | N | −0.8860 | Y | N |
| 18 | Cs2RbBiI6 | 0.9172 | N | N | −0.9317 | Y | N |
| 19 | Cs2NaSbF6 | 1.3905 | N | N | −1.1677 | Y | Y |
| 20 | Cs2NaBiF6 | 1.3297 | N | N | −1.2771 | Y | Y |
| 21 | Cs2KBiF6 | 2.0333 | N | N | −1.4035 | Y | Y |
| 22 | Cs2KSbF6 | 1.9047 | N | N | −1.2915 | Y | Y |
| 23 | Cs2RbSbF6 | 2.1867 | N | N | −1.3060 | Y | Y |
| 24 | Cs2RbBiF6 | 2.5132 | N | N | −1.4065 | Y | Y |
| Method | Accuracy (%) | F1 Score (%) | Precision |
|---|---|---|---|
| Bartel’s τ descriptor | 91.00 | — | — |
| Our method | 96.22 (±1.01) | 98.03 (±0.53) | 96.89 (±0.83) |
| Improvement | +5.22% | — | — |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Liang, G.; Zhang, J. Integrating NSGA-II and TOPSIS for Stacking Model Optimization in Pursuit of Halide Double Perovskite Screening. Materials 2026, 19, 2018. https://doi.org/10.3390/ma19102018
Liang G, Zhang J. Integrating NSGA-II and TOPSIS for Stacking Model Optimization in Pursuit of Halide Double Perovskite Screening. Materials. 2026; 19(10):2018. https://doi.org/10.3390/ma19102018
Chicago/Turabian StyleLiang, Guiqin, and Jian Zhang. 2026. "Integrating NSGA-II and TOPSIS for Stacking Model Optimization in Pursuit of Halide Double Perovskite Screening" Materials 19, no. 10: 2018. https://doi.org/10.3390/ma19102018
APA StyleLiang, G., & Zhang, J. (2026). Integrating NSGA-II and TOPSIS for Stacking Model Optimization in Pursuit of Halide Double Perovskite Screening. Materials, 19(10), 2018. https://doi.org/10.3390/ma19102018

