Next Article in Journal
Ocular Toxicity and Mechanistic Investigation for Berberine and Its Metabolite Berberrubine on Zebrafish
Previous Article in Journal
Blood in Capsules: Multi-Technique Forensic Investigation of Suspicious Food Supplement
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

Prediction of Synthesis Yield of Polymethoxy Dibutyl Ether Under Small Sample Conditions

1
College of Chemical Engineering, China University of Petroleum, Qingdao 266580, China
2
School of Chemical Engineering, Shandong Institute of Petrochemical and Chemical Technology, Dongying 257061, China
*
Author to whom correspondence should be addressed.
Molecules 2025, 30(23), 4601; https://doi.org/10.3390/molecules30234601 (registering DOI)
Submission received: 9 October 2025 / Revised: 21 November 2025 / Accepted: 28 November 2025 / Published: 29 November 2025

Abstract

In chemical reaction processes, yield prediction frequently faces challenges, such as multi-variable coupling, significant nonlinearity, and the limited accuracy of traditional mechanistic models. This study develops a datadriven prediction model that integrates the genetic algorithm (GA) with CatBoost to address these challenges. Four variables, including reactant ratio (nbutanol to trioxane), reaction temperature, reaction time, and catalyst concentration, were selected as model inputs based on 88 sets of experimental data. The model outputs focused on the yield of polymethoxy dibutyl ether with a polymerization degree of 1 (BTPOM1) and the total yield of polymethoxy dibutyl ether with polymerization degrees of 1 to 8 (BTPOM1–8). The model achieved automatic optimization of CatBoost on hyperparameters by combining a hybrid-coding genetic algorithm. The results demonstrated that the GACatBoost model significantly outperformed GAAdaBoost for both datasets: for BTPOM1, it reduced the mean squared error (MSE) by 50.1%, mean absolute error (MAE) by 40.6%, and mean absolute percentage error (MAPE) by 17.8% relative to GAAdaBoost. For BTPOM1–8, the reductions were more pronounced, with MSE decreasing by 54.0%, MAE by 45.0%, and MAPE by 33.8% compared to GAAdaBoost. Additionally, the GACatBoost model significantly outperformed three classical machine learning algorithms: Support Vector Regression (SVR), Random Forest (RF), and KNearest Neighbor (KNN). Feature importance analysis revealed that reaction time and reaction temperature are the key factors influencing BTPOMn yield. This research provides a feasible approach for accurate synthesis yield prediction and process optimization under small sample conditions. It is particularly valuable for early-stage laboratory research where experimental data is often limited.
Keywords: polymethoxy dibutyl ether (BTPOMn); yield prediction; genetic algorithm (GA); CatBoost; small sample size; datadriven modeling polymethoxy dibutyl ether (BTPOMn); yield prediction; genetic algorithm (GA); CatBoost; small sample size; datadriven modeling

Share and Cite

MDPI and ACS Style

Wang, X.; Lu, L.; Ma, Q.; Shang, H.; Sun, L. Prediction of Synthesis Yield of Polymethoxy Dibutyl Ether Under Small Sample Conditions. Molecules 2025, 30, 4601. https://doi.org/10.3390/molecules30234601

AMA Style

Wang X, Lu L, Ma Q, Shang H, Sun L. Prediction of Synthesis Yield of Polymethoxy Dibutyl Ether Under Small Sample Conditions. Molecules. 2025; 30(23):4601. https://doi.org/10.3390/molecules30234601

Chicago/Turabian Style

Wang, Xue, Linyu Lu, Qiuxin Ma, Hongyan Shang, and Lanyi Sun. 2025. "Prediction of Synthesis Yield of Polymethoxy Dibutyl Ether Under Small Sample Conditions" Molecules 30, no. 23: 4601. https://doi.org/10.3390/molecules30234601

APA Style

Wang, X., Lu, L., Ma, Q., Shang, H., & Sun, L. (2025). Prediction of Synthesis Yield of Polymethoxy Dibutyl Ether Under Small Sample Conditions. Molecules, 30(23), 4601. https://doi.org/10.3390/molecules30234601

Article Metrics

Back to TopTop