This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Open AccessArticle
Prediction of Synthesis Yield of Polymethoxy Dibutyl Ether Under Small Sample Conditions
by
Xue Wang
Xue Wang 1,2,
Linyu Lu
Linyu Lu 1,
Qiuxin Ma
Qiuxin Ma 1,
Hongyan Shang
Hongyan Shang 1 and
Lanyi Sun
Lanyi Sun 1,*
1
College of Chemical Engineering, China University of Petroleum, Qingdao 266580, China
2
School of Chemical Engineering, Shandong Institute of Petrochemical and Chemical Technology, Dongying 257061, China
*
Author to whom correspondence should be addressed.
Molecules 2025, 30(23), 4601; https://doi.org/10.3390/molecules30234601 (registering DOI)
Submission received: 9 October 2025
/
Revised: 21 November 2025
/
Accepted: 28 November 2025
/
Published: 29 November 2025
Abstract
In chemical reaction processes, yield prediction frequently faces challenges, such as multi-variable coupling, significant nonlinearity, and the limited accuracy of traditional mechanistic models. This study develops a datadriven prediction model that integrates the genetic algorithm (GA) with CatBoost to address these challenges. Four variables, including reactant ratio (nbutanol to trioxane), reaction temperature, reaction time, and catalyst concentration, were selected as model inputs based on 88 sets of experimental data. The model outputs focused on the yield of polymethoxy dibutyl ether with a polymerization degree of 1 (BTPOM1) and the total yield of polymethoxy dibutyl ether with polymerization degrees of 1 to 8 (BTPOM1–8). The model achieved automatic optimization of CatBoost on hyperparameters by combining a hybrid-coding genetic algorithm. The results demonstrated that the GACatBoost model significantly outperformed GAAdaBoost for both datasets: for BTPOM1, it reduced the mean squared error (MSE) by 50.1%, mean absolute error (MAE) by 40.6%, and mean absolute percentage error (MAPE) by 17.8% relative to GAAdaBoost. For BTPOM1–8, the reductions were more pronounced, with MSE decreasing by 54.0%, MAE by 45.0%, and MAPE by 33.8% compared to GAAdaBoost. Additionally, the GACatBoost model significantly outperformed three classical machine learning algorithms: Support Vector Regression (SVR), Random Forest (RF), and KNearest Neighbor (KNN). Feature importance analysis revealed that reaction time and reaction temperature are the key factors influencing BTPOMn yield. This research provides a feasible approach for accurate synthesis yield prediction and process optimization under small sample conditions. It is particularly valuable for early-stage laboratory research where experimental data is often limited.
Share and Cite
MDPI and ACS Style
Wang, X.; Lu, L.; Ma, Q.; Shang, H.; Sun, L.
Prediction of Synthesis Yield of Polymethoxy Dibutyl Ether Under Small Sample Conditions. Molecules 2025, 30, 4601.
https://doi.org/10.3390/molecules30234601
AMA Style
Wang X, Lu L, Ma Q, Shang H, Sun L.
Prediction of Synthesis Yield of Polymethoxy Dibutyl Ether Under Small Sample Conditions. Molecules. 2025; 30(23):4601.
https://doi.org/10.3390/molecules30234601
Chicago/Turabian Style
Wang, Xue, Linyu Lu, Qiuxin Ma, Hongyan Shang, and Lanyi Sun.
2025. "Prediction of Synthesis Yield of Polymethoxy Dibutyl Ether Under Small Sample Conditions" Molecules 30, no. 23: 4601.
https://doi.org/10.3390/molecules30234601
APA Style
Wang, X., Lu, L., Ma, Q., Shang, H., & Sun, L.
(2025). Prediction of Synthesis Yield of Polymethoxy Dibutyl Ether Under Small Sample Conditions. Molecules, 30(23), 4601.
https://doi.org/10.3390/molecules30234601
Article Metrics
Article Access Statistics
For more information on the journal statistics, click
here.
Multiple requests from the same IP address are counted as one view.