Abstract
The accurate and rapid determination of olefin content in gasoline is crucial for fuel quality control. While near-infrared spectroscopy (NIR) offers a rapid analytical solution, multiple parameters in the conventional partial least squares regression (PLSR) modeling process rely on the modeler’s subjective judgment. Consequently, the quantitative accuracy of the model is often influenced by the modeler’s experience. To address this limitation, this study developed an integrated adaptive PLSR framework. The methodology incorporates four core adaptive components: automated selection of latent variables based on the rate of decrease in PRESS values, dynamic formation of calibration subsets using Spectral Angle Distance and sample number thresholds, optimization of informative wavelength regions via correlation coefficients, and systematic database cleaning through iterative residual analysis. Applied to 248 gasoline samples, this strategy dramatically enhanced model performance, increasing the coefficient of determination (R2) from 0.7391 to 0.9102 and reducing the root mean square error (RMSE) from 1.51% to 0.866% compared to the global PLSR model. This work demonstrates that the adaptive PLSR framework effectively mitigates spectral nonlinearity and improves predictive robustness, thereby providing a reliable and practical solution for the on-site, rapid monitoring of gasoline quality using handheld NIR spectrometers.