Next Article in Journal
Evaluation Rigor from Graph Neural Networks to Graph Foundation Models: A Systematic Review and a Four-Axis Reporting Standard
Previous Article in Journal
Multi-Stage Evaluation Framework for Identifying Deployment-Ready Prediabetes Prediction Models
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

When to Explore and When to Exploit: Adaptive Decisions in Bayesian Optimization

1
Department of Economics Management and Statistics, University of Milano-Bicocca, 20126 Milan, Italy
2
Department of Computer Science Systems and Communication, University of Milano-Bicocca, 20126 Milan, Italy
*
Author to whom correspondence should be addressed.
Mach. Learn. Knowl. Extr. 2026, 8(7), 193; https://doi.org/10.3390/make8070193
Submission received: 29 April 2026 / Revised: 25 June 2026 / Accepted: 1 July 2026 / Published: 3 July 2026

Abstract

Gaussian process-based Bayesian optimization (BO) is a sample-efficient sequential strategy for optimizing expensive black-box functions. The Gaussian process provides a probabilistic approximation of the unknown function, while an acquisition function balances exploration and exploitation to select the next evaluation point. Despite significant research efforts, no master acquisition function has been identified. This paper proposes a novel adaptive acquisition function that dynamically adjusts the exploration–exploitation trade-off based on the evolution of the optimization process, rather than using fixed or random scheduling. While implemented here within a GP-based BO framework, the core switching mechanism is surrogate-agnostic: the exploitative component requires only a surrogate point prediction, and the explorative component is entirely model-free. Unlike traditional approaches, where mechanisms like UCB/LCB lean toward exploration over iterations, or fixed strategies that switch from exploratory (EI) to exploitative (PI) behavior at predetermined points, the proposed method makes purely exploitative decisions using only the GP’s prediction. However, it discards these decisions when they have low potential for significant improvement, instead focusing on uncertainty reduction. Notably, this approach uses inverse distance weighting for uncertainty quantification rather than the GP’s predictive uncertainty, avoiding bias from the GP’s predictions. Testing on benchmark functions demonstrates that the proposed acquisition function is almost always Pareto optimal, offering the most balanced trade-off between convergence to the global optimum and exploration capability compared to state-of-the-art alternatives.
Keywords: Bayesian optimization; Gaussian process; acquisition function; confidence bound Bayesian optimization; Gaussian process; acquisition function; confidence bound

Share and Cite

MDPI and ACS Style

Candelieri, A.; Archetti, F.; Seyedi, I. When to Explore and When to Exploit: Adaptive Decisions in Bayesian Optimization. Mach. Learn. Knowl. Extr. 2026, 8, 193. https://doi.org/10.3390/make8070193

AMA Style

Candelieri A, Archetti F, Seyedi I. When to Explore and When to Exploit: Adaptive Decisions in Bayesian Optimization. Machine Learning and Knowledge Extraction. 2026; 8(7):193. https://doi.org/10.3390/make8070193

Chicago/Turabian Style

Candelieri, Antonio, Francesco Archetti, and Iman Seyedi. 2026. "When to Explore and When to Exploit: Adaptive Decisions in Bayesian Optimization" Machine Learning and Knowledge Extraction 8, no. 7: 193. https://doi.org/10.3390/make8070193

APA Style

Candelieri, A., Archetti, F., & Seyedi, I. (2026). When to Explore and When to Exploit: Adaptive Decisions in Bayesian Optimization. Machine Learning and Knowledge Extraction, 8(7), 193. https://doi.org/10.3390/make8070193

Article Metrics

Back to TopTop