When to Explore and When to Exploit: Adaptive Decisions in Bayesian Optimization

Candelieri, Antonio; Archetti, Francesco; Seyedi, Iman

doi:10.3390/make8070193

This is an early access version, the complete PDF, HTML, and XML versions will be available soon.

Open AccessArticle

When to Explore and When to Exploit: Adaptive Decisions in Bayesian Optimization

by

Antonio Candelieri

¹

,

Francesco Archetti

² and

Iman Seyedi

^2,*

¹

Department of Economics Management and Statistics, University of Milano-Bicocca, 20126 Milan, Italy

²

Department of Computer Science Systems and Communication, University of Milano-Bicocca, 20126 Milan, Italy

^*

Author to whom correspondence should be addressed.

Mach. Learn. Knowl. Extr. 2026, 8(7), 193; https://doi.org/10.3390/make8070193

Submission received: 29 April 2026 / Revised: 25 June 2026 / Accepted: 1 July 2026 / Published: 3 July 2026

Download Versions Notes

Abstract

Gaussian process-based Bayesian optimization (BO) is a sample-efficient sequential strategy for optimizing expensive black-box functions. The Gaussian process provides a probabilistic approximation of the unknown function, while an acquisition function balances exploration and exploitation to select the next evaluation point. Despite significant research efforts, no master acquisition function has been identified. This paper proposes a novel adaptive acquisition function that dynamically adjusts the exploration–exploitation trade-off based on the evolution of the optimization process, rather than using fixed or random scheduling. While implemented here within a GP-based BO framework, the core switching mechanism is surrogate-agnostic: the exploitative component requires only a surrogate point prediction, and the explorative component is entirely model-free. Unlike traditional approaches, where mechanisms like UCB/LCB lean toward exploration over iterations, or fixed strategies that switch from exploratory (EI) to exploitative (PI) behavior at predetermined points, the proposed method makes purely exploitative decisions using only the GP’s prediction. However, it discards these decisions when they have low potential for significant improvement, instead focusing on uncertainty reduction. Notably, this approach uses inverse distance weighting for uncertainty quantification rather than the GP’s predictive uncertainty, avoiding bias from the GP’s predictions. Testing on benchmark functions demonstrates that the proposed acquisition function is almost always Pareto optimal, offering the most balanced trade-off between convergence to the global optimum and exploration capability compared to state-of-the-art alternatives.

Keywords: Bayesian optimization; Gaussian process; acquisition function; confidence bound

Share and Cite

MDPI and ACS Style

Candelieri, A.; Archetti, F.; Seyedi, I. When to Explore and When to Exploit: Adaptive Decisions in Bayesian Optimization. Mach. Learn. Knowl. Extr. 2026, 8, 193. https://doi.org/10.3390/make8070193

AMA Style

Candelieri A, Archetti F, Seyedi I. When to Explore and When to Exploit: Adaptive Decisions in Bayesian Optimization. Machine Learning and Knowledge Extraction. 2026; 8(7):193. https://doi.org/10.3390/make8070193

Chicago/Turabian Style

Candelieri, Antonio, Francesco Archetti, and Iman Seyedi. 2026. "When to Explore and When to Exploit: Adaptive Decisions in Bayesian Optimization" Machine Learning and Knowledge Extraction 8, no. 7: 193. https://doi.org/10.3390/make8070193

APA Style

Candelieri, A., Archetti, F., & Seyedi, I. (2026). When to Explore and When to Exploit: Adaptive Decisions in Bayesian Optimization. Machine Learning and Knowledge Extraction, 8(7), 193. https://doi.org/10.3390/make8070193

Article Menu

When to Explore and When to Exploit: Adaptive Decisions in Bayesian Optimization

Abstract

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI