Next Article in Journal
A Reproducible Benchmarking Methodology for Machine Learning Hardware: Performance–Energy Trade-Offs from GPUs to Apple Silicon
Previous Article in Journal
Optimization of Gabor Filters Based on Quaternions for Image Preprocessing in the Automated Detection of Bemisia tabaci in Yellow Traps
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

Bayesian Optimization for Categorical and Mixed Variables Using a Multinomial Logit Surrogate

by
Muhammad Amir Saeed
* and
Antonio Candelieri
*
Department of Economics, Quantitative Methods and Business Strategies, University of Milano-Bicocca, 20126 Milan, Italy
*
Authors to whom correspondence should be addressed.
Algorithms 2026, 19(5), 361; https://doi.org/10.3390/a19050361
Submission received: 9 March 2026 / Revised: 24 April 2026 / Accepted: 29 April 2026 / Published: 4 May 2026

Abstract

Bayesian optimization (BO) is a widely used framework for optimizing expensive black-box functions. Most BO methods rely on Gaussian process (GP) surrogates, which perform well in continuous domains but encounter difficulties when decision variables include categorical or mixed discrete–continuous components. In particular, GP-based approaches typically require ad hoc numerical encodings of categorical variables that may fail to capture the structure of discrete decision spaces. In this work, we propose MNL-BO (Multinomial Logit Bayesian Optimization), a preference-based Bayesian optimization framework that replaces the GP surrogate with a multinomial logit (MNL) model trained from pairwise preference comparisons. The resulting surrogate provides a natural and interpretable representation of categorical alternatives while allowing continuous, discrete, and categorical variables to be handled within a unified optimization framework. The predictive utility estimates and uncertainty indicators generated by the MNL model are employed to formulate acquisition functions that reconcile exploration with exploitation. The proposed methodology is evaluated on three progressively complex optimization challenges: a purely categorical benchmark, a combinatorial Traveling Salesman problem, and a constrained mixed-variable engineering design problem concerning material selection in pressure vessel optimization. Multi-run tests provide consistent advantages over random search and exhibit stable convergence behavior across diverse random initializations. In addition to heuristic baselines such as local search and classical metaheuristics, we also compare against tree-based Bayesian optimization baselines inspired by the Sequential Model-based Algorithm Configuration (SMAC) framework. The results indicate that the proposed MNL-BO method achieves competitive performance under comparable evaluation budgets while providing an interpretable probabilistic surrogate for categorical decision spaces. These findings suggest that preference-based surrogate modeling provides a practical and flexible alternative for Bayesian optimization in categorical and mixed-variable optimization problems.
Keywords: Bayesian optimization; categorical variables; multinomial logit model; black-box optimization; discrete optimization Bayesian optimization; categorical variables; multinomial logit model; black-box optimization; discrete optimization

Share and Cite

MDPI and ACS Style

Saeed, M.A.; Candelieri, A. Bayesian Optimization for Categorical and Mixed Variables Using a Multinomial Logit Surrogate. Algorithms 2026, 19, 361. https://doi.org/10.3390/a19050361

AMA Style

Saeed MA, Candelieri A. Bayesian Optimization for Categorical and Mixed Variables Using a Multinomial Logit Surrogate. Algorithms. 2026; 19(5):361. https://doi.org/10.3390/a19050361

Chicago/Turabian Style

Saeed, Muhammad Amir, and Antonio Candelieri. 2026. "Bayesian Optimization for Categorical and Mixed Variables Using a Multinomial Logit Surrogate" Algorithms 19, no. 5: 361. https://doi.org/10.3390/a19050361

APA Style

Saeed, M. A., & Candelieri, A. (2026). Bayesian Optimization for Categorical and Mixed Variables Using a Multinomial Logit Surrogate. Algorithms, 19(5), 361. https://doi.org/10.3390/a19050361

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop