Decision-Making Under Model Misspecification: DRO with Robust Bayesian Ambiguity Sets
Abstract
1. Introduction
2. Background
2.1. Bayesian Formulations of DRO
2.2. Robust Bayesian Inference via Divergences
Robust NPL Posterior
- Sample from the posterior .
- Compute where as in (6).
2.3. Maximum Mean Discrepancy
2.4. DRO with the Maximum Mean Discrepancy
3. DRO with Robust Bayesian Ambiguity Sets
3.1. Duality of the DRO-RoBAS Problem
- Computation of (15):
3.2. Tolerance Level Guarantees
4. Experiments
- Model misspecification which occurs when the DGP does not belong to the model family , e.g., if is multimodal while assumes unimodality. This affects Bayesian DRO methods (BDRO, DRO-BAS, DRO-RoBAS) but not empirical approaches, as the latter do not rely on a model.
- Huber’s contamination model [41] which is a specific type of model misspecification (see Figure 2) wherein the training DGP is for some and . Contamination, limited to the training set, impacts both Bayesian and empirical DRO methods since the test distribution is assumed to be . Huber contamination relates to concepts like distribution shift and out-of-distribution robustness (e.g., [42]) and its importance in DRO has attracted increasing attention in recent work [43,44].
4.1. The Newsvendor Problem
4.2. The Portfolio Optimisation Problem
4.3. Computational Time
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
Appendix A. Proofs of Theoretical Results
Appendix A.1. Proof of Proposition 1
Appendix A.2. Proof of Corollary 1
Appendix A.3. Proof of Theorem 2
Appendix A.4. Proof of Corollary 3
Appendix B. Additional Experimental Details
Data-Generating Process Settings
| DGP | RoBAS | Empirical MMD | KL-BDRO | DRO-BASPE | DRO-BASPP |
|---|---|---|---|---|---|
| 1D bimodal | 275.33 (25.98) | 2.17 (0.25) | 3.90 (0.55) | 0.29 (0.03) | 0.30 (0.05) |
| 5D bimodal | 262.94 (31.72) | 2.18 (0.36) | 2.63 (0.26) | 1.24 (0.19) | 1.38 (0.29) |
| 286.27 (16.14) | 2.22 (0.28) | 4.01 (0.49) | 0.29 (0.04) | 0.30 (0.04) | |
| 284.62 (17.42) | 2.23 (0.24) | 3.99 (0.42) | 0.29 (0.03) | 0.30 (0.04) | |
| 260.67 (35.90) | 2.20 (0.24) | 3.74 (0.52) | 0.28 (0.03) | 0.28 (0.04) | |
| Exp, | 311.74 (37.48) | 5.59 (0.68) | 0.43 (0.06) | 0.40 (0.06) | 0.49 (0.06) |
| Exp, | 296.65 (46.10) | 5.39 (0.54) | 0.44 (0.06) | 0.42 (0.05) | 0.50 (0.06) |
| Exp, | 282.55 (53.23) | 5.22 (0.47) | 0.44 (0.06) | 0.42 (0.05) | 0.50 (0.06) |
| DGP | RoBAS | Empirical MMD | KL-BDRO | DRO-BASPE | DRO-BASPP |
|---|---|---|---|---|---|
| 1D bimodal | 15.65 (4.76) | 0.0 (0.0) | 0.001 (0.002) | 0.0002 (0.0) | 0.0002 (0.0) |
| 5D bimodal | 15.86 (4.84) | 0.0 (0.0) | 0.0104 (0.0004) | 0.0006 (0.0003) | 0.001 (0.0004) |
| 15.85 (4.67) | 0.0 (0.0) | 0.0009 (0.002) | 0.0002 (0.0) | 0.0002 (0.0) | |
| 15.78 (4.59) | 0.0 (0.0) | 0.0009 (0.0017) | 0.0002 (0.0) | 0.0002 (0.0) | |
| 15.68 (4.58) | 0.0 (0.0) | 0.0008 (0.0014) | 0.0002 (0.0) | 0.0002 (0.0) | |
| Exp, | 38.10 (5.62) | 0.0 (0.0) | 0.1 (0.01) | 0.0 (0.0) | 0.0 (0.0) |
| Exp, | 37.77 (4.78) | 0.0 (0.0) | 0.1 (0.01) | 0.0 (0.0) | 0.0 (0.0) |
| Exp, | 37.82 (4.85) | 0.0 (0.0) | 0.1 (0.01) | 0.0 (0.0) | 0.0 (0.0) |
Appendix C. Alternative RoBAS Formulations
References
- Black, B. Parametric Distributionally Robust Optimisation Models for Resource and Inventory Planning Problems; Lancaster University: Lancaster, UK, 2022. [Google Scholar]
- Li, J.Y.; Kwon, R.H. Portfolio selection under model uncertainty: A penalized moment-based optimization approach. J. Glob. Optim. 2013, 56, 131–164. [Google Scholar] [CrossRef]
- Zhang, J.; Menon, A.K.; Veit, A.; Bhojanapalli, S.; Kumar, S.; Sra, S. Coping with label shift via distributionally robust optimisation. In Proceedings of the International Conference on Learning Representations, Virtual, 3–7 May 2021. [Google Scholar]
- Kuhn, D.; Esfahani, P.M.; Nguyen, V.A.; Shafieezadeh-Abadeh, S. Wasserstein Distributionally Robust Optimization: Theory and Applications in Machine Learning. In Operations Research & Management Science in the Age of Analytics; Informs: Catonsville, MD, USA, 2019; Chapter 6; pp. 130–166. [Google Scholar] [CrossRef]
- Staib, M.; Jegelka, S. Distributionally robust optimization and generalization in kernel methods. Adv. Neural Inf. Process. Syst. 2019, 32, 9134–9144. [Google Scholar]
- Zhu, J.J.; Jitkrittum, W.; Diehl, M.; Schölkopf, B. Kernel distributionally robust optimization: Generalized duality theorem and stochastic approximation. In Proceedings of the International Conference on Artificial Intelligence and Statistics; PMLR: Cambridge, MA, USA, 2021; pp. 280–288. [Google Scholar]
- Hu, Z.; Hong, L.J. Kullback-Leibler divergence constrained distributionally robust optimization. Available Optim. Online 2013, 1, 9. [Google Scholar]
- Iyengar, G.; Lam, H.; Wang, T. Hedging against complexity: Distributionally robust optimization with parametric approximation. In Proceedings of the International Conference on Artificial Intelligence and Statistics; PMLR: Cambridge, MA, USA, 2023; pp. 9976–10011. [Google Scholar]
- Michel, P.; Hashimoto, T.; Neubig, G. Modeling the second player in distributionally robust optimization. In Proceedings of the International Conference on Learning Representations, Virtual, 3–7 May 2021. [Google Scholar]
- Michel, P.; Hashimoto, T.; Neubig, G. Distributionally robust models with parametric likelihood ratios. In Proceedings of the International Conference on Learning Representations, Virtual, 26–28 October 2022. [Google Scholar]
- Shapiro, A.; Zhou, E.; Lin, Y. Bayesian distributionally robust optimization. SIAM J. Optim. 2023, 33, 1279–1304. [Google Scholar] [CrossRef]
- Dellaporta, C.; O’Hara, P.; Damoulas, T. Decision Making under the Exponential Family: Distributionally Robust Optimisation with Bayesian Ambiguity Sets. In Proceedings of the International Conference on Machine Learning, Vancouver, BC, Canada, 13–19 July 2025. [Google Scholar]
- Grünwald, P. The safe Bayesian: Learning the learning rate via the mixability gap. In Proceedings of the International Conference on Algorithmic Learning Theory; Springer: Berlin/Heidelberg, Germany, 2012; pp. 169–183. [Google Scholar]
- Walker, S.G. Bayesian inference with misspecified models. J. Stat. Plan. Inference 2013, 143, 1621–1633. [Google Scholar] [CrossRef]
- Lyddon, S.; Walker, S.; Holmes, C.C. Nonparametric learning from Bayesian models with randomized objective functions. Adv. Neural Inf. Process. Syst. 2018, 31, 2075–2085. [Google Scholar]
- Fong, E.; Lyddon, S.; Holmes, C. Scalable nonparametric sampling from multimodal posteriors with the posterior bootstrap. In Proceedings of the International Conference on Machine Learning; PMLR: Cambridge, MA, USA, 2019; pp. 1952–1962. [Google Scholar]
- Rahimian, H.; Mehrotra, S. Distributionally robust optimization: A review. arXiv 2019, arXiv:1908.05659. [Google Scholar]
- Husain, H.; Nguyen, V.; van den Hengel, A. Distributionally Robust Bayesian Optimization with φ-divergences. In Advances in Neural Information Processing Systems; Curran Associates, Inc.: Red Hook, NY, USA, 2023; Volume 36. [Google Scholar]
- Wang, S.; Wang, H.; Li, X.; Honorio, J. Learning against distributional uncertainty: On the trade-off between robustness and specificity. IEEE J. Sel. Top. Signal Process. 2025, 19, 1420–1435. [Google Scholar] [CrossRef]
- Dellaporta, C.; Knoblauch, J.; Damoulas, T.; Briol, F.X. Robust Bayesian inference for simulator-based models via the MMD posterior bootstrap. In Proceedings of the International Conference on Artificial Intelligence and Statistics; PMLR: Cambridge, MA, USA, 2022; pp. 943–970. [Google Scholar]
- Bissiri, P.G.; Holmes, C.C.; Walker, S.G. A general framework for updating belief distributions. J. R. Stat. Soc. Ser. B Stat. Methodol. 2016, 78, 1103–1130. [Google Scholar] [CrossRef]
- Ghosh, A.; Basu, A. Robust Bayes estimation using the density power divergence. Ann. Inst. Stat. Math. 2016, 68, 413–437. [Google Scholar] [CrossRef]
- Knoblauch, J.; Jewson, J.; Damoulas, T. An optimization-centric view on Bayes’ rule: Reviewing and generalizing variational inference. J. Mach. Learn. Res. 2022, 23, 1–109. [Google Scholar]
- Matsubara, T.; Knoblauch, J.; Briol, F.X.; Oates, C.J. Robust generalised Bayesian inference for intractable likelihoods. J. R. Stat. Soc. Ser. B Stat. Methodol. 2022, 84, 997–1022. [Google Scholar] [CrossRef]
- Altamirano, M.; Briol, F.X.; Knoblauch, J. Robust and scalable Bayesian online changepoint detection. In Proceedings of the International Conference on Machine Learning; PMLR: Cambridge, MA, USA, 2023; pp. 642–663. [Google Scholar]
- Altamirano, M.; Briol, F.X.; Knoblauch, J. Robust and Conjugate Gaussian Process Regression. In Proceedings of the International Conference on Machine Learning; PMLR: Cambridge, MA, USA, 2024; pp. 1155–1185. [Google Scholar]
- Laplante, W.; Altamirano, M.; Duncan, A.; Knoblauch, J.; Briol, F.X. Robust and conjugate spatio-temporal gaussian processes. arXiv 2025, arXiv:2502.02450. [Google Scholar]
- Müller, A. Integral probability metrics and their generating classes of functions. Adv. Appl. Probab. 1997, 29, 429–443. [Google Scholar] [CrossRef]
- Gretton, A.; Borgwardt, K.M.; Rasch, M.J.; Schölkopf, B.; Smola, A. A kernel two-sample test. J. Mach. Learn. Res. 2012, 13, 723–773. [Google Scholar]
- Briol, F.X.; Barp, A.; Duncan, A.B.; Girolami, M. Statistical inference for generative models with maximum mean discrepancy. arXiv 2019, arXiv:1906.05944. [Google Scholar]
- Gao, R. Finite-sample guarantees for Wasserstein distributionally robust optimization: Breaking the curse of dimensionality. Oper. Res. 2023, 71, 2291–2306. [Google Scholar] [CrossRef]
- Chen, Y.; Kim, J.; Anderson, J. Distributionally robust decision making leveraging conditional distributions. In Proceedings of the IEEE Conference on Decision and Control, Cancun, Mexico, 6–9 December 2022; pp. 5652–5659. [Google Scholar]
- Romao, L.; Hota, A.R.; Abate, A. Distributionally robust optimal and safe control of stochastic systems via kernel conditional mean embedding. In Proceedings of the IEEE Conference on Decision and Control; IEEE: New York, NY, USA, 2023; pp. 2016–2021. [Google Scholar]
- Gao, R.; Kleywegt, A. Distributionally robust stochastic optimization with Wasserstein distance. Math. Oper. Res. 2023, 48, 603–655. [Google Scholar] [CrossRef]
- Vázquez, F.G.; Rückmann, J.J.; Stein, O.; Still, G. Generalized semi-infinite programming: A tutorial. J. Comput. Appl. Math. 2008, 217, 394–419. [Google Scholar] [CrossRef]
- Chérief-Abdellatif, B.E.; Alquier, P. Finite sample properties of parametric MMD estimation: Robustness to misspecification and dependence. Bernoulli 2022, 28, 181–213. [Google Scholar] [CrossRef]
- Alquier, P.; Gerber, M. Universal robust regression via maximum mean discrepancy. Biometrika 2024, 111, 71–92. [Google Scholar] [CrossRef]
- Seeger, M. Gaussian processes for machine learning. Int. J. Neural Syst. 2004, 14, 69–106. [Google Scholar] [CrossRef]
- Mohajerin Esfahani, P.; Kuhn, D. Data-driven distributionally robust optimization using the Wasserstein metric: Performance guarantees and tractable reformulations. Math. Program. 2018, 171, 115–166. [Google Scholar] [CrossRef]
- Bertsimas, D.; Gupta, V.; Kallus, N. Robust saa. arXiv 2014, arXiv:1408.4445. [Google Scholar]
- Huber, P.J. Robust estimation of a location parameter. In Breakthroughs in Statistics: Methodology and Distribution; Springer: Berlin/Heidelberg, Germany, 1992; pp. 492–518. [Google Scholar]
- Liu, J.; Shen, Z.; He, Y.; Zhang, X.; Xu, R.; Yu, H.; Cui, P. Towards out-of-distribution generalization: A survey. arXiv 2021, arXiv:2108.13624. [Google Scholar]
- Zhai, R.; Dan, C.; Kolter, Z.; Ravikumar, P. Doro: Distributional and outlier robust optimization. In Proceedings of the International Conference on Machine Learning; PMLR: Cambridge, MA, USA, 2021; pp. 12345–12355. [Google Scholar]
- Nietert, S.; Goldfeld, Z.; Shafiee, S. Outlier-robust wasserstein dro. Adv. Neural Inf. Process. Syst. 2023, 36, 62792–62820. [Google Scholar]
- Porteus, E.L. Stochastic inventory theory. Handb. Oper. Res. Manag. Sci. 1990, 2, 605–652. [Google Scholar]
- Rahimi, A.; Recht, B. Random features for large-scale kernel machines. Adv. Neural Inf. Process. Syst. 2007, 20, 1177–1184. [Google Scholar]
- Bach, F. Sharp analysis of low-rank kernel matrix approximations. In Proceedings of the Conference on Learning Theory; PMLR: Cambridge, MA, USA, 2013; pp. 185–209. [Google Scholar]
- Jewson, J.; Smith, J.Q.; Holmes, C. Principles of Bayesian inference using general divergence criteria. Entropy 2018, 20, 442. [Google Scholar] [CrossRef]
- Chérief-Abdellatif, B.E.; Alquier, P. MMD-Bayes: Robust Bayesian estimation via maximum mean discrepancy. In Proceedings of the Symposium on Advances in Approximate Bayesian Inference; PMLR: Cambridge, MA, USA, 2020; pp. 1–21. [Google Scholar]
- McDiarmid, C. On the method of bounded differences. Surv. Comb. 1989, 141, 148–188. [Google Scholar]
- Kingma, D.P. Adam: A method for stochastic optimization. arXiv 2014, arXiv:1412.6980. [Google Scholar]





Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Dellaporta, C.; O’Hara, P.; Damoulas, T. Decision-Making Under Model Misspecification: DRO with Robust Bayesian Ambiguity Sets. Entropy 2026, 28, 430. https://doi.org/10.3390/e28040430
Dellaporta C, O’Hara P, Damoulas T. Decision-Making Under Model Misspecification: DRO with Robust Bayesian Ambiguity Sets. Entropy. 2026; 28(4):430. https://doi.org/10.3390/e28040430
Chicago/Turabian StyleDellaporta, Charita, Patrick O’Hara, and Theodoros Damoulas. 2026. "Decision-Making Under Model Misspecification: DRO with Robust Bayesian Ambiguity Sets" Entropy 28, no. 4: 430. https://doi.org/10.3390/e28040430
APA StyleDellaporta, C., O’Hara, P., & Damoulas, T. (2026). Decision-Making Under Model Misspecification: DRO with Robust Bayesian Ambiguity Sets. Entropy, 28(4), 430. https://doi.org/10.3390/e28040430

