Towards Generative Interest-Rate Modeling: Neural Perturbations Within the Libor Market Model
Abstract
1. Introduction
2. Materials and Methods
2.1. Market Data and Instruments
2.1.1. Yield Curves
2.1.2. Swaption Volatility Surface
2.2. Classical LIBOR Market Model Specification
2.2.1. Forward Dynamics
2.2.2. Functional Volatility and Correlation
2.3. Neural Parametrization of Volatility and Correlation
Architecture
2.4. Monte Carlo Simulation and Swaption Pricing
2.4.1. Path Simulation
2.4.2. Swaption Pricing and Implied Volatility
2.5. Calibration Objectives and Diagnostics
2.5.1. Vega-Weighted Swaption Data Loss
2.5.2. Structural Regularization and Diagnostics
- A minimum eigenvalue of over time and paths (PSD check);
- The fraction of simulated forward rates that become negative (positivity check);
- Deviations from martingale conditions for discounted swap rates;
- Gradient norms of network parameters and incidence of NaN/Inf values.
2.5.3. Total Objective
2.6. Pricing Error Diagnostics
2.7. Bucketed RMSE Analysis
- Expiry buckets: ;
- Tenor buckets: .
2.8. Drop-One Expiry Jackknife
2.9. Statistical Significance Tests
- Paired t-test: It tests mean error differences across all instruments.
- Wilcoxon signed-rank test: A non-parametric test on absolute error ranks.
- Cohen’s d: It measures effect size for improvement:
- Proportion improved: The fraction of instruments where neural RMSE is lower than that of the classical instrument.
2.10. Summary Metrics
- Overall implied volatility RMSE: ;
- Overall pricing RMSE in basis points: ;
- Percentage improvement: ;
- Statistical test p-values and effect sizes.
2.11. Training Procedure and Numerical Safeguards
2.11.1. Optimization Scheme
2.11.2. Stability Mechanisms
- Gradient clipping: Global norm clipping is applied to parameter gradients to prevent exploding updates.
- Value clipping: Neural outputs are clipped to predefined bounds before constructing volatilities and correlation factors.
- Correlation projection: The correlation output is normalized to unit scale, symmetrized, and diagonally jittered to ensure stable Cholesky factorization.
- Numerics checks: All intermediate tensors involved in loss computation are passed through finite-value checks; NaNs or Infs trigger diagnostic flags rather than silent failure.
- Shared code path: Classical and neural simulations share the same simulation and pricing routines, with the neural block deactivated when is absent, ensuring that the classical benchmark remains a stable point of reference.
3. Discussion
3.1. Computational Environment and Performance
3.1.1. Hardware and Software Configuration
- CPU: Intel® Core™ i7–8700K @ 3.70 GHz (6 cores/12 threads);
- System Memory: 16 GB RAM;
- GPU: NVIDIA GeForce RTX 2080 SUPER (8 GB VRAM);
- Driver/CUDA: NVIDIA driver 440.33.01, CUDA 10.2;
- Operating Environment: Docker container; (tensorflow/tensorflow:2.12.0-gpu-jupyter)
- Deep Learning Framework: TensorFlow 2.12.0.
3.1.2. Model Configuration and Runtime Parameters
- Number of Monte Carlo paths: ;
- Time step: ;
- Number of time steps: (model-dependent);
- Mini-batch size: equal to the number of surface grid cells;
- Optimization: gradient-based calibration using automatic differentiation.
Initialization and Parameterization Details (Directly from the Code)
3.1.3. Volatility Map (Fixed by Code)
3.1.4. Correlation Map
3.1.5. Overfitting Mitigation Under Mini-Batch Calibration (Clarified Using the Code Design)
- Low capacity: The shared representation comprises only four units, and the heads are small (eight and four hidden units), limiting function complexity.
- Bounded, small volatility deviations: Perturbations are bounded by tanh and scaled by perturb_scale, and then volatility is clipped to . This prevents the network from fitting idiosyncratic quotes by extreme local volatility distortions.
- PSD-by-construction correlation with anchored mixing: The correlation candidate is constrained via and correlation normalization, and its impact is restricted by a capped mixing weight (with w_cap non-trainable in the provided code). Thus, the learned correlation can only deviate modestly from the interpretable exponential baseline unless the cap is explicitly relaxed.
- Numerical regularization: Small diagonal jitter () is added to the correlation output, improving conditioning and reducing sensitivity to mini-batch noise.
3.1.6. Runtime Performance
3.2. Model Comparison on OOS and Long-Tenor Holdout
3.2.1. Per-Row 80/20 Masked-Holdout Evaluation
3.2.2. Long-Tenor Holdout Evaluation
3.2.3. Relative Improvement Summary
3.2.4. Classical vs. Neural Scatter
3.2.5. Interpretation Note
3.3. Implied Volatility Error
| Currency | Year | Quarter | IV RMSE (Classic) | IV RMSE (Neural) | IV (%) | PV RMSE bp (Classic) | PV RMSE bp (Neural) | PV (%) | n |
| EUR | 2024 | Q2 | 0.216 | 0.200 | 7.69 | 289.8 | 258.3 | 10.87 | 213 |
| EUR | 2024 | Q3 | 0.216 | 0.195 | 9.70 | 306.3 | 265.2 | 13.44 | 213 |
| EUR | 2024 | Q4 | 0.231 | 0.207 | 10.21 | 293.7 | 251.7 | 14.32 | 213 |
| EUR | 2025 | Q2 | 0.172 | 0.155 | 10.13 | 277.0 | 236.1 | 14.74 | 213 |
| GBP | 2024 | Q2 | 0.160 | 0.148 | 7.63 | 230.3 | 201.9 | 12.33 | 213 |
| GBP | 2024 | Q3 | 0.159 | 0.147 | 7.41 | 229.8 | 202.4 | 11.93 | 213 |
| GBP | 2024 | Q4 | 0.147 | 0.137 | 6.69 | 225.7 | 202.1 | 10.49 | 213 |
| GBP | 2025 | Q2 | 0.133 | 0.120 | 9.50 | 238.4 | 205.9 | 13.64 | 213 |
| USD | 2024 | Q2 | 0.175 | 0.160 | 8.63 | 253.8 | 222.8 | 12.23 | 213 |
| USD | 2024 | Q3 | 0.187 | 0.170 | 8.96 | 267.5 | 231.1 | 13.61 | 213 |
| USD | 2024 | Q4 | 0.160 | 0.145 | 9.22 | 258.9 | 226.2 | 12.61 | 213 |
| USD | 2025 | Q2 | 0.155 | 0.141 | 8.56 | 259.1 | 226.6 | 12.52 | 213 |
| Currency | Year | Quarter | VW Vol-Pts RMSE (Classic) | VW Vol-Pts RMSE (Neural) | VW (%) | n |
| EUR | 2024 | Q2 | 0.213 | 0.197 | 7.17 | 213 |
| EUR | 2024 | Q3 | 0.211 | 0.192 | 8.92 | 213 |
| EUR | 2024 | Q4 | 0.226 | 0.205 | 9.42 | 213 |
| EUR | 2025 | Q2 | 0.170 | 0.154 | 9.63 | 213 |
| GBP | 2024 | Q2 | 0.159 | 0.147 | 7.39 | 213 |
| GBP | 2024 | Q3 | 0.158 | 0.146 | 7.17 | 213 |
| GBP | 2024 | Q4 | 0.146 | 0.137 | 6.50 | 213 |
| GBP | 2025 | Q2 | 0.132 | 0.120 | 9.26 | 213 |
| USD | 2024 | Q2 | 0.173 | 0.159 | 8.31 | 213 |
| USD | 2024 | Q3 | 0.185 | 0.169 | 8.55 | 213 |
| USD | 2024 | Q4 | 0.158 | 0.144 | 8.90 | 213 |
| USD | 2025 | Q2 | 0.153 | 0.141 | 8.25 | 213 |
4. Conclusions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
Abbreviations
| ATM | At-the-money; |
| BSDE | Backward stochastic differential equation; |
| CDF | Cumulative distribution function; |
| DF | Discount factor; |
| EUR | Euro area currency (euro); |
| EURIBOR | Euro Interbank Offered Rate; |
| FRA | Forward-rate agreement; |
| GAN | Generative adversarial network; |
| GBP | British pound sterling; |
| HJM | Heath–Jarrow–Morton model; |
| IBOR | Interbank Offered Rate (generic benchmark family); |
| IV | Implied volatility; |
| LMM | Libor market model; |
| MC | Monte Carlo; |
| MLP | Multi-layer perceptron; |
| OIS | Overnight indexed swap (discounting curve); |
| OLS | Ordinary least squares (if used anywhere); |
| Probability density function; | |
| PINN | Physics-informed neural network; |
| PV | Present value; |
| PDE | Partial differential equation; |
| PSD | Positive semidefinite; |
| Q | Risk-neutral probability measure (when used as ); |
| RMSE | Root mean squared error; |
| SABR | Stochastic Alpha Beta Rho volatility model; |
| SDE | Stochastic differential equation; |
| SOFR | Secured Overnight Financing Rate; |
| SONIA | Sterling Overnight Index Average; |
| USD | United States dollar; |
| VW | Vega-weighted. |
Appendix A. Charts
Appendix A.1. USD Surface Comparison


Appendix A.2. EUR Surface Comparison


Appendix A.3. GBP Surface Comparison


Appendix A.4. USD Correlation Difference

Appendix A.5. EUR Correlation Difference

Appendix A.6. GBP Correlation Difference

Appendix A.7. USD Model Implied Smile

Appendix A.8. EUR Model Implied Smile

Appendix A.9. GBP Model Implied Smile

Appendix A.10. USD NN Volatility Output Across Time and Tenor

Appendix A.11. EUR NN Volatility Output Across Time and Tenor

Appendix A.12. GBP NN Volatility Output Across Time and Tenor

Appendix A.13. USD Correlation Mix Weight

Appendix A.14. EUR Correlation Mix Weight

Appendix A.15. GBP Correlation Mix Weight

Appendix A.16. USD Volatility Term Structure

Appendix A.17. EUR Volatility Term Structure

Appendix A.18. GBP Volatility Term Structure

References
- Alaya, M. B., Kebaier, A., & Sarr, D. (2021). Deep calibration of interest rates model. arXiv, arXiv:2110.15133. Available online: https://arxiv.org/abs/2110.15133 (accessed on 28 November 2025).
- Andersen, L. B., & Piterbarg, V. V. (2010). Interest rate modeling, volume iii: Products and risk management. Atlantic Financial Press. [Google Scholar]
- Belomestny, D., & Schoenmakers, J. (2009). A jump-diffusion Libor model and its robust calibration (Tech. Rep. No. RQUF-2008-0135). Weierstrass Institute for Applied Analysis and Stochastics (WIAS). Available online: https://www.wias-berlin.de/people/schoenma/RQUF-2008-0135_Final.pdf (accessed on 28 November 2025).
- Berg, J., & Nystrom, K. (2018). A unified deep artificial neural network approach to partial differential equations in complex geometries. Neurocomputing, 317, 28–41. [Google Scholar] [CrossRef]
- Brace, A., Gatarek, D., & Musiela, M. (1997). The market model of interest rate dynamics. Mathematical Finance, 7(2), 127–147. [Google Scholar] [CrossRef]
- Chen, R. T., Rubanova, Y., Bettencourt, J., & Duvenaud, D. K. (2018). Neural ordinary differential equations. Advances in Neural Information Processing Systems, 31, 7. Available online: https://user.eng.umd.edu/~austin/ence688p.d/papers-reading/NN-Ordinary-Differential-Equations2018.pdf (accessed on 28 November 2025).
- Glasserman, P., & Kou, S. (2003). The term structure of simple forward rates with jump risk. Mathematical Finance, 13(3), 383–410. Available online: https://www.columbia.edu/~sk75/GlassKou.pdf (accessed on 28 November 2025). [CrossRef]
- Glasserman, P., & Merener, N. (2003). Numerical solution of jump-diffusion LIBOR market models. Finance and Stochastics, 7(1), 1–27. Available online: https://link.springer.com/article/10.1007/s007800200076 (accessed on 28 November 2025). [CrossRef]
- Hagan, P. S., Kumar, D. K., Lesniewski, A. S., & Woodward, D. E. (2002). Managing smile risk. Wilmott Magazine, 1, 84–108. [Google Scholar]
- Han, J., Jentzen, A., & E, W. (2018). Solving high-dimensional partial differential equations using deep learning. Proceedings of the National Academy of Sciences, 115(34), 8505–8510. [Google Scholar] [CrossRef] [PubMed]
- Henry-Labordère, P. (2008). Analysis, geometry, and modeling in finance: Advanced methods in option pricing. Chapman and Hall/CRC. [Google Scholar]
- Horváth, B., Muguruza, A., & Tomas, J. (2021). Deep learning volatility: A deep neural network perspective on pricing and calibration in (rough) volatility models. Quantitative Finance, 21(11), 1731–1749. [Google Scholar] [CrossRef]
- Hull, J. C. (2022). Options, futures, and other derivatives (11th ed.). Pearson Education. [Google Scholar]
- Huré, C., Pham, H., & Warin, X. (2019). Deep neural networks algorithms for stochastic control problems on finite horizon: Numerical applications. ESAIM: Proceedings and Surveys, 65, 32–50. [Google Scholar] [CrossRef]
- James, J., & Webber, N. (2000). Interest rate modelling. Wiley. [Google Scholar]
- Karlsson, P., Pilz, K. F., & Schlögl, E. (2017). Calibrating a market model with stochastic volatility to commodity and interest rate risk. Quantitative Finance, 17(6), 907–925. [Google Scholar] [CrossRef]
- Kidger, P., Foster, J., Li, X., Oberhauser, H., & Lyon, T. (2021). Neural SDEs as infinite-dimensional GANs. arXiv, arXiv:2106.01845. [Google Scholar] [CrossRef]
- Liu, S., Borovykh, A., Grzelak, L. A., & Oosterlee, C. W. (2019). A neural network-based framework for financial model calibration. Journal of Mathematics in Industry, 9, 1–24. [Google Scholar] [CrossRef]
- Marshall, N., Xiao, K. L., Agarwala, A., & Paquette, E. (2024). The Dynamics of SGD with gradient clipping in high dimensions. arXiv, arXiv:2406.11733. [Google Scholar] [CrossRef]
- Obłój, J. (2007). Fine-tune your smile: Correction to Hagan et al.’s formula. arXiv, arXiv:0708.0998. [Google Scholar] [CrossRef]
- Raissi, M., Perdikaris, P., & Karniadakis, G. E. (2019). Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations. Journal of Computational Physics, 378, 686–707. [Google Scholar] [CrossRef]
- Rebonato, R. (2002). Modern pricing of interest-rate derivatives: The libor market model and beyond. Princeton University Press. [Google Scholar]
- Rebonato, R. (2004). Volatility and Correlation: The Perfect Hedger and the Fox. John Wiley & Sons. [Google Scholar]
- Rebonato, R., & White, M. (2009). Linking caplets and swaptions prices in the LMM–SABR model. The Journal of Computational Finance, 13(2), 1–43. [Google Scholar] [CrossRef]
- Richert, I., & Buch, R. (2022). Interpolation of missing swaption volatility data using gibbs sampling on variational autoencoders. arXiv, arXiv:2204.10400. [Google Scholar] [CrossRef]
- Sridi, A., & Bilokon, P. (2023). Applying deep learning to calibrate stochastic volatility models. arXiv, arXiv:2309.07843. Available online: https://arxiv.org/abs/2309.07843 (accessed on 28 November 2025).
- Steinrücke, M., Zagst, R., & Swishchuk, A. (2015). The Markov-switching jump-diffusion LIBOR market model. Quantitative Finance, 15(3), 455–476. [Google Scholar] [CrossRef]





| Dataset | IV RMSE Improvement (%) | PV RMSE Improvement (%) |
|---|---|---|
| EUR 2024 Q2 | 7.69 | 10.87 |
| EUR 2024 Q3 | 9.70 | 13.44 |
| EUR 2024 Q4 | 10.21 | 14.32 |
| EUR 2025 Q2 | 10.13 | 14.74 |
| GBP 2024 Q2 | 7.63 | 12.33 |
| GBP 2024 Q3 | 7.41 | 11.93 |
| GBP 2024 Q4 | 6.69 | 10.49 |
| GBP 2025 Q2 | 9.50 | 13.64 |
| USD 2024 Q2 | 8.63 | 12.23 |
| USD 2024 Q3 | 8.96 | 13.61 |
| USD 2024 Q4 | 9.22 | 12.61 |
| USD 2025 Q2 | 8.56 | 12.52 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Knezevic, A. Towards Generative Interest-Rate Modeling: Neural Perturbations Within the Libor Market Model. J. Risk Financial Manag. 2026, 19, 82. https://doi.org/10.3390/jrfm19010082
Knezevic A. Towards Generative Interest-Rate Modeling: Neural Perturbations Within the Libor Market Model. Journal of Risk and Financial Management. 2026; 19(1):82. https://doi.org/10.3390/jrfm19010082
Chicago/Turabian StyleKnezevic, Anna. 2026. "Towards Generative Interest-Rate Modeling: Neural Perturbations Within the Libor Market Model" Journal of Risk and Financial Management 19, no. 1: 82. https://doi.org/10.3390/jrfm19010082
APA StyleKnezevic, A. (2026). Towards Generative Interest-Rate Modeling: Neural Perturbations Within the Libor Market Model. Journal of Risk and Financial Management, 19(1), 82. https://doi.org/10.3390/jrfm19010082

