Deep Equal Risk Pricing of Financial Derivatives with Non-Translation Invariant Risk Measures †
Abstract
:1. Introduction
2. Literature Review
3. Mathematical Setup for the Financial Market
4. Methodology
4.1. Neural Network Approximation of the Optimal Solution
4.2. Calibration of Neural Networks through Reinforcement Learning
4.2.1. Fixed and Given Case
4.2.2. Non-Translation Invariant Risk Measures Case
- (1)
- For a given value of , train the long and short neural networks and on the training set.
- (2)
- Evaluate the optimal residual hedging risk and with and on a test set of additional independent simulated paths.
- (3)
- If according to some closeness criterion, then is the equal risk price. Otherwise, update with the bisection algorithm and go back to step (1).
5. Numerical Experiments
5.1. Experiments Setup
5.2. Sensitivity Analysis to Risk Measures
5.2.1. Regime-Switching Model
5.2.2. Numerical Results of the Sensitivity Analysis to the Objective Function
5.2.3. Hedging Performance Benchmarking
5.3. Sensitivity Analysis to Dynamics of Risky Assets
5.3.1. Black–Scholes Model
5.3.2. GJR-GARCH Model
5.3.3. Merton Jump Diffusion Model
5.3.4. Numerical Results of the Sensitivity Analysis to Underlying Asset Dynamics
5.4. Long-Term Maturity ERP with Option Hedges
Numerical Results with Option Hedges
6. Conclusions
Author Contributions
Funding
Conflicts of Interest
Appendix A. Pseudo-Code
Algorithm A1 Pseudo-code training neural networks with underlying stock hedges. |
Input: Output: 1: for do ▹ Loop over each path of minibatch 2: ▹ Time-0 feature vector of 3: for do 4: time- output of FFNN with 5: ▹ Sample next stock price 6: ▹ See (Appendix A) for details 7: update additional state variables 8: ▹ Time feature vector of 9: end for 10: 11: 12: end for 13: 14: Adam algorithm 15: ▹ computed with Tensorflow Notes: Subscript i represents the i-th simulated path among the minibatch of size Also, the time-0 feature vector is fixed for all paths, i.e., , and . |
Algorithm A2 Pseudo-code bisection algorithm. |
Input: and trained neural networks, initial search range and test set paths Output: 1: nbs_iter = 0, 2: while and nbs_iter < max_iter do 3: 4: Compute and on the test set with and 5: 6: if then 7: 8: else 9: 10: end if 11: nbs_iter ← nbs_iter + 1 12: end while 13: . Notes: and max_iter represent, respectively, the admissible level of pricing error and the maximum number of iterations for the bisection algorithm. For all numerical experiments conducted in Section 5, is set to and max_iter to 100. |
Appendix B. Validation of Modified Training Algorithm
- (1)
- Train with the procedure described Section 4.2.2, where is sampled in the interval at the beginning of each SGD step, with being the risk-neutral price. A total of 100 epochs is used on the train set.
- (2)
- For a fixed randomly sampled value set and train with the methodology described in Section 4.2.1. A total of three iterations of this step is performed (i.e., three different values of are considered).
- (3)
- For the three sampled values of , compute the semi- statistics on the test set with and .
Appendix C. Maximum Likelihood Estimates Results
Regime | ||
---|---|---|
Parameter | 1 | 2 |
Appendix D. Risk-Neutral Dynamics
Appendix D.1. Regime-Switching
Appendix D.2. BSM
Appendix D.3. GARCH
Appendix D.4. Merton Jump-Diffusion
1 | |
2 | The original work from Guo and Zhu (2017) considers expected penalties as risk measures, which do not possess all properties of convex risk measures (e.g., most lack the translation invariance property). For instance, the tail value at risk (TVaR) is not a particular case of an expected penalty. |
3 | The class of coherent risk measures is a subset of the class of convex risk measure, which assumes, for instance, the subadditivity and positive homogeneity properties; the latter are more stringent than the convexity property satisfied by all convex risk measures. |
4 | This means is -measurable and is -measurable for . |
5 | Details characterizing well-behavedness in the context of the present study are omitted to avoid lengthy discussions straying us away from the main research objectives of this work. |
6 | A risk measure is a mapping taking a random variable representing a random loss as input, and returning a real number representing its perceived risk as an output. |
7 | Recall that since the trading strategy is self-financing, is characterized by and . |
8 | While the neural network architecture of and considered in this paper is the same for both neural networks in terms of the number of hidden layers and neurons per hidden layer, and thus the total number q of parameters to fit is the same for both neural networks, one could also consider two different architectures for and , with no additional difficulty. |
9 | If the equal risk price is outside the initial search interval , the bisection algorithm must be applied once again with a new initial search interval, and the neural networks and must be trained once again on this new interval. |
10 | An epoch is defined as a complete iteration of the training set with stochastic gradient descent. For example, for a training set of paths and a minibatch size of 1000, one epoch consists of 400 updates of the set of trainable parameters . |
11 | Recall that optimal policies under the CVaR risk measures are independent of due to the translation invariance property. Furthermore, the optimal policies obtained under the semi- risk measures can be used not only with a specific value for but with an interval of initial capital investments that include the risk-neutral price due to the proposed modified training algorithm in this paper. |
12 | The convention that if is adopted. |
13 | Note that traded options with different maturities are never used simultaneously in the same hedging simulation. |
14 | Note that with option hedges, the implied volatility of the options used as hedging instruments is added to feature vectors, not the price of each asset. This has the benefit of necessitating one less state variable with the implied volatility instead of adding two state variables with the price of the call and put used for hedging. Furthermore, this is a reasonable choice from a theoretical standpoint, as implied volatilities are simply a nonlinear transformation of options prices due to the bijection relation between the two values. |
15 | The type of neural networks considered by Carbonneau and Godin (2021a) is the long short-term memory (LSTM). The current paper found that FFNN trading policies performed significantly better for the numerical experiments conducted under the semi- risk measure, which motivated their use over LSTMs. The reader is referred to Section 3 of Carbonneau and Godin (2021a) for the formal description of the LSTM architecture. |
16 | For instance, if and are respectively equal risk prices under the CVaR and semi- objective functions, the relative reduction is computed as |
References
- Abadi, Martín, Ashish Agarwal, Paul Barham, Eugene Brevdo, Zhifeng Chen, Craig Citro, Greg S. Corrado, Andy Davis, Jeffrey Dean, Matthieu Devin, and et al. 2016. Tensorflow: Large-scale machine learning on heterogeneous distributed systems. arXiv arXiv:1603.04467. [Google Scholar]
- Alfeus, Mesias, Xin-Jiang He, and Song-Ping Zhu. 2022. An empirical analysis of option pricing with short sell bans. International Journal of Theoretical and Applied Finance 25: 2250012. [Google Scholar] [CrossRef]
- Ang, Andrew, and Allan Timmermann. 2012. Regime changes and financial markets. Annual Review of Financial Economics 4: 313–37. [Google Scholar] [CrossRef] [Green Version]
- Artzner, Philippe, Freddy Delbaen, Jean-Marc Eber, and David Heath. 1999. Coherent measures of risk. Mathematical Finance 9: 203–28. [Google Scholar] [CrossRef]
- Augustyniak, Maciej, Frédéric Godin, and Clarence Simard. 2017. Assessing the effectiveness of local and global quadratic hedging under GARCH models. Quantitative Finance 17: 1305–18. [Google Scholar] [CrossRef]
- Bao, Wenhang, and Xiao-Yang Liu. 2019. Multi-agent deep reinforcement learning for liquidation strategy analysis. arXiv arXiv:1906.11046. [Google Scholar]
- BCBS. 2016. Standards. Minimum Capital Requirements for Market Risk. January 2016. Basel Committee on Banking Supervision. Basel: Bank for International Settlements. [Google Scholar]
- Bertsimas, Dimitris, Leonid Kogan, and Andrew W. Lo. 2001. Hedging derivative securities and incomplete markets: An ϵ-arbitrage approach. Operations Research 49: 372–97. [Google Scholar] [CrossRef] [Green Version]
- Betancourt, Carlos, and Wen-Hui Chen. 2021. Deep reinforcement learning for portfolio management of markets with a dynamic number of assets. Expert Systems with Applications 164: 114002. [Google Scholar] [CrossRef]
- Black, Fischer, and Myron Scholes. 1973. The pricing of options and corporate liabilities. Journal of Political Economy 81: 637–54. [Google Scholar] [CrossRef] [Green Version]
- Breton, Michèle, and Frédéric Godin. 2017. Global hedging through post-decision state variables. Journal of Risk and Financial Management 10: 16. [Google Scholar] [CrossRef] [Green Version]
- Buehler, Hans, Lukas Gonon, Josef Teichmann, and Ben Wood. 2019. Deep hedging. Quantitative Finance 19: 1271–91. [Google Scholar] [CrossRef]
- Cao, Hongkai, Zhenyu Cui, and Yanchu Liu. 2020. Discrete-time variance-optimal deep hedging in affine GARCH models. SSRN, 3659275. [Google Scholar] [CrossRef]
- Cao, Jay, Jacky Chen, John Hull, and Zissis Poulos. 2021. Deep hedging of derivatives using reinforcement learning. The Journal of Financial Data Science 3: 10–27. [Google Scholar] [CrossRef]
- Carbonneau, Alexandre. 2021. Deep hedging of long-term financial derivatives. Insurance: Mathematics and Economics 99: 327–40. [Google Scholar] [CrossRef]
- Carbonneau, Alexandre, and Frédéric Godin. 2021a. Deep equal risk pricing of financial derivatives with multiple hedging instruments. arXiv arXiv:2102.12694. [Google Scholar]
- Carbonneau, Alexandre, and Frédéric Godin. 2021b. Equal risk pricing of derivatives with deep hedging. Quantitative Finance 21: 593–608. [Google Scholar] [CrossRef]
- Chong, Wing Fung, Haoen Cui, and Yuxuan Li. 2021. Pseudo-model-free hedging for variable annuities via deep reinforcement learning. arXiv arXiv:2107.03340. [Google Scholar] [CrossRef]
- Christoffersen, Peter, Redouane Elkamhi, Bruno Feunou, and Kris Jacobs. 2010. Option valuation with conditional heteroskedasticity and nonnormality. The Review of Financial Studies 23: 2139–83. [Google Scholar] [CrossRef] [Green Version]
- Coleman, Thomas F., Yohan Kim, Yuying Li, and M. Patron. 2007. Robustly hedging variable annuities with guarantees under jump and volatility risks. Journal of Risk and Insurance 74: 347–76. [Google Scholar] [CrossRef] [Green Version]
- Cont, Rama, and José Da Fonseca. 2002. Dynamics of implied volatility surfaces. Quantitative Finance 2: 45–60. [Google Scholar] [CrossRef] [Green Version]
- Delbaen, Freddy, and Walter Schachermayer. 1994. A general version of the fundamental theorem of asset pricing. Mathematische Annalen 300: 463–520. [Google Scholar] [CrossRef]
- Dempster, Arthur P., Nan M. Laird, and Donald B. Rubin. 1977. Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society: Series B (Methodological) 39: 1–22. [Google Scholar]
- Duan, Jin-Chuan. 1995. The GARCH option pricing model. Mathematical Finance 5: 13–32. [Google Scholar] [CrossRef]
- Elliott, Robert J., and Dilip B. Madan. 1998. A discrete time equivalent martingale measure. Mathematical Finance 8: 127–52. [Google Scholar] [CrossRef]
- Föllmer, Hans, and Martin Schweizer. 1988. Hedging by sequential regression: An introduction to the mathematics of option trading. ASTIN Bulletin: The Journal of the IAA 18: 147–60. [Google Scholar] [CrossRef]
- Föllmer, Hans, and Peter Leukert. 1999. Quantile hedging. Finance and Stochastics 3: 251–73. [Google Scholar] [CrossRef]
- Föllmer, Hans, and Peter Leukert. 2000. Efficient hedging: Cost versus shortfall risk. Finance and Stochastics 4: 117–46. [Google Scholar] [CrossRef] [Green Version]
- François, Pascal, Geneviève Gauthier, and Frédéric Godin. 2014. Optimal hedging when the underlying asset follows a regime-switching Markov process. European Journal of Operational Research 237: 312–22. [Google Scholar] [CrossRef] [Green Version]
- Frittelli, Marco. 2000. The minimal entropy martingale measure and the valuation problem in incomplete markets. Mathematical Finance 10: 39–52. [Google Scholar] [CrossRef]
- García-Galicia, Mauricio, Alin A. Carsteanu, and Julio B. Clempner. 2019. Continuous-time reinforcement learning approach for portfolio management with time penalization. Expert Systems with Applications 129: 27–36. [Google Scholar] [CrossRef]
- Gerber, Hans U., and Elias S. W. Shiu. 1994. Option pricing by Esscher transforms. Transactions of the Society of Actuaries 46: 99–191. [Google Scholar]
- Glorot, Xavier, and Yoshua Bengio. 2010. Understanding the difficulty of training deep feedforward neural networks. Paper presented at Thirteenth International Conference on Artificial Intelligence and Statistics, Sardinia, Italy, May 13–15; JMLR Workshop and Conference Proceedings. pp. 249–56. [Google Scholar]
- Glosten, Lawrence R., Ravi Jagannathan, and David E. Runkle. 1993. On the relation between the expected value and the volatility of the nominal excess return on stocks. The Journal of Finance 48: 1779–801. [Google Scholar] [CrossRef]
- Godin, Frédéric. 2016. Minimizing CVaR in global dynamic hedging with transaction costs. Quantitative Finance 16: 461–75. [Google Scholar] [CrossRef]
- Godin, Frédéric, and Denis-Alexandre Trottier. 2019. Option pricing under regime-switching models: Novel approaches removing path-dependence. Insurance: Mathematics and Economics 87: 130–42. [Google Scholar] [CrossRef]
- Goodfellow, Ian, Yoshua Bengio, and Aaron Courville. 2016. Deep Learning. Cambridge: MIT Press. [Google Scholar]
- Guo, Ivan, and Song-Ping Zhu. 2017. Equal risk pricing under convex trading constraints. Journal of Economic Dynamics and Control 76: 136–51. [Google Scholar] [CrossRef]
- Halperin, Igor. 2020. Qlbs: Q-learner in the Black-Scholes (-Merton) worlds. The Journal of Derivatives 28: 99–122. [Google Scholar] [CrossRef] [Green Version]
- Hamilton, James D. 1989. A new approach to the economic analysis of nonstationary time series and the business cycle. Econometrica: Journal of the Econometric Society 57: 357–84. [Google Scholar] [CrossRef]
- Hardy, Mary R. 2001. A regime-switching model of long-term stock returns. North American Actuarial Journal 5: 41–53. [Google Scholar] [CrossRef]
- He, Xin-Jiang, and Song-Ping Zhu. 2020. A revised option pricing formula with the underlying being banned from short selling. Quantitative Finance 20: 935–48. [Google Scholar] [CrossRef]
- Hendricks, Dieter, and Diane Wilcox. 2014. A reinforcement learning extension to the Almgren-Chriss framework for optimal trade execution. Paper presented at 2014 IEEE Conference on Computational Intelligence for Financial Engineering & Economics (CIFEr), London, UK, March 27–28; pp. 457–64. [Google Scholar]
- Hodges, Stewart, and Anthony Neuberger. 1989. Optimal replication of contingent claims under transactions costs. Review of Futures Markets 8: 222–39. [Google Scholar]
- Hornik, Kurt. 1991. Approximation capabilities of multilayer feedforward networks. Neural Networks 4: 251–57. [Google Scholar] [CrossRef]
- Horvath, Blanka, Josef Teichmann, and Zan Zuric. 2021. Deep hedging under rough volatility. Risks 9: 138. [Google Scholar] [CrossRef]
- Jiang, Zhengyao, Dixing Xu, and Jinjun Liang. 2017. A deep reinforcement learning framework for the financial portfolio management problem. arXiv arXiv:1706.10059. [Google Scholar]
- Kingma, Diederik P., and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv arXiv:1412.6980. [Google Scholar]
- Lamberton, Damien, and Bernard Lapeyre. 2007. Introduction to Stochastic Calculus Applied to Finance. Boca Raton: CRC Press. [Google Scholar]
- Li, Yuxi, Csaba Szepesvari, and Dale Schuurmans. 2009. Learning exercise policies for American options. In Artificial Intelligence and Statistics. Clearwater Beach: PMLR, pp. 352–59. [Google Scholar]
- Lu, David W. 2017. Agent inspired trading using recurrent reinforcement learning and LSTM neural networks. arXiv arXiv:1707.07338. [Google Scholar]
- Lütkebohmert, Eva, Thorsten Schmidt, and Julian Sester. 2022. Robust deep hedging. Quantitative Finance 22: 1465–80. [Google Scholar] [CrossRef]
- Ma, Guiyuan, Song-Ping Zhu, and Ivan Guo. 2022. Valuation of general contingent claims with short selling bans: An equal-risk pricing approach. International Journal of Theoretical and Applied Finance 25: 2250022. [Google Scholar] [CrossRef]
- Marzban, Saeed, Erick Delage, and Jonathan Yumeng Li. 2021. Deep reinforcement learning for equal risk pricing and hedging under dynamic expectile risk measures. arXiv arXiv:2109.04001. [Google Scholar]
- Marzban, Saeed, Erick Delage, and Jonathan Yu-Meng Li. 2022. Equal risk pricing and hedging of financial derivatives with convex risk measures. Quantitative Finance 22: 47–73. [Google Scholar] [CrossRef]
- Melnikov, Alexander, and Ivan Smirnov. 2012. Dynamic hedging of conditional value-at-risk. Insurance: Mathematics and Economics 51: 182–90. [Google Scholar] [CrossRef]
- Merton, Robert C. 1973. Theory of rational option pricing. The Bell Journal of Economics and Management Science 4: 141–83. [Google Scholar] [CrossRef] [Green Version]
- Merton, Robert C. 1976. Option pricing when underlying stock returns are discontinuous. Journal of Financial Economics 3: 125–44. [Google Scholar] [CrossRef] [Green Version]
- Moody, John, and Lizhong Wu. 1997. Optimization of trading systems and portfolios. Paper presented at IEEE/IAFE 1997 Computational Intelligence for Financial Engineering (CIFEr), New York City, NY, USA, March 24–25; pp. 300–7. [Google Scholar]
- Ning, Brian, Franco Ho Ting Lin, and Sebastian Jaimungal. 2021. Double deep Q-learning for optimal execution. Applied Mathematical Finance 28: 361–80. [Google Scholar] [CrossRef]
- Pendharkar, Parag C., and Patrick Cusatis. 2018. Trading financial indices with reinforcement learning agents. Expert Systems with Applications 103: 1–13. [Google Scholar] [CrossRef]
- Rockafellar, R. Tyrrell, and Stanislav Uryasev. 2002. Conditional value-at-risk for general loss distributions. Journal of Banking & Finance 26: 1443–71. [Google Scholar]
- Schweizer, Martin. 1995. Variance-optimal hedging in discrete time. Mathematics of Operations Research 20: 1–32. [Google Scholar] [CrossRef] [Green Version]
- Wang, Haoran, and Xun Yu Zhou. 2020. Continuous-time mean-variance portfolio selection: A reinforcement learning framework. Mathematical Finance 30: 1273–308. [Google Scholar] [CrossRef]
- Wu, David, and Sebastian Jaimungal. 2023. Robust risk-aware option hedging. arXiv arXiv:2303.15216. [Google Scholar] [CrossRef]
- Xu, Mingxin. 2006. Risk measure pricing and hedging in incomplete markets. Annals of Finance 2: 51–71. [Google Scholar] [CrossRef]
- Ye, Yunan, Hengzhi Pei, Boxin Wang, Pin-Yu Chen, Yada Zhu, Ju Xiao, and Bo Li. 2020. Reinforcement-learning based portfolio management with augmented asset movement prediction states. Paper presented at AAAI Conference on Artificial Intelligence, New York, NY, USA, February 7–12; Vol. 34, pp. 1112–19. [Google Scholar]
under | under Semi- | ||||||||
---|---|---|---|---|---|---|---|---|---|
Moneyness | |||||||||
OTM | |||||||||
ATM | |||||||||
ITM |
Semi- | ||||||||
---|---|---|---|---|---|---|---|---|
Penalty | ||||||||
Mean | ||||||||
SMSE | ||||||||
MSE |
under | under Semi- | ||||||||
---|---|---|---|---|---|---|---|---|---|
Dynamics | |||||||||
BSM | |||||||||
MJD | |||||||||
GJR−GARCH |
under | under Semi- | |||||
---|---|---|---|---|---|---|
Moneyness | ||||||
OTM | ||||||
ATM | ||||||
ITM |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Carbonneau, A.; Godin, F. Deep Equal Risk Pricing of Financial Derivatives with Non-Translation Invariant Risk Measures. Risks 2023, 11, 140. https://doi.org/10.3390/risks11080140
Carbonneau A, Godin F. Deep Equal Risk Pricing of Financial Derivatives with Non-Translation Invariant Risk Measures. Risks. 2023; 11(8):140. https://doi.org/10.3390/risks11080140
Chicago/Turabian StyleCarbonneau, Alexandre, and Frédéric Godin. 2023. "Deep Equal Risk Pricing of Financial Derivatives with Non-Translation Invariant Risk Measures" Risks 11, no. 8: 140. https://doi.org/10.3390/risks11080140
APA StyleCarbonneau, A., & Godin, F. (2023). Deep Equal Risk Pricing of Financial Derivatives with Non-Translation Invariant Risk Measures. Risks, 11(8), 140. https://doi.org/10.3390/risks11080140