Abstract
This paper demonstrates that it is possible to improve significantly on the estimated call prices obtained with the regression and simulation-based least-squares Monte Carlo method by using put-call symmetry. The results show that, for a large sample of options with characteristics of relevance in real-life applications, the symmetric method performs much better on average than the regular pricing method, is the best method for most of the options, never performs poorly and, as a result, is extremely efficient compared to the optimal, but unfeasible method that picks the method with the smallest Root Mean Squared Error (RMSE). A simple classification method is proposed that, by optimally selecting among estimates from the symmetric method with a reasonably small order used in the polynomial approximation, achieves a relative efficiency of more than . The relative importance of using the symmetric method increases with option maturity and with asset volatility. Using the symmetric method to price, for example, real options, many of which are call options with long maturities on volatile assets, for example energy, could therefore improve the estimates significantly by decreasing their bias and RMSE by orders of magnitude.
JEL Classification:
C15; G12; G13
1. Introduction
In a paper published some 20 years ago, McDonald and Schroder (1998) demonstrated that when the price of the underlying asset is governed by a Geometric Brownian Motion (GBM), the price of a call option with underlying asset price S, strike price K, interest rate r and dividend yield d is equal to the price of an otherwise identical put option with asset price K, strike price S, interest rate d and dividend yield r. The result for the GBM case has since been generalized to more realistic dynamics in (Schroder 1999), among others, and essentially, some version of this Put-Call Symmetry (PCS), potentially with other fundamental parameters changed accordingly, will hold for virtually all the models that have been considered in the existing literature on option pricing, for options with several different payoffs and which are written on multiple assets.1
In this paper, we show that this simple result can be used to improve on one of today’s state-of-the-art numerical option pricing methods, the well-known Least-Squares Monte-Carlo (LSMC) method proposed by Longstaff and Schwartz (2001). In particular, we show that using PCS with LSMC results in estimates that are much less biased and have significantly lower Root Mean Squared Error (RMSE) when pricing American call options for the set of options used in (Longstaff and Schwartz 2001) and for a very large sample of options with realistic characteristics. Using standard choices for the LSMC method, which we implement with 100,000 paths and a polynomial of order , we price options with different strike prices and maturities in a world with different values for the interest rate, dividend yield and volatility. For a large sample of 3125 different options, the average RMSE of the estimates obtained with the symmetric method is only of the RMSE of the estimates from the regular method, and the symmetric estimates have smaller RMSEs for of the options in the sample.
Our results show that the relative performance of the symmetric method, i.e., when call options are priced as put options using PCS, improves as the time to maturity and volatility increase. Moreover, using the symmetric method is most effective for options that are out of the money. The simple intuition for this results is that when option maturity is long and volatility is high, asset values along simulated paths may become “very” large and be spread out over a large interval. Large and widely-spread out asset values lead to poorly-conditioned cross-sectional regressions and this in terms results in poor approximations of the optimal early exercise strategy and precisely determining this strategy is most important for out of the money options. Widely-dispersed asset values also lead to estimates that have higher variance because of the spread out payoffs being discounted back to estimate the price.
The magnitude of the relative improvement obtained with the symmetric method depends on the choice of parameters used in the LSMC algorithm, that is the number of simulated paths, N, and the number of regressors used in the cross-sectional regression, L, in a non-trivial way. In particular, while it is well known (see for example Stentoft (2004b)) that the option price estimated with the LSMC converges to the true value when the number of paths and the number of regressors tend to infinity, this is of little use with finite choices of the number of paths, N, and the order of the polynomial used in the regression, L. However, even with the “worst possible” configuration for the symmetric method, which occurs when 100,000 and where the symmetric method only performs the best for roughly of the individual options, the average RMSE is much smaller than for the regular method and only larger than what could have been obtained with an infeasible method that picks from the regular and symmetric method the one with the smallest RMSE.
One reason that the choice of polynomial is important is that the LSMC method mixes two types of biases: a low bias due to having to approximate the optimal stopping time with a finite degree polynomial and a high bias coming from using the same paths to determine the optimal early exercise strategy and to price the option, potentially leading to over fitting to the simulated paths. For example, the bias just happens to be somewhat smaller without symmetry, a value of , than when using symmetry, a value of when using regressors with 100,000 paths. For all other choices of the number of paths with this number of regressors and when using other numbers of regressors with this number of simulated paths, the symmetric estimates are less biased. An easy way to control the bias is to conduct so-called out-of-sample pricing in which a new set of simulated paths is used to price the option instead of using the same set of paths used for determining the optimal early exercise strategy.
When using out-of-sample pricing, the relative importance of the symmetric method is even more striking. In particular, the symmetric method almost always, and in some cases for more than of the individual options, has the lowest RMSE, and the average RMSE for the large sample of options is around or less of what is obtained with the regular method for most configurations. The efficiency of the symmetric method, when compared to the infeasible optimal method, is extraordinary and in most cases above across various values of the number of paths, N, and number of regressors, L, whereas the regular method only achieves an efficiency of around . Finally, while it is difficult to pick the best method in general, in the case of out of sampling pricing, we propose a simple classification algorithm that, by optimally selecting among estimates from the symmetric method with a reasonably small order used in the polynomial approximation, achieves a relative efficiency of more than compared to the infeasible method that minimizes the RMSE across all estimates.
As noted by Detemple (2001), PCS is a useful property of many option pricing models since it reduces the computational burden when implementing these model. Indeed, a consequence of the property is that the same numerical algorithm can be used to price put and call options and to determine their associated optimal exercise policy. Another benefit is that it reduces the dimensionality of the pricing problem for some payoff functions. Examples include exchange options or quanto options. PCS also provides useful insights about the economic relationship between derivatives contracts. Puts and calls, forward prices and discount bonds and exchange options and standard options are simple examples of derivatives that are theoretically closely connected by symmetry relations. Compared to this literature, our objective is somewhat different. In particular, though PCS can be used to demonstrate theoretically the convergence of a particular numerical scheme for call option pricing using results for put options, our interest here is primarily of a numerical nature, and the objective is to show that PCS can be used to improve significantly on the estimated call option prices obtained with a particular numerical scheme.2
Our findings and proposed method for selecting optimally the configuration to use for option pricing should have broad implications. In particular, we show that improvements are found for a very large sample of options with reasonable characteristics, and since the symmetric method never performs very poorly and simple classification methods can be used to achieve very high relative efficiency, there are strong arguments for always using the symmetric method to price call options. Moreover, our results show that the relative importance of using the symmetric method increases with option maturity and asset volatility, and using symmetry to price long-term options in high volatility situations thus improves massively on the price estimates. The LSMC method is routinely used to price real options, most of which are call options with long maturities on volatile assets, for example energy. We conjecture that pricing such options using the symmetric method could improve significantly on the estimates by decreasing their bias and RMSE by orders of magnitude.
The rest of this paper is organized as follows: In Section 2, we provide motivating results for the small sample of simple vanilla options from Longstaff and Schwartz (2001). In Section 3, we briefly introduce the use of simulation methods for American option pricing in general and discuss the implementation of the method proposed by Longstaff and Schwartz (2001) when combined with PCS. In Section 4, we perform a large-scale study on 3125 options, showing that our proposed method works extremely well. In Section 5, we conduct several robustness checks, and in Section 6, we propose a new metric for efficiency and suggest a method for choosing the optimal specification to use for option pricing. Section 7 offers concluding remarks.
2. Motivation
In this section, we present results for a set of options similar to those used in Longstaff and Schwartz (2001), but to illustrate the effect of PCS, we consider pricing call options instead of put options. In all cases, we use a current value of the stock of 40 and an interest rate and dividend rate of . A non-zero dividend is needed to make the American call option pricing non-trivial and to have positive early exercise premia. Options range between being In The Money (ITM) and Out Of The Money (OTM), have maturities of or years and have early exercise possibilities per year. We also consider two levels of the volatility and set or . The reported estimates are based on independent simulations, each of which uses 100,000 paths and the first weighted Laguerre polynomials and a constant term as regressors in the cross-sectional regressions. We assess model performance using , and , respectively, where P is the true option price, is the simulated price and the average model price.
2.1. Regular Call Option Prices
Table 1 shows the pricing results for the sample of call options. The first thing to notice is that the majority of the estimated prices, shown in Column 5, are close to the benchmark values provided by the binomial model, shown in Column 4, and the bias, shown in Column 6, is in most cases less than one cent. However, for the longer term options with high volatility, shown in the last five rows, biases are large and significant at all reasonable levels. The size of the bias increases with moneyness, and the ITM option has a bias of 15 cents. Moreover, for these options, the estimated price also has a very large standard deviation, StDev, shown in Column 7, and as a result, the RMSE, shown in Column 8, is very large. For example, the RMSE of the option with , and is almost 40-times larger than the RMSE of any of the other deep ITM options.
Table 1.
Call option prices.
The results in Table 1 may hint at why often only put options are studied: it is potentially difficult to price long maturity call options in high volatility settings using the LSMC method. However, in many situations where the LSMC method is used, e.g., for real option pricing, the options considered are exactly long maturity call options. Therefore, what could (and does) go wrong? The fact that the standard deviation of these estimates is larger by (almost) an order of magnitude than that of any of the shorter term options indicates that this is likely caused by numerical issues. This conjecture is further supported by the fact that the skewness and kurtosis of the independent simulations are very far away from what we would expect, i.e., zero skewness and no excess kurtosis, when using independent simulations.3
Therefore, why then would you have numerical issues? The LSMC method estimates the early exercise strategy by performing a series of cross-sectional regressions of future path-wise payoffs on transformations of the current values of the stock price for the paths that are in the money, and the most obvious explanation for the numerical issues arising is that these regressions “break down” in one way or another. In particular, the properties of the input to the regression are very different when pricing calls, where regressors are unbounded, compared to when pricing puts, where regressors are bounded above by the strike price. Thus, one may end up performing regressions with regressors that have very large numerical values, and the probability of this happening increases with maturity and volatility. Note that this issue does not vanish when increasing the number of simulated paths, N.
2.2. Call Options Priced by Symmetry
When pricing call options using the “symmetric” method, the regressions carried out to price the, now, put option may be expected to be better behaved. In particular, the independent variable and the regressors are now bounded above by the strike price when using only the paths that are in the money. Columns 10–13 of Table 1 show the resulting price estimates, which may be compared directly to the estimates from the “regular” method in Columns 5–8. The first thing to notice is that with this approach, the estimated prices for the long-term high volatility options are now much closer to the benchmark values, and in fact, none of them are statistically different from the benchmark values provided by the binomial model. Note that some of the biases, five to be precise, are slightly larger for the symmetric method than for the regular method, yet in all cases, they are very small.
However, not only is the bias of these estimates comparable across options, the standard deviation of the estimates is also similar across options. More importantly, the standard errors of the estimates are always lower than what is obtained with the regular method, and this is so even for the short-term options with low volatility in the first five rows. Across the 20 options, the regular method yields estimates with a standard error that is on average three-times larger, with the best case being roughly worse (the option with , and ), and the worst case having a standard deviation almost nine-times larger.
Because of the low bias and the much lower standard deviation, the RMSE of the call price estimates obtained using symmetry is much lower than that obtained when pricing the option with the regular method across the benchmark sample. For half of the options, the RMSE is less than half that obtained with the regular method when using the symmetric method. In the best case across the 20 options, the regular method is only worse than the symmetry method; however, one would never do worse when pricing this set of call options using symmetry than with the regular method. This is a very strong argument for using symmetry to price call options.
3. Implementation
The first step in implementing any type of numerical algorithm to price American options is to assume that time can be discretized. Thus, we assume that the derivative considered may be exercised at J points in time. We specify the potential exercise points as , with and T corresponding to the current time and maturity of the option, respectively. An American option can be approximated by increasing the number of exercise points J, and a European option can be valued by setting . We assume a complete probability space equipped with a discrete filtration . The derivative’s value depends on one or more underlying assets, which are modelled using a Markovian process, with state variables adapted to the filtration and with known. We denote by an adapted payoff process for the derivative satisfying for a suitable function , which is assumed to be square integrable. Following, e.g., Karatzas (1988) and Duffie (1996), in the absence of arbitrage, we can specify the American option price as:
where denotes the set of all stopping times with values in and where it is therefore implicitly assumed that the option cannot be exercised at time .
In the literature, the problem of calculating the American option price in Equation (1), i.e., with , is referred to as a discrete time optimal stopping time problem. The preferred way to solve such problems is to use the dynamic programming principle. Intuitively, this procedure can be motivated by considering the choice faced by the option holder at time : either to exercise the option immediately or to continue to hold the option until the next period. Obviously, at any time, the optimal choice will be to exercise immediately if the value of this is positive and larger than the expected payoff from holding the option until the next period and behaving optimally from there on forward. To fix notation, in the following, we let denote the value of the option for state variables X at a time prior to expiration. We define as the expected conditional payoff, where is the optimal stopping time. It follows that:
and it is easily seen that it is possible to derive the optimal stopping time iteratively using the following algorithm:
Based on this, the value of the option in Equation (1) can be calculated as:
The backward induction theorem of Chow et al. (1971) (Theorem 3.2) provides the theoretical foundation for the algorithm in Equation (3) and establishes the optimality of the derived stopping time and the resulting price estimate in Equation (4).
3.1. Simulation and Regression Methods
The idea behind using simulation for option pricing is quite simple and involves estimating expected values and therefore option prices by an average of a number of random draws. However, when the option is American, one needs to determine simultaneously the optimal early exercise strategy, and this complicates matters. In particular, it is generally not possible to implement the exact algorithm in Equation (3) because the conditional expectations are unknown, and therefore, the price estimate in Equation (4) is infeasible. Instead, an approximate algorithm is needed. Because conditional expectations can be represented as a countable linear combination of basis functions, we may write , where form a basis.4 In order to make this operational, we further assume that it is possible to approximate well the conditional expectation function by using the first terms such that and that we can obtain an estimate of this function by:
where are approximated or estimated using simulated paths. Based on the estimate in Equation (5), we can derive an estimate of the optimal stopping time from:
From the algorithm in Equation (6), a natural estimate of the option value in Equation (4) is given by:
where is the payoff from exercising the option at the optimal stopping time determined for path n according to Equation (6).
3.2. Implementation of the LSMC Method
When implementing the method outlined above, one has to choose at least two things: how to generate the data, the simulated state variables, and how to approximate the value function, that is how to estimate the parameters in the approximation. The key contribution of Longstaff and Schwartz (2001) is to suggest that the coefficients in the approximation of the continuation value, , can be estimated in a simple cross-sectional ordinary linear (OLS) regression, where the independent variable is the discounted path-wise future payoff and the dependent variables are functions of the current state variables. In this paper, we propose to merge the LSMC method with PCS and use the symmetric method when pricing call options. Thus, instead of simulating paths from a dynamic model with a risk-free rate of r and dividend yield of d, we simulate from the same dynamic model, but with a risk-free rate of d and dividend yield of r, and instead of pricing the option as a call option with a strike price of K and a current value of the underlying asset of S, we price the option as if it had been a put option with a strike price of S and a current value of the underlying asset of K.5 These changes are simple to make and involve no extra computational complexity or changes to the numerical procedure. In fact, for consistency, it is important to note that we use the exact same numerical procedures for simulating the paths and to implement the cross-sectional regression.6
There are two very intuitive reasons why using the symmetric method to price call options may work better than when pricing the call option using the regular method. First, as explained above, in simulation-based methods, the option price, an expectation under the risk neutral measure, is approximated by the average of a number of random realizations of future payoffs, obtained from simulated values of the appropriate state variables. This mean obviously behaves better, and the estimator will have a smaller variance when the possible realizations are bounded, as they are in the case of the payoff of a put option, than when they are unbounded, as they are in the case of the payoff of a call option. Our numerical results for the benchmark options in Section 2 indeed showed that the standard deviation of the estimates is always lower when using the symmetric method than when using the regular method.7
Second, it is easier to approximate the continuation value when this is a bounded function on a bounded interval than when this is an unbounded function on an unbounded interval. In particular, theoretically, it is straightforward to design a robust approximation scheme for the continuation value of a put option using only the simulated paths that are in the money.8 For call options, on the other hand, no general theoretical results exist to justify that this is in fact feasible, and though numerical schemes are available and polynomial families that have nice properties can be used, approximating the continuation value is likely much more complicated. A further complication with the continuation value of a call option is that this is bounded below by the exercise value for large values of the underlying asset and thus asymptotically linear in the stock value. It is obviously difficult to approximate a function with these characteristics.
4. Results
The motivating example in Section 2 clearly demonstrated that there might be significant value to pricing call options as put options using symmetry properties when using the LSMC algorithm. To test this further, we now price a large sample of call options with five different strike prices, , maturities, years, interest rates, , dividend yields, , and volatilities, , for a total of options. This sample arguably spans most of the important cases one would come across in real-life applications of option pricing. We first consider performance for various numbers of exercise possibilities J and across option characteristics, i.e., K and T. Next, we consider performance across model parameters, i.e., r, d and . Benchmark values are from the Cox et al. (1979) binomial model with 25,000 steps and J early exercise possibilities.
In this section, we use a slightly different setup for the LSMC algorithm in that we use monomials as regressors and we use simple “plain vanilla” Monte Carlo simulation. We choose monomials instead of Laguerre polynomials because they are simpler and faster to use. We choose a plain Monte Carlo simulation without any variance reduction techniques such that our results are not potentially dependent on a particular variance reduction method. We again report results with the LSMC method using independent simulations with 100,000 paths and regressors. We assess model performance using the and error metrics where is the simulated estimate of the price P. Since we cannot report all the individual errors, we report average errors instead. We also consider the fraction of options for which the regular and symmetric method have the highest bias and RMSE, respectively.9
4.1. Performance across Option Characteristics
Table 2 reports results for our benchmark implementation of the LSMC method for the large sample of call options considering different numbers of total early exercises, J, constant across the maturity, from to (a close approximation to the continuously-exercisable American option) and across different strike prices, K, and different maturities, T, for options with early exercises. Figure 1 plots the relative performance of the symmetric method compared to the regular method for the four aggregate error metrics across these three dimensions.
Table 2.
Pricing errors across option characteristics.
Figure 1.
Relative pricing performance across option characteristics. This figure plots the relative performance of the symmetric method compared to the regular method across the number of early exercises, J, strike price, K, andmaturity, T. Panels B and C use results for J = 50 early exercise points only.
Panel A of Table 2 first of all shows that using the regular method for this sample of options leads to significantly low biased price estimates. For example, when considering the case with exercise times, the average bias is almost six cents with this method, whereas it is less than a cent if the symmetric method is used leading to an average improvement of . The improvement in performance with the symmetric approach is large also for the RMSE. Moreover, the improvement in performance is not only large on average, but also across most of the options, as the counting metrics show. In particular, improvements occur for and of the options in terms of bias and the RMSE, respectively. Figure 1a shows that the improved performance is not limited to a particular choice of the number of early exercise possibilities, J, and improvements are found for all the reported values of J. Across the number of early exercises, the figure does indicate that the relative performance improves rapidly with J when there are only a small number of exercise possibilities. Once J reaches 50 or 100, the effect in terms or RMSE tapers off, and the relative improvement in performance does not change much when increasing the number of early exercise points further.10
Panel B of Table 2 shows the results across moneyness and demonstrates that the absolute errors of both methods decrease when the strike price increases. In terms of the counting metrics the panel shows that the symmetric method has the smallest errors for of the out of the money options. For in the money options, where determining the early exercise strategy is of less importance, the improvement is relatively smaller, though the symmetric method continues to yield more precise price estimates on average and for the majority of the options. In relative terms, Figure 1b shows that the symmetric method performs better than the regular method across all strike prices. The relative performance is best for options with low strike prices, i.e., call options that are out of the money, and for these options, the relative performance in terms of the counting metrics is quite extraordinary.
Finally, Panel C of Table 2 shows the results across maturity and documents clear and significant improvements in the relative performance of the symmetric method for all metrics when maturity increases. It is noteworthy that the symmetric method actually leads to more precise, in terms of RMSE, estimates for all subcategories. In terms of the counting metrics, the symmetric method also largely outperforms the regular method and leads to prices estimated with smaller errors in at least of the cases, the worst relative performance being for the shortest maturity options. In relative terms, Figure 1c shows that the symmetric method performs better than the regular method across all maturities and error metrics. In fact, for the majority of the categories, that is for options with maturity of years or more, the symmetric method leads to lower RMSE for at least nine out of 10 of the options. Thus, the results from Section 2 hold true in general for a much larger sample of options.
4.2. Performance across Model Parameters
We now consider the relative performance of the two methods for some interesting subgroups of model parameters like the interest rate r, the dividend yield d and the volatility of the underlying asset . Table 3 shows the results across interest rate, r, dividend yield, d, and volatility, , for our large sample of options. Figure 2 plots the relative performance of the symmetric method compared to the regular method across these three dimensions.
Table 3.
Pricing errors across model characteristics.
Figure 2.
Relative pricing performance across model characteristics. This figure plots the relative performance of the symmetricmethod compared to the regular method across interest rates, r, dividend yields, d, and volatility levels, .
Figure 2a,b along with Panels A and B of Table 3 show that the relative improvement from using the symmetric method is large across all the possible values of interest rates and dividend yields. This holds both in terms of the average metrics and in terms of the number of options for which the symmetric method has the smallest error. In terms of absolute errors, the relative performance of the symmetric method decreases slightly when the interest rate increases, though the method produces estimates with errors that are very small and never above one third of the errors obtained with the regular method. When the dividend yield increases, the relative performance of the symmetric method increases somewhat. Note that the case with is special since in this situation, the American call option should never be exercised early.
Figure 2c and Panel C of Table 3 document clear and significant improvements in the relative performance of the symmetric method for all metrics in absolute, as well as relative terms when volatility increases. It is noteworthy that the symmetric method actually leads to more precise, in terms of RMSE, estimates for all subcategories. In terms of the counting metrics, the symmetric method also largely outperforms the regular method and leads to price estimates with smaller errors in at least of the cases, the worst relative performance being for the lowest volatility options. For the majority of the categories, that is for options with volatility of or more, the symmetric method leads to lower RMSE for at least nine out of 10 of the options. Thus, the results from Section 2 hold true in general for a much larger sample of options.
5. Robustness
The previous section provides strong evidence in favour of using symmetric pricing for call options. In this section, we examine the robustness of these results along two dimensions. We first examine the importance of the choice of the number of paths, N, and the number of regressors, L, used in the Monte Carlo simulation and whether or not our reported results are robust to using so-called out-of-sample pricing. Next, we examine the robustness of our results to using alternative option pricing models. Here, we consider the case with multiple underlying assets and the case in which the underlying asset follows the stochastic volatility model of Heston (1993).
5.1. Alternative Choices for the Number of Paths and Regressors
When implementing the LSMC method, one needs to choose the number of paths to simulate, N, and the number of regressors, L, to use in the cross-sectional regressions. While it is well known that the estimated prices converge to the true price when both N and L tend to infinity (see, e.g., Stentoft (2004b)), any real application involves choosing a finite number of paths and regressors. Table 4 shows the results across the number of simulated paths, N, and the number of regressors, L, for our large sample of options. Figure 3 plots the relative performance of the symmetric method compared to the regular method across these two dimensions.
Table 4.
Pricing errors across algorithm characteristics.
Figure 3.
Relative pricing performance across algorithm characteristics. This figure plots the relative performance of the symmetric method compared to the regular method across the number of simulated paths, N, and number of regressors, L.
Panel A of Table 4 shows the results when increasing the number of simulated paths from 20,000 to 500,000, while keeping the number of regressors fixed at . In this case, we know that the methods converge to a low estimate of the true value, one that is based on using a rather rough approximation of the conditional expectation function used to determine the optimal early exercise. The table confirms this numerically in that the bias for both methods, regular as well as symmetric, becomes more negative with increasing N. Note also that the regular method always yields price estimates with a low bias on average even when using as low as 20,000 paths, whereas the symmetric method yields high biased estimates for low N.
When comparing the two methods, it is noteworthy, though, that in all cases, the absolute bias and the RMSE is lowest with the symmetric method, and this method consistently outperforms the regular method across all choices of N, as can be seen from Figure 3a. When it comes to the number of times the symmetric method has lower errors, a pattern very similar to what was seen when increasing T or is found.
Panel B of Table 4 shows the results when increasing the number of regressors from to , while keeping the number of simulated paths fixed at 100,000. In this case, we know that, everything else equal, the estimated prices should increase as the approximation gets better and better, although this may eventually result in a high bias because of over fitting the function on a finite number of simulated paths. The table confirms this numerically in that the bias for both methods, regular as well as symmetric, becomes more positive with increasing L. The change is most dramatic for the regular method, which goes from having an average negative bias of close to eight cents to having an average positive bias of more than six cents.
When comparing the two methods, the table shows that the symmetric method almost always provides estimates with smaller errors than does the regular method across the choice of L. The exception to this is when using and looking at the average bias, where the absolute value from using the regular method is half that of using the symmetric method. When it comes to the number of times the symmetric method has lower errors, a pattern very similar to what was observed previously is found. The main difference is that, for the first configuration of all the ones considered up to this point, a case occurs where the regular method on average provides estimates that are better in terms of the RMSE. Unsurprisingly, this happens when for which of the estimated regular prices have smaller errors.
When looking at Panel B of the table, it is noteworthy that the performance of the symmetric method is better for a small, i.e., , or a large, i.e., , choice of regressors. A similar, though less pronounced, non-linear relationship is found in Panel A of the table when the number of simulated paths is increased. Given this concave relationship, as a function of L, and convex relationship, as a function of N, in the relative performance, it is indeed possible that one could find a combination of L and N for which the regular method would outperform the symmetric method for our large sample of options. This though would be largely due to luck (or would require one to consider a large number of possible combinations) and as such is not of much help or relevance.
One reason that the choice of polynomial is important is that the LSMC method mixes two types of biases: a low bias due to having to approximate the optimal stopping time with a finite degree polynomial and a high bias stemming from using the same paths to determine the optimal early exercise strategy and to price the option, potentially leading to over fitting to the simulated paths. As a result of this, the bias just happens to be somewhat smaller without symmetry, a value of , then when using symmetry, a value of when using regressors with 100,000 paths.11 One easy way to control the sign of the bias is to conduct so-called out-of-sample pricing in which a new set of simulated paths is used to price the option instead of using the same set of paths that were used for determining the optimal early exercise strategy. Table 5 shows the results for the different configurations of N and L and, as expected and in line with the theory, shows that the bias of the estimates from the regular, as well as the symmetric method is negative, i.e., the estimates are low biased.
Table 5.
Pricing errors across algorithm characteristics using out-of-sample pricing.
Compared to Table 4, Panel A of Table 5 shows that when using out-of-sample pricing, the estimated prices with the regular method improve significantly when the number of simulated paths, N, increases. The estimates from the symmetric method, however, are much less affected by the number of paths used. The reason for this is related to the over fitting and large variance in the estimates in the cross-sectional regressions with the regular method, which for a given choice of L, becomes less of an issue with increasing N. When using symmetric pricing, this is much less of an issue since the regressors are bounded. Compared to Figure 3, Figure 4 shows that when using out-of-sample pricing, the relative performance of the symmetric method is much less dependent on the choice of N and L. In particular, the symmetric method now improves significantly on the regular method irrespective of the choice of L. Panel B of Table 5 shows that once or more regressors are used, the RMSE of the symmetric method is around of the RMSE of the regular method. In terms of the number of times the symmetric method leads to the smallest RMSE, this is around or more for all values of L.
Figure 4.
Relative pricing performance across algorithm characteristics using out-of-sample pricing. This figure plots the relative performance of the symmetric method compared to the regular method using out-of-sample pricing across the number of simulated paths, N, and number of regressors, L.
5.2. Extensions to Other Option Pricing Models
Until now, we have presented results for the simple Black–Scholes–Merton setup. The reason for this was obvious: we wanted to have fast and precise benchmark results available. Without these, it makes no sense to talk about one method being more efficient than another since we measure efficiency by loss functions such as the RMSE for which a benchmark is required. However, we have argued that since our results rely on nothing but simulation and regression, our conclusions should be valid for other settings in terms of asset dynamics and option payoffs for which PCS holds. We now consider two obvious alternatives and demonstrate that our previous conclusions indeed continue to hold. We first provide results when the option payoffs depend on the average of several assets in a multivariate model, and second, we consider the case where the asset dynamics are instead given by the stochastic volatility (SV) model of Heston (1993).
The case with options written on multiple assets is the most obvious generalization of the standard constant volatility case. Options can be written on the maximum, minimum or average of multiple assets. These types of payoff functions have been used widely in the literature, and the work in Boyle and Tse (1990) gave examples on where these types of options are traded. The work in Stentoft (2004a) demonstrated that as the dimension of the problem increases, simulation-based methods like the LSMC are the most efficient methods to use for pricing. While put-call symmetry properties have been established in several cases (see for example Detemple (2001)), the most clean-cut case occurs with options written on the geometric average, i.e., options for which the payoff is given by:
where , are the prices of the underlying assets, M being the dimension, and K is the strike price as before. The reason that this case is a “clean-cut” example is that since the product of lognormals is lognormal, the pricing problem essentially reduces to that of pricing single asset options on an asset that follows a (particular and slightly non-standard) GBM.12
In the LSMC method, we again consider the complete set of polynomials of order or less and therefore use a total of 10, 20 and 56 regressors when the dimension of the problem, M, is 2, 3 and 5, respectively. In all cases, we use independent simulations with 100,000 simulated paths. We consider options with different features and different asset dynamics. In particular, we consider three values of the strike price, , the time to maturity, years, the volatility, , the correlation between the assets, , and the number of assets, , for a total of options. Pricing options in multiple dimensions quickly become computationally complex, and for this reason, we consider a smaller sample of options than in the benchmark case. Overall, our results show that the average RMSE obtained with the symmetric method is smaller than that obtained with the regular method, and the RMSE is smallest for of the individual options when using the symmetric method. These results clearly show that symmetry is valuable also for more advanced multivariate models.
Figure 5a and Panel A of Table 6 show the results across the number of assets, M, and demonstrate clearly that our suggestion of pricing call options as put options becomes more important as the dimension, and hence the computational complexity, of the option increases. In particular, the relative error of the symmetric method in terms of RMSE in Figure 5a goes from – as the dimension increases from 2–5. Moreover, when the dimension of the problem is high, the symmetric method almost always, in of the cases, has the lowest RMSE, as shown in Panel A of Table 6.
Figure 5.
Relative pricing performance in a multivariate model. This figure plots the relative performance of the symmetric method compared to the regular method across the mean reversion rate κ, long-term variance, θ, initial volatility, V0, and correlation, ρ.
Table 6.
Pricing errors across model characteristics in a multivariate model.
Figure 5b and Panel B of Table 6 show the results across the correlation between the assets, . From the figure, it is seen that symmetry becomes more important when correlations between assets increase. In particular, in terms of the absolute metrics, the regular method performs on par with the symmetric method when the correlation is low and , but when the correlation is high and , the error in pricing of the symmetric method is only around half that of the regular method, as shown in Figure 5b. Moreover, the symmetric method is always the method that yields the smallest errors for the largest fraction of the options. For example, when correlations are high among the underlying assets, the symmetric method has larger RMSE for only of the options as shown in Panel B of Table 6.
The SV model of Heston (1993) is one of the most famous extensions to the constant volatility model. While it may not have been the first SV model, e.g., earlier examples include Hull and White (1987), Scott (1987) and Wiggins (1987), this particular model has emerged as the most important one and now serves as a benchmark against which many other SV models are compared. In the model of Heston (1993), the variance followed a Cox et al. (1985) process specified as:
Here, represents the mean reversion rate of the variance, is the long-term variance, is the volatility of volatility (vol of vol) and the stock dynamics and variance are allowed to be correlated with correlation coefficient . Put-call symmetry also holds in this model as demonstrated by, e.g., Battauz et al. (2014), which also listed the appropriate changes of parameters. In particular, the paper shows that in the model of Heston (1993), the following parity holds between a call option and a put option:
In other words, a call option can be priced as a put option in which the stock price and the strike price and the interest rate and the dividend yield are interchanged, as was the case in the constant volatility setting, and in which the mean reversion, , the long-term variance, , and the correlation, , for the symmetric put option are changed to:
where , and are the actual mean reversion, long-term variance and correlation.
We again price each of the individual options -times with independently-simulated state variables used in the LSMC method, which we implemented using 100,000 paths. In the cross-sectional regressions, we use the complete set of polynomials in two dimensions of order or less as regressors for a total of 10 regressors including the constant term. We consider three different values of the strike price, , the time to maturity, years, the mean reversion rate, , the long-term variance, , the initial level of the variance, , the volatility of volatility, , and the correlation between the two Brownian motions, , for a total of options. Overall, our results show that the average RMSE obtained with the symmetric method is smaller than that obtained with the regular method, and the RMSE is smallest for of the individual options when using the symmetric method. These results clearly show that symmetry is valuable also for more advanced models, and though the results are somewhat closer to each other than with our benchmark model, this is to be expected since we are considering options here that have on average shorter maturities.
Figure 6 and Table 7 show the results across the various values of the mean reversion rate, , the long-term variance, , the initial level of the variance, , and the correlation, . The first thing to notice from the table is that across all these interesting parameters and parameter values, the symmetric method most of the time outperforms the regular approach. The two exceptions to this are options with very low long-term variance and and when the initial level of the variance is very high and , where the RMSE is slightly lower with the regular method than with the symmetric method. In both of these cases, the symmetric method also has larger RMSE for more options than does the regular method, whereas in all the other cases, the symmetric method most often has the smallest RMSE.
Figure 6.
Relative pricing performance in a stochastic volatility model. This figure plots the relative performance of the symmetric method compared to the regular method across the mean reversion rate κ, long-term variance, θ, initial volatility, V0, and correlation, ρ.
Table 7.
Pricing errors across model characteristics in a stochastic volatility model.
In terms of performance across parameters, Figure 6 and Table 7 show that the symmetric method performs relatively better the faster the volatility mean reverts, i.e., the larger the value of in Figure 6a, and the larger the value of the long-term variance, i.e., the larger the value of in Figure 6b. The last of these findings is completely in line with the results from our benchmark model where symmetric pricing was most important for high values of the asset volatility. Figure 6d shows that the symmetric method performs relatively better when correlations are negative, which is indeed the empirically most relevant case in the stochastic volatility model of Heston (1993). When the correlation is highly negative and , the symmetric method has errors in terms of bias and RMSE that are around lower than the regular method and has the largest errors for only around one third of the individual options. The effect on the relative performance of the initial variance, , is less clear cut though Figure 6c does indicate that the symmetric method performs relatively better when is not too extreme, and in particular not too high.
6. Discussion
The fact that pricing call options using the symmetry method works best for most and along some dimensions almost all of the options considered is great news. However, since it does not perform the best for all the options, it leaves the obvious question of when to choose one method over the other. As it is, the only solid recommendations that arise from Section 4 and Section 5 are that using the symmetric method with standard choices of the number of paths and number of regressors used in the LSMC method is relatively better the longer the maturity and the larger the volatility and that the methods become more similar when simulating a very large number of paths, e.g., when N is as large as 500,000, and that they diverge when using a large number of regressors, e.g., when L is as large as 15. In this section, we first examine the performance of the individual methods in terms of a relative efficiency measure, which compares the performance of a method to what could have been obtained optimally. Then, using properties of the out-of-sample method for pricing, we propose a method for selecting which specification to use across the methods and the number of regressors, which is simple to implement and achieves a very high degree of efficiency.
6.1. Efficiency as an Alternative Metric
Until know, we have compared performance metrics, i.e., the RMSE or the number of times a method works the best or worst, for the regular and symmetric pricing methods, respectively. An alternative and perhaps more interesting metric for “practitioners” is what one stands to lose in terms of increased pricing errors by picking and sticking to one particular method instead of using the optimal method for a given individual option in our sample. To examine this, we now consider a metric, which we will refer to as the “efficiency” given by the ratio of a specific method’s RMSE to the optimal and infeasible RMSE that could be obtained if one knew which method to use for each of the individual options.
Table 8 shows the efficiency of the two methods using in-sample pricing for various values of N and L in Columns 5 and 6. For comparison, the fraction of the options for which a particular model performs the best in terms of having the lowest RMSE is also reported in Columns 7 and 8.13 Panel A of Table 8 clearly shows that the symmetric method performs extremely well across the number of simulated paths, and one would never lose more than from using this method. In fact, for most realistic specifications, i.e., when 100,000, the loss is less than . The regular method, on the other hand, often has an efficiency of just around , meaning that if this method was used to price the sample of options, one would lose around compared to what could optimally be obtained.
Table 8.
Efficiency across the number of paths N and regressors L.
Panel B of the table is, given the results in the previous section on robustness, even more interesting. In particular, the previous results showed that for some specification, i.e., when picking , the symmetric method actually has larger RMSE than the regular method for most options. The row labelled in Table 8, however, shows that even in this case where the symmetric RMSE is the lowest for only of the options, the method’s efficiency is above . That is, even for settings when the regular method is the best, measured by minimizing the RMSE, for of the options when using the symmetric method, you would not lose more than compared to what could be optimally obtained had you known what would be the best method to use for the individual options. It is also striking that if you, on the other hand, would use the regular method for all options, the efficiency is only around in spite of the fact that this is the method that has the lowest RMSE for most of the options.
Table 9 shows the efficiency of the two methods using out-of-sample pricing for various values of N and L in Columns 5 and 6. The first thing to notice form this table is that when using out-of-sample pricing, i.e., when a new set of paths is used for pricing, the efficiency of the symmetric method is extremely high, and often above , across both the choice of the number of paths, N, and the number of regressors, L. Compared to the in-sample results in Table 8, the efficiency of the symmetric method is most of the time improved, the exception being when using 200,000 paths in the simulation. For the regular method, on the other hand, efficiency is generally much lower, as low as , and does not improve in any systematic way when using out-of-sample pricing. In conclusion, although the symmetric method is not always the model that has the smallest RMSE, the efficiency of this method is generally very high, always significantly higher than that of the corresponding regular method, and therefore, the costs of using this method are always reasonably low. Our suggestion is therefore very naturally to use the symmetric method for call option pricing.
Table 9.
Efficiency across number of paths N and regressors L using OSpricing.
6.2. Picking the Best Configuration
Although we recommend to always use the symmetric method for call option pricing, you may still wonder if it is possible to improve on this recommendation, i.e., if it is possible to pick the “right” model using some “observables”. This is essentially a question of classification. A straightforward classification variable is the estimated price. In particular, we know that when using the out-of-sample pricing technique, estimates are in expectation low biased. Moreover, while we expect the estimates to increase when increasing the number of regressors, L, initially, as this improves the polynomial approximation, when L becomes very large and over fitting to the paths used to determine the optimal exercise strategy becomes a problem, the estimated out-of-sample price could decrease. When comparing results for several different values of L and different methods, i.e., regular versus symmetric, one could therefore propose to choose the method that maximizes the out-of-sample price. In particular, this should result in picking the method that has the smallest bias, and this would potentially also be the method with a small RMSE. The results from implementing this classification strategy are shown in Table 10.
Table 10.
Efficiency across the number of regressors L using OS pricing.
Panel A in Table 10 reports results for individual values of L, i.e., when the method, regular or symmetric, that has the highest price for a given value of L is picked. The first thing to notice from this panel is that the right method for a given option is picked at least of the time, and the efficiency of this method is always above . The symmetric method clearly performs the best on average for all values of L, and this method does have a very high “local” efficiency, that is compared to the optimal RMSE for a particular value of L. The regular method, on the other hand, has a much lower efficiency. Compared to the efficiency of the individual methods, the panel shows that classification according to maximum price does improve on the RMSE in all but one case. In terms of “global” efficiency though, the performance of the methods varies greatly across L.
Panel B in Table 10 reports results across all the values of L used in Panel A, i.e., in this panel, the optimal RMSE is picked across both methods and the values of L. The first thing to notice from this panel is that the classification method performs very well and picks the right method more than of the time and has a very high efficiency of close to . Picking the symmetric method that has the highest price across L also results in estimates that are very efficient, although the optimal RMSE is slightly lower. The efficiency of the regular method is below when compared to the globally optimal method, although measured locally across L picking the method with the highest price results in estimates with an RMSE very close to the minimum RMSE for this method.
The results above show that when using out-of-sample pricing, it is possible to derive a simple classification algorithm that achieves very high efficiency. In reality, the algorithm ends up picking the symmetric method most of the time, and if you only pick within this method, the loss in efficiency is very small, i.e., it decreases from –. Thus, it is possible to save on the computational time by only considering this method. Moreover, for most of the options, the highest price is achieved with when using the symmetric method, and if, instead of picking among all the possible values of L in the table, you only consider , the global efficiency decreases only marginally to .14 This approach thus yields very good estimates across our large sample of options, and it is easy to implement. It is not possible to come up with a similar approach when using in-sample pricing though.
7. Conclusions
This paper shows that it is possible to improve significantly on the estimated call prices obtained with a state-of-the-art algorithm, the Least Squares Monte Carlo (LSMC) method of Longstaff and Schwartz (2001), for option pricing using regression and Monte Carlo simulation by using Put-Call Symmetry (PCS). PCS holds widely and in the classical Black–Scholes–Merton case, for example, implies that a call option has the same price as an otherwise similar put option where strike price and stock level and where interest rate and dividend yield are interchanged, respectively. The immediate implication is that path-wise payoffs in simulations are bounded above by the strike price instead of being unbounded, and for methods that use regression to determine the optimal early exercise strategy, this leads to much improved estimates of the stopping time and more precise option price estimates as measured by, for example, the Root Mean Squared Error (RMSE). Our results show that, for a large sample of options with characteristics of relevance in real-life applications, the symmetric method on average performs much better than the regular pricing method, is the best method for most of the options, never performs very poorly and as a result is very efficient compared to an optimal, but unfeasible method that picks the method with the smallest RMSE. When using out-of-sample pricing, a simple classification algorithm is proposed that, by optimally selecting among estimates from the symmetric method with a reasonably small order used in the polynomial approximation, achieves a relative efficiency of more than compared to the infeasible, but optimal method that minimizes the RMSE across all estimates. Our results also show that the relative importance of using the symmetric method increases with option maturity and with asset volatility and using symmetry methods to price long-term options in high volatility situations improves massively on the price estimates. The LSMC method is routinely used to price real options, many of which are call options with long maturity on volatile assets, for example energy. We therefore conjecture that pricing such options using the symmetric method could improve the estimates significantly by decreasing their bias and RMSE by orders of magnitude.
Funding
This research received no external funding.
Acknowledgments
I thank Pascal Francois for bringing the put-call symmetry property to my attention and the three reviewers for excellent comments that led to a much improved paper. Dayi Li provided expert research assistance. For additional numerical results and further details, please consult the online working paper version, which is available at https://papers.ssrn.com/sol3/papers.cfm?abstract_id=3362426. More than 5,000,000 artificial individual options were priced in this paper, which would have been impossible without the computational resources made available by the Canadian Foundation for Innovation (CFI).
Conflicts of Interest
The author declares no conflict of interest.
References
- Battauz, Anna, Marzia De Donno, and Alessandro Sbuelz. 2014. The Put-Call Symmetry for American Options in the Heston Stochastic Volatility Model. Mathematical Finance Letters 7: 1–8. [Google Scholar]
- Boyle, Phelim P., and Yiu Kuen Tse. 1990. An Algorithm for Computing Values of Options on the Maximum or Minimum of Several Assets. Journal of Financial and Quantitative Analysis 25: 215–27. [Google Scholar] [CrossRef]
- Chow, Yuan-Shih, Herbert Robbins, and David Siegmund. 1971. Great Expectations: The Theory of Optimal Stopping. New York: Houghton Mifflin. [Google Scholar]
- Cox, John C., Jonathan E. Ingersoll, and Stephen A. Ross. 1985. A Theory of the Term Structure of Interest Rates. Econometrica 53: 385–408. [Google Scholar] [CrossRef]
- Cox, John C., Stephen A. Ross, and Mark Rubinstein. 1979. Option pricing: A simplified approach. Journal of Financial Economics 7: 229–63. [Google Scholar] [CrossRef]
- Detemple, Jerome. 2001. American Options: Symmetry Properties. In Handbooks in Mathematical Finance: Topics in Option Pricing, Interest Rates and Risk Management. Edited by Elyes Jouini, Jaksa Cvitanic and Marek Musiela. Cambridge: Cambridge University Press, pp. 67–104. [Google Scholar]
- Duffie, Darrell. 1996. Dynamic Asset Pricing Theory. Princeton: Princeton University Press. [Google Scholar]
- Grabbe, J. Orlin. 1983. The Pricing of Call and Put Options of Foreign Exchange. Journal of International Money and Finance 2: 239–53. [Google Scholar] [CrossRef]
- Heston, Steven L. 1993. A Closed-Form Solution for Options with Stochastic Volatility with Applications to Bond and Currency Options. Review of Financial Studies 6: 327–43. [Google Scholar] [CrossRef]
- Hull, John, and Alan White. 1987. The Pricing of Options on Assets with Stochastic Volatilities. Journal of Finance 42: 281–300. [Google Scholar] [CrossRef]
- Karatzas, Ioannis. 1988. On the Pricing of American Options. Applied Mathematics and Optimization 17: 37–60. [Google Scholar] [CrossRef]
- Longstaff, Francis A., and Eduardo S. Schwartz. 2001. Valuing American Options by Simulation: A Simple Least-Squares Approach. Review of Financial Studies 14: 113–47. [Google Scholar] [CrossRef]
- McDonald, Robert C., and Mark Schroder. 1998. A Parity Result for American Options. Journal of Computational Finance 1: 5–13. [Google Scholar] [CrossRef]
- Royden, Halsey. 1988. Real Analysis. Upper Saddle River: Prentice Hall, Inc. [Google Scholar]
- Schroder, Mark. 1999. Changes of Numeraire for Pricing Futures, Forwards, and Options. Review of Financial Studies 12: 1143–63. [Google Scholar] [CrossRef]
- Scott, Louis O. 1987. Option Pricing When the Variance Changes Radomly: Theory, Estimation, and an Application. Journal of Financial and Quantitative Analysis 22: 419–38. [Google Scholar] [CrossRef]
- Stentoft, Lars. 2004a. Assessing the Least Squares Monte-Carlo Approach to American Option Valuation. Review of Derivatives Research 7: 129–68. [Google Scholar] [CrossRef]
- Stentoft, Lars. 2004b. Convergence of the Least Squares Monte Carlo Approach to American Option Valuation. Management Science 50: 1193–203. [Google Scholar] [CrossRef]
- Wiggins, James B. 1987. Option Values Under Stochastic Volatility: Theory and Empirical Estimates. Journal of Financial Economics 19: 351–72. [Google Scholar] [CrossRef]
| 1. | For example, PCS also holds in the stochastic volatility model of Heston (1993) when the parameters of the volatility process and the correlation are changed appropriately. See, e.g., Battauz et al. (2014) for the exact specification, Grabbe (1983) for an intuitive explanation of how to derive the relationship using options on foreign exchange and Detemple (2001) for extensions to derivatives on multiple assets. |
| 2. | The main parts of the paper present results for the simple Black–Scholes–Merton setup. The reason for this is obvious: we want to have fast and precise benchmark results available. Without these, it makes no sense to talk about one method being more efficient than another. Section 5, though, shows that these conclusions extend to other asset dynamics, like the stochastic volatility model of Heston (1993), and to options with other payoff functions, like options written on multivariate underlying assets. |
| 3. | Although the path-wise payoffs obtained with the LSMC method for a given Monte Carlo simulation are dependent and could be very far from normally distributed, the price estimates we report in the table are averages of independent simulations and should therefore be normally distributed by a central limit theorem. The actual values for the skewness and excess kurtosis are not shown in the table, but are available upon request. |
| 4. | This is justified when approximating elements of the space of square-integrable functions relative to some measure. Since is a Hilbert space, it has a countable orthonormal basis (see, e.g., Royden 1988). |
| 5. | For now, we maintain the assumption that dynamics are governed by simple geometric Brownian motion. However, our results generalize to other models for which PCS holds, as we demonstrate in Section 5. |
| 6. | We deal with the difference in, for example, the payoff when exercising the option by using negative values for the strike price and the stock prices for put options since . |
| 7. | Unreported results, available upon request, show that this generalizes to the much larger sample of options we consider in Section 4. |
| 8. | This follows from the Weierstrass approximation theorem, which states that every continuous function defined on a closed interval can be uniformly approximated as closely as desired by a polynomial function. |
| 9. | Using the fraction of times a given method has the highest error metric ensures, as is the case with the bias and RMSE error metrics, that lower numbers are better. |
| 10. | Note, though, that, e.g., the bias of both the regular and symmetric method increases in absolute terms when increasing the number of exercise points. This is likely related to the fact that dependence is introduced between the paths in the LSMC method because of the cross-sectional regression, and this dependence “accumulates” as we go backwards in time in the algorithm and becomes more and more important as the number of early exercise possibilities increases. |
| 11. | Again, for all other values of the number of simulated paths with this number of regressors and when using other numbers of regressors with this number of simulated paths, the symmetric estimates are less biased. |
| 12. | The working version of this paper contains the full details on how to derive these dynamics. |
| 13. | These numbers are the “inverse” of the counting metrics used in previous tables. |
| 14. | The mode of the number of regressors, L, for the regular method is seven, on the other hand. |
© 2019 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).