Next Article in Journal
Are CDS Spreads Sensitive to the Term Structure of the Yield Curve? A Sector-Wise Analysis under Various Market Conditions
Next Article in Special Issue
Testing the Information-Based Trading Hypothesis in the Option Market: Evidence from Share Repurchases
Previous Article in Journal
Dynamic Responses of Major Equity Markets to the US Fear Index
Previous Article in Special Issue
Volatility Integration in Spot, Futures and Options Markets: A Regulatory Perspective
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Correcting the Bias in the Practitioner Black-Scholes Method

1
School of Business, Macau University of Science and Technology, Avenida Wai Long, Taipa, Macau
2
School of Economics, University of East Anglia, Norwich Research Park, Norwich, Norfolk NR4 7TJ, UK
*
Author to whom correspondence should be addressed.
J. Risk Financial Manag. 2019, 12(4), 157; https://doi.org/10.3390/jrfm12040157
Submission received: 16 August 2019 / Revised: 20 September 2019 / Accepted: 23 September 2019 / Published: 26 September 2019
(This article belongs to the Special Issue Option Pricing)

Abstract

:
We address a number of technical problems with the popular Practitioner Black-Scholes (PBS) method for valuing options. The method amounts to a two-stage procedure in which fitted values of implied volatilities (IV) from a linear regression are plugged into the Black-Scholes formula to obtain predicted option prices. Firstly we ensure that the prediction from stage one is positive by using log-linear regression. Secondly, we correct the bias that results from the transformation applied to the fitted values (i.e., the Black-Scholes formula) being a highly non-linear function of implied volatility. We apply the smearing technique in order to correct this bias. An alternative means of implementing the PBS approach is to use the market option price as the dependent variable and estimate the parameters of the IV equation by the method of non-linear least squares (NLLS). A problem we identify with this method is one of model incoherency: the IV equation that is estimated does not correspond to the set of option prices used to estimate it. We use the Monte Carlo method to verify that (1) standard PBS gives biased option values, both in-sample and out-of-sample; (2) using standard (log-linear) PBS with smearing almost completely eliminates the bias; (3) NLLS gives biased option values, but the bias is less severe than with standard PBS. We are led to conclude that, of the range of possible approaches to implementing PBS, log-linear PBS with smearing is preferred on the basis that it is the only approach that results in valuations with negligible bias.

1. Introduction

The Practitioner Black-Scholes (PBS) method (Dumas et al. 1998; Christoffersen and Jacobs 2004) has become a very popular as a benchmark option pricing method, against which other pricing methods can usefully be compared.1 The method amounts to the use of a cross-section sample of market option prices to estimate the implied volatility surface, that is, to estimate the parameters of an equation showing implied volatility (IV) as a function of strike price and time to expiry. The estimated IV equation can then be used to predict option prices, either within-sample or out-of-sample. The purpose of this paper is to raise a number of technical problems arising with the PBS method, and to propose methods for overcoming them.
There are a number of possible ways of implementing the PBS method. One, which we shall call “standard PBS”, is a two-stage approach: (1) use the cross-section of options to estimate a linear regression model with IV as the dependent variable, and strike price, time to expiry, and their squares and cross-product, as explanatory variables; (2) plug the fitted values from this regression into the Black-Scholes formula to obtain predicted option prices.
One immediate problem is that while IV is always positive, there is no guarantee that the prediction of IV from the stage-one regression will be positive. We address this problem by using a log-linear regression in stage one. A possibly more serious problem relates to stage two: as pointed out by Christoffersen and Jacobs (2004), the two-stage method yields biased predictions of option valuations, for the simple reason that the transformation being applied to the fitted values (i.e., the Black-Scholes formula) is a non-linear function of implied volatility. In this paper, we apply the smearing technique (Duan 1983) in order to correct this bias. The smearing technique makes use of the residuals from the IV regression as well as the fitted values and is classified as a non-parametric technique since it is valid whatever the actual distribution of the error term in the IV equation.
An alternative means of implementing the PBS approach is to use the market option price as the dependent variable and estimate the parameters of the IV equation by the method of non-linear least squares (NLLS). A problem we identify with this method is one of model incoherency. The model that underlies the method of NLLS can be shown to be incoherent in the sense that the IV equation that is estimated does not correspond to the set of option prices used to estimate it. Hence NLLS estimation of the IV equation is inconsistent. This may be verified by applying the Hausman test to establish that the IV equation estimated using NLLS is systematically different from that estimated using OLS with IV data.
In this paper, the Monte Carlo method is used to verify that: standard PBS gives biased option values; using standard (log-linear) PBS with smearing corrects the bias, both in-sample and out-of-sample; NLLS gives biased option values, but the bias is less severe than with standard PBS. We also compare other standard measures of performance (MAE and MSE) for each of these prediction measures. We are led to conclude that, of the range of possible approaches to implementing PBS, standard (log-linear) PBS with smearing is preferred on the basis that it is the only approach that results in unbiased valuations, and also performs well in terms of precision.
In Section 2, we outline the PBS method. In Section 3, we outline Duan’s (1983) smearing method, and explain how it can be applied to the PBS method. In Section 4, we consider how the IV equation can be estimated by NLLS applied to option prices. In Section 5 we consider the problem of estimating the IV equation using data from a cross-section of options traded on a particular day. In Section 6 we use estimates from one of the IV equations estimated in Section 5 to design a data generating process (DGP) for a Monte Carlo experiment whose purpose is to evaluate the relative performance of the various prediction methods considered earlier. The principal focus is on which method yields unbiased predictions. Section 7 concludes.

2. The PBS Method

There are three steps in the implementation of the PBS model. The first step is to compute the implied volatility of each option in the sample using (the inverse of) the Black-Scholes formula. That is, to find the implied volatility of call option i, we compute the value of σi that causes the following equality to be satisfied:
C i = C i B S ( σ i )
where Ci is the market price of call option i, and C i B S ( · ) is the Black-Scholes valuation of call option i.2
Second, simple ordinary least squares (OLS) is used to run a regression of implied volatilities on different polynomials in time-to-expiry and strike price. This regression model is sometimes referred to as the “DFW implied volatility (IV) equation”, having been introduced by Dumas et al. (1998). The most general IV equation appearing in the literature is
σ ( θ ) = θ 0 + θ 1 K + θ 2 K 2 + θ 3 τ + θ 4 τ 2 + θ 5 K τ + ε I V
where σ is implied volatility, K is the strike price, and τ is time to expiry.
Finally, the fitted values for IV are obtained as
σ ( θ ^ ) = θ ^ 0 + θ ^ 1 K + θ ^ 2 K 2 + θ ^ 3 τ + θ ^ 4 τ 2 + θ ^ 5 K τ
where hats indicate OLS estimates. These fitted values are plugged back into the Black-Scholes formula to obtain the PBS prices.
C P B S = C B S ( σ ( θ ^ ) )
There is a logical problem with (2), since it does not restrict the implied volatility to be positive. Hence there is a possibility that the fitted values (3) take negative values and this would result in meaningless predictions of option prices when (4) is applied. We will address this problem by assuming a log-linear form for the IV equation3:
ln [ σ ( θ ) ] = θ 0 + θ 1 K + θ 2 K 2 + θ 3 τ + θ 4 τ 2 + θ 5 K τ + ε I V
Let the vector of estimates from (5) be θ ^ log I V . Given fitted values from (5), ln [ σ ( θ ^ log I V ) ] , an obvious procedure for obtaining predicted option prices is to use the following analogue of (4);
C P B S , log I V = C B S [ exp ( ln [ σ ( θ ^ log I V ) ] ) ]
Christoffersen and Jacobs (2004, p. 298), make a very important point when outlining the PBS procedure: “It is clear that simply plugging σ ( θ I V ) into the Black-scholes formula will yield a biased estimate of the observed call price. While OLS will ensure that E [ ε I V ] = 0 , the non-linearity of the dollar option price in volatility and thus in ε I V implies that E [ C ] C B S ( σ ( θ I V ) ) .” This problem arises if either (4) or (6) is used, and is because (4) and (6) are both non-linear functions of the predicted implied volatility.
To our knowledge, no correction has ever been made for this problem. This motivates us to apply the smearing technique (Duan 1983) to correct the bias. This approach to correcting the bias was first suggested by Yin (2018).

3. The Smearing Method in Prediction

Consider the regression model:
y i = x i β + u i i = 1 , , n
Suppose that Equation (7) is estimated by OLS, resulting in predictions y i ^ and residuals u i ^ . But assume that we are interested in predicting not y , but some non-linear function f ( y ) of y . We could just use f ( y ^ ) for this. However, this gives biased predictions unless the function f ( · ) is linear.
Duan (1983) proposed the “smearing” formula which is as follows:
f ^ i = 1 n j = 1 n f ( y ^ i + u ^ j )
The important feature of the smearing formula (8) is that each observation ( i ) is predicted using the mean of a quantity involving the OLS prediction of y i and all of the n residuals ( u j ^ , j = 1 , , n ). Consistency of the smearing estimator (8) is proved by Duan (1983). It is classified as a non-parametric method, since it gives rise to consistent predictions whatever the distribution of the error term (u) in the regression model.
To see exactly how smearing may be applied to the PBS model, consider regression Equation (2) above. The fitted values from this regression, σ i ( θ ^ ) , are given by (3). Further, let the residuals be ε ^ i . The prediction of the price of option i, with smearing, is given by
C i P B S , S M = 1 n j = 1 n C i B S ( σ i ( θ ^ ) + ε ^ j )
As noted in Section 2, a problem with (3) is that it is possible for the predicted IV to be negative, giving rise to meaningless predictions. With (9), this problem is more serious. Even in a situation in which all of the predicted IV’s are positive, it is highly likely that the lowest predicted IV’s become negative when negative residuals are added to them, making some terms of the sum in (9) meaningless.
This problem is avoided if smearing is instead applied to (5) and (6). Letting the residuals from (5) be ε ^ i , the prediction with smearing is
C i P B S , log I V , S M = 1 n j = 1 n C i B S [ exp ( ln [ σ i ( θ ^ log I V ) ] + ε ^ j ) ]
In (10) all terms contained in the outermost brackets are positive, as required. In Appendix B, we provide the STATA code that is required to implement (10) using a cross-section sample of call options.
The smearing technique can also be applied to the problem of out-of-sample prediction. Consider option n + 1 that does not appear in the estimation sample. Equation (10) can be extended to predict the value of option n + 1 , as follows:
C n + 1 P B S , log I V , S M = 1 n + 1 j = 1 n + 1 C n + 1 B S [ exp ( ln [ σ n + 1 ( θ ^ log I V ) ] + ε ^ j ) ]
where ε ^ n + 1 = 0 , that is, the residual for the out-of-sample option is set to zero in the smearing formula.

4. Estimation of the IV Equation by NLLS on Option Price Data

An alternative method for estimating the parameters of the IV equation, (2) or (5), is to apply non-linear least squares to the option price data. Again consider a cross section of call option prices C i , i = 1 , , n . The NLLS estimator of the IV equation parameters, θ, defined in (2) above, is given by
θ ^ N L L S = arg min θ i = 1 n [ C i C i B S ( σ i ( θ ) ) ] 2
A problem we identify with this method is one of model incoherency. The model that underlies the method of NLLS can be shown to be incoherent in the sense that the IV equation that is estimated does not correspond to the set of option prices used to estimate it. Hence NLLS estimation of the IV equation is inconsistent.
The problem is explained as follows. The non-linear regression model underlying (12) is
C i = C i B S ( σ i ( θ ) ) + ε i
The essential difference between (13) and the standard PBS model based around (2) is that while in (2) a stochastic term is applied to the IV equation, in (13) IV is treated as deterministic and the stochastic term is instead applied to the option value.
The problem with (13) is that, unless the error term ( ε i ) is identically zero, it violates the identity that defines implied volatility, namely,
C i C i B S ( σ i ( θ ) )
Put differently, as soon as a non-zero error term is applied to an option price, the option price becomes inconsistent with the implied volatility appearing in brackets on the RHS of (13).
This means that the implied volatility surface estimated using NLLS will not provide implied volatilities that correspond to the option prices used to estimate it. Hence the NLLS estimator of the volatility surface is inconsistent.
This may be verified by applying the Hausman (Hausman 1978) test to establish that the IV equation estimated using NLLS is systematically different from that estimated using log-linear regression on IV data. The formula for the Hausman test statistic is
H = ( θ ^ log I V θ ^ N L L S ) ( V ^ log I V V ^ N L L S ) 1 ( θ ^ log I V θ ^ N L L S )
where V ^ log I V and V ^ N L L S are the estimated variance matrices of the two estimators. Let k be the dimension of θ. If H > χ k , 0.05 2 , there is evidence that θ ^ N L L S is systematically different from θ ^ log I V .

5. Estimation of IV Equations Using Real Data

In this section, we apply the various models to estimate the IV equation using data on S&P500 call options from a single arbitrarily chosen day. The chosen day is 27 July 2000. Data on 51 call options traded on this day are available.4 The full data set is presented in Appendix A.
In Table 1, we report the estimated IV equations resulting from four different models. In Model 1 it is simply assumed that all options have the same IV, equal to the mean of IV taken over all options; this mean is 0.182. Note that this is actually a standard PBS model with no explanatory variables in the IV equation, and is therefore a flat implied volatility surface.
One reason for selecting the chosen day as the data source is that it gives rise to an interesting implied volatility surface in the sense that most of the coefficients in the implied volatility equation are estimated as significantly different from zero. In particular, the coefficients of K and K2 (respectively significantly negative and significantly positive in all models in which they appear) represent strong evidence of the well-known “volatility smile”.
Model 2 is the IV equation used in the standard PBS method, that is, an equation with IV as the dependent variable, and K, K2, τ, τ2, and as explanatory variables. Model 3 is the same as Model 2 but with log(IV) as the dependent variable. Figure 1 compares predicted IV from Models 2 and 3. The two curves are broadly similar, both displaying the “smile”, although perhaps interestingly, the lowest point of the latter is somewhat to the right of that of the former. Model 3 is the model in which we are most interested, since this is the model whose estimates we use as the basis of the DGP for the Monte Carlo study reported in the next section.
It is informative to examine the residuals from Model 3. A histogram of these residuals is presented in Figure 2, with a normal distribution superimposed. The histogram suggests that the distribution of residuals is negatively skewed. With this in mind, the Monte Carlo study reported in the next section includes a DGP with a negatively skewed error. For good measure, we also include DGPs with symmetric and positively skewed errors.
Model 4 applies NLLS to the problem of estimating the parameters of the log(IV) equation the using option price data. It is noted that the estimates from Model 4 appear somewhat different from those of Model 3. One reason for this difference was explained in Section 4: an incorrect equation for log(IV) is being estimated when NLLS is used. The Hausman test statistic has been obtained using (15) and confirms the difference: there is overwhelming evidence of a systematic difference between the two sets of estimates (Models 3 and 4), which confirms that NLLS provides an inconsistent estimate of the true implied volatility surface.

6. A Monte Carlo Analysis of PBS with Smearing

In this section, we set out to verify using the Monte Carlo method that the standard PBS is biased and that PBS with smearing corrects the bias.
For the design matrix (the matrix whose columns are the model’s explanatory variables) we use the same set of 51 S&P 500 (call) options as used in Section 5. We vertically concatenate this matrix with itself to give a total of 102 rows: the first block of 51 rows is used for estimation and in-sample prediction; the second block of 51 observations is used for out-of-sample prediction.
For the data generation process, we use the estimates from the third column of Table 1. That is, the data generating process for IV is
ln ( I V i ) = 11.72 0.0168 K i + 0.000005 K i 2 + 3.98 τ i 0.136 τ i 2 0.0022 K i τ i + ε i V ( ε i ) = 0.0282 2 i = 1 , , 102
We make three different assumptions about the distribution of the error term in (16): normal; positive skew; negative skew. For the skewed errors, we use a suitably standardised chi-squared (3) distribution.5
Having generated a sample of Implied Volatilities, the Black-Scholes formula is then applied to each of them to generate the sample of option prices.
Table 2 presents the results from the Monte Carlo analysis. For each of the three distributional assumptions, we perform 10,000 replications. We then compute the mean of the following three quantities over the 10,000 replications, where predictions ( C ^ i ) for each model have been obtained both in-sample and out-of-sample:
B I A S : 1 n i = 1 n ( C ^ i C i ) M A E : 1 n i = 1 n | C ^ i C i | M S E : 1 n i = 1 n ( C ^ i C i ) 2
Because the focus of the paper is correcting the bias of PBS, we will start by focusing on the bias. We first see in Table 2 that standard PBS (PBS IV) always has a considerably smaller bias than the PBS mean. This simply tells us, unsurprisingly, that a model that assumes that the volatility surface is flat (i.e., PBS mean) leads to severely biased prediction of option prices. However, we also see that the bias of PBS with smearing is–again in every case—considerably lower than the bias of standard PBS. This is exactly the result we set out to verify.
Turning to NLLS, we see that the bias of NLLS is also lower than that of standard PBS, but the bias of NLLS is larger than that of PBS with smearing in every case but one. Broadly, on the criterion of unbiasedness, PBS with smearing is the best method of the ones considered.
Clearly we are not only interested in the bias—we are also interested in precision. Standard measures of precision are MAE and MSE defined in (17) above. In Table 2, we see that the MSE of PBS with smearing is—yet again in every case–lower than that of standard PBS. NLLS, when used to predict in-sample, tends to have the smallest MAE and MSE. However, in the perhaps more important setting of out-of-sample prediction, PBS with smearing has the smallest MAE and MSE.
Overall, we may conclude that, of the prediction methods considered, PBS with smearing is preferred because it is the closest to being unbiased, and it also has the highest precision when used to predict out-of-sample.
Finally, we can use the Monte Carlo experiment to verify that NLLS estimates the incorrect IV surface, as postulated in Section 4. In Section 5, in the context of a single sample, the Hausman test was used to establish that the NLLS estimate was systematically different from the estimate from log-linear regression. In the context of the Monte Carlo study, a more direct test is possible. We apply Wald Testing procedure6 to test the null hypothesis that the parameters estimated by NLLS equal the “true” parameters of the IV equation which appear in (16). Such a test is performed for each replication, and then we compute the proportion of the tests that result in rejection of the null hypothesis. The results are presented in Table 3.
For good measure, we conduct a similar test (F-test) using the estimates from the log-linear regression model. The first column contains the proportion of rejections of this test, and the numbers in this column simply tells us that the F-test has actual size close to nominal size (0.05), whether or not the error is normally distributed. The second column contains the proportion of rejections of the Wald Test when NLLS is used. These numbers are clearly much higher, and tell us that in nearly half of the simulated samples, there was sufficient evidence to conclude that the NLLS estimates were significantly different from the true values. These numbers can be interpreted in terms of test power. Clearly power is limited by the low sample size of 52, and power could be increased by raising the sample size.
The numbers in the second column of Table 3 tell us that it is reasonably likely that the NLLS estimates will be significantly different from the “true” parameters of the IV equation, even with a modest sample size.

7. Conclusions

This paper has, for the first time, addressed the problem of bias in the standard application of the Practitioner Black-Scholes method. The bias arises from the fact that the method uses a non-linear transformation of predictions from a linear regression model. The method used to correct this bias is the smearing method introduced by Duan (1983). The smearing method is non-parametric and has been proved by Duan to yield consistent predictions under a range of distributional assumptions.
The Monte-Carlo method has been used to verify that the application of smearing to the PBS method does indeed correct the bias. The bias of the predictions from PBS with smearing is considerably lower than that of standard PBS, both in-sample and out-of-sample, and we have verified that this result is robust to distributional assumptions.
The Monte Carlo approach is clearly very useful for addressing the type of question of interest here. This is partly because the “true” model determining option prices is known, enabling clear measurement of the prediction bias from the models of interest.
It must be said that NLLS applied to option prices is another approach that performs well on the criteria considered. However, in Section 4 we raised a separate problem with NLLS—incoherency. This problem arises because the manner in which the random error term is introduced causes the option prices to be inconsistent with the implied volatilities appearing in the same model. This essentially means that the implied volatility surface estimated under NLLS is incorrect. This was verified in the Monte Carlo study. However, despite this problem, NLLS results in option price predictions that are close to being unbiased and relatively precise.
This leads us to the recommendation that if the implied volatility surface is the focus of the analysis, NLLS on option prices should be avoided, and log-linear regression should be used. If the focus is instead on accurate prediction of option prices, NLLS can reliably be used. However, log-linear PBS with smearing has been shown to perform even better under most of the situations considered.

Author Contributions

Conceptualisation and Methodology: P.G.M.; Data Curation, Software and Formal Analysis: Y.Y.; Writing: P.G.M. and Y.Y.

Funding

This research received no external funding.

Conflicts of Interest

The authors declare no conflict of interest.

Data Availability Statement

The Data that support the findings of this study are openly available in Appendix A of the paper.

Appendix A. Data Used for Single Day Estimation in Section 5

Table A1. Data on 51 call options traded on 27 July 2000. C is price of option (mid-point between bid and ask); S is current price of underlying (S&P 500 index); K is strike price; rf is risk-free rate; tau is time to expiry in years; IV is implied volatility. Data Source: https://optionmetrics.com/data-products/.
Table A1. Data on 51 call options traded on 27 July 2000. C is price of option (mid-point between bid and ask); S is current price of underlying (S&P 500 index); K is strike price; rf is risk-free rate; tau is time to expiry in years; IV is implied volatility. Data Source: https://optionmetrics.com/data-products/.
tDateExpirydateCSKrftauiv
127 July 200016 December 200040.51449.6215500.05980.38904110.18459306
227 July 200016 September 20002.8751449.6216000.05980.1397260.16379156
327 July 200019 August 20008.1251449.6215000.05980.06301370.16414911
427 July 200016 December 200071449.6217000.05980.38904110.16543757
527 July 200019 August 200012.6251449.6214850.05980.06301370.16855563
627 July 200019 August 2000157.1251449.6213000.05980.06301370.29000379
727 July 200019 August 20001331449.6213250.05980.06301370.26658718
827 July 200019 August 200026.3751449.6214550.05980.06301370.18138442
927 July 200016 September 200014.751449.6215250.05980.1397260.17296152
1027 July 200016 September 200047.251449.6214500.05980.1397260.19113849
1127 July 200019 August 200018.6251449.6214700.05980.06301370.1738262
1227 July 200016 September 200036.1251449.6214700.05980.1397260.18486601
1327 July 200021 October 20006.251449.6216250.05980.23561640.16558332
1427 July 200019 August 2000109.1251449.6213500.05980.06301370.24189121
1527 July 200019 August 20001.6251449.6215450.05980.06301370.15661302
1627 July 200019 August 200011.1251449.6214900.05980.06301370.16827447
1727 July 200016 December 200031.751449.6215750.05980.38904110.18067305
1827 July 200019 August 200014.51449.6214800.05980.06301370.17055967
1927 July 200019 August 20003.68751449.6215250.05980.06301370.1615991
2027 July 200017 March 200161.3751449.6215750.05980.63835610.19218853
2127 July 200016 December 200013.51449.6216500.05980.38904110.17021512
2227 July 200019 August 20001.31251449.6215500.05980.06301370.15584699
2327 July 200016 September 200063.6251449.6214250.05980.1397260.20124521
2427 July 200016 September 200038.8751449.6214650.05980.1397260.18695045
2527 July 200019 August 200016.51449.6214750.05980.06301370.17231101
2627 July 200017 March 20015.1251449.6218500.05980.63835610.16320811
2727 July 200016 December 200062.6251449.6215000.05980.38904110.19390107
2827 July 200016 September 2000231449.6215000.05980.1397260.17839537
2927 July 200019 August 200031.8751449.6214450.05980.06301370.18350645
3027 July 200016 September 20001.51449.6216250.05980.1397260.1619217
3127 July 200019 August 2000351449.6214400.05980.06301370.18568564
3227 July 200016 December 20004.751449.6217250.05980.38904110.16223686
3327 July 200016 December 20003.1251449.6217500.05980.38904110.15914554
3427 July 200016 December 20009.51449.6216750.05980.38904110.16608682
3527 July 200016 September 200019.3751449.6215100.05980.1397260.17603378
3627 July 200019 August 20002.06251449.6215400.05980.06301370.15860899
3727 July 200016 December 200018.1251449.6216250.05980.38904110.17263715
3827 July 200019 August 200029.1251449.6214500.05980.06301370.18295443
3927 July 200016 September 200081.1251449.6214000.05980.1397260.20730415
4027 July 200021 October 200020.6251449.6215500.05980.23561640.17530438
4127 July 200016 September 200033.8751449.6214750.05980.1397260.1846462
4227 July 200016 September 2000100.6251449.6213750.05980.1397260.21527798
4327 July 200016 December 200024.251449.6216000.05980.38904110.1764836
4427 July 200019 August 200064.751449.6214000.05980.06301370.2074976
4527 July 200019 August 200031449.6215300.05980.06301370.15969375
4627 July 200021 October 200065.6251449.6214500.05980.23561640.19772039
4727 July 200016 September 20005.1251449.6215750.05980.1397260.16519108
4827 July 200016 September 20008.8751449.6215500.05980.1397260.16814606
4927 July 200019 August 200045.251449.6214250.05980.06301370.19296223
5027 July 200017 March 200135.6251449.6216500.05980.63835610.18229888
5127 July 200016 September 200041.51449.6214600.05980.1397260.18801008

Appendix B

The STATA code required to apply smearing to PBS is as follows. The first three commands perform the (log-linear) IV regression and obtain the fitted values (log_ivhat), and the residuals (uhat). The main part of the code is a loop over the sample (using the forvalues command). The predicted option price is stored in the variable pbs_smearing.
 
regress log_iv K K2 tau tau2 Ktau
 
predict log_ivhat, xb
predict uhat, resid
 
gen d1=.
gen d2=.
gen pbs_smearing=.
 
quietly{
 
local N=_N
forvalues j=1(1)‘N’ {
scalar log_ivhat_temp=log_ivhat in ‘j’
scalar S_temp=S in ‘j’
scalar K_temp=K in ‘j’
scalar r_temp=r in ‘j’
scalar tau_temp=tau in ‘j’
 
replace d1=(ln(S_temp/K_temp)+(r_temp+(exp(log_ivhat_temp+uhat))^2/2) ///
*tau_temp)/(exp(log_ivhat_temp+uhat)*sqrt(tau_temp))
 
replace d2=d1-(exp(log_ivhat_temp+uhat))*sqrt(tau_temp)
 
egen w=mean(S_temp*normal(d1)-exp(-r_temp*tau_temp)*K_temp*normal(d2))
 
quietly replace pbs_smearing=w if _n==‘j’
drop w
}
}

References

  1. Andreou, Panayiotis C., Chris Charalambous, and Spiros H. Martzoukos. 2014. Assessing the performance of symmetric and asymmetric implied volatility functions. Review of Quantitative Finance and Accounting 42: 373–97. [Google Scholar] [CrossRef]
  2. Bakshi, Gurdip, Charles Cao, and Zhiwu Chen. 1997. Empirical performance of alternative option pricing models. The Journal of Finance 52: 2003–49. [Google Scholar] [CrossRef]
  3. Christoffersen, Peter, and Kris Jacobs. 2004. The importance of the loss function in option valuation. Journal of Financial Economics 72: 291–318. [Google Scholar] [CrossRef] [Green Version]
  4. Duan, Naihua. 1983. Smearing estimate: A nonparametric retransformation method. Journal of the American Statistical Association 78: 605–10. [Google Scholar] [CrossRef]
  5. Duan, J., Genevieve Gauthier, J. Simonato, and Caroline Sasseville. 2006. Approximating the GJR-GARCH and EGARCH option pricing models analytically. Journal of Computational Finance 9: 41. [Google Scholar] [CrossRef]
  6. Dumas, Bernard, Jeff Fleming, and Robert E. Whaley. 1998. Implied volatility functions: Empirical tests. The Journal of Finance 53: 2059–106. [Google Scholar] [CrossRef]
  7. Hausman, Jerry A. 1978. Specification tests in econometrics. Econometrica 46: 1251–71. [Google Scholar] [CrossRef]
  8. Heston, Steven L., and Saikat Nandi. 2000. A closed form GARCH option pricing model. The Review of Financial Studies 13: 585–625. [Google Scholar] [CrossRef]
  9. Hull, John. 2011. Fundamentals of Futures and Options Markets, 7th ed. London: Pearson. [Google Scholar]
  10. Yin, Y. 2018. Flexible Option Valuation Methods. Unpublished M.Phil. thesis, University of East Anglia, Norwich, UK. [Google Scholar]
1
Examples of option pricing models for which a comparison against the benchmark of PBS is likely to be very useful are: Bakshi et al.’s (1997) SVSI-J model which allows volatility, interest rates, and jumps all to be stochastic; Heston and Nandi’s (2000) model built on the assumption of a GARCH process in the underlying index; and Duan et al.’s (2006) models built on even more general assumptions such as GJR-GARCH (TGARCH) and EGARCH.
2
The Black-Scholes formula (Hull 2011) also contains as arguments: current price of the underlying; strike price; time-to-expiry, risk-free rate. Since the focus here is on volatility, we suppress these arguments and express the option value as a function of only the volatility of the underlying, σ.
3
The log-linear form (5) is clearly a new and untested class of implied volatility function. However, since ln(·) is a monotonically increasing function, any non-monotonic pattern in the original IV function (such as the well-known “volatility smile”) is obviously reproduced (albeit with a different shape) when the log of IV is used. Other authors (e.g., Andreou et al. 2014) address the negativity problem by applying a lower bound on predicted IV, using for example m a x ( σ ( θ ^ ) , 0.01 ) in place of σ ( θ ^ ) in (4). On Monte Carlo evidence, we find that this approach tends to exacerbate biases in option values. For this reason, and also because the lower-bound correction is ad-hoc, we prefer the log-linear approach.
4
The data is extracted from OptionMetrics http://www.optionmetrics.com/.
5
For the positively skewed error, we simulate ε i = 0.0282 × ( χ 2 ( 3 ) 3 ) / 6 . For the negatively skewed error, we apply the same formula with the sign reversed. Note that these skewed distributions have the same mean and variance as the normal error N(0, 0.02822) used for the symmetric case.
6
The Wald test can be applied following estimation of a NLLS model in STATA by using the test command immediately after the nl command.
Figure 1. Predicted IV against strike price from Standard Practitioner Black-Scholes (PBS) (Model 2 in Table 1) and Log-linear PBS (Model 3 in Table 1). τ and τ2 fixed at sample means. Vertical line drawn at current underlying price.
Figure 1. Predicted IV against strike price from Standard Practitioner Black-Scholes (PBS) (Model 2 in Table 1) and Log-linear PBS (Model 3 in Table 1). τ and τ2 fixed at sample means. Vertical line drawn at current underlying price.
Jrfm 12 00157 g001
Figure 2. Histogram of residuals from log-linear model (Model 3 in Table 1). Normal density superimposed.
Figure 2. Histogram of residuals from log-linear model (Model 3 in Table 1). Normal density superimposed.
Jrfm 12 00157 g002
Table 1. Estimation Results. Implied volatility (IV) equations from various models: (1) Linear regression with IV as dependent variable and no explanatory variables; (2) Linear regression with IV as dependent variable and standard set of explanatory variables; (3) log-linear regression with log(IV) as dependent variable and standard set of explanatory variables; (4) non-linear least squares estimation of log(IV) equation with option price as the dependent variable. Hausman test tests for systematic difference between estimates from Models 3 and 4. Data is from 51 options traded on 27 July 2000 (See Appendix A for complete data set).
Table 1. Estimation Results. Implied volatility (IV) equations from various models: (1) Linear regression with IV as dependent variable and no explanatory variables; (2) Linear regression with IV as dependent variable and standard set of explanatory variables; (3) log-linear regression with log(IV) as dependent variable and standard set of explanatory variables; (4) non-linear least squares estimation of log(IV) equation with option price as the dependent variable. Hausman test tests for systematic difference between estimates from Models 3 and 4. Data is from 51 options traded on 27 July 2000 (See Appendix A for complete data set).
VariableModel 1Model 2Model 3Model 4
IV Mean OnlyIV (Standard PBS)Log(IV) (Log-Linear PBS)NLLS (Log IV)
K −0.00414 ***−0.0168 ***−0.00626 ***
(−8.91)(−8.85)(−4.55)
K2 0.00000131 ***0.00000517 ***0.00000159 **
(7.98)(7.72)(3.27)
τ 1.104 ***3.978 ***−0.256
(4.34)(3.83)(−0.38)
τ2 0.0262−0.136−0.421 **
(0.49)(−0.62)(−3.48)
K * τ −0.000668 ***−0.00222 **0.000594
(−3.76)(−3.06)(1.24)
constant0.182 ***3.426 ***11.72 ***4.003 ***
(50.33)(10.42)(8.73)(4.10)
σ0.02590.006920.0282
n51515151
R200.9360.954
Hausman χ2(5) (p-value) 516.0
(0.0000)
t statistics in parentheses; * p < 0.05, ** p < 0.01, *** p < 0.001.
Table 2. Monte Carlo Results. Models correspond to Models 1–4 in Table 1. The data generating process (DGP) combines Equation (15) and design matrix presented in Appendix A. Distribution of error is explained in footnote 5. There are 10,000 Replications for each result set. Means over 10,000 replications taken: bias (of predicted option prices); mean absolute error (MAE); mean squared error (MSE). Results were obtained using simulate command in STATA with random number seed 87654321.
Table 2. Monte Carlo Results. Models correspond to Models 1–4 in Table 1. The data generating process (DGP) combines Equation (15) and design matrix presented in Appendix A. Distribution of error is explained in footnote 5. There are 10,000 Replications for each result set. Means over 10,000 replications taken: bias (of predicted option prices); mean absolute error (MAE); mean squared error (MSE). Results were obtained using simulate command in STATA with random number seed 87654321.
(1) PBS MEAN(2) PBS IV (Standard PBS)(3) PBS LogIV SMEARING(4) NLLS
Symmetric error:
In sample:
BIAS0.09720.01270.0005−0.0031
MAE2.41020.67140.64280.6324
MSE9.57590.90860.82620.7435
Out of sample:
BIAS0.09370.0092−0.0033−0.0067
MAE2.41470.76400.74130.7681
MSE9.62381.27871.19511.2787
Negatively skewed error:
In sample:
BIAS0.09650.01290.0007−0.0028
MAE2.40810.62690.59830.5891
MSE9.51110.86200.77980.7017
Out of sample:
BIAS0.09790.01430.0018−0.0015
MAE2.41320.70960.68910.7118
MSE9.55511.20641.12901.1948
Positively skewed error:
In sample:
BIAS0.09740.01270.0005−0.0033
MAE2.41370.65100.61880.6113
MSE9.62820.96290.87860.7908
Out of sample:
BIAS0.09590.0112−0.0013−0.0049
MAE2.41760.73850.71190.7120
MSE9.67081.35161.26231.3573
Table 3. Further Monte Carlo Results. Proportion of replications in which null hypothesis is rejected by Wald Test. Null Hypothesis is that each parameter equals its true value under the DGP. Nominal test size 0.05, with 10,000 replications.
Table 3. Further Monte Carlo Results. Proportion of replications in which null hypothesis is rejected by Wald Test. Null Hypothesis is that each parameter equals its true value under the DGP. Nominal test size 0.05, with 10,000 replications.
Distributional AssumptionP(Reject H0) with Log-Linear RegressionP(Reject H0) with NLLS
Symmetric error0.0520.473
Negatively skewed error0.0530.396
Positively skewed error0.0540.399

Share and Cite

MDPI and ACS Style

Yin, Y.; Moffatt, P.G. Correcting the Bias in the Practitioner Black-Scholes Method. J. Risk Financial Manag. 2019, 12, 157. https://doi.org/10.3390/jrfm12040157

AMA Style

Yin Y, Moffatt PG. Correcting the Bias in the Practitioner Black-Scholes Method. Journal of Risk and Financial Management. 2019; 12(4):157. https://doi.org/10.3390/jrfm12040157

Chicago/Turabian Style

Yin, Yun, and Peter G. Moffatt. 2019. "Correcting the Bias in the Practitioner Black-Scholes Method" Journal of Risk and Financial Management 12, no. 4: 157. https://doi.org/10.3390/jrfm12040157

Article Metrics

Back to TopTop