Abstract
Modeling stock returns and option pricing in the presence of jumps remains a central challenge in financial economics. This paper employs a novel score-driven GARCH-jump model to analyze SSE (Shanghai Stock Exchange) 50 ETF returns and option pricing. The main findings are as follows. First, we use 50 ETF spot returns to estimate conditional volatility and jump intensity, and find that the SDSDJ (score-driven separate dynamic jumps) model significantly outperforms conventional GARCH-jump models in model fitting. Second, we evaluate both in-sample and out-of-sample pricing performance using data from 50 ETF options, and find that the SDSDJ model achieves the lowest in-sample pricing error among all benchmarks, while its simplified variant—the SDJ (score-driven jumps) model—delivers the most accurate out-of-sample results. Third, the superior pricing performance of both models is robust across different levels of moneyness and DTM (days-to-maturity).
MSC:
62M10; 91B84; 91G20
1. Introduction
Modeling sudden jumps in asset prices is a critical topic in finance. Jumps—typically referring to large, abrupt price changes due to significant news—help explain why asset returns often have fat-tailed distributions and why option-implied volatilities exhibit pronounced “smiles”. Empirical evidence in the developed markets such as the U.S. market shows that jumps, while infrequent, can account for a non-trivial share of return variability (on the order of 5∼7% of total variance [1]). In emerging markets such as China, stock price fluctuations tend to be even more extreme, and existing studies document episodes of abnormal jumps and higher overall volatility than in mature markets [2]. These characteristics make accurate jump modeling particularly important for the Chinese stock market, as it deepens our understanding of return dynamics and improves option valuation in this market.
The Chinese stock market did not introduce exchange-traded equity options until 2015, when the Shanghai Stock Exchange launched the SSE 50 ETF option. Since then, the Chinese options market has expanded rapidly: By 2021, the average daily trading volume reached approximately 2.6 million contracts, making the SSE 50 ETF option one of the most actively traded ETF options. The annual trading volume increased from about 23 million contracts in 2015 to more than 300 million in 2024, reflecting the growing demand for hedging, arbitrage, and speculative activities. In particular, the introduction of SSE 50 ETF options has also impacted the underlying market. For example, Arkorful et al. [3] found that the introduction of these options significantly decreases the volatility of the SSE 50 ETF itself, as option trading attracts informed traders and improves the flow of information on the spot market. The rapid development of the Chinese options market provides a rich data source and a timely opportunity to investigate return-jump and volatility modeling, as well as option pricing, under a new market regime.
Despite the rapid growth of the SSE 50 ETF options markets, most empirical research on return modeling and option pricing focuses on U.S. indices (such as the S&P 500). The Chinese market differs in several important ways that warrant dedicated investigation [4,5]. First, the Chinese equity market is dominated by retail investors and is periodically shaped by regulatory interventions, resulting in regime shifts and clusters of large price movements. Second, capital controls and other restrictions can cause liquidity shocks around policy events, for example, daily price limits may also induce discontinuities. Third, the SSE 50 ETF options are physically delivered contracts that are adjusted for dividends—a feature uncommon in standard index options. Furthermore, the number of available strike prices is limited, and the cost of short-selling the underlying assets is much higher than in U.S. markets. Finally, implied volatility in the SSE 50 ETF options market is generally higher than in the U.S. These distinctive market characteristics highlight the limitations of traditional models and motivate the usage of context-specific approaches to volatility and jump modeling in the Chinese equity market.
A wide range of models have been developed to capture asset volatility dynamics in both continuous and discrete-time frameworks. Under the classic continuous-time framework, the stochastic volatility model proposed by Black and Scholes [6] and Heston [7], laid the theoretical foundation for option pricing under diffusion processes. On the discrete-time side, GARCH models have been proven to be very successful in capturing time-varying volatility. Building on this success, Duan [8] pioneered the GARCH option pricing model, showing that a GARCH-based approach can parsimoniously track time-varying volatility and even explain certain systematic pricing biases of the Black–Scholes model. Subsequently, Heston and Nandi [9] derived a closed-form option valuation formula for a GARCH process, providing the first convenient analytical pricing formula for a stochastic volatility model that can be estimated and implemented solely based on observables. Recently, Mozumder et al. [10] proposed a practitioner version of Heston and Nandi’s model.
However, empirical findings document that asset returns are occasionally disturbed by large jumps triggered by news or unexpected events. A growing body of literature links jumps to the arrival of big news (e.g., [2,11,12,13,14]). For instance, using an extensive news dataset, Jeon et al. [14] show that stock price jumps (in both frequency and size) are strongly related to news flows and their content, especially for firms with higher media visibility, analyst coverage, and institutional ownership. These evidences reinforce the view that jumps are not merely statistical outliers but are often tied to fundamental information events. To accommodate these discontinuities in returns, researchers have extended volatility models to include jump components in returns. In discrete time, numerous GARCH-jump models have been proposed for equities and exchange rates (e.g., [11,15]), where the return innovation has a mixture of a GARCH-driven continuous part and a jump-driven discontinuous part. These models have also been adapted to option pricing. For example, Duan et al. [16] developed approximation methods for GARCH-jump models, while Christoffersen et al. [17] estimated a model with time-varying jump intensity to price S&P 500 options. A consistent finding in the literature is that incorporating jumps improves pricing accuracy, and models with dynamic jump intensity outperform their static counterparts. For example, Christoffersen et al. [18] and Christoffersen et al. [17] verify that models allowing for jumps (especially with time-varying intensity) outperform those assuming purely continuous diffusion.
While GARCH and GARCH-jump-based models have been highly useful and progressed significantly in modeling returns [19,20] or option pricing [10,20], they update volatility using heuristic rules (e.g., lagged squared returns), which raise robustness concerns. This has motivated the development of score-driven volatility models—also known as dynamic conditional score (DCS) models—by Creal et al. [21] and Harvey [22]. Score-driven models take a different route: time-varying parameters (such as volatility or jump intensity) are updated each period using the score (i.e., the gradient) of the conditional log-likelihood, rather than ad hoc functions of past residuals. This generalized filtering mechanism guarantees that updates incorporate new information exactly as prescribed by the likelihood. Score-driven models have been developed rapidly [23] due to their flexibility in modeling time-varying parameters—such as volatility, skewness, kurtosis, tail risk, and correlations—through efficient likelihood-based updating. Incorporating the score-driven approach into option pricing models may improve the accuracy of option pricing, since it typically improves the volatility modeling—an input that directly determines option prices. In view of this, Ballestra et al. [24] recently proposed a score-driven GARCH-jump model, i.e., the SDSDJ (score-driven separate dynamic jumps) model, where both the conditional variance and the conditional jump intensity evolve according to auto-regressive updates driven by scaled score innovations. Their empirical application to S&P 500 returns and options demonstrates an excellent in-sample fitting and significant out-of-sample improvements, surpassing traditional GARCH-jump benchmarks. In particular, the score-driven model can accurately capture returns’ jump clustering and volatility dynamics, resulting in more reliable pricing of the S&P 500 options. These results suggest that score-driven jump models can offer a powerful tool for option valuation.
Unlike the extensive studies on the U.S. market, relatively few studies have examined option pricing in the Chinese market. For example, Huang et al. [5] compared discrete-time models on SSE 50 ETF options and found that specifications incorporating realized-volatility measures outperform standard GARCH models based solely on daily returns. Yang [25] introduced a GARCH option pricing model that includes a double-exponential jump component, and empirical results show that the model improves the fit on SSE 50ETF returns and options and provides relatively better in- and out-of-sample pricing performance. Recently, Chi et al. [26] evaluated GARCH-type option pricing models in the Chinese SSE 50 ETF market and find that these models (and their extensions) deliver superior in-sample fit, GARCH-based forecasts even outperform option-implied volatility in predicting realized volatility out of sample. However, to the best of our knowledge, the performance of score-driven GARCH-jump models for the Chinese stock market has not yet been investigated. In light of the unique market characteristics discussed above, as well as the good performance of score-driven models in other contexts, it is natural to investigate whether these models can enhance return filtering and option pricing in the SSE 50 ETF market.
This paper employs a novel score-driven GARCH-jump model to fit SSE (Shanghai Stock Exchange) 50 ETF returns and option pricing. The main findings are as follows. First, we use 50 ETF spot returns to estimate conditional volatility and jump intensity, and find that the SDSDJ model significantly outperforms conventional GARCH-jump models in model fitting. Second, we evaluate both in-sample and out-of-sample pricing performance using data from 50 ETF options, and find that the SDSDJ model achieves the lowest in-sample pricing error among all benchmarks, while its simplified variant—the SDJ (Score-driven jumps) model—delivers the most accurate out-of-sample results. Third, the superior pricing performance of both models is robust across different levels of moneyness and DTM (days-to-maturity).
The marginal contributions of this paper are threefold. First, we investigate the performance of score-driven GARCH-jump models in modeling SSE 50 ETF returns and option pricing in the Chinese stock market. Our study not only enriches existing studies on option pricing in the emerging markets, such as the Chinese stock market [5,25,26], but also provides evidence on jump behavior in the Chinese stock market [2,25]. Second, we obtain a novel out-of-sample finding. Although the empirical studies in Ballestra et al. [24] have shown that the dynamic score-driven GARCH-jump model outperforms its static counterpart for the U.S. stock market, rigorous out-of-sample option pricing evidence remains scarce. Using data from the Chinese market, we find that the SDJ model performs better than the SDSDJ model for out-of-sample option pricing. This empirical result suggests that, in some cases, the reduced generalization ability of dynamic GARCH-jump modeling may outweigh its flexibility, thereby allowing more parsimonious models to achieve superior performance. Last but not least, we also enrich the empirical applications of score-driven models in return modeling and option pricing.
The remainder of this article is organized as follows. Section 2 provides the data and an overview of the SSE 50 ETF options market. Section 3 introduces the methodology, including the SDSDJ and SDJ models, used in this paper. Section 4 presents and discusses the empirical results. Section 5 concludes the paper.
2. The Data and Features of the SSE 50 ETF Option Market
The underlying asset of the SSE (Shanghai Stock Exchange) 50 ETF options is the 50 ETF issued by Hua Xia Fund Asset Management Company. Covering the 50 largest high-liquidity blue-chip stocks listed on the SSE, the SSE 50 ETF accounted for approximately of the total market value (i.e., over trillion RMB) of all 1037 equity ETFs in China by the end of 2024. In this study, we use the daily returns of the SSE 50 ETF from 23 February 2005 to 28 October 2024, and the daily settlement prices of the 50 ETF option contracts from 9 February 2015, to 28 October 2024, as the sample. The data is sourced from the Wind Database, which is a reliable and widely used financial database in China.
Figure 1a plots the log-return time series of the 50 ETF, and Panel A of Table 1 gives its descriptive statistics. Apparently, we can find that the 50 ETF returns are highly volatile; there exists evidence of time-varying jumps, that jumps tend to cluster over time, and that the returns are characterized by a sharp peak and fat tails.
Figure 1.
Log-return of SSE 50 ETF and trading volumn and open interest of 50 ETF options. (a) The log-return of 50 ETF; (b) trading volume and open interest of 50 ETF options, where the blue bars represent trading volume (in tens of thousands) and the red line represents open interest (in tens of thousands).
Table 1.
Descriptive statistics.
The 50 ETF option was introduced on 9 February 2015 as a European-style, physically settled option written on the 50 ETF. Panel B of Table 1 gives the number of options sorted by moneyness and DTM, and the basic contract terms are provided in Table 2. As illustrated in Figure 1b, both daily trading volume and open interest increased dramatically from 2015 to 2024. Panel A of Table 3 lists the proportion of trading volume by option category. The following can be observed: First, ATM (at-the-money) and OTM (out-of-the-money) options account for more than of the total trading volume, while ITM (in-the-money) options account for less than . Second, short- and medium-term options with DTM (days-to-maturity) of less than 60 days comprise over of the trading volume, whereas long-term options account for less than . Therefore, in line with the literature (e.g., [24]), we restrict our selection to ATM and OTM options with maturities of up to one year (250 days) for pricing, as shown in Panel B of Table 1.
Table 2.
Basic terms of the SSE 50 ETF option contracts.
Table 3.
Trading volume and implied volatility by moneyness and maturity.
Compared to their U.S. counterparts, SSE 50 ETF options exhibit several notable differences in contract specifications, regulatory practices, and market behavior. First, the number of strikes available is limited. At inception, the 50 ETF options offered only five strikes (one ATM, two OTM, and two ITM), which expanded to nine strikes (one ATM, four OTM, and four ITM) by early 2018. New strikes are added upon dividend adjustments, and seventeen such adjustments have been made since the launch of this product.
Second, option contracts are adjusted when dividends are paid on the underlying ETF. Specifically, if the ex-dividend price of the ETF is S and the cash dividend is d, the adjustment provides each holder with new contracts at a strike price of for each original contract at strike K. Take call options as an example: when dividends are paid, the stock price decreases and the value of the call option is eroded; by adjusting the exercise price, the call option buyer can be protected. This leads to the following pricing identity:
This mechanism effectively shields option holders from dividend-induced price drops—a feature not standard for index options such as SPX.
Finally, the Chinese market experiences higher price volatility [5]. For example, the average realized 10-day volatility of the 50 ETF from February 2015 to December 2024 was approximately , compared to about for the S&P 500 (SPX). Moreover, by February 2018, the volatility premium (defined as VIX minus realized volatility) for the 50 ETF was approximately higher than that of SPX options.
These observations highlight fundamental differences between the Chinese and U.S. options markets, suggesting that pricing models useful for the U.S. may not be directly applicable to the Chinese context.
3. The Methodology
3.1. The SDSDJ and SDJ Models
We first introduce the SDSDJ model. Let denote the daily closing price series of SSE 50 ETF, then the log-return is , including the dividend . Following Christoffersen et al. [17] and Ballestra et al. [24], we use affine building blocks to model the log-return as follows:
where r is the risk-free rate, is the conditional variance, and is the (expected) number of jumps per day. The terms and are risk premia for diffusion and jump risks under the physical measure, while the compensators and ensure that the conditional gross return has the expected value . Specifically, the diffusion shock is standard normal, and is defined as a compound Poisson process:
where is the number of jumps arriving between times and t, and the jump size is a sequence of independent and identically distributed normal random variables, , as shown in Equation (3).
From an economic perspective, Equation (2) decomposes returns into a predictable risk-compensation part and two orthogonal surprise components: a continuous shock of variance and a rare-event jump . This specification is consistent with the work of Maheu and Curdy [11], which proposes that an unobserved news process directs movements in prices, while the latent news process is assumed to consist of two distinct components—normal and unusual news events—which are identified by their effects on return volatility.
The key features of the SDSDJ model is that both the conditional volatility and conditional intensity are updated by a score-drivenfilter [21,22], which uses the score (the gradient of the predictive log-density) as the optimal direction in which to adjust latent states after each new observation. We use exponential links, and , to enforce and , and update
where and are (scaled) scores of the one-step predictive density, as shown in Equation (A3). and represent own-score weight for volatility and jump intensity, while score weight and allow cross-effects (e.g., a burst of jump-like price moves can temporarily raise diffusive volatility, and vice versa), and control persistence, and anchor long-run levels. The unknown parameter vector can be estimated by maximizing (Equation (A1)).
It is remarked that the SDJ model is a special case of the SDSDJ model, where the jump intensity is a constant () rather than time-varying (). This is useful when the jump intensity is believed to be stable over the sample.
3.2. Option Valuation
In order to price the SSE 50 ETF options, risk neutralization is needed to convert the physical measure (real-world measure, ) to the risk-neutral measure (equivalent martingale measure, ), which removes risk premia from returns and tilts the jump component to ensure discounted prices are martingales [17,24], then the log-return process and the iterative process under -measure are given by
where , is a compound Poisson process, whose jump intensity is and jump size follows , in which is the equivalent martingale measure (EMM) coefficient that capture the wedge between the physical and the risk-neutral measure. Under the -measure, and are the score functions given by Equation (A5).
Given current states and parameters, we obtain model-implied option values by simulating the risk-neutral return process and discounting expected payoffs as
where is the option price, is the payoff of the option at maturity, and is the information set available up to time t.
Fitting a dynamic model to a panel of option prices could be challenging, especially when the goal is to jointly estimate parameters using both returns and option prices [24]. Since it is important to consider both sources of information, we define a suitably weighted log-likelihood function that incorporates both the daily time series of log-returns and the panel of option contracts. While the log-likelihood for the daily returns, , is already provided in Equation (A1), modeling the option pricing errors requires additional assumptions. Formally, if we have N option prices, the option pricing error for the i-th contract, , for , is defined as the relative implied volatility loss function:
where and denote the market- and model-implied volatilities of the i-th option, respectively, where the latter is computed according to the popular Black–Scholes model. Following Christoffersen et al. [17] and Ornthanalai [27], the likelihood associated with the error on implied volatility is computed through using a Gaussian specification:
It is important to note that the number N of data points available for the option panel is different from the number T of daily returns. We follow Christoffersen et al. [17] and Ornthanalai [27], assign an equal weight to returns and option prices, and consider the following weighted joint (total) log-likelihood:
To assess the performance of the option pricing models, we follow Ballestra et al. [24], and employ the (percentage) IVRMSE (Implied Volatility Root Mean Square Error):
where N is the total number of option prices considered, and and denote the market and the model-implied volatilities of the i-th option, respectively.
3.3. Benchmark Models
To demonstrate the advantage of the score-driven GARCH-jump models, we choose the Black–Scholes [6], Heston and Nandi GARCH [9], GARCHJ (GARCH model with jumps [18]), and DVSDJ (dynamic volatility separate dynamic jumps [17]) as benchmark models.
Since the Black–Scholes model is so well known and widely used, we will not introduce it here. The Heston and Nandi GARCH model uses a closed-form solution as follows, rather than numerical methods (see Equation (A7)), where the underlying asset follows the following process:
where r is the continuously compounded interest rate for the time interval , is a standard normal disturbance, and is the conditional variance of the log return between and t.
The GARCHJ model incorporates static jumps (i.e., Poisson process) into the GARCH option pricing model, whose return process is the same as Equation (2) and the iterative process is given by
where is the conditional variance, and are the normal and jump components of shocks for returns, respectively, and are parameters to be estimated.
The DVSDJ model is the generalized version of the GARCHJ model, where the jump intensity varies over time. The return process of the DVSDJ model is the same as Equation (2), and the iterative process is given by
where is the conditional variance, is the dynamic jump intensity, and are the normal and jump components of shock, respectively, and are parameters to be estimated.
4. Empirical Results
4.1. In-Sample Fitting of Spot Returns
We use different GARCH-type models (H-N GARCH, GARCHJ, SDJ, DVSDJ, and SDSDJ model) to filter the conditional volatility and jump intensity of the 50 ETF spot returns. Since different models imply different conditional volatility and jump intensity for the same market, our objective is to identify which model achieves superior in-sample fitting.
Table 4 presents the maximum likelihood estimation results for these models. First, three different criteria—log-likelihood value, AIC, and BIC—are used to evaluate in-sample model fitting. From the table, we can find that, among all GARCH-type models, the novel SDSDJ model achieves the highest log-likelihood and the lowest AIC and BIC. Interestingly, the SDJ and DVSDJ models perform even worse than the plain vanilla Heston–Nandi GARCH model. Therefore, we demonstrate that score-driven GARCH-jump models provide better in-sample fitting. Second, as shown in the table, we find that none of the GARCH-type models produce significant coefficient estimates for , whereas the S&P 500 market exhibits a significant jump risk premium (e.g., ; see [24]). However, is significantly positive across models, indicating that investors are compensated for bearing the normal component of risk. Third, the jump size (denoted by ) of log-returns in the 50 ETF market is relatively small. As shown in the table, although all estimated values are negative—reflecting negatively skewed returns—their absolute magnitudes are small, in contrast to jump sizes of around observed in the S&P 500 market. Lastly, similar to the S&P 500 market, there exists significant interaction between the normal and jump components for the SSE 50 ETF market. The relative magnitudes of and reflect the interaction between jump intensity and conditional variance. If , the interaction is considered significant [24], which is also confirmed by the estimates in the table.
Table 4.
Maximum likelihood estimation results for the spot return of 50 ETF.
Figure 2 plots the filtered conditional volatility for different models. As illustrated in Panel (a)–Panel (e), the 50 ETF has experienced two episodes of extreme volatility: the 2008 financial crisis and the 2015 Chinese stock market crash. Although the traditional GARCH model captures all market turbulence, its volatility peaks are not pronounced, exhibiting a relatively smooth pattern, and the same applies to the static GARCHJ model (Panel (b)). In contrast, the SDJ model (Panel (e)) and the SDSDJ model (Panel (e)) provide more significant peaks in modeling the conditional volatility of the 50 ETF.
Figure 2.
This figure illustrates the time series plots of filtered conditional volatility. The sample period is from 23 February 2005 to 28 October 2024. (a) Heston and Nandi GARCH model; (b) GARCHJ model; (c) DVSDJ model; (d) SDJ model; (e) SDSDJ model.
Figure 3 presents the filtered jump intensity for different GARCH-jump models. Clearly, models with dynamic jump intensity (i.e., DVSDJ and SDSDJ) provide more significant peaks than the static models (i.e., GARCHJ and SDJ). One possible reason is that static models use constant jump intensities, making it difficult to distinguish between high- and low-jump intensity trading days. However, GARCH-jump models with dynamic jump models provide better estimates of the number of posterior jumps associated with market crises.
Figure 3.
This figure illustrates the time series plots of filtered conditional jump intensity. The sample period is from 23 February 2005 to 28 October 2024. (a) GARCHJ model; (b) DVSDJ model; (c) SDJ model; (d) SDSDJ model.
Finally, we propose a general criterion for evaluating the quality of these volatility models: to what extent can the model strip out the predictable part of the log-return and assign a truly random and unpredictable part into the normal component (). Time series models with stronger explanatory power should yield values that are closer to a standard normal distribution. Figure 4 illustrates the Q-Q plots of the normal innovation for each GARCH model. As shown in the figure, of traditional GARCH-type models deviates significantly from the standard normal distribution beyond , while of the SDSDJ model closely approximates a standard normal distribution within .
Figure 4.
This figure illustrates the Q-Q plots of the normal innovation . The sample period is from 23 February 2005 to 28 October 2024. (a) Heston and Nandi GARCH model; (b) GARCHJ model; (c) DVSDJ model; (d) SDJ model; (e) SDSDJ model.
4.2. Out-of-Sample Analysis for Spot Returns
To evaluate the predictive performance of the novel SDSDJ model and its rival models in 50 ETF spot returns, we follow Ballestra et al. [24] and forecast the 5% and 1% Value-at-Risk (VaR) from one to five days. We evaluate the predictive performance of these models across several crisis and relative stable periods by partitioning the dataset into five subperiods and conducting separate out-of-sample analyses: January 2005–August 2009 (including the 2008 global financial crisis), August 2009–December 2012 (a relatively stable period), January 2013–December 2017 (including the 2015 Chinese stock market crash), January 2018–December 2022 (including the COVID-19 pandemic), and December 2022–November 2024.
For each period, we estimate the model parameters using a rolling window with a width equal to of the total number of days in that period. That is, initially, we calibrate these models using the first of daily total returns from the SSE 50 ETF, reserving the remaining data for out-of-sample evaluations. Subsequently, for each day within the out-of-sample dataset, we re-estimate the parameters employing the latest daily return observations and generate forecasts for horizons ranging from one to five days.
Specifically, for the SDSDJ model, at each forecast date , we generate 100,000 simulated return paths for , with , based on the return specification in Equation (2). This involves randomly generating the innovations and , with and evolving according to Equation (4). We follow similar simulation procedures for the GARCHJ, DVSDJ, and SDJ models. We then calculate the VaR at significance levels of and by determining empirical quantiles from the 100,000 simulated outcomes of . Following Creal et al. [21], we evaluate the accuracy of the VaR forecast using the proportion-of-failures test proposed by Kupiec [28]. The null hypothesis for this out-of-sample test is defined as
H0.
The probability of model prediction failure equals .
Consequently, if the null hypothesis is rejected, it suggests that the model does not exhibit significant predictive power for VaR.
Table 5 reports the result of the likelihood ratio test. The empirical results indicate that the same model performs differently across high and low quantiles, and that market conditions also play a critical role in determining the model’s performance. For example, during the 2008 global financial crisis period (Panel A), at the level, the SDJ and SDSDJ models exhibit superior performance, each with only one rejection. In contrast, the GARCHJ and DVSDJ models are rejected five times each. At the level, however, each model demonstrates comparable and satisfactory performance. Interestingly, during the relatively stable period (Panel B), the SDSDJ model performs relatively better at but worse at , while the GARCHJ model exhibits the opposite pattern. During the 2015 Chinese stock market crash period (Panel C), all models perform relatively well at the level, but perform poorly at the level. The performances of these models during the COVID-19 pandemic (Panel D) period show a pattern similar to that observed in the 2008 crisis: the SDSDJ model performs best at , whereas the GARCHJ model performs best at . In the most recent period (Panel E), the SDSDJ model performs well at both the and levels.
Table 5.
Out-of-sample VaR prediction for spot returns.
Overall, the above empirical results suggest that the SDSDJ and SDJ models can effectively capture the extreme tail (e.g., ) of the conditional return distribution. By contrast, traditional GARCH-type models are better suited for modeling non-extreme returns (e.g., ) but yield notably relatively poorer out-of-sample fits when modeling extreme losses.
4.3. In-Sample Option Valuation
The GARCH-type models under -measure can be used to model the conditional volatility and jump intensity of spot returns, while those under -measure can be used to price options. Here, we apply several risk-neutralized GARCH-type models, including the SDSDJ and SDJ models to price the 50 ETF options from 23 February 2015 to 20 December 2024.
Table 6 presents the joint MLE estimation results using both spot returns and corresponding options data. Similar to the MLE estimation results for using only spot returns, the SDSDJ model achieves the highest log-likelihood value and the lowest AIC and BIC among all GARCH-type models. However, we find that the DVSDJ model performs very poorly over the time horizon, with a log-likelihood of only 21,823— lower than that of the SDSDJ model. This empirical result supports the idea that incorporating score dynamics improves the performance of GARCH-jump models in option pricing. Furthermore, we find that the jump sizes are all negative, the values of lie between 0 and 1, and the estimates of are all positive, indicating that the risk premium associated with the jump component is significantly positive. These findings are consistent with those of [17,18,24,27].
Table 6.
Joint MLE estimation results using both SSE 50 ETF spot returns and options data.
We also employ the IVRMSE (%) criterion (implied Volatility Root Mean Square Error) to evaluate the performance of these models. Table 7 reports the in-sample IVRMSE for different option pricing models, both overall and across various moneyness and maturities. Panel A indicates that, overall, the SDSDJ model delivers the best performance, with an IVRMSE of only , followed by the SDJ model () and the GARCHJ model (), while the DVSDJ model performs the worst (). Panel B shows that the pricing errors for OTM Put options are generally higher than those for OTM call options. Panel C reveals that short-term options tend to have higher pricing errors than long-term ones. As DTM increases, IVRMSE decreases—for example, the GARCHJ model performs better in the long term () than in the short term (), and similar patterns are observed for the other models. Our primary concern is whether the SDSDJ model outperforms traditional GARCH-type models across different maturities and moneyness levels. The empirical results confirm that the SDSDJ model exhibits robust and consistently superior performance, with IVRMSE ranging from to across maturities, and from to across moneyness categories. Therefore, the empirical results confirm the robustness of its performance with respect to the option moneyness and maturities.
Table 7.
In-sample IVRMSE (%) for different option pricing models.
4.4. Out-of-Sample Option Valuation
We also conduct out-of-sample analyses for option valuation. For each year, option contracts from the first half of that year are used to estimate the optimal parameters, which are then applied to price contracts in the second half of that year. The model-implied volatility is derived using the classic Black–Scholes formula. By comparing the model-implied and market-implied volatilities, we compute the out-of-sample IVRMSE (%) for each model.
Table 8 reports the out-of-sample IVRMSE (%) for different option pricing models, both overall, and across various moneyness and maturities. Panel A indicates that the SDJ model yields the lowest overall IVRMSE, while the SDSDJ model performs poorly out of sample. This may be attributed to the larger number of parameters, which can result in overfitting and consequently weaker out-of-sample performance, while more parsimonious models (such as Black–Scholes and SDJ) exhibit greater stability for out-of-sample option valuation performance. Panels B through D present results when the options sample is sorted by year, maturity, and DTM, respectively. We have the following discoveries. First, higher market volatility is usually associated with poorer out-of-sample model performance. For example, in 2020 and 2024, the IVRMSE for all models exceeds , while in 2022 and 2023, it can be as low as (for the SDJ model). This substantial variation highlights the sensitivity of GARCH-type models to market conditions. During crisis periods, the performance of option pricing models generally deteriorates; however, the SDJ model remains the most robust among them. Second, pricing accuracy is higher for at-the-money options compared to deep out-of-the-money options, and the SDJ model still performs best across all moneyness categories. Finally, all models exhibit better performance in pricing long-term options, which is consistent with the in-sample findings.
Table 8.
Out-of-sample IVRMSE (%) for different option pricing models.
To summarize, the empirical results on the out-of-sample option valuation in this section demonstrate that the score-driven GARCH-jump models outperform traditional approaches—including both GARCH-type models and the classic Black–Scholes model—in out-of-sample analyses for SSE 50 ETF options.
5. Conclusions
This paper employs a novel score-driven GARCH-jump model to analyze SSE 50 ETF returns and option pricing. Our analysis yields several key findings:
The SDSDJ (score-driven model with separate dynamic jumps) model achieves the highest log-likelihood and the lowest AIC and BIC values, indicating its superiority in-sample fitting to the 50 ETF return series. This model effectively captures both time-varying volatility and jump dynamics by utilizing score-driven filters.
According to the Kupiec likelihood ratio test, the GARCHJ (GARCH with jumps) model performs reasonably well in stable periods, but the SDSDJ and SDJ (score-driven model with jumps) models more accurately capture extreme losses in turbulent periods. This highlights the robustness of the SDSDJ framework for modeling fat-tailed returns and its value in risk management under stress.
The option pricing analysis shows that the SDSDJ model achieves the best in-sample performance across all GARCH-type models and has robust pricing accuracy across moneyness and maturities. However, its out-of-sample performance is less stable—likely due to overfitting, but its simplified variant—the SDJ model—delivers the most accurate out-of-sample results. These results imply a trade-off between model flexibility and generalizability, suggesting that while more flexible models like SDSDJ may offer better in-sample fits, more parsimonious models such as SDJ are better suited for practical out-of-sample option pricing under varying market conditions.
In summary, this study contributes to the literature on return modeling and option pricing in the Chinese equity markets using such score-driven GARCH-jump models. We document that the SDSDJ and SDJ models provide a unified framework for return modeling and option pricing, demonstrating robustness across different market conditions. These findings have important implications for regulators and practitioners, who may consider adopting the score-driven GARCH-type models for return modeling and option pricing, such as the recently introduced other stock index options in China; however, we leave this for future research.
Author Contributions
Conceptualization, M.S. and C.Z.; methodology, M.S. and C.Z.; software, M.S.; validation, M.S., C.Z. and W.K.H.; formal analysis, M.S. and C.Z.; resources, C.Z., Q.C. and W.K.H.; data curation, M.S.; writing—original draft preparation, M.S.; writing—review and editing, C.Z., Q.C. and W.K.H.; supervision, C.Z. and W.K.H. All authors have read and agreed to the published version of the manuscript.
Funding
This paper is jointly supported by the Humanity and Social Science Youth Foundation of Ministry of Education of China (Grant No. 18YJC790210); by the Innovation and Talent Base for Digital Technology and Finance (Grant No. B21038); by the Fundamental Research Funds for the Central Universities, Zhongnan University of Economics and Law (Grant No. 2722024BY008; 2722023EJ002); via the IDA Institute of Digital Assets, Bucharest University of Economic Studies, CF166/15.11.2022, contract number 760046/23.05.2023, financed under the Romania’s National Recovery and Resilience Plan, Apel nr. PNRR-III-C9-2022-I8; by a Huawei Distinguished Visiting Professorship for Wolfgang Karl Härdle at The University of Edinburgh; by the Yushan Fellowhsip of the National Yang Ming Chiao Tung University, Hsinchu, Taiwan; and the Marie Sklodowska-Curie Actions under the European Union’s Horizon Europe research and innovation program for the Industrial Doctoral Network on Digital Finance, acronym: DIGITAL, Project No. 101119635.
Institutional Review Board Statement
Not applicable.
Informed Consent Statement
Not applicable.
Data Availability Statement
The data presented in this study are available on request from the corresponding author due to privacy and legal restrictions.
Acknowledgments
We are grateful to Enzo D’Innocenzo for generously sharing part of the code used in this study.
Conflicts of Interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Abbreviations
The following abbreviations are used in this manuscript:
| GARCH | Generalized Auto-regressive Conditional Heteroskedasticity |
| H-N GARCH | Heston and Nandi GARCH |
| GARCHJ | GARCH with Jumps |
| DVSDJ | Dynamic Volatility with Separate Dynamic Jumps |
| SDJ | Score-Driven Model with Jumps |
| SDSDJ | Score-Driven Model with Separate Dynamic Jumps |
| AIC | Akaike Information Criterion |
| BIC | Bayesian Information Criterion |
| IVRMSE | Implied Volatility Root Mean Squared Error |
| MLE | Maximum Likelihood Estimation |
Appendix A. Mathematical Details of Score-Driven Models
Appendix A.1. Log-Likelihood and Score Functions
Appendix A.2. Likelihood-Ratio Backtest for VaR Exceedances [28]
Let and for . With and , test
Binomial likelihoods:
LR statistic:
We report and p-values. (Edge cases are flagged.)
Appendix A.3. Risk-Neutral Dynamics
Under the measure, the return and state updates are defined in Equation (5), in which the score terms are given by
where
Appendix A.4. Benchmark Specifications
The closed-form option pricing formula for the Heston–Nandi GARCH model [9] is
where C denotes the European call option price, S is the underlying asset price, K is the strike price, r is the risk-free interest rate, is the time to maturity, is the discount factor, is the Fourier integration variable, is the characteristic function under the Heston–Nandi GARCH model, i is the imaginary unit, and denotes the real part of a complex number.
References
- Huang, X.; Tauchen, G. The relative contribution of jumps to total price variance. J. Financ. Econom. 2005, 3, 456–499. [Google Scholar]
- Zhang, C.; Liu, Z.; Liu, Q. Jumps at ultra-high frequency: Evidence from the Chinese stock market. Pac. Basin Financ. J. 2021, 68, 101420. [Google Scholar] [CrossRef]
- Arkorful, G.B.; Chen, H.; Liu, X.; Zhang, C. The impact of options introduction on the volatility of the underlying equities: Evidence from the Chinese stock markets. Quant. Financ. 2020, 20, 2015–2024. [Google Scholar] [CrossRef]
- Yue, Y.; Gehricke, S.A.; Zhang, J.E.; Pan, Z. The implied volatility smirk in the Chinese equity options market. Pac. Basin Financ. J. 2021, 69, 101624. [Google Scholar] [CrossRef]
- Huang, Z.; Tong, C.; Wang, T. Which volatility model for option valuation in China? Empirical evidence from SSE 50 ETF options. Appl. Econ. 2019, 52, 1866–1880. [Google Scholar] [CrossRef]
- Black, F.; Scholes, M. The pricing of options and corporate liabilities. J. Political Econ. 1973, 81, 637–654. [Google Scholar] [CrossRef]
- Heston, S.L. A closed-form solution for options with stochastic volatility with applications to bond and currency options. Rev. Financ. Stud. 1993, 6, 327–343. [Google Scholar] [CrossRef]
- Duan, J.C. The GARCH option pricing model. Math. Financ. 1995, 5, 13–32. [Google Scholar] [CrossRef]
- Heston, S.L.; Nandi, S. A closed-form GARCH option valuation model. Rev. Financ. Stud. 2000, 13, 585–625. [Google Scholar] [CrossRef]
- Mozumder, S.; Frijns, B.; Talukdar, B.; Kabir, M.H. On practitioners closed-form GARCH option pricing. Int. Rev. Financ. Anal. 2024, 94, 103296. [Google Scholar] [CrossRef]
- Maheu, J.M.; McCurdy, T.H. News arrival, jump dynamics, and volatility components for individual stock returns. J. Financ. 2004, 59, 755–793. [Google Scholar] [CrossRef]
- Zhang, C.; Chen, H.; Peng, Z. Does Bitcoin futures trading reduce the normal and jump volatility in the spot market? Evidence from GARCH-jump models. Financ. Res. Lett. 2022, 47, 102777. [Google Scholar] [CrossRef]
- Zhang, C.; Ma, H.; Liao, X. Futures trading activity and the jump risk of spot market: Evidence from the bitcoin market. Pac. Basin Financ. J. 2023, 78, 101950. [Google Scholar] [CrossRef]
- Jeon, Y.; McCurdy, T.H.; Zhao, X. News as sources of jumps in stock returns: Evidence from 21 million news articles for 9000 companies. J. Financ. Econ. 2022, 145, 1–17. [Google Scholar] [CrossRef]
- Vlaar, P.J.; Palm, F.C. The message in weekly exchange rates in the European monetary system: Mean reversion, conditional heteroscedasticity, and jumps. J. Bus. Econ. Stat. 1993, 11, 351–360. [Google Scholar] [CrossRef]
- Duan, J.C.; Ritchken, P.; Sun, Z. Approximating GARCH-jump models, jump-diffusion processes, and option pricing. Math. Financ. 2006, 16, 21–52. [Google Scholar] [CrossRef]
- Christoffersen, P.; Jacobs, K.; Ornthanalai, C. Dynamic jump intensities and risk premiums: Evidence from S&P500 returns and options. J. Financ. Econ. 2012, 106, 447–472. [Google Scholar]
- Christoffersen, P.; Jacobs, K.; Ornthanalai, C.; Wang, Y. Option valuation with long-run and short-run volatility components. J. Financ. Econ. 2008, 90, 272–297. [Google Scholar] [CrossRef]
- Li, C.; Maheu, J.M. A multivariate GARCH-jump mixture model. J. Forecast. 2024, 43, 182–207. [Google Scholar] [CrossRef]
- Ballestra, L.V.; D’Innocenzo, E.; Guizzardi, A. A new bivariate approach for modeling the interaction between stock volatility and interest rate: An application to S&P500 returns and options. Eur. J. Oper. Res. 2024, 314, 1185–1194. [Google Scholar]
- Creal, D.; Koopman, S.J.; Lucas, A. A dynamic multivariate heavy-tailed model for time-varying volatilities and correlations. J. Bus. Econ. Stat. 2011, 29, 552–563. [Google Scholar] [CrossRef]
- Harvey, A.C. Dynamic Models for Volatility and Heavy Tails: With Applications to Financial and Economic Time Series; Cambridge University Press: Cambridge, UK, 2013. [Google Scholar]
- Harvey, A.C. Score-driven time series models. Annu. Rev. Stat. Appl. 2022, 9, 321–342. [Google Scholar] [CrossRef]
- Ballestra, L.V.; D’Innocenzo, E.; Guizzardi, A. Score-driven modeling with jumps: An application to S&P500 returns and options. J. Financ. Econom. 2023, 22, 375–406. [Google Scholar]
- Yang, X. Good jump, bad jump, and option valuation. J. Futur. Mark. 2018, 38, 1097–1125. [Google Scholar] [CrossRef]
- Chi, Y.; Hao, W.; Zhang, Y. Volatility model applications in China’s SSE50 options market. J. Futur. Mark. 2022, 42, 1704–1720. [Google Scholar] [CrossRef]
- Ornthanalai, C. Levy jump risk: Evidence from options and returns. J. Financ. Econ. 2014, 112, 69–90. [Google Scholar] [CrossRef]
- Kupiec, P. Techniques for verifying the accuracy of risk measurement models. J. Deriv. 1995, 3, 73–84. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).