1. Introduction
Modeling sudden jumps in asset prices is a critical topic in finance. Jumps—typically referring to large, abrupt price changes due to significant news—help explain why asset returns often have fat-tailed distributions and why option-implied volatilities exhibit pronounced “smiles”. Empirical evidence in the developed markets such as the U.S. market shows that jumps, while infrequent, can account for a non-trivial share of return variability (on the order of 5∼7% of total variance [
1]). In emerging markets such as China, stock price fluctuations tend to be even more extreme, and existing studies document episodes of abnormal jumps and higher overall volatility than in mature markets [
2]. These characteristics make accurate jump modeling particularly important for the Chinese stock market, as it deepens our understanding of return dynamics and improves option valuation in this market.
The Chinese stock market did not introduce exchange-traded equity options until 2015, when the Shanghai Stock Exchange launched the SSE 50 ETF option. Since then, the Chinese options market has expanded rapidly: By 2021, the average daily trading volume reached approximately 2.6 million contracts, making the SSE 50 ETF option one of the most actively traded ETF options. The annual trading volume increased from about 23 million contracts in 2015 to more than 300 million in 2024, reflecting the growing demand for hedging, arbitrage, and speculative activities. In particular, the introduction of SSE 50 ETF options has also impacted the underlying market. For example, Arkorful et al. [
3] found that the introduction of these options significantly decreases the volatility of the SSE 50 ETF itself, as option trading attracts informed traders and improves the flow of information on the spot market. The rapid development of the Chinese options market provides a rich data source and a timely opportunity to investigate return-jump and volatility modeling, as well as option pricing, under a new market regime.
Despite the rapid growth of the SSE 50 ETF options markets, most empirical research on return modeling and option pricing focuses on U.S. indices (such as the S&P 500). The Chinese market differs in several important ways that warrant dedicated investigation [
4,
5]. First, the Chinese equity market is dominated by retail investors and is periodically shaped by regulatory interventions, resulting in regime shifts and clusters of large price movements. Second, capital controls and other restrictions can cause liquidity shocks around policy events, for example, daily price limits may also induce discontinuities. Third, the SSE 50 ETF options are physically delivered contracts that are adjusted for dividends—a feature uncommon in standard index options. Furthermore, the number of available strike prices is limited, and the cost of short-selling the underlying assets is much higher than in U.S. markets. Finally, implied volatility in the SSE 50 ETF options market is generally higher than in the U.S. These distinctive market characteristics highlight the limitations of traditional models and motivate the usage of context-specific approaches to volatility and jump modeling in the Chinese equity market.
A wide range of models have been developed to capture asset volatility dynamics in both continuous and discrete-time frameworks. Under the classic continuous-time framework, the stochastic volatility model proposed by Black and Scholes [
6] and Heston [
7], laid the theoretical foundation for option pricing under diffusion processes. On the discrete-time side, GARCH models have been proven to be very successful in capturing time-varying volatility. Building on this success, Duan [
8] pioneered the GARCH option pricing model, showing that a GARCH-based approach can parsimoniously track time-varying volatility and even explain certain systematic pricing biases of the Black–Scholes model. Subsequently, Heston and Nandi [
9] derived a closed-form option valuation formula for a GARCH process, providing the first convenient analytical pricing formula for a stochastic volatility model that can be estimated and implemented solely based on observables. Recently, Mozumder et al. [
10] proposed a practitioner version of Heston and Nandi’s model.
However, empirical findings document that asset returns are occasionally disturbed by large jumps triggered by news or unexpected events. A growing body of literature links jumps to the arrival of big news (e.g., [
2,
11,
12,
13,
14]). For instance, using an extensive news dataset, Jeon et al. [
14] show that stock price jumps (in both frequency and size) are strongly related to news flows and their content, especially for firms with higher media visibility, analyst coverage, and institutional ownership. These evidences reinforce the view that jumps are not merely statistical outliers but are often tied to fundamental information events. To accommodate these discontinuities in returns, researchers have extended volatility models to include jump components in returns. In discrete time, numerous GARCH-jump models have been proposed for equities and exchange rates (e.g., [
11,
15]), where the return innovation has a mixture of a GARCH-driven continuous part and a jump-driven discontinuous part. These models have also been adapted to option pricing. For example, Duan et al. [
16] developed approximation methods for GARCH-jump models, while Christoffersen et al. [
17] estimated a model with time-varying jump intensity to price S&P 500 options. A consistent finding in the literature is that incorporating jumps improves pricing accuracy, and models with dynamic jump intensity outperform their static counterparts. For example, Christoffersen et al. [
18] and Christoffersen et al. [
17] verify that models allowing for jumps (especially with time-varying intensity) outperform those assuming purely continuous diffusion.
While GARCH and GARCH-jump-based models have been highly useful and progressed significantly in modeling returns [
19,
20] or option pricing [
10,
20], they update volatility using heuristic rules (e.g., lagged squared returns), which raise robustness concerns. This has motivated the development of score-driven volatility models—also known as dynamic conditional score (DCS) models—by Creal et al. [
21] and Harvey [
22]. Score-driven models take a different route: time-varying parameters (such as volatility or jump intensity) are updated each period using the score (i.e., the gradient) of the conditional log-likelihood, rather than ad hoc functions of past residuals. This generalized filtering mechanism guarantees that updates incorporate new information exactly as prescribed by the likelihood. Score-driven models have been developed rapidly [
23] due to their flexibility in modeling time-varying parameters—such as volatility, skewness, kurtosis, tail risk, and correlations—through efficient likelihood-based updating. Incorporating the score-driven approach into option pricing models may improve the accuracy of option pricing, since it typically improves the volatility modeling—an input that directly determines option prices. In view of this, Ballestra et al. [
24] recently proposed a score-driven GARCH-jump model, i.e., the SDSDJ (score-driven separate dynamic jumps) model, where both the conditional variance and the conditional jump intensity evolve according to auto-regressive updates driven by scaled score innovations. Their empirical application to S&P 500 returns and options demonstrates an excellent in-sample fitting and significant out-of-sample improvements, surpassing traditional GARCH-jump benchmarks. In particular, the score-driven model can accurately capture returns’ jump clustering and volatility dynamics, resulting in more reliable pricing of the S&P 500 options. These results suggest that score-driven jump models can offer a powerful tool for option valuation.
Unlike the extensive studies on the U.S. market, relatively few studies have examined option pricing in the Chinese market. For example, Huang et al. [
5] compared discrete-time models on SSE 50 ETF options and found that specifications incorporating realized-volatility measures outperform standard GARCH models based solely on daily returns. Yang [
25] introduced a GARCH option pricing model that includes a double-exponential jump component, and empirical results show that the model improves the fit on SSE 50ETF returns and options and provides relatively better in- and out-of-sample pricing performance. Recently, Chi et al. [
26] evaluated GARCH-type option pricing models in the Chinese SSE 50 ETF market and find that these models (and their extensions) deliver superior in-sample fit, GARCH-based forecasts even outperform option-implied volatility in predicting realized volatility out of sample. However, to the best of our knowledge, the performance of score-driven GARCH-jump models for the Chinese stock market has not yet been investigated. In light of the unique market characteristics discussed above, as well as the good performance of score-driven models in other contexts, it is natural to investigate whether these models can enhance return filtering and option pricing in the SSE 50 ETF market.
This paper employs a novel score-driven GARCH-jump model to fit SSE (Shanghai Stock Exchange) 50 ETF returns and option pricing. The main findings are as follows. First, we use 50 ETF spot returns to estimate conditional volatility and jump intensity, and find that the SDSDJ model significantly outperforms conventional GARCH-jump models in model fitting. Second, we evaluate both in-sample and out-of-sample pricing performance using data from 50 ETF options, and find that the SDSDJ model achieves the lowest in-sample pricing error among all benchmarks, while its simplified variant—the SDJ (Score-driven jumps) model—delivers the most accurate out-of-sample results. Third, the superior pricing performance of both models is robust across different levels of moneyness and DTM (days-to-maturity).
The marginal contributions of this paper are threefold. First, we investigate the performance of score-driven GARCH-jump models in modeling SSE 50 ETF returns and option pricing in the Chinese stock market. Our study not only enriches existing studies on option pricing in the emerging markets, such as the Chinese stock market [
5,
25,
26], but also provides evidence on jump behavior in the Chinese stock market [
2,
25]. Second, we obtain a novel out-of-sample finding. Although the empirical studies in Ballestra et al. [
24] have shown that the dynamic score-driven GARCH-jump model outperforms its static counterpart for the U.S. stock market, rigorous out-of-sample option pricing evidence remains scarce. Using data from the Chinese market, we find that the SDJ model performs better than the SDSDJ model for out-of-sample option pricing. This empirical result suggests that, in some cases, the reduced generalization ability of dynamic GARCH-jump modeling may outweigh its flexibility, thereby allowing more parsimonious models to achieve superior performance. Last but not least, we also enrich the empirical applications of score-driven models in return modeling and option pricing.
The remainder of this article is organized as follows.
Section 2 provides the data and an overview of the SSE 50 ETF options market.
Section 3 introduces the methodology, including the SDSDJ and SDJ models, used in this paper.
Section 4 presents and discusses the empirical results.
Section 5 concludes the paper.
2. The Data and Features of the SSE 50 ETF Option Market
The underlying asset of the SSE (Shanghai Stock Exchange) 50 ETF options is the 50 ETF issued by Hua Xia Fund Asset Management Company. Covering the 50 largest high-liquidity blue-chip stocks listed on the SSE, the SSE 50 ETF accounted for approximately of the total market value (i.e., over trillion RMB) of all 1037 equity ETFs in China by the end of 2024. In this study, we use the daily returns of the SSE 50 ETF from 23 February 2005 to 28 October 2024, and the daily settlement prices of the 50 ETF option contracts from 9 February 2015, to 28 October 2024, as the sample. The data is sourced from the Wind Database, which is a reliable and widely used financial database in China.
Figure 1a plots the log-return time series of the 50 ETF, and Panel A of
Table 1 gives its descriptive statistics. Apparently, we can find that the 50 ETF returns are highly volatile; there exists evidence of time-varying jumps, that jumps tend to cluster over time, and that the returns are characterized by a sharp peak and fat tails.
The 50 ETF option was introduced on 9 February 2015 as a European-style, physically settled option written on the 50 ETF. Panel B of
Table 1 gives the number of options sorted by moneyness and DTM, and the basic contract terms are provided in
Table 2. As illustrated in
Figure 1b, both daily trading volume and open interest increased dramatically from 2015 to 2024. Panel A of
Table 3 lists the proportion of trading volume by option category. The following can be observed: First, ATM (at-the-money) and OTM (out-of-the-money) options account for more than
of the total trading volume, while ITM (in-the-money) options account for less than
. Second, short- and medium-term options with DTM (days-to-maturity) of less than 60 days comprise over
of the trading volume, whereas long-term options account for less than
. Therefore, in line with the literature (e.g., [
24]), we restrict our selection to ATM and OTM options with maturities of up to one year (250 days) for pricing, as shown in Panel B of
Table 1.
Compared to their U.S. counterparts, SSE 50 ETF options exhibit several notable differences in contract specifications, regulatory practices, and market behavior. First, the number of strikes available is limited. At inception, the 50 ETF options offered only five strikes (one ATM, two OTM, and two ITM), which expanded to nine strikes (one ATM, four OTM, and four ITM) by early 2018. New strikes are added upon dividend adjustments, and seventeen such adjustments have been made since the launch of this product.
Second, option contracts are adjusted when dividends are paid on the underlying ETF. Specifically, if the ex-dividend price of the ETF is
S and the cash dividend is
d, the adjustment provides each holder with
new contracts at a strike price of
for each original contract at strike
K. Take call options as an example: when dividends are paid, the stock price decreases and the value of the call option is eroded; by adjusting the exercise price, the call option buyer can be protected. This leads to the following pricing identity:
This mechanism effectively shields option holders from dividend-induced price drops—a feature not standard for index options such as SPX.
Finally, the Chinese market experiences higher price volatility [
5]. For example, the average realized 10-day volatility of the 50 ETF from February 2015 to December 2024 was approximately
, compared to about
for the S&P 500 (SPX). Moreover, by February 2018, the volatility premium (defined as VIX minus realized volatility) for the 50 ETF was approximately
higher than that of SPX options.
These observations highlight fundamental differences between the Chinese and U.S. options markets, suggesting that pricing models useful for the U.S. may not be directly applicable to the Chinese context.
4. Empirical Results
4.1. In-Sample Fitting of Spot Returns
We use different GARCH-type models (H-N GARCH, GARCHJ, SDJ, DVSDJ, and SDSDJ model) to filter the conditional volatility and jump intensity of the 50 ETF spot returns. Since different models imply different conditional volatility and jump intensity for the same market, our objective is to identify which model achieves superior in-sample fitting.
Table 4 presents the maximum likelihood estimation results for these models. First, three different criteria—log-likelihood value, AIC, and BIC—are used to evaluate in-sample model fitting. From the table, we can find that, among all GARCH-type models, the novel SDSDJ model achieves the highest log-likelihood and the lowest AIC and BIC. Interestingly, the SDJ and DVSDJ models perform even worse than the plain vanilla Heston–Nandi GARCH model. Therefore, we demonstrate that score-driven GARCH-jump models provide better in-sample fitting. Second, as shown in the table, we find that none of the GARCH-type models produce significant coefficient estimates for
, whereas the S&P 500 market exhibits a significant jump risk premium (e.g.,
; see [
24]). However,
is significantly positive across models, indicating that investors are compensated for bearing the normal component of risk. Third, the jump size (denoted by
) of log-returns in the 50 ETF market is relatively small. As shown in the table, although all estimated
values are negative—reflecting negatively skewed returns—their absolute magnitudes are small, in contrast to jump sizes of around
observed in the S&P 500 market. Lastly, similar to the S&P 500 market, there exists significant interaction between the normal and jump components for the SSE 50 ETF market. The relative magnitudes of
and
reflect the interaction between jump intensity and conditional variance. If
, the interaction is considered significant [
24], which is also confirmed by the estimates in the table.
Figure 2 plots the filtered conditional volatility for different models. As illustrated in Panel (a)–Panel (e), the 50 ETF has experienced two episodes of extreme volatility: the 2008 financial crisis and the 2015 Chinese stock market crash. Although the traditional GARCH model captures all market turbulence, its volatility peaks are not pronounced, exhibiting a relatively smooth pattern, and the same applies to the static GARCHJ model (Panel (b)). In contrast, the SDJ model (Panel (e)) and the SDSDJ model (Panel (e)) provide more significant peaks in modeling the conditional volatility of the 50 ETF.
Figure 3 presents the filtered jump intensity for different GARCH-jump models. Clearly, models with dynamic jump intensity (i.e., DVSDJ and SDSDJ) provide more significant peaks than the static models (i.e., GARCHJ and SDJ). One possible reason is that static models use constant jump intensities, making it difficult to distinguish between high- and low-jump intensity trading days. However, GARCH-jump models with dynamic jump models provide better estimates of the number of posterior jumps associated with market crises.
Finally, we propose a general criterion for evaluating the quality of these volatility models: to what extent can the model strip out the predictable part of the log-return and assign a truly random and unpredictable part into the normal component (
). Time series models with stronger explanatory power should yield
values that are closer to a standard normal distribution.
Figure 4 illustrates the Q-Q plots of the normal innovation
for each GARCH model. As shown in the figure,
of traditional GARCH-type models deviates significantly from the standard normal distribution beyond
, while
of the SDSDJ model closely approximates a standard normal distribution within
.
4.2. Out-of-Sample Analysis for Spot Returns
To evaluate the predictive performance of the novel SDSDJ model and its rival models in 50 ETF spot returns, we follow Ballestra et al. [
24] and forecast the 5% and 1% Value-at-Risk (VaR) from one to five days. We evaluate the predictive performance of these models across several crisis and relative stable periods by partitioning the dataset into five subperiods and conducting separate out-of-sample analyses: January 2005–August 2009 (including the 2008 global financial crisis), August 2009–December 2012 (a relatively stable period), January 2013–December 2017 (including the 2015 Chinese stock market crash), January 2018–December 2022 (including the COVID-19 pandemic), and December 2022–November 2024.
For each period, we estimate the model parameters using a rolling window with a width equal to of the total number of days in that period. That is, initially, we calibrate these models using the first of daily total returns from the SSE 50 ETF, reserving the remaining data for out-of-sample evaluations. Subsequently, for each day within the out-of-sample dataset, we re-estimate the parameters employing the latest daily return observations and generate forecasts for horizons ranging from one to five days.
Specifically, for the SDSDJ model, at each forecast date
, we generate 100,000 simulated return paths for
, with
, based on the return specification in Equation (
2). This involves randomly generating the innovations
and
, with
and
evolving according to Equation (
4). We follow similar simulation procedures for the GARCHJ, DVSDJ, and SDJ models. We then calculate the VaR at significance levels of
and
by determining empirical quantiles from the 100,000 simulated outcomes of
. Following Creal et al. [
21], we evaluate the accuracy of the VaR forecast using the proportion-of-failures test proposed by Kupiec [
28]. The null hypothesis for this out-of-sample test is defined as
H0. The probability of model prediction failure equals .
Consequently, if the null hypothesis is rejected, it suggests that the model does not exhibit significant predictive power for VaR.
Table 5 reports the result of the likelihood ratio test. The empirical results indicate that the same model performs differently across high and low quantiles, and that market conditions also play a critical role in determining the model’s performance. For example, during the 2008 global financial crisis period (Panel A), at the
level, the SDJ and SDSDJ models exhibit superior performance, each with only one rejection. In contrast, the GARCHJ and DVSDJ models are rejected five times each. At the
level, however, each model demonstrates comparable and satisfactory performance. Interestingly, during the relatively stable period (Panel B), the SDSDJ model performs relatively better at
but worse at
, while the GARCHJ model exhibits the opposite pattern. During the 2015 Chinese stock market crash period (Panel C), all models perform relatively well at the
level, but perform poorly at the
level. The performances of these models during the COVID-19 pandemic (Panel D) period show a pattern similar to that observed in the 2008 crisis: the SDSDJ model performs best at
, whereas the GARCHJ model performs best at
. In the most recent period (Panel E), the SDSDJ model performs well at both the
and
levels.
Overall, the above empirical results suggest that the SDSDJ and SDJ models can effectively capture the extreme tail (e.g., ) of the conditional return distribution. By contrast, traditional GARCH-type models are better suited for modeling non-extreme returns (e.g., ) but yield notably relatively poorer out-of-sample fits when modeling extreme losses.
4.3. In-Sample Option Valuation
The GARCH-type models under -measure can be used to model the conditional volatility and jump intensity of spot returns, while those under -measure can be used to price options. Here, we apply several risk-neutralized GARCH-type models, including the SDSDJ and SDJ models to price the 50 ETF options from 23 February 2015 to 20 December 2024.
Table 6 presents the joint MLE estimation results using both spot returns and corresponding options data. Similar to the MLE estimation results for using only spot returns, the SDSDJ model achieves the highest log-likelihood value and the lowest AIC and BIC among all GARCH-type models. However, we find that the DVSDJ model performs very poorly over the time horizon, with a log-likelihood of only 21,823—
lower than that of the SDSDJ model. This empirical result supports the idea that incorporating score dynamics improves the performance of GARCH-jump models in option pricing. Furthermore, we find that the jump sizes
are all negative, the values of
lie between 0 and 1, and the estimates of
are all positive, indicating that the risk premium associated with the jump component is significantly positive. These findings are consistent with those of [
17,
18,
24,
27].
We also employ the IVRMSE (%) criterion (implied Volatility Root Mean Square Error) to evaluate the performance of these models.
Table 7 reports the in-sample IVRMSE for different option pricing models, both overall and across various moneyness and maturities. Panel A indicates that, overall, the SDSDJ model delivers the best performance, with an IVRMSE of only
, followed by the SDJ model (
) and the GARCHJ model (
), while the DVSDJ model performs the worst (
). Panel B shows that the pricing errors for OTM Put options are generally higher than those for OTM call options. Panel C reveals that short-term options tend to have higher pricing errors than long-term ones. As DTM increases, IVRMSE decreases—for example, the GARCHJ model performs better in the long term (
) than in the short term (
), and similar patterns are observed for the other models. Our primary concern is whether the SDSDJ model outperforms traditional GARCH-type models across different maturities and moneyness levels. The empirical results confirm that the SDSDJ model exhibits robust and consistently superior performance, with IVRMSE ranging from
to
across maturities, and from
to
across moneyness categories. Therefore, the empirical results confirm the robustness of its performance with respect to the option moneyness and maturities.
4.4. Out-of-Sample Option Valuation
We also conduct out-of-sample analyses for option valuation. For each year, option contracts from the first half of that year are used to estimate the optimal parameters, which are then applied to price contracts in the second half of that year. The model-implied volatility is derived using the classic Black–Scholes formula. By comparing the model-implied and market-implied volatilities, we compute the out-of-sample IVRMSE (%) for each model.
Table 8 reports the out-of-sample IVRMSE (%) for different option pricing models, both overall, and across various moneyness and maturities. Panel A indicates that the SDJ model yields the lowest overall IVRMSE, while the SDSDJ model performs poorly out of sample. This may be attributed to the larger number of parameters, which can result in overfitting and consequently weaker out-of-sample performance, while more parsimonious models (such as Black–Scholes and SDJ) exhibit greater stability for out-of-sample option valuation performance. Panels B through D present results when the options sample is sorted by year, maturity, and DTM, respectively. We have the following discoveries. First, higher market volatility is usually associated with poorer out-of-sample model performance. For example, in 2020 and 2024, the IVRMSE for all models exceeds
, while in 2022 and 2023, it can be as low as
(for the SDJ model). This substantial variation highlights the sensitivity of GARCH-type models to market conditions. During crisis periods, the performance of option pricing models generally deteriorates; however, the SDJ model remains the most robust among them. Second, pricing accuracy is higher for at-the-money options compared to deep out-of-the-money options, and the SDJ model still performs best across all moneyness categories. Finally, all models exhibit better performance in pricing long-term options, which is consistent with the in-sample findings.
To summarize, the empirical results on the out-of-sample option valuation in this section demonstrate that the score-driven GARCH-jump models outperform traditional approaches—including both GARCH-type models and the classic Black–Scholes model—in out-of-sample analyses for SSE 50 ETF options.
5. Conclusions
This paper employs a novel score-driven GARCH-jump model to analyze SSE 50 ETF returns and option pricing. Our analysis yields several key findings:
The SDSDJ (score-driven model with separate dynamic jumps) model achieves the highest log-likelihood and the lowest AIC and BIC values, indicating its superiority in-sample fitting to the 50 ETF return series. This model effectively captures both time-varying volatility and jump dynamics by utilizing score-driven filters.
According to the Kupiec likelihood ratio test, the GARCHJ (GARCH with jumps) model performs reasonably well in stable periods, but the SDSDJ and SDJ (score-driven model with jumps) models more accurately capture extreme losses in turbulent periods. This highlights the robustness of the SDSDJ framework for modeling fat-tailed returns and its value in risk management under stress.
The option pricing analysis shows that the SDSDJ model achieves the best in-sample performance across all GARCH-type models and has robust pricing accuracy across moneyness and maturities. However, its out-of-sample performance is less stable—likely due to overfitting, but its simplified variant—the SDJ model—delivers the most accurate out-of-sample results. These results imply a trade-off between model flexibility and generalizability, suggesting that while more flexible models like SDSDJ may offer better in-sample fits, more parsimonious models such as SDJ are better suited for practical out-of-sample option pricing under varying market conditions.
In summary, this study contributes to the literature on return modeling and option pricing in the Chinese equity markets using such score-driven GARCH-jump models. We document that the SDSDJ and SDJ models provide a unified framework for return modeling and option pricing, demonstrating robustness across different market conditions. These findings have important implications for regulators and practitioners, who may consider adopting the score-driven GARCH-type models for return modeling and option pricing, such as the recently introduced other stock index options in China; however, we leave this for future research.