1. Introduction
Bond yields and their returns are not normally distributed, thus overruling the concept of symmetry around the mean. They retain non-normal features, such as fat tails, skewness, and volatility clustering (
Piazzesi, 2010).
ATSMs augmented with jumps or regime switching dynamics offer enhanced flexibility to capture these stylised facts observed in interest rate data (
Piazzesi, 2010;
Singleton, 2006). When used to fit the term structure of interest rates, these models often lead to a reduction in the Root Mean Squared Error (RMSE) and produce residuals that more closely approximate Gaussian white noise. This improvement reflects better model specification and a more accurate representation of interest rate dynamics. Standard Gaussian ATSMs often fall short in capturing abrupt market movements or structural shifts. Incorporating jumps or regime switching mechanisms allows the model to internalise these dynamics, thereby improving both in-sample fit and the statistical properties of residuals (
Dai & Singleton, 2000;
Duffie & Kan, 1996).
Jumps are popularly associated with scheduled macroeconomic news release, typically the Central Banks announcing the interest rates changes. ATSM class is not completely affine except for special cases as found in
Hamilton (
1989),
Dai and Singleton (
2000), and
Ang and Bekaert (
2002). This class of regime switching models exhibits commonly two regimes, high-persistence low-volatility and a low-persistence high-volatility (
Piazzesi, 2010).
The extension of affine diffusion models to AJD is accomplished by including a Levy type jump to them.
Duffie et al. (
2000) discusses the AJD in terms of no-arbitrage, analytical tractibility, and their impact regarding accuracy with which they price bonds and their derivatives.
Cont et al. (
2004) discusses various Levy processes, their nature, whether finite or infinite jumps, and their impact on the diffusion. The associated stylised facts for these AJD is also exhibited by volatilty skews and the behaviour of options. Another important aspect to the term structure modelling and pricing of jump models is the market price of risk, in particular, consideration for compensation from the jump risk. Regime switching models are expected to be free from arbitrage, analytically tractable, and offer closed-form solutions.
A growing body of literature considers the treatment of regime detection through the implementation of Markov-modulated models; see
Hu (
2022),
Qin et al. (
2024),
Ding et al. (
2025), among others for more details. Regarding jumps within regime switching models, we consider the approach involving a marked point process by
Landen (
2000). This procedure determines the intensity kernel and regime-based risk compensation. Our study seeks to link the jumps to regimes even though they arise from different circumstances. Our focus has more to do with the ability of jumps and regime switching models to fit the term structure of interest rates when non-Gaussian dynamics are present. The effectiveness of these models should be confirmed by a reduction of the Root Mean Square Errors (RMSEs) and improvement from non-Gaussian residuals shifting more towards Gaussian.
In this study, we consider a continuous-time modelling in regimes, contrary to previous work involving mostly discrete-time—
Dai et al. (
2007),
Ang and Timmermann (
2012), among others. We also consider a non-Gaussian approach, unlike the Gaussian assumption made by
Dai et al. (
2007).
Bandara and Munclinger (
2011) also follow the continuous-time approach but their estimation strategy differs from ours as they use the Kalman filter. We estimate the model using the UKF for its known capability to handle nonlinearity.
We follow the AJD from the empirical work by
Duffie et al. (
2000), and the canonical form of parameters by
Dai and Singleton (
2000), from which we adopt the closed-form and tractability in pricing of bonds and derivatives. The bond yields derived from these models are tested for fitting our data when jumps are introduced. The same process is extended to regime switching models from which we expect regime switching analogue to the AJD, with certain parameter restrictions. We consider an affine process
X whose characteristic function takes the form
where
is the state vector at time
t,
u is a complex or real vector argument in the characteristic function,
and
are complex valued functions, and ⊤ is a transpose operation for a matrix
. The parameters within
are expected to vary with regimes while those within
do not.
Our argument regarding the affine jump diffusion and subsequently the regime switching affine models is triggered by the definition and characterisation of the regular affine process from
Duffie et al. (
2002). A generator-based approach is then followed on how a Markov process
X evolves over time (
Duffie et al., 2002;
Keller-Ressel et al., 2011). In the context of regime switching models, the generator should also define the dynamics that characterise both the continuous-time and discrete-time stochastic dynamics. The final output is the regime-specific characteristic function and a set of regime-specific ordinary differential equations (ODE)s or Ricatti-type equations from which a
Duffie and Kan (
1996) style zero-coupon bond price
P maturing at time
; hence, the yield
is derived. Extension of the regular affine process to the regime switching formulation is the condition for a regime switching affine models (
Van Beek et al., 2020). In addition, we follow
Landen (
2000) by incorporating the kernel intensity for regimes to ensure that for analogous with the jump kernel intensity, the model enables the pricing of regime risk.
Our key contributions are threefold. First, we formally demonstrate that the regular affine process specification naturally extends RS-AJD models. Second, we employ the UKF for efficient state estimation and model calibration, which enables accurate yield curve fitting in the presence of latent factors and nonlinearities. Third, empirical results show that the fitted models produce residuals that closely approximate Gaussian white noise, as confirmed through residual diagnostics and Q-Q plots—indicating improved model specification and robustness in capturing interest rate dynamics. Unlike the three-factor yield curve model based on the Diebold-Li extension of Nelson–Siegel as recently applied by
Choudhary (
2022) on the SA yields, AJD and RS-AJD have an advantage of econometric tractability as they simultaneously study the term structure and time series dynamics whilst also incorporating the jumps and regime switches.
The remainder of the paper is structured as follows:
Section 2 discusses briefly a literature review;
Section 3 describes the model framework and estimation strategy;
Section 4 discusses data collection;
Section 5 discusses the scenarios;
Section 6 discusses the model implementation;
Section 7 discuses the analysis of results; and
Section 8 concludes.
2. Literature Review
The modelling of term structure of interest rates has undergone significant development over the past three decades, with ATSMs emerging as a dominant framework due to their analytical tractability and empirical flexibility. Early models such as
Vasicek (
1977) and
Cox et al. (
1981) provided foundational single-factor Gaussian and square-root processes, but these were insufficient to capture the richness of the yield curve observed in empirical data. The extension to multi-factor affine models was formalised by
Duffie and Kan (
1996), who derived the zero-coupon bond equation as an exponentially affine function of the state variable
X.
Dai and Singleton (
2000) introduced the specification analysis for ATSMs, which determines whether they are well-defined for bond pricing, “admissibility”. They defined the
family of affine models, where
N is the number of factors and
denotes the number of volatility components. Within this structure, the
models represent the purely Gaussian and homoskedastic affine class, where the state variable
X does not affect its volatility at time
t.
models with
are characterised by the assumption that one of the
Xs determines the conditional volatility of all three state variables. This specification introduces the short rate dynamics such as mean reversion, central tendency, and stochastic volatility, leading to a shift in the normal distribution.
Affine term structure models permit deviations from normality primarily through the incorporation of stochastic volatility and the introduction of jump components. Distinguishing between these sources of non-normality using only first and second moments—such as mean and variance—is inherently challenging, as both features can produce similar effects on volatility clustering and heavy-tailed behaviour in returns. Regime switching models also produce non-normal return distributions and are consistent with empirical evidence of nonlinearities in the conditional first moments of financial time series. By allowing parameters such as drift or volatility to shift across latent regimes, these models can capture structural breaks, asymmetries, and time-varying risk premia that are otherwise difficult to model within standard linear frameworks; see
Piazzesi (
2010) and references therein.
Singleton (
2006) investigated the recurrent non-normal features observed in yield return distributions, particularly during periods of peak or quite financial market volatility. These deviations from Gaussianity are often attributed to time-varying volatility and sudden, infrequent price movements. The latter is typically modelled via classical jump processes, such as Poisson jumps, or more generally through Lévy-type processes. Other approaches include drawing shocks from mixtures of distributions or introducing regime switching structures that allow for discrete shifts in model parameters. Jump processes capture abrupt discontinuities, whereas regime switching models emphasise shifts between persistent economic states.
Singleton (
2006) further explored how these mechanisms interact with stochastic volatility—initially in discrete-time settings and subsequently within continuous-time affine frameworks that incorporate both stochastic volatility and jump components. Together, these enhancements aimed to replicate the observed skewness, kurtosis, and tail behaviour in bond yield and return distributions, especially across different market conditions.
Duffie et al. (
2000) presented an AJD that provides an analytical treatment of a class of transforms, including various Laplace and Fourier transforms as special cases, that allow a solution for a range of valuation and econometric problems. This AJD is a process for which the drift vector, instantaneous covariance matrix, and jump intensities all have affine dependence on the state vector
X. They extended the existing literature on affine asset pricing models by deriving a closed-form expression for an extended transform of an AJD process
X, and showed that this transform leads to analytically tractable pricing relations for a wide variety of valuation problems.
Piazzesi (
2000) applies the extended transform differently in the treatment of term structure models with releases of macro-economic information and with central-bank interest rate targeting.
Regime switching affine models should inherit similar characteristics to AJD, including tractability and close-form solutions. The work of
Van Beek et al. (
2020) addressed this issue by evaluating the prevalence of the regular affine process and parameter restrictions within regime switching. They prove that the joint process of the Markov chain and the conditionally affine part is a process with an affine structure on an enlarged state space, conditionally on the starting state of the Markov chain. The process leads to regime-specific characteristic functions and the solution of ODE as it is the case with the regular affine process. Essentially, a pricing solution based on an affine process is extended to a regime switching affine process without sacrificing the analytical tractability of the affine process. They consider among several empirical option pricing applications, a diffusion type short rate model of
Landen (
2000), which falls within the category of hidden Markov models. In this model, the underlying Markov process is assumed to have a stochastic differential driven by Wiener processes and a marked point process. They also consider the bond valuation application of
Elliot and Siu (
2013) on short rate models, who establish a Markov-modulated exponential-affine bond price formula with coefficients given in terms of fundamental matrix solutions of linear matrix differential equations.
Zhu et al. (
2015) developed Feynman–Kac formulas for a class of regime switching jump diffusion processes. In these models, the jump part is driven by a Poisson random measure associated with a general Lévy process and the switching part depends on the jump diffusion processes.
In this paper we adapt the regular affine process by
Duffie et al. (
2002) and their formulations together with those from
Keller-Ressel et al. (
2011). The AJD part is extended to the regime part accordingly to evaluate the regime-specific formulation of the characteristic function, the generators, and the bond pricing equation. Our objective is to model first, the AJD, followed by the regime switching models in the realm of the regular affine process. In contrast with
Dai et al. (
2007) who assume a Gaussian distribution and discrete-time process, we assume that the yields and their returns are non-normal.
A growing body of literature highlights the limitations of standard Kalman filtering in the context of nonlinear and non-Gaussian dynamics, particularly in fixed income modelling.
Christoffersen et al. (
2013) show that the UKF significantly outperforms the Extended Kalman Filter (EKF) in affine term structure models due to its ability to better handle nonlinearities without requiring Jacobian computations; see also
Hirsa (
2024). Their findings support the UKF as a viable estimation tool for fixed income applications.
More recently,
Doshi et al. (
2024) demonstrate the effectiveness of modelling financial variables within an AJD environment estimated using nonlinear filtering techniques, the UKF, while
B. Wu et al. (
2024) propose a multi-factor, mean-reverting affine framework for pricing volatility-based instruments such as variance swaps. These contributions confirm the relevance of AJD in capturing the structural complexity and tail behaviour of financial time series. The affine property of the jump diffusion state dynamics is preserved conditional on each regime, satisfying the standard affine admissibility conditions. The model therefore retains a RS-AJD structure governed by a latent Markov chain. Consequently, the application of the UKF—previously validated in standard AJD and multi-factor nonlinear environments—naturally extends to the regime switching context. Our study contributes to the literature by combining these frameworks and empirically demonstrating the implementation of RS-AJD and estimation and filtration via UKF.
Chourdakis (
2002) explains the need for estimation methods that can accommodate both jumps and regime switching in continuous-time models, noting that traditional Gaussian-based filters may fail to capture the irregularities introduced by such dynamics. Although developed outside of finance, the Generalized Maximum-likelihood UKF (GM-UKF) introduced in the robust control literature
Liu et al. (
2017) offers further motivation by demonstrating how robust filtering approaches can mitigate the impact of non-Gaussian noise—precisely the type of issue that arises in affine jump diffusion models with regime switching. Building on these insights, we adopt a UKF-based estimation strategy tailored to capture the nonlinear, regime-dependent, and non-Gaussian features of the term structure under jump risk and switching dynamics.
3. Model Establishment
Affine processes are a class of Markov chain characterised by the property that their logarithmic characteristic functions are affine in the initial state. They are typically defined through their infinitesimal generators, which involve parameters such as drift, diffusion, and jump intensity functions, each depending affinely on the current state. Under suitable regularity conditions, the evolution of the characteristic function is governed by a system of Riccati-type ODEs parameterised by time and initial conditions (
Duffie et al., 2002;
Keller-Ressel et al., 2011).
A natural extension of this framework is the regime switching affine process, in which the model parameters—and hence the generator—are modulated by an underlying finite-state Markov chain. This regime-specific structure enables the model to capture structural changes in the dynamics, such as shifts in volatility, mean reversion, or jump activity, driven by the regime variable. The process is expected to exhibit the affine behaviour, conditioned on the current regime, while preserving analytical tractability through a system of ODEs that characterise the evolution of the characteristic function under each regime (
Van Beek et al., 2020).
The Markov-modulated component builds upon the marked point process framework of
Landen (
2000), adapted here to ATSMs with jumps and regime dependence.
Proposition 1 (Affine Process)
. Let be a time-homogeneous Markov process with state space , adapted to a filtered probability space , and satisfying the Feller property (a central regularity condition in the theory of affine processes, particularly when ensuring existence, uniqueness, and well-behaved semigroups). The process is said to be affine if there exist deterministic functionssuch that the conditional characteristic function of , given , admits the exponential-affine form: Proposition 2 (Affine Jump Diffusion)
. Let be a time-homogeneous Markov process satisfying the following SDE:where is a constant drift vector and a loading matrix;
is the volatility matrix;
is a non-negative affine process driving stochastic volatility;
is a standard Brownian motion;
is the jump Poisson random measure associated with ;
is the compensated jump measure;
is the Lévy measure, governing the distribution of jump sizes .
The diffusion term
introduces stochastic volatility while preserving the affine structure. This framework allows the derivation of bond price equations via generalised Riccati equations. As shown in
Duffie and Kan (
1996), models in the affine class admit bond price expressions that are exponentially affine in the state variables under an affine short rate specification.
3.1. Regime Switching Affine Processes
The classical AJD framework, as developed by
Duffie et al. (
2000), provides a powerful and tractable class of models where characteristic functions admit closed-form or semi-closed-form solutions. Empirical evidence suggests that financial markets exhibit distinct
regimes characterised by shifts in volatility, jump intensity, mean reversion levels (
Ang & Bekaert, 2002;
Ang & Timmermann, 2012;
Hamilton, 1989), among others. These regime shifts cannot be captured adequately by a single affine jump diffusion process.
The AJD is extended to RS-AJD as a continuous-time Markov chain on a specific state space that switches between the states. Importantly, this extension preserves the affine structure, thereby retaining analytical tractability while allowing for richer dynamics that reflect regime changes (
Van Beek et al., 2020).
We now extend Proposition 2 to its regime switching counterpart as follows:
Proposition 3 (Regime Switching Affine Jump Diffusion (RS-AJD)). Let be a continuous-time, finite-state Markov chain with generator matrix , where .
Let satisfy the stochastic differential equation:where is a drift vector in regime , and is a loading matrix in regime ;
is the regime-dependent volatility matrix;
is a non-negative stochastic process governing volatility;
is a standard Brownian motion;
is the compensated jump measure under regime ;
is the regime-specific Lévy measure governing jump intensity and size distribution;
denotes the support of jump sizes ξ.
The regime process is a pure jump Markov chain with jump measure on , and compensator:where denotes the Dirac measure at point . The compensated jump measure of Z is defined byThe stochastic integral representation of isand the semimartingale decomposition of isThus, the full process is a Markov semi-martingale with regime-dependent drift, diffusion, and jump components. 3.2. Generator of the Regime Switching Affine Process
The infinitesimal generator provides a compact way to describe the local behaviour of a Markov process (
Duffie et al., 2000). In our model, the joint process
is a RS-AJD, with
Fix
, and let
be a twice continuously differentiable function. The conditional infinitesimal generator
of the state process
under fixed regime
z acts on
f as
where
The volatility scaling matrix describing state-dependent volatility scaling under regime
z is specified as
where
is the constant term (intercept) for the i-th component of the volatility scaling in regime z.
is the loading (sensitivity) of the i-th volatility component to the j-th state variable , under regime z.
indexes the volatility components, i.e., the diagonal entries of .
indexes the state variables.
This generator captures the combined effects of drift, diffusion, and jumps under a given regime. The jump intensity and the Lévy measure may vary with the regime, allowing for flexible modelling of rare or discontinuous events.
3.2.1. Joint Generator for
To describe the full process
, we combine the regime-specific generator
with the generator matrix
of the finite-state Markov chain
, where
for
and
. Then for any function
, the joint infinitesimal generator
is given by
The first term
describes the evolution of
under a fixed regime
z, while the second term accounts for regime switching jumps governed by the generator matrix
Q.
1The affine structure of the RS-AJD model ensures that expectations of exponential-affine functions—such as characteristic functions or bond prices—can be computed from a system of ODEs. Specifically, for a short rate of the form
with regime-dependent parameters
,
, pricing a zero-coupon bond reduces to solving a system of regime-specific Riccati equations derived from the generator
.
This generator-based formulation plays a central role in deriving semi-analytical solutions for prices, expectations, and moment-generating functions in a regime-specific and computationally tractable manner.
Under the risk-neutral measure
, define the zero-coupon bond price as
Let
Then
f satisfies the partial integro-differential equation (PIDE)
We postulate the affine
with deterministic functions
,
, and terminal conditions
Subsrituting the affine (
11) into the PIDE, applying the generator
, and matching coefficients in powers of
x yields a system of coupled ODEs (Riccati equations) for
:
Here
is the affine function of
corresponding to regime
z, and the sum over
reflects regime transitions through the Markov generator
Q.
are the terminal conditions.
3.2.2. Bond Yields and Term Structure
From the solution of
and
, the zero-coupon bond price in regime
z takes the exponential-affine form (
Landen, 2000;
L. Wu & Zeng, 2004)
when the current regime is
z. More generally, if a regime is not observed or transitions are probabilistic, priced value becomes a regime-weighted mixture:
where
or an equivalent regime probability.
The continuously compounded yield is
which retains an affine form in the state vector
and depends on regime probabilities.
Given that the process is a semi-martingale, it admits a well-defined stochastic integral formulation, allowing us to work within a no-arbitrage pricing framework. For tractability, we assume that the market price of risk is zero, implying that the physical measure coincides with the risk-neutral measure . Under this assumption, the model dynamics can be interpreted directly under , and no change-of-measure adjustment is necessary for pricing.
3.3. Joint Dynamics of State and Regime Processes
Following the specification of
Landen (
2000) and
L. Wu and Zeng (
2004), we consider a joint affine jump diffusion system for the latent state process
and the discrete-valued regime process
. These processes evolve in continuous-time and interact dynamically through their drift, volatility, and jump intensities.
3.3.1. State Process
The dynamics characterised in Proposition 3 can alternatively be expressed in the form of a SDE for the state process
, which evolves under a regime-dependent affine jump diffusion specification:
where
is a compensated marked point process driving jumps in .
denotes the jump size in the state space.
3.3.2. Regime Process
The evolution of the regime
is modelled via a marked point process:
where
is the mark space of regime transitions.
for , mapping the transition to the new regime index.
is a compensated marked point process governing regime jumps.
3.3.3. State-Dependent Regime Transition Intensity
The stochastic intensity of regime switching is allowed to depend on the current state
. The compensator (or intensity measure) of the regime-jump process is given by
where
denotes a transition from regime i to j.
is a measurable state-dependent transition intensity.
is a reference measure over the mark space .
We follow
L. Wu and Zeng (
2004) in assuming that the transition intensity is exponentially affine in the state
where
and
are parameters specific to the transition
. This choice ensures both tractability and consistency with the affine term structure framework.
Equation (
17) describes a multi-factor affine jump diffusion process for the latent state variables
, whose parameters depend on the prevailing regime
. Simultaneously, (
18) captures endogenous regime transitions via a marked jump process, where the intensity of switching from regime
i to
j is driven by the current economic state
through an exponential-affine function (
20).
This joint specification allows regime switching to be triggered endogenously by macro-financial conditions, thereby embedding jump risks and regime-dependent dynamics within a unified affine framework.
3.4. Estimation Framework
We consider modelling the term structure of interest rates using a nonlinear state-space framework, where observed yields are driven by latent factors . This section outlines the estimation strategy based on the UKF for two affine models, the AJD and the RS-AJD.
3.4.1. State-Space Modelling and UKF Estimation of Affine Yield Curve Models
UKF have been used together with the AJD to address both the filtration problem and parameter estimation. Recent advances in the literature (
Cheng et al., 2024;
Christoffersen et al., 2013;
Deng, 2023;
Lv et al., 2025) emphasise the need for flexible estimation frameworks that can accommodate nonlinearities and deviations from Gaussian assumptions.
Christoffersen et al. (
2013) show that the UKF offers significant improvements over the EKF when applied to affine term structure models, due to its ability to more accurately approximate nonlinear transformations without requiring Jacobians. Their findings support the UKF as a practical and robust estimation strategy in fixed income applications.
In light of this, and the fact that our models incorporate both jumps and regime switching dynamics, we adopt the UKF to estimate latent states and parameters. The UKF is particularly suitable here, as it efficiently handles the nonlinear, non-Gaussian features that arise from jump components and regime-dependent behaviour.
2 3.4.2. State (Transition) Equation
The state variables transition equations take the Euler–Maruyama approximation form of respective SDEs. The latent factor process
follows a discrete-time approximation of a jump diffusion process:
where
is the drift function.
is the diffusion (volatility) function.
is standard Gaussian noise.
is the jump component, where and the jump sizes are i.i.d. and independent of and .
In the UKF implementation, the jump component is approximated as Gaussian with
which is incorporated into the process noise covariance.
The transition function used in the UKF is
with total process noise covariance
3.4.3. Measurement Equation
The observed yield vector
is modelled as an affine function of the latent state:
where
is a vector of intercept terms.
is the factor loading matrix.
is measurement noise.
The measurement function for the UKF is:
3.4.4. Latent Factors
In both the AJD and RS-AJD specifications, the latent state vector is defined as
where
,
, and
represent the level, slope, and curvature factors of the yield curve, respectively. These three latent components capture the dominant cross-sectional movements in yields across maturities and provide an economically interpretable representation of the term structure dynamics.
3.4.5. UKF in Regime Switching Jump Diffusion (RS-AJD) Model
To capture the structural changes in yield dynamics, we extend the model to include a hidden regime index
, which evolves as a first-order Markov chain. The latent state evolves according to:
where the regime-specific components are:
is an affine drift function in regime k,
is an affine diffusion function in regime k,
is the jump term in regime k,
is the jump sizes,
is a standard Gaussian innovation.
The Markov transition rule for the regime index is
The measurement equation remains
3.4.6. UKF Implementation
The UKF approximates the filtering problem by propagating a set of sigma points through the nonlinear state and measurement functions. The method is described fully in
Hirsa (
2024). We implement it in this study for both the AJD and RS-AJD models. The UKF is a nonlinear extension of the classical Kalman filter that avoids the need for linearising the system dynamics—as done in the Extended Kalman Filter. Instead, it uses a deterministic sampling approach based on sigma points, which are carefully chosen points around the mean of the state distribution. These points are propagated through the nonlinear functions—state transition and measurement functions—to capture the posterior mean and covariance up to second-order accuracy.
As discussed in
Christoffersen et al. (
2013), the UKF is particularly well-suited for yield curve modelling when the measurement equation is affine—linear in latent states, but the state dynamics are nonlinear. The sigma points provide a computationally efficient means of approximating the distribution of the latent states without resorting to more complex simulation-based methods such as particle filtering. This balance between accuracy and computational tractability makes the UKF attractive for large-scale term structure models, particularly when estimating latent factor dynamics under jump diffusion or regime switching specifications.
The approach is also relatively easy to implement and tune, with hyperparameters (e.g., ) controlling the spread and weighting of the sigma points, thus offering robustness against non-Gaussian features or local nonlinearities in the system. The UKF estimates the latent states by recursively applying a prediction-update cycle based on a deterministic set of sigma points. The following steps outline the UKF implementation for both the AJD and RS-AJD models:
Initialization: Set initial state estimate , covariance matrix , and model parameters . Given the observed yield data , begin filtering from .
Sigma Point Generation: At time
, construct
sigma points
from the current estimate
and covariance
:
where
and
are UKF tuning parameters.
Prediction Step: Propagate each sigma point through the nonlinear transition function
, which may depend on the regime
:
Compute the predicted mean and covariance:
where
is the process noise covariance (including diffusion and jumps), and
are UKF weights.
Measurement Prediction: Pass predicted sigma points through the measurement function:
Then compute
Update Step: Compute the Kalman gain and update state estimates:
Iterate: Repeat the above steps for all to obtain the filtered state estimates and covariances .
: Latent factor vector.
: Observed yield vector.
: Filtered state estimate.
: Filtered state covariance.
: Regime-dependent process noise.
R: Measurement noise covariance.
A detailed pseudocode of this procedure is provided in
Appendix B.
3.4.7. Parameter Estimation via Maximum Likelihood
There are four parameters that are affine in
X for the AJD model according to the formulation in Proposition 2 and the measurement (
23). For the regime switching model, we assume two regimes
resulting in two sets of four each
parameters. We describe the parameters for each model as follows:
AJD model: The parameter vector
consists of four latent process parameters drift
, diffusion volatility
, jump intensity
, and jump size standard deviation
, along with noise parameters
A,
B, and measurement noise covariance
R.
RS-AJD model: The parameter vector
includes separate process parameters for each regime
, capturing regime-dependent dynamics.
These parameters are estimated by maximising the UKF-based log-likelihood, which can be expressed as
where
is the observed yield vector at time
t,
is the UKF one-step-ahead prediction,
is the predicted observation covariance, and
is the dimension of the observation vector.
For the RS-AJD model, the likelihood is computed conditional on the regime sequence, with parameters applied according to the regime at each time step.
Optimization is performed using gradient-based local algorithms, e.g., L-BFGS-B, or global methods, Differential Evolution (DE), depending on initialisation and model complexity.
4. Data Collection
We retrieved the weekly South African (SA) Government bond prices and yields from the Thomson Reuters database for the period October 2015 to September 2024, over 3 months, 5, 10, 12, 20, 25, and 30 year maturities. This sample encompasses a range of market and monetary policy events with episodes of heightened volatility and discontinuous yield adjustments. It provides the necessary variation to identify both the continuous and jump components of the term structure as well as distinct interest rate regimes. The choice of weekly data is informed by weekly auctions held by the Treasury. Technically, weekly frequency is suitable for smoothing transitory market noise while retaining sensitivity to regime shifts and jump events.
SA bonds have fewer short maturities but instead their bond portfolios consist of longer maturities. For the out-sample forecast, we used one year weekly bond yields for the period October 2023 to Septermber 2024.
The yield surface in
Figure 1 presents both the cross-section and time series of yields. It demonstrates a clear upward trend in yield levels from 2015 to 2024. In the earlier years (2015–2016), yields were comparatively low, averaging around 8%, but began rising steadily, reaching a local peak around 2020. By 2024, yields remain elevated compared to the beginning of the period, suggesting a steep rate increase.
Along the maturity dimension, the term structure typically exhibits an upward-sloping pattern from short to intermediate maturities, with yields increasing steeply between 0 and 5 years. A modest hump is visible around the 15-year maturity mark, beyond which yields stabilise or slightly decline, indicating a flattening yield curve at the long end.
6. Model Implementation
We begin with the AJD model as the baseline single-regime specification, which captures jump dynamics in the short rate process within a continuous-time framework. This model is then extended to a RS-AJD framework, allowing for parameters such as drift, volatility, and jump intensity to vary across two latent regimes () governed by a generator matrix .
Both models are discretised using the Euler–Maruyama scheme to approximate the continuous-time stochastic differential equations for numerical simulation and likelihood evaluation. For the models, we assume the presence of the stylised facts for asset returns and non-Gaussianity (
Cont, 2001). As a result, we estimate the parameters, hidden states, and regime paths using the UKF.
Estimation involves jointly inferring the model parameters and the regime transition matrix from observed yield curve data across maturities. We compare the fit and forecasting performance of (i) non-Gaussian AJD models with jumps and (ii) the regime switching RS-AJD extensions. This hierarchical modelling approach balances tractability with flexibility, enabling us to capture regime-dependent shifts in the term structure dynamics that are crucial for accurate pricing, risk assessment, and empirical analysis.
8. Conclusions
In this study, jumps and regime switching dynamics were introduced within the affine term structure framework to address the empirically observed non-normality in bond yields and their returns. The resulting models were calibrated using the UKF, and their performance was assessed through RMSE metrics, residual diagnostics, and visual inspection of fitted versus observed yield curves.
The inclusion of jumps and regimes was justified by a clear improvement in model fit, evidenced by lower in-sample and out-of-sample RMSE values and the model’s ability to reproduce key stylised facts, such as skewness and excess kurtosis. Residual diagnostics including Q-Q plots, autocorrelation, and distributional measures showed a movement toward Gaussianity, supporting the notion that the models capture critical nonlinearities and discontinuities in the yield process.
Some limitations remain. The jump component was modelled using a simple Poisson process with constant intensity and alternative Lévy-based jump processes—the variance gamma, normal inverse Gaussian, or Merton jump processes were not considered. Additionally, the marked point process was implemented through a kernel-compensated formulation, which, although analytically tractable, may not fully capture the complex jump dynamics observed in the data. Future work could explore these richer jump specifications to potentially improve model fit and better characterise the jump behaviour.
Despite a strong post-calibration fit from both models, residuals exhibit statistically significant autocorrelation, suggesting missing dynamics. We hypothesise that this may be due to simplified jump modelling and fixed regime transitions. Inherent tensions between a cross-section and time series of interest rates could be another possible cause for this misspecification. Future work could extend the framework to incorporate Lévy-type jumps and stochastic regime transitions, and employ advanced filtering methods—such as expectation maximisation (EM) and the Particle filter—to improve state estimation.