Next Article in Journal
Modeling and Forecasting Time-Series Data with Multiple Seasonal Periods Using Periodograms
Previous Article in Journal
Dynamic Interaction Between Microfinance and Household Well-Being: Evidence from the Microcredit Progressive Model for Sustainable Development
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Explosive Episodes and Time-Varying Volatility: A New MARMA–GARCH Model Applied to Cryptocurrencies

by
Alain Hecq
* and
Daniel Velasquez-Gaviria
Department of Quantitative Economics, School of Business and Economics, Maastricht University, 6211 LK Maastricht, The Netherlands
*
Author to whom correspondence should be addressed.
Econometrics 2025, 13(2), 13; https://doi.org/10.3390/econometrics13020013
Submission received: 30 December 2024 / Revised: 11 February 2025 / Accepted: 18 March 2025 / Published: 24 March 2025

Abstract

:
Financial assets often exhibit explosive price surges followed by abrupt collapses, alongside persistent volatility clustering. Motivated by these features, we introduce a mixed causal–noncausal invertible–noninvertible autoregressive moving average generalized autoregressive conditional heteroskedasticity (MARMA–GARCH) model. Unlike standard ARMA processes, our model admits roots inside the unit disk, capturing bubble-like episodes and speculative feedback, while the GARCH component explains time-varying volatility. We propose two estimation approaches: (i) Whittle-based frequency-domain methods, which are asymptotically equivalent to Gaussian likelihood under stationarity and finite variance, and (ii) time-domain maximum likelihood, which proves to be more robust to heavy tails and skewness—common in financial returns. To identify causal vs. noncausal structures, we develop a higher-order diagnostics procedure using spectral densities and residual-based tests. Simulation results reveal that overlooking noncausality biases GARCH parameters, downplaying short-run volatility reactions to news ( α ) while overstating volatility persistence ( β ). Our empirical application to Bitcoin and Ethereum enhances these insights: we find significant noncausal dynamics in the mean, paired with pronounced GARCH effects in the variance. Imposing a purely causal ARMA specification leads to systematically misspecified volatility estimates, potentially underestimating market risks. Our results emphasize the importance of relaxing the usual causality and invertibility assumption for assets prone to extreme price movements, ultimately improving risk metrics and expanding our understanding of financial market dynamics.
JEL Classification:
C22; C14; C58

1. Introduction

Financial returns often exhibit relatively weak autocorrelation in levels but strong and persistent volatility, a pattern that poses notable challenges for conventional linear time series models. Although ARCH-type approaches capture time-varying volatility, standard ARMA processes assume that all autoregressive and moving average roots lie strictly outside the unit circle, thereby excluding the possibility of forward-looking or explosive dynamics. Recent research has emphasized that certain economic or financial time series—particularly those prone to speculative bubbles or local explosions—may benefit from noncausal autoregressive components, wherein some roots reside inside the unit disk (Aguirre & Lobato, 2024; Blasques et al., 2024; Giancaterini & Hecq, 2022; Gouriéroux & Zakoïan, 2017; Hall & Jasiak, 2024; Lanne & Saikkonen, 2011; Lobato & Velasco, 2022; Lof & Nyberg, 2017). In response, the mixed causal–noncausal invertible–noninvertible autoregressive moving average (MARMA) model extends traditional ARMA by allowing both causal and noncausal (as well as invertible and noninvertible) factors in the same framework, thus permitting local explosions consistent with bubble-like phenomena.
A distinctive feature of the MARMA model is that it admits multiple ARMA representations with identical second-order properties. Under Gaussianity, this multiplicity renders the true placement of roots (inside vs. outside the unit disk) unidentifiable through second moments alone, as the resulting ARMA forms share the same variance–covariance structure and spectral density (Breidt et al., 2001). Identification only becomes possible by exploiting higher-order probability structures, including autocorrelations of squared residuals or third- and fourth-order spectral densities. In this sense, noncausality reveals itself through richer nonlinear dynamics that standard causal ARMA models cannot replicate.
Although MARMA models capture nonlinear behavior arising from noncausal feedback, they do not, by themselves, generate the well-known volatility clustering phenomenon commonly observed in financial markets (Breidt et al., 2001; Engle, 1982). For instance, the magnitude of asset returns often displays strong autocorrelation, so that large shocks tend to be followed by further volatility. These dynamics are typically addressed by GARCH processes (Bollerslev, 1986), which model conditional variances as functions of past shocks and past conditional variances. Accordingly, a combined MARMA–GARCH structure can better reproduce the main stylized facts in financial returns: weak autocorrelation in levels, local explosiveness when bubbles arise, and persistent conditional volatility.
Yet introducing a GARCH component into the MARMA framework raises theoretical and practical challenges. Much of the existing MARMA literature assumes i.i.d. errors, while GARCH fundamentally presumes conditional heteroskedasticity in these errors. Although Meitz and Saikkonen (2013) study noninvertible ARMA models with ARCH errors, more general MARMA–GARCH processes with potentially noncausal terms remain relatively unexplored. In this paper, we fill that gap by detailing estimation and identification procedures for MARMA–GARCH processes—including cases where the process is noninvertible, noncausal, or both.
We compare two estimation approaches. The first is Whittle estimation in the frequency domain, which is asymptotically equivalent to Gaussian maximum likelihood for stationary processes with finite variance. The second is time-domain maximum likelihood, which can be more flexible in heavy-tailed situations. While both methods provide consistent and asymptotically normal estimators of the MARMA parameters under stationarity, they differ in their requirements for the GARCH component. Maximum likelihood only demands finite second moments, whereas Whittle estimation requires the existence of up to eight moments—a significant constraint in the presence of high kurtosis or skewness. Our Monte Carlo evidence confirms that maximum likelihood outperforms Whittle estimation for GARCH parameters when faced with heavy-tailed or skewed innovations.
An additional insight from our analysis is that erroneously imposing a purely causal (or purely invertible) representation for the mean may distort GARCH parameter estimates. Specifically, forcing roots outside the unit disk when the true process is partly noncausal can lead to a lower estimated α , implying an understated immediate reaction to new information, and a higher estimated β , suggesting an exaggerated persistence of volatility. Such parameter biases compromise both the fit of the model and the reliability of derived risk measures, such as Value at Risk (VaR).
To illustrate these concepts, we apply the MARMA–GARCH model to daily prices of two major cryptocurrencies, Bitcoin and Ethereum. After detrending the data (via cubic splines or Fourier filtering), we find that noncausal specifications dominate purely causal alternatives and yield more plausible estimates of conditional volatility during bubble-like episodes. In particular, the MARMA( 0 , 1 , 0 , 0 ) structure accommodates future-dependent feedback, while the GARCH(1,1) component captures persistent volatility clustering. By contrast, imposing a causal AR(1) in the mean tends to inflate β and attenuate α , underestimating the susceptibility of volatility to immediate shocks.
The remainder of this paper proceeds as follows: Section 2 develops the MARMA–GARCH model, describes our proposed estimation and identification strategies, and addresses practical issues surrounding data detrending. Section 3 presents a Monte Carlo study and an empirical application to cryptocurrencies, highlighting how noncausal modeling affects GARCH parameter estimates and risk metrics. Finally, Section 4 discusses the main findings and concludes.

2. Materials and Methods

2.1. MARMA( r , s , r , s )–GARCH( p , q ) Processes

Let { Y t : t Z } be a strictly stationary univariate process defined on a probability space ( Ω , F , P ) . We denote it as a mixed causal–noncausal invertible–noninvertible autoregressive moving average generalized autoregressive conditional heteroskedasticity process, abbreviated as
MARMA ( r , s , r , s ) GARCH ( p , q ) .
Specifically, it satisfies
ϕ + ( L ) ϕ * ( L 1 ) Y t = θ + ( L ) θ * ( L 1 ) ε t ,
where
ε t = η t σ t , η t i . i . d . F ( ζ ) , and σ t 2 = ω + α ( L ) ε t 2 + β ( L ) σ t 2 .
The innovations { η t } are assumed i.i.d. with
E [ η t ] = 0 , E [ η t 2 ] = 1 , E [ η t 4 ] < .
Hence, { ε t } is a martingale difference sequence satisfying
E [ ε t ] = 0 , E [ ε t 2 ] = σ t 2 , E [ ε t 3 ] = κ 3 , E [ ε t 4 ] = μ 4 , κ 4 = μ 4 3 σ t 4 .
Throughout, L denotes the backward shift operator, i.e., L k Y t = Y t k for k { 0 , ± 1 , ± 2 , } . Define the following lag polynomials:
ϕ + ( L ) = 1 j = 1 r ϕ j + L j ,
θ + ( L ) = 1 + i = 1 r θ i + L i ,
and their corresponding lead polynomials,
ϕ * ( L 1 ) = 1 k = 1 s ϕ k * L k ,
θ * ( L 1 ) = 1 + m = 1 s θ m * L m .
We collect all parameters in the vector
ϑ = ϕ 1 + , , ϕ r + ; ϕ 1 * , , ϕ s * ; θ 1 + , , θ r + ; θ 1 * , , θ s * C r + s + r + s .
If r = 0 and s = 0 , then the above reduces to a classical causal–invertible ARMA( r , s ) model.
From (1), an equivalent rearrangement is
( 1 j = 1 r ϕ j + L j k = 1 s ϕ k * L k + j = 1 r k = 1 s ϕ j + ϕ k * L j k ) Y t = 1 + i = 1 r θ i + L i + m = 1 s θ m * L m + i = 1 r m = 1 s θ i + θ m * L i m ε t .
We assume any common roots between ϕ + ( z ) ϕ * ( z ) and θ + ( z ) θ * ( z ) are canceled. In the complex domain (substituting L z ), these polynomials become
ϕ + ( z ) = 1 j = 1 r ϕ j + z j , ϕ * ( z ) = 1 k = 1 s ϕ k * z k ,
θ + ( z ) = 1 + i = 1 r θ i + z i , θ * ( z ) = 1 + m = 1 s θ m * z m .
All these factors are assumed to have zeros outside the unit disk. They may be factorized as
ϕ + ( z ) = = 1 r 1 m z ,
ϕ * ( z ) = = r + 1 r + s 1 m z ,
θ + ( z ) = h = 1 r 1 n h z ,
θ * ( z ) = h = r + 1 r + s 1 n h z .
Observe that any root m (or n h ) lying outside the unit circle can equivalently be represented by its reciprocal m 1 (or n h 1 ) inside the unit circle. Flipping a root from outside to inside the unit disk effectively switches a causal factor to a noncausal one (or an invertible factor to a noninvertible one), but does not affect second-order properties such as the autocorrelation function or spectrum. Hence, for Gaussian processes, there exist 2 r + s + r + s alternative factorizations corresponding to the same second-order structure, making them observationally indistinguishable via second-order statistics alone. Under non-Gaussianity, however, higher-order moments and cumulants break this equivalence, thus providing identifiability of the particular causal–noncausal and invertible–noninvertible roots (Hecq & Velasquez-Gaviria, 2024).
  • GARCH Component.
For the GARCH( p , q ) part, let
α ( L ) = u = 1 p α u L u , p 0 ,
β ( L ) = v = 1 q β v L v , q > 0 ,
with ω > 0 and α u , β v > 0 . These restrictions ensure σ t 2 0 . Hence, (2) may be written more explicitly as
σ t 2 = ω + u = 1 p α u ε t u 2 + v = 1 q β v σ t v 2 .
As shown in Engle (1982) and Bollerslev (1986), a squared GARCH( p , q ) sequence can be rewritten as an ARMA(max ( p , q ) , q ) model. From (16), we obtain
ε t 2 = 1 max ( p , q ) α + β ε t 2 = ω + v t = 1 q β v t ,
where
v t = ε t 2 σ t 2 = σ t 2 η t 2 1 .
Since { v t } is a martingale difference (assuming E [ ε t 2 ] < ), it follows that { ε t 2 } satisfies an ARMA(max ( p , q ) , q ) representation:
α + β ( L ) ε t 2 = ω + β ( L ) v t .
Though { v t } is uncorrelated, it is not independent. This equivalence allows GARCH parameters to be estimated via standard ARMA-based techniques.

2.1.1. Causal and Invertible Representation

A MARMA( r , s , r , s ) process typically admits multiple ARMA( p , q ) representations (with p = r + s and q = r + s ) sharing the same second-order structure (Hecq & Velasquez-Gaviria, 2024). A convenient representation for some purposes is the causal and invertible (often termed “pseudo-causal and invertible”) form, in which all roots lie strictly outside the unit disk. Even if the true process is noncausal or noninvertible, this representation yields errors that are uncorrelated (though not necessarily independent) (Breidt et al., 2001).
A common causal–invertible representation is
1 j = 1 r + s ϕ j L j Y t = 1 + i = 1 r + s θ i L i ε t * ,
with parameter vector
β = ϕ 1 , , ϕ r + s , θ 1 , , θ r + s R r + s + r + s .
Causality means there exists an absolutely summable filter { π j } such that
Y t = j = 0 π j ε t j * ,
and invertibility means there is a similar filter { π j * } such that
ε t * = j = 0 π j * Y t j .
In practice, fitting this causal–invertible form (often all-pass under genuine noncausality) can simplify the subsequent extraction of the correct factors and identification of their roots (Hecq & Velasquez-Gaviria, 2024). Misallocating the roots, however, may yield residuals (while uncorrelated) whose squares exhibit dependence differing from the correct GARCH specifications.

2.1.2. Spectral Representations

The spectral representation of a MARMA( r , s , r , s )–GARCH( p , q ) process splits naturally into MARMA and GARCH components. The MARMA part has second-order properties equivalent to those of the corresponding ARMA( p , q ) (Hecq & Velasquez-Gaviria, 2024), while the GARCH component corresponds to an ARMA(max ( p , q ) , q ) model for { ε t 2 } (Bollerslev, 1986).
Let { Y t } be the MARMA( r , s , r , s )–GARCH( p , q ) process satisfying (1). Define z = e i ω . The transfer function for the conditional mean is
ψ ϑ , ω = θ + ( z ) θ * ( z ) ϕ + ( z ) ϕ * ( z ) = 1 + i = 1 r θ i + z i + m = 1 s θ m * z m + i = 1 r m = 1 s θ i + θ m * z i m 1 j = 1 r ϕ j + z j k = 1 s ϕ k * z k + j = 1 r k = 1 s ϕ j + ϕ k * z j k .
The k-th order spectral density is given by
S k ϑ , κ k ; ω 1 , , ω k 1 = κ k ( 2 π ) k 1 = 1 k 1 ψ ( ϑ , ω ) ψ ϑ , = 1 k 1 ω ,
where κ k is the k-th order cumulant of { Y t } , and typically ω = 2 π / T .
Likewise, the transfer function for the conditional variance component (i.e., the ARMA form of { ε t 2 } ) is
g ξ , ω = β ( z ) ( α + β ) ( z ) = 1 v = 1 q β v z v 1 u = 1 max ( p , q ) ( α u + β u ) z u .
Its k-th order spectral density is
S k v ξ , κ k v ; ω 1 , , ω k 1 = κ k v ( 2 π ) k 1 = 1 k 1 g ( ξ , ω ) g ξ , = 1 k 1 ω ,
where κ k v denotes the k-th order cumulant of { v t } = { ε t 2 σ t 2 } .
  • Higher-Order Periodogram.
A sample analog of the k-th order spectral density is the k-th order periodogram:
I k ( ω 1 , , ω k 1 ) = 1 ( 2 π ) k 1 T = 1 k 1 d T ( ω ) d T = 1 k 1 ω ,
where
d T ( ω ) = t = 1 T Y t e i ω t
is the discrete Fourier transform of { Y t } . For k = 2 , (23) reduces to the ordinary periodogram.

2.2. Parameter Estimation and Identification

We now outline a two-stage procedure for estimating and identifying the MARMA ( r , s , r , s ) structure and the associated GARCH ( p , q ) parameters. In the first stage, we fit the mean equation using a causal–invertible ARMA ( p , q ) representation of the MARMA process, factorize its polynomials, and select among the resulting 2 r + s + r + s candidate representations via higher-order diagnostics. In the second stage, we fit the GARCH ( p , q ) dynamics, comparing Whittle-based and maximum likelihood approaches.

2.2.1. Two-Step Approach

  • First Step (MARMA Identification).
    • Fit a Causal–Invertible ARMA ( p , q ) . Estimate the orders p and q (where p = r + s and q = r + s ) using information criteria (AIC, BIC) and significance tests. Treat the process as if it were causal and invertible, which allows for a standard ARMA( p , q ) fit in the frequency or time domain (e.g., Whittle or Gaussian maximum likelihood).
    • Redistribute Roots via Factorization. Once the ARMA( p , q ) fit is obtained, factorize its polynomials and systematically explore all 2 r + s + r + s possible root allocations, corresponding to distinct causal/noncausal and invertible/noninvertible configurations. In practice, this step involves flipping certain roots inside/outside the unit disk (see Section 2.1).
    • Select Final Representation. For each candidate, compute residual diagnostics such as higher-order spectral measures (Hecq & Velasquez-Gaviria, 2022, 2024; Velasco & Lobato, 2018) or a portmanteau-type test on { ε t } and { ε t 2 } . The representation with the weakest remaining serial dependence (in both levels and squares) is chosen as the final MARMA ( r , s , r , s ) specification.
  • Second Step (GARCH(p,q) Estimation).
    • Estimate GARCH ( p , q ) . Using the final residual series { ε t } from the identified MARMA model, estimate the parameters { ω , α u , β v } of a GARCH ( p , q ) process (commonly GARCH ( 1 , 1 ) in many financial applications).
    • Whittle vs. MLE. We compare frequency-domain Whittle estimation (which typically requires higher-order moment assumptions) against time-domain maximum likelihood methods (which may hold under weaker moment conditions). Our Monte Carlo experiments suggest that MLE often exhibits smaller bias and variance, although Whittle remains a practical alternative if the required moments exist.

2.2.2. Estimation and Identification of the MARMA( r , s , r , s ) Process

We begin with the causal–invertible ARMA( p , q ) representation that shares second-order properties with the MARMA ( r , s , r , s ) process; see Section 2.1 and Fries and Zakoïan (2019). Suppose we have observations { Y t } t = 1 T . Under mild regularity conditions (stationarity, finite fourth moments), Whittle estimation of the ARMA parameters is asymptotically equivalent to maximizing the Gaussian likelihood, because (i) the discrete Fourier transforms of { Y t } converge in distribution to complex normal variables, and (ii) the periodogram converges in probability to the underlying second-order spectral density.
  • Whittle Estimation.
Let
β 0 = ϕ 0 , 1 , , ϕ 0 , r + s , θ 0 , 1 , , θ 0 , r + s
denote the true parameter vector in the causal–invertible ARMA( p , q ) representation. Recall from (21) that S 2 ( β , κ 2 , ω ) denotes the second-order (power) spectral density of { Y t } . The Whittle criterion is
W ( β ) = = 1 T / 2 log S 2 ( β , κ 2 ; ω ) + I 2 ( ω ) S 2 ( β , κ 2 ; ω ) ,
where I 2 ( ω ) is the usual periodogram at frequency ω . The Whittle estimator is
β ^ = arg min β B W β .
Provided E [ Y t 4 ] < , standard theorems (e.g., Hannan, 1973) imply
β ^ p β 0 , and T β ^ β 0 d N 0 , V ( β 0 ) ,
where the asymptotic covariance matrix can be approximated by the inverse of
I ( β 0 ) = 1 4 π π π 1 S 2 ( β 0 , κ 2 , ω ) 2 S 2 ( β 0 , κ 2 , ω ) β S 2 ( β 0 , κ 2 , ω ) β d ω .
  • Redistributing Roots.
After obtaining β ^ , we factorize the fitted polynomials and move each root inside or outside the unit disk, as described in (10)–(13), thereby generating all 2 r + s + r + s candidates consistent with β ^ . We select the final model (i.e., the correct allocation of causal/noncausal and invertible/noninvertible factors) by finding the root configuration that most reduces higher-order correlation in the residuals (see Section 2.1 and, e.g., Velasco and Lobato (2018) for third/fourth-order diagnostics). Formally, one may minimize W ( β ) over this discrete set of admissible factorizations or rely on higher-order tests outlined below.

2.2.3. Spectral Identification Function

To incorporate higher-order dynamics, Velasco and Lobato (2018) propose combining the third- and fourth-order spectral densities into a single loss function. For k = 3 or 4, define
L k T ( ϑ ) = ( 2 π ) k 2 2 k T k 1 j = 1 T 1 I k ( ω 1 , , ω k 1 ) S k ϑ , κ k ( ϑ ) ; ω 1 , , ω k 1 2 = 1 k 1 ψ ( β , ω ) ψ β , = 1 k 1 ω 2 ,
where j = ( j 1 , , j k 1 ) indexes all relevant frequency combinations. Let
L T ( ϑ ) = L 3 T ( ϑ ) + L 4 T ( ϑ ) .
Minimizing L T ( ϑ ) over ϑ helps detect non-Gaussian structures (e.g., nonlinearity, noncausality) not captured by second-order spectral analysis. In the Gaussian case ( k = 2 ), one recovers the usual Whittle function.

2.2.4. Portmanteau-Type Tests

We also employ a simplified joint portmanteau test (cf. Dalla et al., 2020; Jasiak & Neyazi, 2023) that targets both linear and nonlinear dependencies:
ρ ε ( k ) = 0 and ρ ε 2 ( k ) = 0 ( k 1 ) ,
where ρ ε ( k ) is the autocorrelation of the residuals { ε t } and ρ ε 2 ( k ) is the autocorrelation of their squares. Define
J ε , ε 2 , k = T 2 T k ρ ^ ε ( k ) 2 + ρ ^ ε 2 ( k ) 2 ,
and the cumulative statistic
C ε , ε 2 , m = k = 1 m J ε , ε 2 , k .
Under the null hypothesis of i.i.d. residuals, J ε , ε 2 , k converges in distribution to χ 2 2 , and C ε , ε 2 , m to χ 2 m 2 . The preferred model is the one for which the null is not rejected at typical significance levels (or, among those rejected, yields the smallest value of C ε , ε 2 , m ).

2.2.5. Whittle Estimation for the GARCH( p , q ) Model

Whittle estimation of GARCH models was introduced by Giraitis et al. (2000), leveraging the ARMA(max ( p , q ) , q ) representation in (18). One key distinction is that Whittle methods for the GARCH process typically require higher-order moments: E [ Y t 8 ] < in the original treatment. Later, Mikosch and Straumann (2002) showed that for squared GARCH processes, existence of the fourth moment suffices, though convergence is often slower than under maximum likelihood.
  • Whittle Criterion for GARCH.
Let
ξ 0 = α 0 , 1 , , α 0 , p , β 0 , 1 , , β 0 , q
be the true GARCH( p , q ) parameter vector (excluding ω ). From (22), denote by S 2 v ( ξ , κ 2 v , ω ) the power spectrum of { ε t 2 } . The Whittle function becomes
W ξ = = 1 T / 2 log S 2 v ( ξ , κ 2 v ; ω ) + I 2 ( ω ) S 2 v ( ξ , κ 2 v ; ω ) ,
where I 2 ( ω ) is now the periodogram of { ε t 2 } . The Whittle estimator is
ξ ^ = arg min ξ Ξ W ξ .
Because this method uses only { ε t 2 } , the latent { σ t 2 } sequence need not be explicitly computed, and no initial values for σ t 2 are required. In practice, ω can be estimated via the sample variance:
var ( Y t ) = ω 1 u = 1 max ( p , q ) ( α u + β u ) .
However, if the mean { Y t } is noncausal or noninvertible, it remains an open question whether the classic Whittle approach to the GARCH component is fully robust, since the conditional variance depends only on past shocks, whereas the MARMA mean may incorporate lead terms.

2.2.6. Maximum Likelihood Estimation for the GARCH( p , q ) Model

Maximum likelihood (or quasi-maximum likelihood) estimation is the standard for GARCH models and requires weaker moment conditions than the Whittle approach (Francq & Zakoian, 2019). In particular, normal QMLE imposes only the existence of the second moment, whereas Whittle typically requires up to the fourth moment (Mikosch & Straumann, 2002).
  • Likelihood Function.
Let
ξ = α 1 , , α p , β 1 , , β q R p + q ,
excluding ω . Conditional on initial values, the log-likelihood (assuming Gaussian errors) is
log Lik T ( ξ ) = t = 1 T 1 2 log 2 π 1 2 log σ ˜ t 2 ( ξ ) ε t 2 2 σ ˜ t 2 ( ξ ) ,
where
σ ˜ t 2 ( ξ ) = ω + u = 1 p α u ε t u 2 + v = 1 q β v σ ˜ t v 2 .
The MLE is
ξ ^ T = arg max ξ Ξ log Lik T ( ξ ) .
Under stationarity and finite variance, ξ ^ T is consistent and asymptotically normal, with asymptotic covariance given by the inverse Fisher information. Empirical work often finds that MLE (or QMLE) outperforms Whittle if the moment requirements of the latter are not comfortably satisfied.
Because many financial time series (e.g., equity prices) appear nonstationary, one typically detrends or works with returns rather than levels. In the following section, we outline standard approaches for differencing or transforming { Y t } to achieve approximate stationarity before MARMA–GARCH modeling.

2.3. Detrending Methods

Many economic and financial time series exhibit trends that must be removed before modeling. Whether a trend is stochastic or deterministic has important implications for subsequent inference, and different detrending methods can alter the resulting dynamics (Aguirre & Lobato, 2024; Fries & Zakoïan, 2019; Hencic & Gouriéroux, 2015). In practice, formal unit-root tests (e.g., augmented Dickey–Fuller or Phillips–Perron) guide this decision, but their conclusions can be sensitive to lag-length choices and might not always reflect the true data-generating process.
When a series is believed to have a stochastic trend, simple differencing or a log-transformation is commonly applied. However, excessive differencing can remove meaningful low-frequency information and potentially introduce artificial serial dependence. Conversely, for a deterministic trend, one typically estimates a low-order polynomial (linear, quadratic, cubic) or employs more flexible smoothing (e.g., spline fitting). Each approach influences the residual dynamics that remain for MARMA–GARCH estimation.
The literature offers many possibilities: for instance, Hencic and Gouriéroux (2015) used a cubic polynomial trend, Fries and Zakoïan (2019) and Aguirre and Lobato (2024) relied on linear trends or simple mean-centering, while Hecq et al. (2017); Hecq and Voisin (2023) used seasonal filters (X–11 or Hodrick–Prescott). More recently, Blasques et al. (2023) proposed filtering that separately models a random-walk (fundamental) component and a bubble component.
  • Our Implementation.
Here, we discuss two flexible methods for detrending: cubic splines and Fourier detrending. Each seeks to remove low-frequency components, thus leaving a (approximately) stationary series suitable for MARMA–GARCH modeling. In both cases, the knot placements (for splines) or the number of frequencies removed (in Fourier detrending) are ultimately user choices and can influence the estimated dynamics. Consequently, we treat these methods as a preprocessing step before specifying and fitting the MARMA–GARCH model.

2.3.1. Cubic Splines

Cubic splines provide a smooth, piecewise polynomial fit to the observed data. Let { Y t } t = 1 T be the original series. A cubic spline  S ( · ) is a function defined by separate cubic polynomials on each sub-interval (or “knot”):
S ( Y ) = a t + b t Y Y t + c t Y Y t 2 + d t Y Y t 3 , Y [ Y t , Y t + 1 ] ,
for t = 1 , , T 1 . The coefficients { a t , b t , c t , d t } are chosen so that the spline and its first two derivatives are continuous at each knot. Common boundary conditions are S ( Y 1 ) = 0 and S ( Y T ) = 0 .
If one partitions { Y t } into n knots, the spline coefficients can be estimated by solving a tridiagonal system:
h t 1 6 M t 1 + h t 1 + h t 3 M t + h t 6 M t + 1 = Y t + 1 Y t h t Y t Y t 1 h t 1 ,
where M t = S ( Y t ) and h t = Y t + 1 Y t . Once { M t } are found, the polynomial coefficients follow straightforwardly:
a t = Y t , b t = Y t + 1 Y t h t h t 3 2 M t + M t + 1 , c t = M t , d t = M t + 1 M t 3 h t .
The fitted spline defines a smooth trend t ( Y ) . The detrended observations are then
Y t ( detrend ) = Y t t ( Y t ) .
In our numerical examples, we follow Hall and Jasiak (2024) by using a second-order spline with one knot every 30 observations, but in practice, the choice of knot frequency can be refined using graphical diagnostics or information criteria.

2.3.2. Fourier Detrending

Fourier detrending filters out low-frequency components of the discrete Fourier transform (DFT). Let
d T ( ω ) = t = 1 T Y t e i t ω
be the DFT of { Y t } , with frequencies ω = 2 π T for = 0 , 1 , , T 1 . Suppose one identifies = 1 , , D (and by symmetry, = T D + 1 , , T ) as the “lowest” and “highest” frequencies capturing long-run or trend variations. Following Lobato and Velasco (2022), we zero out these components:
d T ( ω ) = 0 for = 1 , , D and = T D + 1 , , T .
Then, an inverse DFT yields the detrended series:
Y t ( detrend ) = 1 T = 0 T 1 d T ( ω ) e i ω t ,
where ( · ) denotes the real part. The integer D controls how many frequencies are removed, and hence determines the amount of low-frequency content retained in { Y t ( detrend ) } . Typically, one inspects the periodogram or employs domain knowledge to choose D.
Both methods—cubic splines and Fourier filtering—serve to remove gradual trends or low-frequency cycles. In subsequent sections, we apply these detrended data to the MARMA–GARCH estimation procedures described above (Section 2.1 and Section 2.2). As with any filtering, the choice of knot frequency or frequency cutoffs can subtly alter the remaining dynamics; hence, we emphasize that careful empirical checking, including residual diagnostics, is essential to ensure an appropriate trend-removal strategy for a given application.

3. Results

3.1. Simulation Study

In this section, we investigate and compare frequency-domain (Whittle) and time-domain (maximum likelihood) estimation methods for the GARCH component of a MARMA ( r , s , r , s ) –GARCH ( p , q ) model. While the properties of Whittle estimation for MARMA processes have been studied by Hecq and Velasquez-Gaviria (2024), our focus here is on the GARCH parameters.
  • Simulation Design.
To generate a sample { Y t } t = 1 T from a MARMA ( r , s , r , s ) –GARCH ( p , q ) process, we follow these steps:
  • Generate Innovations: Draw an i.i.d. sequence { η t } of length T from a chosen distribution F ( ζ ) (e.g., Student’s t, skew-t, or exponential), with E [ η t ] = 0 and Var ( η t ) = 1 .
  • Simulate GARCH( p , q ) Variance: Initialize σ 1 2 > 0 . Recursively compute
    σ t 2 = ω + u = 1 p α u η t u 2 σ t u 2 + v = 1 q β v σ t v 2 , t = 2 , , T ,
    ensuring { σ t 2 } remains strictly positive.
  • Form GARCH Innovations:
    ε t = η t σ t , t = 1 , , T .
  • Fourier Representation of Residuals: Compute the DFT of { ε t } :
    d ε ( ω j ) = t = 1 T ε t e i t ω j , ω j = 2 π j T , j = 0 , 1 , , T 1 .
  • MARMA Transfer Function: Select parameters ϑ for the MARMA ( r , s , r , s ) part. The frequency-domain transfer function is
    ψ ( ϑ , ω ) = θ + ( z ) θ * ( z ) ϕ + ( z ) ϕ * ( z ) , z = e i ω .
  • Construct the MARMA Process: Multiply the DFT of the innovations by the transfer function:
    d T ( ω j ) = d ε ( ω j ) × ψ ( ϑ , ω j ) .
  • Inverse DFT (Real Part):
    Y t = 1 T j = 0 T 1 d T ( ω j ) e i t ω j , t = 1 , , T .
    This final step yields { Y t } , a realization from the specified MARMA ( r , s , r , s ) –GARCH ( p , q ) process, provided stationarity and positivity conditions are met.
Figure 1 displays a single simulated trajectory of a MARMA(1,1,0,0)–GARCH(1,1) process, including its conditional variance { σ t 2 } and the resulting time series { Y t } . The conditional variance in the top panel exhibits clusters of high volatility and strong responses to shocks. The bottom panel reveals nonlinear dynamics in { Y t } , such as localized explosions and volatility bursts, reminiscent of real-world financial markets.

3.1.1. Whittle vs. Maximum Likelihood for Pure GARCH( p , q )

We first assess the estimation accuracy for a martingale difference GARCH(1,1) process (i.e., no mean structure). Table 1 reports the bias and RMSE of the Whittle and maximum likelihood (ML) estimators for parameters ω , α , β . We consider several distributions (exponential, Student’s t with different degrees of freedom, and skew-t) and three sample sizes.
The ML estimator achieves lower bias and RMSE than the Whittle estimator in all cases, and this gap becomes more pronounced for heavier-tailed distributions (Student’s t with lower degrees of freedom or skew-t). Whittle’s heavier moment requirements appear to render it more sensitive to large outliers. Overall, ML estimation delivers more reliable GARCH parameter estimates, a finding that aligns with common practices in financial econometrics.

3.1.2. Estimating MARMA( r , s , r , s )–GARCH( p , q )

We next compare how parameter estimates differ when fitting a causal ARMA representation (e.g., AR(2) for a MARMA(1,1,0,0)) versus the true noncausal specification. In particular, Table 2 considers a data-generating process (DGP) of MARMA(1,1,0,0)–GARCH(1,1), but estimates parameters under both (i) AR(2)–GARCH(1,1) and (ii) MARMA(1,1,0,0)–GARCH(1,1). Although both mean specifications generate uncorrelated residuals, higher-order statistics differ, affecting the estimated GARCH parameters.
When the mean is incorrectly constrained to a causal AR(2) form (left panel), { ε t } are uncorrelated but contain higher-order dependencies that invalidate the correct GARCH structure (Hecq & Velasquez-Gaviria, 2024). This mismatch translates into biased estimates of ω , α , β . By contrast, the correctly specified MARMA(1,1,0,0) representation (right panel) leads to parameter estimates that converge accurately on the true values.
Figure 2 illustrates this difference more clearly. Estimates from the AR(2)–GARCH(1,1) representation (left boxplots) systematically overestimate ω and α while underestimating β . In financial terms, this translates to higher perceived long-term volatility and an exaggerated immediate reaction to shocks, but less persistence. In contrast, the correct MARMA(1,1,0,0)–GARCH(1,1) fit (right boxplots) recovers nearly unbiased estimates and narrower spreads with a growing sample size.
These Monte Carlo experiments produce two key findings:
  • Whittle vs. MLE for GARCH: Maximum likelihood estimation more accurately recovers the GARCH parameters, particularly in heavy-tailed environments, while Whittle estimation exhibits larger bias and RMSE.
  • Importance of Correct MARMA Specification: Using a purely causal ARMA representation when the data are partially noncausal can distort the higher-order structure and lead to biased GARCH estimates. Correctly assigning the roots to causal/noncausal polynomials (as in a MARMA model) mitigates this issue.
These points are particularly relevant in financial applications, where heavy tails and bubbles or local explosiveness can invalidate causal ARMA assumptions. In such contexts, careful attention to model specification and estimation methods is crucial for reliable volatility forecasts and risk assessments.

3.2. Empirical Applications in Cryptocurrencies

We next illustrate the proposed MARMA–GARCH framework using daily data for two prominent cryptocurrencies, Bitcoin (BTC) and Ethereum (ETH), over the sample period January 2021–July 2024. These assets have been noted for their extreme volatility (Cheah & Fry, 2015; Corbet et al., 2017; Hafner, 2018), heavy tails and clustering in returns (Ardia et al., 2019), and bubble-like local explosions (Hall & Jasiak, 2024). As a result, cryptocurrencies provide a natural testing ground for models that allow for noncausal (forward-looking) dynamics.
Figure 3 plots the daily USD prices of Bitcoin and Ethereum. Both assets experienced significant peaks during 2021—reaching approximately USD 60,000 (BTC) and USD 4000 (ETH)—fueled by institutional adoption. Regulatory constraints, including a direct ban in China, contributed to a sharp drawdown by mid-2021, reinforcing the perception that these markets can be subject to speculative bubbles (Bazán-Palomino, 2022). Further volatility ensued in 2022 amid macroeconomic uncertainties (notably inflationary pressures and geopolitical tensions), with partial recoveries in late 2023 tied to central bank policy signals and a growing array of crypto-derivative products.
Although changes in regulation, macro news, and investor sentiment have been widely cited as drivers of cryptocurrency volatility, these markets also appear to exhibit persistent departures from pure random-walk behavior. Recurrent episodes of rapid price surges followed by abrupt declines suggest potential noncausal features—i.e., future expectations or bubble-like behavior—making this domain suitable for our MARMA–GARCH methodology.
  • Detrending.
Both BTC and ETH prices are nonstationary, exhibiting long-run growth punctuated by sharp local explosions. As detailed in Section 2.3, we use two detrending techniques to remove low-frequency components while preserving short-run dynamics:
  • Cubic Splines: Following Hall and Jasiak (2024), we employ a piecewise cubic spline with knots every 30 days. Additional experiments (e.g., quadratic vs. cubic, varying knot frequencies) suggested that a 30-day cubic spline balances flexibility with parsimony for these data.
  • Fourier Filtering: We remove the lowest and highest 32 frequency components in the discrete Fourier transform of { Y t } , corresponding to low-frequency trends and ultra-long cycles (Lobato & Velasco, 2022). This approach effectively zeroes out the first (and last) 32 harmonics before an inverse transform.
Figure 4 and Figure 5 depict the resulting detrended series for Bitcoin and Ethereum under both spline and Fourier filtering. Table 3 contrasts descriptive statistics of the raw and detrended data. While both methods yield mean-zero series, the Fourier-detrended versions manifest higher kurtosis, indicating that subtracting low-frequency oscillations may amplify outlier effects at shorter horizons. Notably, both detrending approaches leave intact the marked skewness and heavy tails often reported in cryptocurrency markets (Ardia et al., 2019).
  • Fitting MARMA–GARCH.
Once detrended, each series displays weak autocorrelation at the first or second lag. For both Bitcoin and Ethereum, AIC/BIC selection suggests an AR(1) model for the conditional mean if one imposes a purely causal structure. However, to allow for potential noncausality, we invert the root(s) of the AR polynomial and compare all possible causal–noncausal configurations (see Section 2.1 and Section 2.2).
In all cases, the data favor a fully noncausal representation, MARMA ( 0 , 1 , 0 , 0 ) . Concretely, the estimated parameter ϕ ^ * 0.825 indicates strong feedback from future expectations or bubble-like dynamics (Hall & Jasiak, 2024). The residuals from this MARMA(0,1,0,0) fit (in both frequency and time domains) pass higher-order spectral diagnostics (Velasco & Lobato, 2018) and a portmanteau-type test on raw and squared residuals.
  • GARCH(1,1) Estimation.
We then fit a GARCH(1,1) model to the residuals { ε ^ t } , comparing the following:
MARMA ( 0 , 1 , 0 , 0 ) GARCH ( 1 , 1 ) versus AR ( 1 ) GARCH ( 1 , 1 ) .
Table 4 summarizes the estimation results under both spline and Fourier detrending. Three primary findings emerge:
  • Long-Run Volatility (ω): The noncausal MARMA–GARCH model consistently yields higher estimates of ω than the causal AR(1)–GARCH(1,1). Hence, ignoring noncausality may understate the baseline variance level.
  • Reactivity to Shocks (α): The noncausal specification produces larger α , suggesting that these cryptocurrencies exhibit more immediate responses to new information when future expectations (or bubble-like feedback) are accounted for.
  • Volatility Persistence (β): Reflecting the stronger short-run reactivity, β is lower in the MARMA–GARCH model, implying reduced persistence once the noncausal feedback is captured.
Figure 6 compares the conditional volatility estimates for BTC and ETH. The MARMA(0,1,0,0)–GARCH(1,1) model (red lines) reacts more sharply during volatile episodes (2021 and 2022), yet tends to be lower in tranquil periods. By contrast, the AR(1)–GARCH(1,1) model (black lines) smooths out these extremes, potentially understating tail risks in real-time risk management contexts.
  • Diagnostic Checks.
Table 4 also reports Ljung–Box test p-values (on raw residuals) and McLeod–Li test p-values (on squared residuals) for both the MARMA–GARCH and AR–GARCH fits. Across multiple lags, neither model shows significant serial correlation in the final residuals, indicating that both representations adequately capture second-order dependencies. However, only the MARMA–GARCH specification is consistent with the higher-order spectral tests, reinforcing the conclusion that noncausal dynamics are present.
Although we obtain broadly consistent results under both the spline and Fourier methods, our analysis reveals that detrending can alter higher-order moments and thus affect the identification of noncausal roots. In practice, it remains crucial to verify that the chosen method for removing low-frequency components does not inadvertently remove important cyclical or bubble-like behaviors. For the Bitcoin and Ethereum data examined here, a 30-day spline and a low-frequency cutoff of 32 harmonics yield quite similar outcomes.
These empirical findings highlight that allowing for noncausal (forward-looking) dynamics in cryptocurrency prices can substantially affect the estimated volatility process. Specifically, in MARMA(0,1,0,0)–GARCH(1,1) models, shocks are met with higher immediate volatility responses and a somewhat lower persistence of volatility clustering, relative to a standard AR(1)–GARCH(1,1). This may be interpreted as evidence of speculative or bubble elements, where future price expectations significantly amplify immediate volatility reactions. From a risk-management perspective, such models potentially offer more realistic scenarios of rapid volatility escalation in cryptocurrency markets—an important consideration for both institutional investors and regulators.
Allowing for noncausal (forward-looking) components in cryptocurrency price dynamics leads to a larger estimated immediate impact of shocks ( α ) and higher overall volatility levels ( ω ), while reducing persistence ( β ). In practical terms, a MARMA–GARCH approach may provide more realistic volatility forecasts during bubble-like expansions and subsequent crashes, helping financial practitioners better gauge tail risk. Of course, model identification remains sensitive to the detrending method and the assumption of stationarity after trend removal; thus, careful robustness checks are recommended.
Our analysis suggests that both Bitcoin and Ethereum exhibit strong noncausal features and that GARCH estimates can be substantially biased if one imposes a purely causal ARMA structure. By contrast, MARMA–GARCH captures higher-order dynamics and more pronounced reactivity to new information. The results confirm the practical relevance of noncausal modeling in markets prone to speculative cycles, with repercussions for volatility forecasting and risk management.

4. Discussion and Conclusions

This paper has introduced a new MARMA–GARCH model that accounts for bubble-like (explosive) episodes in financial time series together with time-varying conditional volatility. The central motivation is to allow for noncausal (forward-looking) components in the mean dynamics while simultaneously capturing ARCH-type effects in the variance. This joint specification becomes particularly relevant during speculative booms and busts, when prices can exhibit abrupt run-ups and subsequent collapses.
A key contribution lies in showing how to factorize the AR and MA polynomials in a manner that places one or more roots inside the unit disk, thereby capturing the noncausal dimension. By relaxing the typical ARMA restriction that all roots lie outside the unit circle, the MARMA model can generate local explosiveness or feedback from future expectations. This capacity to accommodate bubble-like behavior is rare in standard time series approaches, which generally assume causality. In addition, the GARCH component models time-varying volatility in a way that remains valid under noncausal mean dynamics.
With regard to estimation, two broad approaches were proposed. The first is a frequency-domain method based on the Whittle likelihood, which is asymptotically equivalent to maximizing the Gaussian likelihood in the presence of finite variance. The second is time-domain maximum likelihood, which can be particularly advantageous for heavy-tailed or skewed return distributions. While both methods deliver broadly consistent results for the MARMA portion of the model (assuming stationarity and finite variance), the simulation evidence indicates that time-domain maximum likelihood provides more robust estimation of the GARCH parameters in the presence of high kurtosis or significant skewness. This finding reflects the greater flexibility of maximum likelihood estimation in accommodating non-Gaussian features—a salient feature in many financial contexts.
An important insight is that a misspecified mean equation can bias the estimated parameters of the GARCH process. In particular, forcing the mean to be a causal AR(1) when it is, in fact, partially or fully noncausal leads to underestimating the immediate response to shocks (the α parameter) and overstating the degree of volatility persistence (the β parameter). Consequently, the model may fail to capture swift volatility reactions to new information and systematically underestimate tail risk. Such biases have material consequences for practitioners and regulators who rely on these models for real-time risk management, portfolio hedging, and broader policy decisions.
Applying the MARMA–GARCH framework to two major cryptocurrencies, Bitcoin and Ethereum, highlights its empirical relevance. After detrending the data using both cubic splines and Fourier filters, we observe that noncausal specifications consistently outperform purely causal representations. The noncausal character, embodied in a MARMA( 0 , 1 , 0 , 0 ) structure, captures bubble-like episodes that standard ARMA models treat as anomalous volatility clusters. The associated GARCH estimates reveal stronger shock responsiveness (higher α ) and lower persistence ( β ) under the noncausal model, suggesting that bubble dynamics alter the interplay between incoming news and volatility. Such findings resonate with the high levels of speculation and price explosiveness often witnessed in crypto markets.
Notwithstanding these strengths, several caveats merit attention. First, both Whittle and Gaussian likelihood methods require at least finite second (and ideally higher) moments. Markets with extremely heavy tails could violate such assumptions. Second, the detection of noncausal roots hinges on the presumption that the data become stationary post detrending. Different detrending choices (e.g., polynomial, spline, or frequency-based) can shift the low-frequency components and affect the subsequent root factorization. Third, the method could be extended to more complex scenarios, including structural breaks, vector autoregressions, or regime-switching volatility, all of which may be essential in certain market environments.
Looking ahead, the MARMA–GARCH approach invites further research on the role of anticipatory or bubble-related behavior in a broader class of assets, including equities, commodities, and housing markets. Its capacity to depict forward-looking price formation can potentially elucidate herding phenomena, investor sentiment cycles, and other behavioral dynamics that conventional ARMA–GARCH models may fail to capture. From a methodological standpoint, future investigations might explore distributional assumptions beyond standard parametric families, such as robust filters or mixture distributions, to accommodate the pronounced skewness and kurtosis frequently encountered in high-volatility markets.
  • Practical and Societal Implications.
The findings in this study have notable implications for financial practitioners, regulatory authorities, and market participants more generally. By explicitly modeling the noncausal behavior, the MARMA–GARCH framework enhances the detection of bubble-like episodes and sharp downturns, thus improving the short-run forecasts of volatility during turmoil periods. This refinement is crucial in risk management applications, where underestimation of volatility can lead to inadequate capital reserves and margin requirements. Regulators may find value in an approach that isolates forward-looking or speculative episodes, thereby enabling more effective macroprudential oversight. Finally, a better grasp of bubble mechanisms and extreme volatility also bears on societal concerns: overly leveraged investors, pension funds, and retail traders can be severely impacted by rapid price collapses, and the systemic repercussions of such events can extend to broader economic stability. By offering a more nuanced model of how future expectations amplify shock impacts, the MARMA–GARCH framework contributes to a deeper, more informed discourse on the interplay between speculation, volatility, and financial market integrity.

Author Contributions

Conceptualization, A.H. and D.V.-G.; methodology, D.V.-G.; validation, A.H.; formal analysis, D.V.-G.; investigation, D.V.-G.; writing—original draft preparation, D.V.-G.; writing—review and editing, A.H. and D.V.-G.; supervision, A.H. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

All data used in this study were obtained from Yahoo Finance and are publicly available at https://finance.yahoo.com.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Aguirre, A., & Lobato, I. N. (2024). Evidence of non-fundamentalness in OECD capital stocks. Empirical Economics, 67(2), 761–772. [Google Scholar]
  2. Ardia, D., Bluteau, K., & Rüede, M. (2019). Regime changes in Bitcoin GARCH volatility dynamics. Finance Research Letters, 29, 266–271. [Google Scholar] [CrossRef]
  3. Bazán-Palomino, W. (2022). Interdependence, contagion and speculative bubbles in cryptocurrency markets. Finance Research Letters, 49, 103132. [Google Scholar]
  4. Blasques, F., Koopman, S. J., & Mingoli, G. (2023). Observation-driven filters for time-series with stochastic trends and mixed causal non-causal dynamics (Tech. Rep.). Tinbergen Institute Discussion Paper. [Google Scholar]
  5. Blasques, F., Koopman, S. J., Mingoli, G., & Telg, S. (2024). A novel test for the presence of local explosive dynamics (Tech. Rep.). Tinbergen Institute Discussion Paper. [Google Scholar]
  6. Bollerslev, T. (1986). Generalized autoregressive conditional heteroskedasticity. Journal of Econometrics, 31(3), 307–327. [Google Scholar]
  7. Breidt, F. J., Davis, R. A., & Trindade, A. A. (2001). Least absolute deviation estimation for all-pass time series models. The Annals of Statistics, 29(4), 919–946. [Google Scholar]
  8. Cheah, E.-T., & Fry, J. (2015). Speculative bubbles in Bitcoin markets? An empirical investigation into the fundamental value of Bitcoin. Economics Letters, 130, 32–36. [Google Scholar] [CrossRef]
  9. Corbet, S., Meegan, A., Larkin, C., Lucey, B., & Yarovaya, L. (2017). Exploring the dynamic relationships between cryptocurrencies and other financial assets. Economics Letters, 165, 28–34. [Google Scholar] [CrossRef]
  10. Dalla, V., Giraitis, L., & Phillips, P. C. (2020). Robust tests for white noise and cross-correlation. Econometric Theory, 38(5), 913–941. [Google Scholar]
  11. Engle, R. F. (1982). Autoregressive conditional heteroscedasticity with estimates of the variance of United Kingdom inflation. Econometrica: Journal of the Econometric Society, 50, 987–1007. [Google Scholar]
  12. Francq, C., & Zakoian, J.-M. (2019). GARCH models: Structure, statistical inference and financial applications. John Wiley & Sons. [Google Scholar]
  13. Fries, S., & Zakoïan, J.-M. (2019). Mixed causal-noncausal AR processes and the modelling of explosive bubbles. Econometric Theory, 35(6), 1234–1270. [Google Scholar]
  14. Giancaterini, F., & Hecq, A. (2022). Inference in mixed causal and noncausal models with generalized Student’s t-distributions. Econometrics and Statistics, 33, 1–12. [Google Scholar] [CrossRef]
  15. Giraitis, L., Robinson, P. M., & Surgailis, D. (2000). A model for long memory conditional heteroscedasticity. Annals of Applied Probability, 10, 1002–1024. [Google Scholar]
  16. Gouriéroux, C., & Zakoïan, J.-M. (2017). Local explosion modelling by non-causal process. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 79(3), 737–756. [Google Scholar] [CrossRef]
  17. Hafner, C. M. (2018). Hedging properties of Bitcoin: A GARCH-extended analysis. Finance Research Letters, 30, 151–155. [Google Scholar]
  18. Hall, M. K., & Jasiak, J. (2024). Modelling common bubbles in cryptocurrency prices. Economic Modelling, 139, 106782. [Google Scholar] [CrossRef]
  19. Hannan, E. J. (1973). The asymptotic theory of linear time-series models. Journal of Applied Probability, 10(1), 130–145. [Google Scholar] [CrossRef]
  20. Hecq, A., Telg, S., & Lieb, L. (2017). Do seasonal adjustments induce noncausal dynamics in inflation rates? Econometrics, 5(4), 48. [Google Scholar] [CrossRef]
  21. Hecq, A., & Velasquez-Gaviria, D. (2022). Spectral estimation for mixed causal-noncausal autoregressive models. arXiv, arXiv:2211.13830. [Google Scholar] [CrossRef]
  22. Hecq, A., & Velasquez-Gaviria, D. (2024). Spectral estimation for mixed causal-noncausal autoregressive models. Econometric Reviews, 1–24. [Google Scholar] [CrossRef]
  23. Hecq, A., & Voisin, E. (2023). Predicting bubble bursts in oil prices during the COVID-19 pandemic with mixed causal-noncausal models. In Essays in honor of Joon Y. Park: Econometric methodology in empirical applications (Vol. 45, pp. 209–233). Emerald Group Publishing Limited. [Google Scholar]
  24. Hencic, A., & Gouriéroux, C. (2015). Noncausal autoregressive model in application to bitcoin/USD exchange rates. In Econometrics of risk (pp. 17–40). Springer. [Google Scholar]
  25. Jasiak, J., & Neyazi, A. M. (2023). GCov-Based Portmanteau Test. arXiv, arXiv:2312.05373. [Google Scholar]
  26. Lanne, M., & Saikkonen, P. (2011). Noncausal autoregressions for economic time series. Journal of Time Series Econometrics, 3(3). [Google Scholar]
  27. Lobato, I. N., & Velasco, C. (2022). Single step estimation of ARMA roots for nonfundamental nonstationary fractional models. The Econometrics Journal, 25(2), 455–476. [Google Scholar]
  28. Lof, M., & Nyberg, H. (2017). Noncausality and the commodity currency hypothesis. Energy Economics, 65, 424–433. [Google Scholar]
  29. Meitz, M., & Saikkonen, P. (2013). Maximum likelihood estimation of a noninvertible ARMA model with autoregressive conditional heteroskedasticity. Journal of Multivariate Analysis, 114, 227–255. [Google Scholar] [CrossRef]
  30. Mikosch, T., & Straumann, D. (2002). Whittle estimation in a heavy-tailed GARCH (1, 1) model. Stochastic Processes and Their Applications, 100(1–2), 187–222. [Google Scholar] [CrossRef]
  31. Velasco, C., & Lobato, I. N. (2018). Frequency domain minimum distance inference for possibly noninvertible and noncausal ARMA models. The Annals of Statistics, 46(2), 555–579. [Google Scholar]
Figure 1. Simulated MARMA(1,1,0,0)–GARCH(1,1) Trajectory. True parameters: ϕ + = 0.2 , ϕ * = 0.9 , ω = 0.01 , α = 0.15 , β = 0.8 ; innovations follow a Student’s t with 4.5 degrees of freedom. Top panel shows σ t 2 ; bottom panel shows Y t .
Figure 1. Simulated MARMA(1,1,0,0)–GARCH(1,1) Trajectory. True parameters: ϕ + = 0.2 , ϕ * = 0.9 , ω = 0.01 , α = 0.15 , β = 0.8 ; innovations follow a Student’s t with 4.5 degrees of freedom. Top panel shows σ t 2 ; bottom panel shows Y t .
Econometrics 13 00013 g001
Figure 2. Boxplots of GARCH parameter estimates ( ω , α , β ) for a MARMA(1,1,0,0)–GARCH(1,1) DGP with Student’s t(4.5) innovations. Left: AR(2)–GARCH(1,1) representation; Right: MARMA(1,1,0,0)–GARCH(1,1) representation. Whiskers denote approximate quartiles; dashed horizontal lines show true parameter values.
Figure 2. Boxplots of GARCH parameter estimates ( ω , α , β ) for a MARMA(1,1,0,0)–GARCH(1,1) DGP with Student’s t(4.5) innovations. Left: AR(2)–GARCH(1,1) representation; Right: MARMA(1,1,0,0)–GARCH(1,1) representation. Whiskers denote approximate quartiles; dashed horizontal lines show true parameter values.
Econometrics 13 00013 g002
Figure 3. Daily USD prices of Bitcoin and Ethereum (January 2021–July 2024).
Figure 3. Daily USD prices of Bitcoin and Ethereum (January 2021–July 2024).
Econometrics 13 00013 g003
Figure 4. Bitcoin detrended series (cubic spline and Fourier).
Figure 4. Bitcoin detrended series (cubic spline and Fourier).
Econometrics 13 00013 g004
Figure 5. Ethereum detrended series (cubic spline and Fourier).
Figure 5. Ethereum detrended series (cubic spline and Fourier).
Econometrics 13 00013 g005
Figure 6. Estimated conditional volatility for Bitcoin and Ethereum. MARMA(0,1,0,0)–GARCH(1,1) (red) vs. AR(1)–GARCH(1,1) (black).
Figure 6. Estimated conditional volatility for Bitcoin and Ethereum. MARMA(0,1,0,0)–GARCH(1,1) (red) vs. AR(1)–GARCH(1,1) (black).
Econometrics 13 00013 g006
Table 1. Bias and RMSE for GARCH(1,1) Parameter Estimates Under Various Distributions. True parameters: ω = 0.1 ,   α = 0.15 ,   β = 0.85 . Sample sizes: T = 1000 ,   2000 ,   5000 . Results use Whittle and Maximum Likelihood estimation. Cell shading reflects absolute magnitude of each estimate (white = near 0, red = larger bias or RMSE).
Table 1. Bias and RMSE for GARCH(1,1) Parameter Estimates Under Various Distributions. True parameters: ω = 0.1 ,   α = 0.15 ,   β = 0.85 . Sample sizes: T = 1000 ,   2000 ,   5000 . Results use Whittle and Maximum Likelihood estimation. Cell shading reflects absolute magnitude of each estimate (white = near 0, red = larger bias or RMSE).
BiasRMSE
Whittle EstimationMaximum LikelihoodWhittle EstimationMaximum Likelihood
T ω α β ω α β ω α β ω α β
Exp(1) Innovations
10000.089−0.030−0.0300.0200.006−0.0200.2880.0730.1820.0670.0630.081
20000.076−0.028−0.0160.0100.006−0.0110.1630.0490.1000.0300.0280.035
50000.073−0.024−0.0160.0020.000−0.0020.1150.0320.0750.0100.0110.013
Student’s t (8) Innovations
10000.055−0.018−0.0150.014−0.001−0.0090.1360.0390.0920.0340.0260.036
20000.043−0.011−0.0130.0070.001−0.0050.0640.0300.0500.0150.0120.015
50000.039−0.008−0.0120.001−0.0010.0000.0420.0160.0310.0060.0050.006
Student’s t (4.5) Innovations
10000.103−0.023−0.0390.0200.003−0.0170.2480.0540.1520.0460.0390.051
20000.079−0.024−0.0200.0120.004−0.0120.1200.0330.0720.0220.0220.026
50000.054−0.024−0.0080.0030.000−0.0030.0630.0190.0450.0070.0080.009
skew-t (4.5,1.5) Innovations
10000.128−0.022−0.0510.0210.007−0.0220.3270.0640.1660.0490.0500.062
20000.107−0.021−0.0330.0080.007−0.0110.1970.0340.0950.0210.0240.027
50000.103−0.019−0.0370.0040.004−0.0050.0970.0250.0650.0090.0110.012
Table 2. Parameter Estimates for the MARMA(1,1,0,0)–GARCH(1,1) process. True parameters: ϕ + = 0.2 , ϕ * = 0.7 , ω = 0.01 , α = 0.15 , β = 0.85 . Each row gives mean estimates (top row) and standard deviations of estimates (bottom row) across repeated simulations for different sample sizes (T) and innovation distributions.
Table 2. Parameter Estimates for the MARMA(1,1,0,0)–GARCH(1,1) process. True parameters: ϕ + = 0.2 , ϕ * = 0.7 , ω = 0.01 , α = 0.15 , β = 0.85 . Each row gives mean estimates (top row) and standard deviations of estimates (bottom row) across repeated simulations for different sample sizes (T) and innovation distributions.
AR(2)–GARCH(1,1) FitMARMA(1,1,0,0)–GARCH(1,1) Fit
T ϕ 1 ϕ 2 ω α , β ϕ + ϕ * ω α , β
Exp(1) Innovations
10000.893−0.1430.0230.358, 0.5420.6750.2180.0120.108, 0.828
(0.048)(0.047)(0.009)(0.087, 0.116)(0.064)(0.089)(0.009)(0.060, 0.093)
20000.896−0.1420.0220.359, 0.5490.6860.2110.0110.106, 0.836
(0.036)(0.036)(0.006)(0.060, 0.077)(0.048)(0.065)(0.005)(0.041, 0.054)
50000.899−0.1410.0210.359, 0.5580.6950.2040.0110.104, 0.843
(0.025)(0.024)(0.004)(0.040, 0.049)(0.030)(0.041)(0.003)(0.028, 0.033)
Student’s t (8) Innovations
10000.896−0.1410.0130.115, 0.8200.6850.2110.0120.102, 0.836
(0.040)(0.039)(0.006)(0.032, 0.055)(0.059)(0.076)(0.006)(0.031, 0.054)
20000.900−0.1410.0110.115, 0.8280.6940.2060.0110.100, 0.844
(0.032)(0.030)(0.004)(0.021, 0.034)(0.038)(0.053)(0.004)(0.021, 0.033)
50000.899−0.1400.0110.115, 0.8300.6980.2010.0100.101, 0.846
(0.020)(0.019)(0.002)(0.013, 0.021)(0.024)(0.033)(0.002)(0.013, 0.020)
Student’s t (4.5) Innovations
10000.897−0.1400.0160.180, 0.7360.6850.2120.0130.111, 0.825
(0.047)(0.047)(0.009)(0.050, 0.088)(0.066)(0.089)(0.008)(0.053, 0.078)
20000.898−0.1420.0140.177, 0.7510.6890.2090.0120.108, 0.834
(0.037)(0.037)(0.005)(0.036, 0.053)(0.049)(0.068)(0.005)(0.040, 0.050)
50000.901−0.1420.0130.175, 0.7560.6950.2050.0110.104, 0.842
(0.026)(0.025)(0.003)(0.021, 0.032)(0.032)(0.044)(0.003)(0.024, 0.032)
skew-t (4.5,1.5) Innovations
10000.897−0.1440.0170.222, 0.6830.6780.2190.0120.111, 0.825
(0.050)(0.048)(0.009)(0.063, 0.103)(0.067)(0.090)(0.010)(0.065, 0.094)
20000.897−0.1430.0160.217, 0.6990.6840.2130.0120.109, 0.832
(0.039)(0.038)(0.005)(0.040, 0.063)(0.051)(0.070)(0.006)(0.046, 0.061)
50000.899−0.1400.0150.214, 0.7080.6950.2040.0110.104, 0.843
(0.028)(0.027)(0.003)(0.025, 0.037)(0.036)(0.049)(0.003)(0.028, 0.036)
Table 3. Descriptive Statistics of Original and Detrended Prices (Bitcoin and Ethereum).
Table 3. Descriptive Statistics of Original and Detrended Prices (Bitcoin and Ethereum).
MeanStd. Dev.MinMaxSkewnessKurtosis
Bitcoin
Price38,384.6515,015.4915,787.2873,083.500.4732.188
Cubic Splines0.0002190.83−8347.698712.280.0344.210
Fourier0.0002177.26−19,449.8812,595.56−0.87413.362
Ethereum
Price2324.60888.75730.374812.090.6802.439
Cubic Splines0.000172.25−858.581003.890.1936.543
Fourier0.000175.12−1433.181162.61−0.43413.595
Table 4. Estimated parameters for the noncausal model (MARMA(0,1,0,0)–GARCH(1,1)) and the causal model (AR(1)–GARCH(1,1)) for Bitcoin and Ethereum. The table also reports the Ljung–Box test p-values for the residuals and the McLeod–Li test p-values for the standardized residuals, under two different detrending methods (Cubic Splines and Fourier).
Table 4. Estimated parameters for the noncausal model (MARMA(0,1,0,0)–GARCH(1,1)) and the causal model (AR(1)–GARCH(1,1)) for Bitcoin and Ethereum. The table also reports the Ljung–Box test p-values for the residuals and the McLeod–Li test p-values for the standardized residuals, under two different detrending methods (Cubic Splines and Fourier).
BitcoinEthereum
Cubic SplinesFourierCubic SplinesFourier
NoncausalCausalNoncausalCausalNoncausalCausalNoncausalCausal
Estimated Parameters
ϕ + , ϕ * 0.8115
(0.0197)
0.8115
(0.0197)
0.8341
(0.0157)
0.8341
(0.0157)
0.8243
(0.1580)
0.8243
(0.1580)
0.8395
(0.0154)
0.8395
(0.0154)
ω 0.0048
(0.0017)
0.0029
(0.0010)
0.0035
(0.0016)
0.0031
(0.0011)
0.0053
(0.0021)
0.0056
(0.0017)
0.0065
(0.0020)
0.0057
(0.0018)
α 0.0952
(0.0179)
0.0564
(0.0106)
0.0717
(0.0215)
0.0593
(0.0117)
0.0937
(0.0195)
0.0828
(0.0147)
0.1142
(0.0208)
0.0856
(0.0156)
β 0.9037
(0.0171)
0.9406
(0.0102)
0.9274
(0.0210)
0.9382
(0.0111)
0.9053
(0.0187)
0.9137
(0.0141)
0.8847
(0.0297)
0.9116
(0.0149)
Residuals: Ljung–Box Test p-values
Lag(1)0.73200.62340.35710.60330.59670.65330.47240.8152
Lag(2)0.36110.23000.31240.19200.36020.13750.26360.1378
Lag(3)0.24950.11770.17520.16630.15430.15060.07680.0861
Lag(4)0.17990.11930.16080.13930.42880.10740.25140.1522
Lag(5)0.26800.18010.25240.22100.10060.22170.12960.2971
Standardized Residuals: McLeod–Li Test p-values
Lag(1)0.57150.46750.20730.36900.76950.77690.64910.9418
Lag(2)0.77970.46350.43480.48820.87570.81250.89820.9142
Lag(3)0.38720.25290.30590.35320.71370.56040.54300.5474
Lag(4)0.53660.39330.44880.50990.84780.67030.69430.6602
Lag(5)0.65400.34780.51910.51790.62480.49270.70150.4562
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Hecq, A.; Velasquez-Gaviria, D. Explosive Episodes and Time-Varying Volatility: A New MARMA–GARCH Model Applied to Cryptocurrencies. Econometrics 2025, 13, 13. https://doi.org/10.3390/econometrics13020013

AMA Style

Hecq A, Velasquez-Gaviria D. Explosive Episodes and Time-Varying Volatility: A New MARMA–GARCH Model Applied to Cryptocurrencies. Econometrics. 2025; 13(2):13. https://doi.org/10.3390/econometrics13020013

Chicago/Turabian Style

Hecq, Alain, and Daniel Velasquez-Gaviria. 2025. "Explosive Episodes and Time-Varying Volatility: A New MARMA–GARCH Model Applied to Cryptocurrencies" Econometrics 13, no. 2: 13. https://doi.org/10.3390/econometrics13020013

APA Style

Hecq, A., & Velasquez-Gaviria, D. (2025). Explosive Episodes and Time-Varying Volatility: A New MARMA–GARCH Model Applied to Cryptocurrencies. Econometrics, 13(2), 13. https://doi.org/10.3390/econometrics13020013

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop