Estimator’s Properties of Specific Time-Dependent Multivariate Time Series

Mélard, Guy

doi:10.3390/math13071163

Open AccessArticle

Estimator’s Properties of Specific Time-Dependent Multivariate Time Series

by

Guy Mélard

Université libre de Bruxelles, Solvay Brussels School of Economics and Management and ECARES, CP 114/04, Avenue Franklin Roosevelt, 50, B-1050 Brussels, Belgium

Mathematics 2025, 13(7), 1163; https://doi.org/10.3390/math13071163

Submission received: 27 February 2025 / Revised: 25 March 2025 / Accepted: 27 March 2025 / Published: 31 March 2025

(This article belongs to the Special Issue New Challenges in Time Series and Statistics)

Download Versions Notes

Abstract

There is now a vast body of literature on ARMA and VARMA models with time-dependent or time-varying coefficients. A large part of it is based on local stationary processes using time rescaling and assumptions of regularity with respect to time. A recent paper has presented an alternative asymptotic theory for the parameter estimators based on several distinct assumptions that seem difficult to verify at first look, especially for time-dependent VARMA or tdVARMA models. The purpose of the present paper is to detail several examples that illustrate the verification of the assumptions in that theory. These assumptions bear on the moments of the errors, the existence of the information matrix, but also how the coefficients of the pure moving average representation of the derivatives of the residuals (with respect to the parameters and evaluated at their true value) behave. We will do that analytically for two bivariate first-order models, an autoregressive model, and a moving average model, before sketching a generalization to higher-order models. We also show simulation results for these two models illustrating the analytical results. As a consequence, not only the assumptions can be checked but the simulations show how well the small sample behavior of the estimators agrees with the theory.

Keywords:

time-varying model; VARMA model; nonstationary process; parameter estimation; asymptotic properties; non-Gaussian model; information matrix

MSC:

62F12; 62M09; 62M10

1. Introduction

Consider a multivariate time series

(x_{t}; t = 1, \dots, n)

of length n and dimension r. It is supposed to be generated by an array process

(x_{t}^{(n)}; t = 1, \dots, n; n > 0)

. There is now a vast body of literature on ARMA and VARMA models with time-dependent or time-varying coefficients. For a recent review of VARMA models, including their time-dependent variants, see [1].

The history of time-dependent models for time series started with the works of [2,3,4,5,6]. Several of them, [2,4,5] and also [7,8,9,10,11,12], focussed on the temporal aspects, while others [3,6] were more interested in the spectral point of view.

We will not consider the numerous studies related to the time-dependent spectral approaches, except for the following one that had considerable attraction. We mean the theory based on local stationarity processes (LSP) due to Dahlhaus (see [13,14,15,16,17]). We will not repeat here the many contributions about it since they are nicely summarized in [18], and a few more recent references are mentioned in [19]. See also [20,21,22,23,24,25,26,27,28,29,30].

Other somewhat related approaches to ours include [31] (Chapter 17), which treats tdVAR models by Gaussian maximum likelihood (but does not discuss asymptotic properties), generalized autoregressive score (GAS) models of [32], testing parameter constancy against deterministically time-varying parameters, e.g., ref. [33] (Section 6.3) and references therein, generalized to VAR models in [34], smooth online parameter estimation approach [35], deep-learning approaches [36], using state-space methods [37], a non-parametric approach [38], or by using an explicit representation for ARMA recursions with either deterministically or stochastically varying coefficients [39].

If a few of the above-mentioned papers treat multivariate time-dependent models, like [25,31,36], they are often tdVAR, and, apparently, never tdVMA and tdVARMA models, with the notable exception of [38], which is semi-parametric and uses a kernel-density estimator. Ref. [40] explains that, although in theory a tdMA process can be written as an infinite tdAR process, it is not efficient to fit a high-order model. Since the first edition of [41] in 1970 and the practical studies that followed, it is well known that most time series in many fields like economics, sociology, tourism, agriculture, energy, and so on, are better fitted by ARIMA models, possibly on transformed data, rather than by AR models. Based on 13,238 monthly series taken from the Industrial Short-Term Indicator section of the EUROSTAT database on 15 Member States of the European Union and a few series from the United States and Japan, ref. [42] observed that the airline model, i.e., the seasonal ARIMA(0,1,1)(0,1,1

)_{12}

model on the log-transformed data, was best-fitting for 61% of the series, whereas only 2% were best fitted by an autoregression. There is no reason why the primacy of moving averages would not extend to multivariate time series and time-dependent models, although there is no empirical study at this stage. A few of the previously mentioned papers for univariate ARMA models with time-dependent coefficients include MA coefficients like [7,10,11,12]. In the multivariate case, ref. [17] treats time-varying or time-dependent VARMA (tdVARMA) models in the context of local stationarity but only for Gaussian processes and under the assumption that all the eigenvalues of the true spectral density matrix are uniformly bounded from below, a condition that is difficult to verify in practice. More recently, ref. [38] has provided results for a semi-parametric estimator for tdVARMA models.

In a recent paper [43], estimation results were produced for a wide class of vector ARMA array processes with time-dependent coefficients, denoted tdVARM

A^{(n)} (p, q)

, which includes as special cases both local stationary processes [18] and cyclically time-dependent processes [11]. The assumptions are rather general but complex at first sight, so it would be worthwhile to demonstrate their applicability. Previously, ref. [44] has already treated examples for cyclically tdVARMA stochastic processes. Ref. [43] could also be seen as a generalization to multivariate processes of [12] devoted to univariate ARMA models with time-dependent coefficients, thereby generalizing the autoregressive moving average (ARMA) models popularized by Box and Jenkins, ref. [41]. In [12], the two cases where the coefficients depend only on time t and both t and n were considered with an accent on the former case. The case where the coefficients depend on t but not series length n was generalized to VARMA models by [44]. Ref. [12] contained very simple univariate examples where the theoretical assumptions for the asymptotic properties were checked. We will see that developing simple examples is much more complex in a multivariate setting.

Previously, ref. [45] had shown that, when specialized to VARMA models with constant coefficients, these assumptions coincide with the assumptions for the standard asymptotic properties of the parametric estimation for these models. The problem is that the assumptions in [43] are complex. For instance, that paper contains remarks to address requests from reviewers who could not believe that these assumptions would work, despite the provided proofs. Consequently, we consider here the case of relatively simple array processes and will check analytically a representative sample of the assumptions for bivariate processes. By representative sample, we mean that the other assumptions can be verified using the same arguments. To simplify, we will restrain ourselves to first-order processes, tdVA

R^{(n)}

(1), when

p = 1

and

q = 0

, and tdVM

A^{(n)}

(1), when

p = 0

and

q = 1

. Even for these two special cases, we drastically limit the form of time-dependency and the number m of parameters. We believe, however, that verifying the assumptions of [43] in these two cases is exemplary for more complex models. These models were used, for specific values of the parameters, in a simulation study presented by [43], so that the present paper completes the information. Not only does estimation give results that are predicted by the theory for sufficiently large series, but we will also see that the values selected for the parameters in these simulation experiments fulfill the requirements.

We do not consider high-dimensional models in this paper for two reasons: first, there is little hope to be able to offer simple examples for

r > 2

; and second, since the size of the matrices in the coefficients will be

r \times r

, the number of parameters m increases with

r^{2}

. With

n = 100

,

p = q = 2

, and

r = 4

, the number of parameters is already

4 \times 4^{2} = 64

, plus

4 \times (4 + 1) / 2 = 10

in the error covariance matrix, leaving only

100 - 74 = 26

degrees of freedom. Moreover, the dimension of the information matrix (defined below) grows with

r^{4}

, so in that case, it is already

64 \times 64

, possibly implying serious computational problems with matrix inversion. A solution would be to extend the sparse identification and estimation approach proposed for VARMA models by [46] and implemented by [47] in the R package bigtime 0.2.3 for R 3.6.0 and above. This is done by using sparsity-inducing convex regularizers. It works even for large-scale VARMA models, under sufficient regularity conditions expressed by the condition

r^{3} log (r \sqrt{n}) \to 0

.

This article is organized as follows. In Section 2, we introduce the general marginally heteroscedastic tdVARM

A^{(n)}

array model with the main notations, describe the crucial assumptions under which [43] have proved the asymptotic properties of the Gaussian maximum likelihood estimator, and present approximations of the true information matrix. Section 3 contains our results: in Section 3.1, we consider a tdVA

R^{(n)}

(1) process and a tdVM

A^{(n)}

(1) process in Section 3.2. In both cases, after reducing the number of parameters progressively, we prove analytically that the assumptions can be verified, provide the constraints on the true values, and simulation results with non-Gaussian errors to assess the quality of the estimates and their standard errors. Section 3.3 and Section 3.4 are short attempts for generalization but, of course, the model complexity does not permit a complete analytical treatment. The paper ends with a discussion of the results in Section 4. There are three appendices for details about Section 3.1 and Section 3.2.

2. Materials and Methods

2.1. The General tdVARMA Array Model

Let

θ = {(θ_{1}, \dots, θ_{m})}^{T}

,

m \geq 1

, where ^T denotes transposition, be the parameter belonging to

Θ

, an open set of

R^{m}

, and

θ^{0} \in Θ

, the true value of

θ

. Let

P_{θ}

be the probability measure on

Θ

. Let the

r \times r

matrices

A_{t 1}^{(n)}, \dots, A_{t p}^{(n)}

, and

B_{t 1}^{(n)}, \dots, B_{t q}^{(n)}

, as well as

g_{t}^{(n)}

, be deterministic functions of time t and, possibly, of n. Let

{ϵ_{t}; t \in N}

be a sequence of zero-mean independent random variables with a covariance matrix

Σ

which is invertible, and with finite moments of order

4 + 2 δ

with

δ > 0

. Denoting ⊗ the Kronecker product, and

vec (.)

which transforms a matrix into a column vector, we let

κ_{t} = E (vec (ϵ_{t} ϵ_{t}^{T}) vec {(ϵ_{t} ϵ_{t}^{T})}^{T}) = E ((ϵ_{t} ϵ_{t}^{T}) \otimes (ϵ_{t} ϵ_{t}^{T}))

, which can depend on t, but it will not in all our examples. For

t < 1

, we suppose that

ϵ_{t}

and

x_{t}^{(n)}

are both equal to zero. According to [43], the tdVARM

A^{(n)} (p, q)

array process can be defined by the equation

\begin{matrix} x_{t}^{(n)} = \sum_{i = 1}^{p} A_{t i}^{(n)} x_{t - i}^{(n)} + g_{t}^{(n)} ϵ_{t} + \sum_{j = 1}^{q} B_{t j}^{(n)} g_{t - j}^{(n)} ϵ_{t - j} . \end{matrix}

(1)

Note that

g_{t}^{(n)}

implies marginal heteroscedasticity, meaning that the errors do not have a constant covariance matrix but well a time-dependent covariance matrix. Unfortunately, the theory in [43] does not accommodate conditional heteroscedasticity, as seen in VARMA-GARCH models, such as VARMA-BEKK models (see [48,49,50]).

The model consists of replacing the coefficients

A_{t i}^{(n)}

,

i = 1, \dots, p

,

B_{t j}^{(n)}

,

j = 1, \dots, q

, and

g_{t}^{(n)}

by adding an argument

(θ)

so that the coefficients in (1) correspond to the value at

θ = θ^{0}

. We suppose that the parameters involved in the

A_{t i}^{(n)} (θ)

and the

B_{t j}^{(n)} (θ)

, on the one hand, and those in

g_{t}^{(n)} (θ)

, on the other hand, are distinct and ordered that way. Let

m_{A B} \leq m

be the number of parameters of the first type so that the derivative of

g_{t}^{(n)} (θ)

with respect to

θ_{i}

is identically zero for

i = 1, \dots, m_{A B}

. Of course, we also need to replace

g_{t - j}^{(n)} ϵ_{t - j}

with the residuals

e_{t - j} (θ)

,

j = 0, \dots, q

, noting that

e_{t - j} (θ^{0}) = g_{t - j}^{(n)} ϵ_{t - j}

. Let

α_{t}^{(n)} (θ) = log \det (Σ_{t}^{(n)} (θ))) + e_{t}^{(n) T} (θ) Σ_{t}^{(n) - 1} (θ) e_{t}^{(n)} (θ)

, where

Σ_{t}^{(n)} (θ) = E_{θ} (e_{t}^{(n)} (θ) e_{t}^{(n) T} (θ)) = g_{t}^{(n)} (θ) Σ g_{t}^{(n) T} (θ)

. Using a series of length n, we estimate

θ

by

{\hat{θ}}^{(n)} = {\arg \min}_{θ \in Θ} \frac{1}{2} \sum_{t = 1}^{n} α_{t}^{(n)} (θ)

, which means using the Gaussian quasi-likelihood method. Under the assumptions given in detail in [43] and summarized in Section 2.2,

{\hat{θ}}^{(n)} \to θ^{0}

in probability, and

\sqrt{n} ({\hat{θ}}^{(n)} - θ^{0}) \to N (0, V^{- 1} W V^{- 1})

in law, both when

n \to \infty

, where V and W are defined in (4) below. Note that using this so-called sandwich or robust asymptotic covariance matrix

V^{- 1} W V^{- 1}

should improve the estimated standard errors and reduce their negative bias when the errors are not normally distributed, especially for the parameters related to heteroscedasticity.

The assumptions are based on the behavior of the coefficients of a pure moving average decomposition of the process and the derivatives (up to order 3 and evaluated at

θ = θ^{0}

) of the residuals, taking into account the assumptions for

t < 1

. For the process, the decomposition is

x_{t}^{(n)} = \sum_{k = 0}^{t - 1} ψ_{t k}^{(n)} g_{t - k}^{(n)} ϵ_{t - k}

and the coefficients are denoted

ψ_{t k}^{(n)}

. We will also use the coefficients of the pure autoregressive decomposition of the process

x_{t}^{(n)} = \sum_{k = 1}^{t - 1} π_{t k}^{(n)} (θ) x_{t - k}^{(n)} + e_{t}^{(n)} (θ)

and let

π_{t k}^{(n)} = π_{t k}^{(n)} (θ^{0})

. The first-order derivative of

e_{t}^{(n)} (θ)

with respect to

θ_{i}

,

i = 1, \dots, m

, is expressed either as

- \sum_{k = 1}^{t - 1} \frac{\partial π_{t k}^{(n)} (θ)}{\partial θ_{i}} x_{t - k}^{(n)} or \sum_{k = 1}^{t - 1} ψ_{t i k}^{(n)} (θ) g_{t - k}^{(n)} ϵ_{t - k},

(2)

and we let

ψ_{t i k}^{(n)} = ψ_{t i k}^{(n)} (θ^{0})

. These coefficients can be obtained by recurrence. Indeed, generalizing [44] (Equation (3.8)), it can be shown that

\begin{matrix} ψ_{t i k}^{(n)} & = & - \sum_{u = 1}^{k} {\frac{\partial π_{t u}^{(n)} (θ)}{\partial θ_{i}}|}_{θ = θ^{0}} ψ_{t - u, k - u}^{(n)} . \end{matrix}

(3)

It is not obvious how to deal with these coefficients that are, however, essential in [43], as will be shown in Section 3.

We need finally to introduce the Hessian matrix V and the outer product of gradient W. They are obtained as limits of averages of time-dependent matrices

V = lim_{n \to \infty} \frac{1}{n} \sum_{t = 1}^{n} V_{t}^{(n)}, W = lim_{n \to \infty} \frac{1}{n} \sum_{t = 1}^{n} W_{t}^{(n)},

(4)

where the elements

V_{i j}^{(n)}

and

W_{i j}^{(n)}

for

i, j = 1, \dots, m

are obtained, respectively, by

E_{θ^{0}} (\frac{\partial e_{t}^{(n) T} (θ)}{\partial θ_{i}} Σ_{t}^{(n) - 1} (θ) \frac{\partial e_{t}^{(n)} (θ)}{\partial θ_{j}}) + \frac{1}{2} tr {[Σ_{t}^{(n) - 1} (θ) \frac{\partial Σ_{t}^{(n)} (θ)}{\partial θ_{i}} Σ_{t}^{(n) - 1} (θ) \frac{\partial Σ_{t}^{(n)} (θ)}{\partial θ_{j}}]}_{θ = θ^{0}},

(5)

where

E_{θ^{0}}

is the expectation under

P_{θ^{0}}

, and

\frac{1}{4} E_{θ^{0}} (\frac{\partial α_{t}^{(n)} (θ)}{\partial θ_{i}} \frac{\partial α_{t}^{(n)} (θ)}{\partial θ_{j}}) .

(6)

The theoretical aspects of the computation of V and W are discussed in [51], with the impact of the distribution on

κ_{t}

for the latter. In particular, as a consequence of our assumptions and notations, for

i, j = 1, \dots, m_{A B}

, there is a simple common expression for the terms in (5) of both V and W:

V_{t, i j}^{(n)} = W_{t, i j}^{(n)} = tr \{ψ_{t i k}^{(n) T} Σ_{t}^{(n) - 1} ψ_{t j k}^{(n)} Σ_{t - k}^{(n)}\} .

(7)

For

i, j > m_{A B}

, the expressions differ: for V, it is based on the second term in (5), while for W, it is based on

tr (Σ_{t}^{(n)} (\partial Σ_{t}^{(n) - 1} / \partial θ_{i}))

,

vec (g_{t}^{(n) T} (\partial Σ_{t}^{(n) - 1} / \partial θ_{i}) g_{t}^{(n)})

and makes use of

κ_{t}

(see [51] for details). It is also not obvious at all how to use (4)–(7) for a given tdVARM

A^{(n)}

model, although these matrices V and W are crucial for obtaining the standard errors of the estimators. Also, obtaining a finite-sample approximation will be useful (see Section 2.3).

2.2. Typical Assumptions

It would be very lengthy to check all the assumptions of [43]. We will examine a representative sample of these assumptions, knowing that the others can be checked in the same way. More precisely,

(i): the matrices $A_{t i}^{(n)} (θ)$ , $i = 1, \dots, p$ , $B_{t j}^{(n)} (θ)$ , $j = 1, \dots, q$ , and $g_{t}^{(n)} (θ)$ are three times continuously differentiable with respect to $θ$ ;
(ii): existence of bounds on the Frobenius norm, denoted by ${∥ . ∥}_{F}$ , of $g_{t}^{(n)}$ , $κ_{t}$ , $Σ_{t}^{(n)} (θ)$ , $Σ_{t}^{(n) - 1} (θ)$ , and their derivatives with respect to $θ$ at $θ = θ_{0}$ ;
(iii): upper bounds like $\sum_{k = ν}^{t - 1} {∥ ψ_{t i k}^{(n)} ∥}_{F}^{2} < N_{1} Φ^{ν - 1}$ and $\sum_{k = ν}^{t - 1} {∥ ψ_{t i k}^{(n)} ∥}_{F}^{4} < N_{2} Φ^{ν - 1}$ , $i = 1, \dots, m$ , $ν = 1, \dots, t - 1$ , where $N_{1}$ and $N_{2}$ are positive constants and $0 < Φ < 1$ , and similar conditions on derivatives up to the third order;
(iv): existence of a strictly positive definite matrix V defined by (4) and (5);
(v): existence of a positive definite matrix W defined by (4) and (6);
(vi): and, for $i = 1, \dots, m$ , an assumption on triple sums of the kind

\begin{matrix} \frac{1}{n^{2}} \sum_{d = 1}^{n - 1} \sum_{t = 1}^{n - d} \sum_{k = 1}^{t - 1} {∥g_{t - k}^{(n)}∥}_{F}^{2} {∥ψ_{t i k}^{(n)}∥}_{F} {∥ψ_{t + d, i, k + d}^{(n)}∥}_{F} = O (\frac{1}{n}) . \end{matrix}

Note that, contrary to the theory of local stationarity processes, we do not assume regularity conditions on the dependence with respect to time.

2.3. Obtaining Approximations of True V and W

To validate the simulations and the theory, we have implemented an estimation of V and W as a by-product of estimation (see the computational details in [52]). This was done, in particular, for the bivariate first-order models tdVA

R^{(n)}

(1) and tdVM

A^{(n)}

(1) of [43] (Section 4).

Moreover, for given values of the parameters

θ

, a program written in Matlab (but outside of the estimation program first described in [53]) computes for

t = 1, \dots, n

, successively

g_{t}^{(n)} (θ)

,

Σ_{t}^{(n)} (θ) = g_{t}^{(n)} (θ) Σ g_{t}^{(n) T} (θ)

,

A_{t 1}^{(n)} (θ)

or

B_{t 1}^{(n)} (θ)

, the

ψ_{t i k}^{(n)} (θ)

’s for

k = 1, \dots, t

in (3). For the computation of the standard errors of the estimates, we need the information matrix, thus the Hessian V using (5) and the outer product of gradient W using (6). Approximations of V and W are then obtained using the averages in (4), respectively, without taking the limits. Divided by n, they are used to obtain the “theoretical standard errors” in the simulation results. Note that the number of operations is

O (n^{2})

but this is done only once for the true value of the parameters, specifically

θ = θ^{0}

.

3. Results

To begin with, we consider here two first-order models, i.e., the cases of the tdVA

R^{(n)}

(1) model where

p = 1

and

q = 0

, on the one hand, and the tdVM

A^{(n)}

(1) model where

p = 0

and

q = 1

, on the other hand. To make it practical, and allow for a fully analytical treatment, we will quickly move to bivariate processes, i.e., the case where

r = 2

in (1). Then, we will consider a tdVARM

A^{(n)}

(1, 1)

model for which we will provide an expression for the coefficients

ψ_{t i k}

in the moving average representation of the first-order derivatives of the residuals. Finally, for the general case of a tdVARM

A^{(n)}

(p, q)

model, we will only provide indications on sufficient conditions to demonstrate how it is possible to proceed.

3.1. Treatment of a tdVA $R^{(n)}$ (1) Model

In this section, we consider a tdVA

R^{(n)}

(1) model, first in the general case before taking

r = 2

. The model is defined by

\begin{matrix} x_{t}^{(n)} = A_{t}^{(n)} (θ) x_{t - 1}^{(n)} + e_{t}^{(n)} (θ), \end{matrix}

(8)

with

e_{t}^{(n)} (θ^{0}) = g_{t}^{(n)} ϵ_{t}

,

g_{t}^{(n)} = g_{t}^{(n)} (θ^{0})

, and

ϵ_{t}

having all moments of order

4 + 2 δ

,

δ > 0

, and such that

∥ κ_{t} ∥_{F}

is bounded. Let also

A_{t}^{(n)} = A_{t}^{(n)} (θ^{0})

. Let us define

A_{t}^{(n) [k - 1]} = \prod_{l = 1}^{k - 1} A_{t - l}^{(n)}, k > 1, and A_{t}^{(n) [0]} = I_{r} .

(9)

It can be checked that

ψ_{t k}^{(n)} = A_{t + 1}^{(n) [k]}

,

k = 0, 1, \dots

, and

ψ_{t i k}^{(n)} = \partial A_{t}^{(n)} (θ) / \partial θ_{i} |_{θ = θ^{0}} A_{t}^{(n) [k - 1]}

,

i = 1, \dots, m

. Note that the coefficients of the pure autoregressive decomposition of the process are

π_{t 1}^{(n)} (θ) = A_{t}^{(n)} (θ)

, and

π_{t k}^{(n)} (θ) = 0

,

k > 1

.

To be more specific, assume a bivariate process such that the elements of the matrices

A_{t}^{(n)} (θ)

are linear functions of time, and the diagonal elements of

g_{t}^{(n)} (θ)

are exponential functions of time. More precisely, we suppose that

\begin{matrix} \{\begin{matrix} A_{t}^{(n)} (θ) = (\begin{matrix} A_{11}^{'} & A_{12}^{'} \\ A_{21}^{'} & A_{22}^{'} \end{matrix}) + \frac{1}{n - 1} (t - \frac{n + 1}{2}) (\begin{matrix} A_{11}^{″} & A_{12}^{″} \\ A_{21}^{″} & A_{22}^{″} \end{matrix}), \\ g_{t}^{(n)} (θ) = (\begin{matrix} exp \{\frac{η_{11}}{n - 1} (t - \frac{n + 1}{2})\} & 0 \\ 0 & exp \{\frac{η_{22}}{n - 1} (t - \frac{n + 1}{2})\} \end{matrix}) and Σ = (\begin{matrix} σ_{11} & 0 \\ 0 & σ_{22} \end{matrix}) . \end{matrix} \end{matrix}

(10)

Remark 1.

We have taken linear functions of time for illustrative purposes but it should be clear that the theory works in whole generality. The case of a linear function of time with the divisor

n - 1

appearing in (10) is compatible with Dahlhaus LSP theory. This is why we consider array processes instead of stochastic processes. We will come back to this in the discussion.

Now we start examining the typical assumptions of the theory stated in Section 2.2.

3.1.1. Assumptions (i) and (ii)

Assumption (i) is clearly satisfied. Denote

L (t, n) = \frac{1}{n - 1} (t - \frac{n + 1}{2})

for

t = 1, \dots, n

and

- \frac{1}{2}

for

t \leq 0

. It is obvious that

| L (t, n) | \leq \frac{1}{2}

for all

t \leq n

. This is because

L (1, n) = L (n, n) = \frac{1}{2}

so that we have preferred the denominator

n - 1

instead of n used in the theory of locally stationary processes. Using the definition,

Σ_{t}^{(n)} (θ) = (\begin{matrix} σ_{11} e^{2 η_{11} L (t, n)} & 0 \\ 0 & σ_{22} e^{2 η_{22} L (t, n)} \end{matrix}), Σ_{t}^{(n) - 1} (θ) = (\begin{matrix} \frac{1}{σ_{11}} e^{- 2 η_{11} L (t, n)} & 0 \\ 0 & \frac{1}{σ_{22}} e^{- 2 η_{22} L (t, n)} \end{matrix})

(11)

are matrices whose Frobenius norms are bounded uniformly in t and n, from below by a strictly positive number and also from above, hence Assumption (ii) is satisfied.

3.1.2. Assumption (iii)

Only to simplify the analytical expressions, assume that

η_{11} = 0

in

P_{θ}

, that the element

(2, 1)

of

A_{t}^{(n)} (θ)

is identically zero, and the element

(1, 2)

of

A_{t}^{(n)} (θ)

is a constant

A_{12}^{'} \neq 0

. This is to have an upper-triangular form (facilitating the analytical treatment) that does not degenerate into a diagonal matrix (as the latter would imply uncorrelated components since

Σ

is diagonal). To simplify further the details (although it is not necessary for the principles), in addition to

η_{11} = 0

, instead of the full vector of parameters

θ = {(A_{11}^{'}, A_{12}^{'}, A_{22}^{'}, A_{11}^{″}, A_{22}^{″}, η_{22})}^{T},

we put

A_{22}^{'} = A_{11}^{″} = 0

and

A_{12}^{'}

is fixed to

A_{12}^{' 0}

, so that the vector of parameters to estimate reduces to

θ = {(A_{11}^{'}, A_{22}^{″}, η_{22})}^{T}

, hence

m = 3

and there is one parameter of each kind. The presence of

A_{12}^{' 0} \neq 0

makes sure that the two components of

x_{t}^{(n)}

are not independent. There are assumptions about the true value

θ^{0} = {(A_{11}^{' 0}, A_{22}^{″ 0}, η_{22}^{0})}^{T}

that will be stated later. Then, for

k \geq 2

, we can check by induction that

A_{t, 21}^{(n) [k - 1]} = 0

and

\begin{matrix} A_{t, 11}^{(n) [k - 1]} & = {(A_{11}^{' 0})}^{k - 1}, A_{t, 22}^{(n) [k - 1]} = {(A_{22}^{″ 0})}^{k - 1} \prod_{l = 1}^{k - 1} L (t - l, n), \\ A_{t, 12}^{(n) [k - 1]} & = A_{12}^{' 0} \sum_{l = 1}^{k - 1} {(A_{11}^{' 0})}^{k - l - 1} {(A_{22}^{″ 0})}^{l - 1} \prod_{f = k - l}^{k - 2} L {(t - f, n)}^{δ_{l f}}, \end{matrix}

(12)

where

δ_{l f} = 0

, for

l + f \leq k - 1

, and

δ_{l f} = 1

, for

l + f > k - 1

, generalizing relations in [44] (Appendix S1). To simplify the notations, we will henceforth omit ⁽ⁿ⁾ in the entries of

A_{t}^{(n) [k - 1]}

.

From the definition of

ψ_{t i k}^{(n)}

in (3), we deduce

ψ_{t 3 k}^{(n)} = 0

and

\begin{matrix} ψ_{t 1 k}^{(n)} & = & (\begin{matrix} 1 & 0 \\ 0 & 0 \end{matrix}) (\begin{matrix} A_{t, 11}^{[k - 1]} & A_{t, 12}^{[k - 1]} \\ 0 & A_{t, 22}^{[k - 1]} \end{matrix}) = (\begin{matrix} A_{t, 11}^{[k - 1]} & A_{t, 12}^{[k - 1]} \\ 0 & 0 \end{matrix}), \\ ψ_{t 2 k}^{(n)} & = & (\begin{matrix} 0 & 0 \\ 0 & L (t, n) \end{matrix}) (\begin{matrix} A_{t, 11}^{[k - 1]} & A_{t, 12}^{[k - 1]} \\ 0 & A_{t, 22}^{[k - 1]} \end{matrix}) = (\begin{matrix} 0 & 0 \\ 0 & L (t, n) A_{t, 22}^{[k - 1]} \end{matrix}) . \end{matrix}

(13)

We define the constant

Φ^{1 / 2} = max {| A_{11}^{' 0} |, \frac{1}{2} | A_{22}^{″ 0} |}

such that

0 < Φ < 1

. Therefore, we assume

| A_{11}^{' 0} | < 1

and

| A_{22}^{″ 0} | < 2

. From (13), since

L (t, n) A_{t, 22}^{[k - 1]} < Φ^{(k - 1) / 2}

, we deduce

\sum_{k = ν}^{t - 1} {∥ψ_{t 2 k}^{(n)}∥}_{F}^{2} = \sum_{k = ν}^{t - 1} {[L (t, n) A_{t, 22}^{[k - 1]}]}^{2} \leq \sum_{k = ν}^{t - 1} Φ^{k - 1} < N_{1} Φ^{ν - 1}

, where

N_{1} = 1 / (1 - Φ)

. It is more delicate for

ψ_{t 1 k}^{(n)}

for which

\begin{matrix} \sum_{k = ν}^{t - 1} {∥ψ_{t 1 k}^{(n)}∥}_{F}^{2} = \sum_{k = ν}^{t - 1} [{(A_{t, 11}^{[k - 1]})}^{2} + {(A_{t, 12}^{[k - 1]})}^{2}] . \end{matrix}

(14)

The sum of the first term is also bounded by

N_{1} Φ^{ν - 1}

because

A_{t, 11}^{[k - 1]} < Φ^{(k - 1) / 2}

. Thanks to (12), an upper bound of

| A_{t, 12}^{[k - 1]} |

equals

| A_{12}^{' 0} | \sum_{ℓ = 1}^{k - 1} | A_{11}^{' 0} |^{k - l - 1} | A_{22}^{″ 0} |^{l - 1} \prod_{f = k - l}^{k - 2} {| L (t - f, n) |}^{δ_{l f}} \leq | A_{12}^{' 0} | (k - 1) Φ^{k / 2 - 1},

(15)

hence

\sum_{k = ν}^{t - 1} {∥ψ_{t 1 k}^{(n)}∥}_{F}^{2} < N_{1}^{'} Φ^{ν - 1}

, where

N_{1}^{'} = {(A_{12}^{' 0})}^{2} Φ / {(1 - Φ)}^{3}

(see the details in Appendix A.1). We could have used [54] to simplify the investigation.

3.1.3. Assumption (iv)

First, to evaluate the elements

(i, j)

,

i = 1, 2

, of V in (4) with (5), we have to take the limit for

n \to \infty

of the sum for

t = 1, \dots, n

of

V_{t, i j}^{(n)} / n

, which can be computed using (7). But in the tdVA

R^{(n)}

(1) case, using (2) with

π_{t 1}^{(n)} (θ) = A_{t}^{(n)} (θ)

, it is more simply expressed as

V_{t, i j}^{(n)} = tr \{{\frac{\partial A_{t}^{T (n)} (θ)}{\partial θ_{i}}|}_{θ = θ^{0}} Σ_{t}^{(n) - 1} {\frac{\partial A_{t}^{(n)} (θ)}{\partial θ_{j}}|}_{θ = θ^{0}} E (x_{t - 1}^{(n)} x_{t - 1}^{(n) T})\}, i, j = 1, \dots, m_{A B} .

The most difficult case is for

i = j = 2

: the product of derivatives and

Σ_{t}^{(n) - 1}

has the element

(2, 2)

equal to

L^{2} (t, n) exp (- 2 η_{22}^{0} L (t, n)) / σ_{22}

. Assume

η_{22}^{0} > 0

. Then it can be seen that

(1 / n) \sum_{t = 1}^{n} L^{2} (t, n) exp (- 2 η_{22}^{0} L (t, n))

converges to a limit when

n \to \infty

. We will now prove that

E (x_{t - 1}^{(n)} x_{t - 1}^{(n) T})

tends to a limit when

n \to \infty

. Denoting

E_{t - k} = exp {2 η_{22}^{0} L (t - k, n)}

, it is a sum for

k = 1

to

t - 1

of matrices

(\begin{matrix} σ_{11} {(A_{11}^{' 0})}^{2 (k - 1)} + σ_{22} E_{t - k} A_{t, 12}^{[k - 1] 2} & σ_{22} E_{t - k} A_{t, 12}^{[k - 1]} {(A_{22}^{″ 0})}^{k - 1} K_{t n}^{[k - 1]} \\ σ_{22} E_{t - k} A_{t, 12}^{[k - 1]} {(A_{22}^{″ 0})}^{k - 1} K_{t n}^{[k - 1]} & σ_{22} E_{t - k} {(A_{22}^{″ 0})}^{2 (k - 1)} K_{t n}^{[k - 1]} \end{matrix}),

(16)

where

K_{t n}^{[k - 1]} = \prod_{l = 1}^{k - 1} L (t - l, n)

. Using the bounds already presented when discussing

ψ_{t i k}

,

i = 1, 2

, and given that

E_{t - k} = exp {2 η_{22}^{0} L (t - k, n)} = exp {2 η_{22}^{0} L (t, n)}

{(exp {- 2 η_{22}^{0} / (n - 1)})}^{k}

, it is easy to see that the elements of (16) behave like terms of a geometric series (or terms of two sums of geometric series) for large k, so that, for large t, their sum is finite, strictly positive, and convergent when

n \to \infty

. The existence of the limit

V_{22}

then follows from Theorem 2.5 of [55]. The two other cases (

i = j = 1

, and

i = 1, j = 2

) are straightforward.

The computation of

V_{33}

involves only the second term of the definition in (5). The derivative of

Σ_{t}^{(n)}

with respect to

η_{22}

yields a zero matrix, except the element

(2, 2)

, which is

2 σ_{22} L (t, n)

exp {2 η_{22}^{0} L (t, n)}

. The product of

Σ_{t}^{(n) - 1}

by that derivative gives a matrix of 0 except for the element

(2, 2)

which is equal to

2 L (t, n)

. We have, therefore, to take one-half of the limit for

n \to \infty

of the sum for

t = 1, \dots, n

of

4 L^{2} (t, n) / n

. However, the sum of

{(t - \frac{n + 1}{2})}^{2} / n

is the variance in a discrete uniform distribution over

{1, 2, \dots, n}

which equals

(n^{2} - 1) / 12

. Hence, we obtain

(n + 1) / (6 (n - 1))

whose limit for

n \to \infty

equals

1 / 6

. We can check that the elements not discussed vanish. Note that the factor

\frac{1}{6}

appeared already in a similar univariate example shown by [12] (Example 3).

3.1.4. Assumption (v)

For a Gaussian process,

W = V

in (6). For a Laplace or a Student distribution (the latter with at least 5 degrees of freedom), in particular, the entries of

κ_{t}

are not smaller than those for a normal distribution [51] (Section 4), hence

W_{33} \geq V_{33}

.

3.1.5. Assumption (vi)

In order to verify Assumption (vi), for example for

i = 2

, the simplest case, we have to show that

(1 / n^{2})

multiplied by

\sum_{d = 1}^{n - 1} \sum_{t = 1}^{n - d} \sum_{k = 1}^{t - 1} ∥ g_{t - k}^{(n)} ∥_{F}^{2} {∥ ψ_{t 2 k}^{(n)} ∥}_{F}

∥ ψ_{t + d, 2, k + d}^{(n)} ∥_{F}

is

O (1 / n)

. First

∥ g_{t - k}^{(n)} ∥_{F}^{2}

equals

1 + exp \{2 η_{22}^{0} L (t - k, n)\}

, which is bounded by

1 + exp \{η_{22}^{0}\}

. Then, we take an upper bound of

∥ ψ_{t 2 k}^{(n)} ∥_{F}

by

Φ^{k - 1}

and thus of

∥ ψ_{t + d, 2, k + d}^{(n)} ∥_{F}

by

Φ^{k + d - 1}

. The sum for

k = 1, \dots, t - 1

of the product

Φ^{k - 1} Φ^{k + d - 1} = Φ^{d + 2 k - 2}

is bounded by

Φ^{d - 2}

times a constant

1 / (1 - Φ^{2})

. By exchanging the two outside summations, we have to find an upper bound of

\sum_{t = 1}^{n - 1} \sum_{d = 1}^{n - t} Φ^{d - 1}

by

Φ^{- 1}

times the sum for

t = 1, \dots, n - 1

of a constant

1 / (1 - Φ)

. Dividing by

n^{2}

, we have

O (1 / n)

. The case where

i = j = 1

is more delicate and will not be detailed but the principle is identical.

Example 1.

In [43] (Section 4.1.2), simulations results were shown for artificial Gaussian time series generated by a specific case of model Equations (8)–(10), under all the restrictions mentioned above and with the following values for

θ^{0} = {(A_{11}^{' 0}, A_{22}^{″ 0}, η_{22})}^{T} = {(0.8, 0.75, 0.7)}^{T}

. Moreover,

A_{12}^{'} = 0.5

is a constant as above. First, note that these values satisfy the requirements. Indeed

Φ^{1 / 2} = max (| A_{11}^{' 0} |, \frac{1}{2} | A_{22}^{″ 0} |) < 1

and

η_{22} > 0

. Let us consider 1000 time series of length

n = 25, 50, 100, 200, and 400

obtained using multivariate Laplace, on the one hand, and Student with 5 degrees of freedom, on the other hand. The empirical estimation results are shown, respectively, in Table 1 and Table 2.

In both cases (multivariate Laplace and multivariate Student), the estimates in column (a) (emp.est.) are close to the true value, and nearly always closer when n increases; the sample standard errors in column (b) (emp.s.e.) decrease with n and are very close to the averages across simulations of the estimated standard errors (obtained using the sandwich formula and estimates

\hat{V}

and

\hat{W}

of V and W, respectively, see [52] for details) shown in column (c) (est.s.e.) and the approximate theoretical standard errors in column (d) (theor.s.e.), also based on a sandwich formula but now on the two finite averages of (4) evaluated at

θ^{0}

, as discussed in Section 2.3; the percentages of simulations where the hypothesis

H_{0} (θ_{i} = θ_{i}^{0})

is rejected at the 5% level in column (e) (% rej.) are, of course, close to 5%. The results look better for the Laplace than the Student distribution, especially for column (d) and row

η_{22}

. As a consequence of these simulations, we see that, for n large enough, the estimates become close to the true value, the standard errors that are the by-product of estimation correspond broadly to the empirical results (although less well for the parameter

η_{22}

related to heteroscedasticity), coincide relatively well with the approximated values derived from the theory, and, finally, the level of the test that the parameter equals the true value is relatively close to 5%. Thanks to the sandwich correction, the standard errors are not underestimated. For instance, for

η_{22}

,

n = 400

, and the Laplace distribution, the average of the standard errors using

{\hat{V}}^{- 1}

instead of

{\hat{V}}^{- 1} \hat{W} {\hat{V}}^{- 1}

would lead to

0.1239

instead of

0.1801

as shown in column (c) and

0.1222

instead of

0.1932

in column (d). These numbers are far from the empirical standard deviation of

0.1948

. Finally, the results in [43] for a normal distribution in an otherwise similar setup are closer to the expected values than those shown here. For instance, the last row shows the results

0.6913

,

0.1229

,

0.1235

,

0.1222

, and

4.3

in columns (a) to (e), respectively, with a much better agreement for the empirical, estimated, and theoretical standard errors.

3.2. Treatment of a tdVM $A^{(n)}$ (1) Model

Moving average models are more difficult to study than autoregressive ones. We consider a tdVM

A^{(n)}

(1) model defined by

\begin{matrix} x_{t}^{(n)} = e_{t}^{(n)} (θ) + B_{t}^{(n)} (θ) e_{t - 1}^{(n)} (θ), \end{matrix}

(17)

with the same notations as in Section 2, and

B_{t}^{(n)} (θ) = B_{t 1}^{(n)} (θ)

. For any

θ

, the pure autoregressive representation of

x_{t}^{(n)}

is

x_{t}^{(n)} = e_{t}^{(n)} (θ) + B_{t}^{(n)} (θ) x_{t - 1}^{(n)} - B_{t}^{(n)} (θ) B_{t - 1}^{(n)} (θ) x_{t - 2}^{(n)} + B_{t}^{(n)} (θ) B_{t - 1}^{(n)} (θ) B_{t - 2}^{(n)} (θ) x_{t - 3}^{(n)} - \dots,

hence

\frac{\partial e_{t}^{(n)} (θ)}{\partial θ_{i}} = - \frac{\partial B_{t}^{(n)} (θ)}{\partial θ_{i}} x_{t - 1}^{(n)} + \frac{\partial B_{t}^{(n)} (θ) B_{t - 1}^{(n)} (θ)}{\partial θ_{i}} x_{t - 2}^{(n)} - \frac{\partial B_{t}^{(n)} (θ) B_{t - 1}^{(n)} (θ) B_{t - 2}^{(n)} (θ)}{\partial θ_{i}} x_{t - 3}^{(n)} + \dots

Replacing

x_{t - j}^{(n)}

,

j = 1, 2, \dots

by (17) for

θ = θ^{0}

, given that

e_{t}^{(n)} (θ^{0}) = g_{t}^{(n)} (θ^{0}) ϵ_{t}

, we obtain

\begin{matrix} \frac{\partial e_{t}^{(n)} (θ)}{\partial θ_{i}} & = & - \frac{\partial B_{t}^{(n)} (θ)}{\partial θ_{i}} (g_{t - 1}^{(n)} ϵ_{t - 1} + B_{t - 1}^{(n)} g_{t - 2}^{(n)} ϵ_{t - 2}) \\ + & \frac{\partial B_{t}^{(n)} (θ) B_{t - 1}^{(n)} (θ)}{\partial θ_{i}} (g_{t - 2}^{(n)} ϵ_{t - 2} + B_{t - 2}^{(n)} g_{t - 3}^{(n)} ϵ_{t - 3}) \\ - & \frac{\partial B_{t}^{(n)} (θ) B_{t - 1}^{(n)} (θ) B_{t - 2}^{(n)} (θ)}{\partial θ_{i}} (g_{t - 3}^{(n)} ϵ_{t - 3} + B_{t - 3}^{(n)} g_{t - 4}^{(n)} ϵ_{t - 4}) + \dots, \end{matrix}

with

g_{t}^{(n)} = g_{t}^{(n)} (θ^{0})

and

B_{t}^{(n)} = B_{t}^{(n)} (θ^{0})

, like before. Let us denote

B_{t}^{(n) [k - 1]} (θ) = \prod_{j = 0}^{k - 1} B_{t - j}^{(n)} (θ),

(18)

with elements

B_{t, i j}^{(n) [k - 1]} (θ)

,

i, j = 1, 2

. Hence, from (3),

ψ_{t i k}^{(n)} (θ) = {(- 1)}^{k} [\frac{\partial B_{t}^{(n) [k - 1]} (θ)}{\partial θ_{i}} - \frac{\partial B_{t}^{(n) [k - 2]} (θ)}{\partial θ_{i}} B_{t - k + 1}^{(n)}] .

(19)

Assume now

B_{t, 21}^{(n)} (θ) = 0

in

P_{θ}

. To proceed more in detail, in order to simplify the analytic computations (and this is not necessary for numerical computations), we assume also that

B_{t, 12}^{(n)} (θ) = B_{12}^{'}

, a constant.

We can now examine the typical assumptions of the theory stated in Section 2.2.

3.2.1. Assumptions (i) and (ii)

To obtain nice analytic expressions, we suppose that the diagonal elements of the matrices

B_{t}^{(n)} (θ)

and

g_{t}^{(n)} (θ)

are exponential functions of time. The elements

(1, 2)

and

(2, 1)

of

g_{t}^{(n)} (θ)

are supposed to be different from zero so that the correlation between the two components of

ϵ_{t}

varies with time. More precisely, using again

L (t, n) = \frac{1}{n - 1} (t - \frac{n + 1}{2})

, we suppose that

\begin{matrix} \{\begin{matrix} B_{t}^{(n)} (θ) = (\begin{matrix} B_{11}^{'} exp \{B_{11}^{″} L (t, n)\} & B_{12}^{'} \\ 0 & B_{22}^{'} exp \{B_{22}^{″} L (t, n)\} \end{matrix}), \\ g_{t}^{(n)} (θ) = (\begin{matrix} exp \{η_{11} L (t, n)\} & α \\ β & exp \{η_{22} L (t, n)\} \end{matrix}) and Σ = (\begin{matrix} σ_{11} & σ_{12} \\ σ_{21} & σ_{22} \end{matrix}), \end{matrix} \end{matrix}

(20)

where

α

and

β

are here constants, with

η_{11} \geq 0

and

η_{22} \geq 0

. Denoting

g_{t, i i}^{(n)}

an element of

g_{t}^{(n)} (θ)

and omitting the argument

θ

,

Σ_{t}^{(n)} (θ)

is equal to

(\begin{matrix} σ_{11} g_{t, 11}^{(n) 2} + 2 σ_{12} α g_{t, 11}^{(n)} + σ_{22} α^{2} & σ_{11} β g_{t, 11}^{(n)} + σ_{12} (α β + g_{t, 11}^{(n)} g_{t, 22}^{(n)}) \\ + σ_{22} α g_{t, 22}^{(n)} \\ σ_{11} β g_{t, 11}^{(n)} + σ_{12} (α β + g_{t, 11}^{(n)} g_{t, 22}^{(n)}) & σ_{11} β^{2} + 2 σ_{12} β g_{t, 22}^{(n)} + σ_{22} g_{t, 22}^{(n) 2} \\ + σ_{22} α g_{t, 22}^{(n)} \end{matrix}) .

(21)

It is easy to see that Assumptions (i) and (ii) are verified.

3.2.2. Assumption (iii)

To simplify the example, since a correlation between the two components is already guaranteed by the presence of a non-diagonal covariance matrix

Σ

, we will assume that

B_{12}^{'} = 0

so that

B_{t}^{(n) [k - 1]} (θ)

defined in (18) is diagonal.

To simplify the discussion, we will further suppose, similarly to what we did in Section 3.1, in addition of course to the already introduced parameters

η_{11}

and

η_{22}

, we have only one parameter of each type, e.g.,

θ_{1} = B_{11}^{'}

and

θ_{2} = B_{22}^{″}

, assuming that

B_{22}^{'} \neq 0

and

B_{11}^{″}

are fixed constants, respectively denoted

B_{22}^{' 0}

and

B_{11}^{″ 0}

. To summarize,

θ = {(B_{11}^{'} B_{22}^{″} η_{11} η_{22})}^{T}

. Contrary to Section 3.1, we recourse to the results of [54] here, although the more lengthy direct analysis is given in Appendix A.2. Indeed, a sufficient condition for (iii) is stated in [54]. It can be shown that the eigenvalues of the MA polynomial are the solutions of the equation

(1 - B_{11}^{'} exp (B_{11}^{″ 0} L (t, n))) (1 - B_{22}^{' 0} exp (B_{22}^{″} L (t, n))) = 0

. They should be smaller than 1 in absolute value at

θ^{0}

. Therefore, we assume that the true value

θ^{0} = {(B_{11}^{' 0} B_{22}^{″ 0} η_{11}^{0} η_{22}^{0})}^{T}

of

θ

satisfies

| B_{j j}^{' 0} | exp (B_{j j}^{″ 0} / 2) < 1

,

j = 1, 2

. We denote

Φ^{1 / 2} = max {| B_{11}^{' 0} | exp (B_{11}^{″ 0} / 2), | B_{22}^{' 0} | exp (B_{22}^{″ 0} / 2)} < 1 .

(22)

Assumption (iii) is, therefore, satisfied. In practice, we don’t assume zero initial values for the process, but that it is invertible before time 1. This leads to additional conditions that

| B_{11}^{' 0} | exp (B_{11}^{″ 0} L (0, n)) < 1

and

| B_{22}^{' 0} | exp (B_{22}^{″ 0} L (0, n)) < 1

. If we were interested in forecasting up to time

n + h

, we should replace above

L (t, n)

with

L (t + h, n)

to have stronger conditions in (22).

3.2.3. Assumption (iv)

Now, we consider the existence of V in (4) with (5) for the 4-parameter model described in the previous paragraphs. Let us start with the elements

V_{i j}

,

i, j = 3, 4

, corresponding to the parameters

η_{11}

and

η_{22}

. The formulas in [51] (Theorem 3) are applicable for computing the terms

V_{t, i j}^{(n)}

,

i, j = 3, 4

. Given the expression of

g_{t, ℓ ℓ}^{(n)}

in (20), the terms in these formulas are all of the forms

v_{t}^{(n)} = \frac{L {(t, n)}^{2} g_{t, ξ}^{(n)}}{{(α β - g_{t, η}^{(n)})}^{2}},

(23)

where

ξ \geq 0

is equal to

ρ η_{ℓ ℓ}

,

ρ = 0, 1, 2

, and

ℓ = 1, 2

, or

η_{11} η_{22}

, and

η = η_{11} + η_{22} > 0

. See Appendix A.3 for indications on how to prove the existence of the limit of averages of these terms, and even their computation using integration.

Using (7), the elements of

V_{t, i j}^{(n)}

,

i, j = 1, 2

are sums for

k = 1

to

t - 1

of terms of the kind

τ_{t, i j}^{(n)} {(ψ_{t i k}^{(n)})}_{i i} {(ψ_{t j k}^{(n)})}_{j j} σ_{t - k}^{(n)}

, where

τ_{t, i j}^{(n)}

are elements of

Σ_{t}^{(n) - 1}

and

σ_{t - k}^{(n)}

are elements of

Σ_{t - k}^{(n)}

. Given (21), the special form of

g_{t}^{(n)} (θ)

,

σ_{t}^{(n)}

is a quadratic polynomial of the diagonal elements

g_{t, ℓ ℓ}^{(n)}

,

ℓ = 1, 2

. More precisely,

σ_{t}^{(n)}

can be written as a finite sum of the form

c_{δ_{1} δ_{2}} {(g_{t, 11}^{(n)})}^{δ_{1}} {(g_{t, 22}^{(n)})}^{δ_{2}}

, with

δ_{1}

and

δ_{2}

integers such that

0 \leq δ_{1} \leq 2

,

0 \leq δ_{2} \leq 2

, and

δ_{1} + δ_{2} \leq 2

, and

c_{δ_{1} δ_{2}}

is a constant. However,

g_{t - k, ℓ ℓ}^{(n)}

is equal to

g_{t, ℓ ℓ}^{(n)} exp (- η_{ℓ ℓ}^{0} k / (n - 1))

. Consequently,

σ_{t - k}^{(n)}

is composed of terms

c_{δ_{1} δ_{2}} {(g_{t - k, 11}^{(n)})}^{δ_{1}} {(g_{t - k, 22}^{(n)})}^{δ_{2}}

equal to

c_{δ_{1} δ_{2}} {(g_{t, 11}^{(n)})}^{δ_{1}} {(g_{t, 22}^{(n)})}^{δ_{2}} exp (- (η_{11}^{0} δ_{1} + η_{22}^{0} δ_{2}) k / (n - 1))

. Hence

V_{t, i j}^{(n)}

,

i, j = 1, 2

, is a sum for

k = 1

to

t - 1

of terms of the kind

c_{δ_{1} δ_{2}} τ_{t, i j}^{(n)} {(g_{t, 11}^{(n)})}^{δ_{1}} {(g_{t, 22}^{(n)})}^{δ_{2}} u_{t, i j δ_{1} δ_{2}}^{(n)},

where we denote

u_{t, i j δ_{1} δ_{2}}^{(n)} = \sum_{k = 1}^{t - 1} {(ψ_{t i k}^{(n)})}_{i i} {(ψ_{t j k}^{(n)})}_{j j} exp (- (η_{11}^{0} δ_{1} + η_{22}^{0} δ_{2}) k / (n - 1)),

(24)

where

{(ψ_{t i k}^{(n)})}_{i i}

and

{(ψ_{t j k}^{(n)})}_{j j}

are obtained using (A2) or/and (A3). The terms in (24) are not all positive because of

L (t - k + 1, n)

. However,

L (t - k + 1, n) < 1 / 2

for all

t = 1, \dots, n

and all n, and, therefore,

\sum_{ℓ = 1}^{k - 1} L (t - ℓ, n) \leq k / 2

so that the absolute value of the product

{(ψ_{t i k}^{(n)})}_{i i} {(ψ_{t j k}^{(n)})}_{j j}

can be bounded by

Φ^{k}

, given (22). We want to show that the triangular array

{u_{t, i j δ_{1} δ_{2}}^{(n)}, t = 1, \dots, n}

converges to a constant (dependent on i, j,

δ_{1}

, and

δ_{2}

) when

t \to \infty

, and hence

n \to \infty

. First, the absolute value of the kth term of (24) is bounded by

Φ^{k} exp (- (η_{11}^{0} δ_{1} + η_{22}^{0} δ_{2}) k / (n - 1)) < Φ^{k}

, since

η_{11}^{0} δ_{1} + η_{22}^{0} δ_{2} > 0

, and the limit of the sum of these terms is a convergent geometric series, ensuring convergence of the sequence (24) when

t \to \infty

. Now, for

i, j = 1, 2

,

V_{t, i j}^{(n)}

is a sum of terms that are proportional to

u_{t, i j δ_{1} δ_{2}}^{(n)} v_{t, i j δ_{1} δ_{2}}^{(n)}

, where

v_{t, i j δ_{1} δ_{2}}^{(n)} = τ_{t, i j}^{(n)} {(g_{t, 11}^{(n)})}^{δ_{1}} {(g_{t, 22}^{(n)})}^{δ_{2}} .

We have defined

τ_{t, i j}^{(n)}

as elements of

Σ_{t}^{(n) - 1}

, so as elements of the matrix of cofactors of the matrix

Σ_{t}^{(n)}

divided by its determinant

D_{t}^{(n)}

, where

D_{t}^{(n)}

is

(σ_{11} σ_{22} - σ_{12}^{2}) {(α β - g_{t, 11}^{(n)} g_{t, 22}^{(n)})}^{2}

. Hence,

v_{t, i j δ_{1} δ_{2}}^{(n)}

is a finite sum with terms that are proportional to

v_{t, i j δ_{3} δ_{4}}^{(n)} = \frac{{(g_{t, 11}^{(n)})}^{δ_{3}} {(g_{t, 22}^{(n)})}^{δ_{4}}}{{(α β - g_{t, 11}^{(n)} g_{t, 22}^{(n)})}^{2}},

where

δ_{3} \geq δ_{1}

and

δ_{4} \geq δ_{2}

are integers. We obtain

V_{t, i j}^{(n)}

for

i, j = 3, 4

to show that

lim_{n \to \infty} \frac{1}{n} \sum_{t = 1}^{n} v_{t, i j δ_{3} δ_{4}}^{(n)}

exists, is finite, and can be evaluated using an integral. Therefore, we can invoke Theorem 2.5 of [55] to show that

lim_{n \to \infty} \frac{1}{n} \sum_{t = 1}^{n} u_{t, i j δ_{1} δ_{2}}^{(n)} v_{t, i j δ_{3} δ_{4}}^{(n)}

exists and is finite. This proves the existence of

V_{i j}

for

i, j = 1, 2

.

3.2.4. Assumption (v)

The existence of W in (4) with (6) can be studied similarly using

κ_{t}

, depending on the distribution of

ϵ_{t}

, see [52] (Section 3.5).

3.2.5. Assumption (vi)

It is also easy to check Assumption (vi), for example for

i = 1

. We have to show that

(1 / n^{2})

multiplied by

\sum_{d = 1}^{n - 1} \sum_{t = 1}^{n - d} \sum_{k = 1}^{t - 1} ∥ g_{t - k}^{(n)} ∥_{F}^{2} ∥ ψ_{t 1 k}^{(n)} ∥_{F} {∥ ψ_{t + d, 1, k + d}^{(n)} ∥}_{F}

is

O (1 / n)

. First, it can be shown that

∥ g_{t - k}^{(n)} ∥_{F}^{2}

is equal to

2 + exp (2 η_{11}^{0} L (t - k, n)) + exp (2 η_{22}^{0} L (t - k, n))

, and this is less than or equal to

2 + exp (η_{11}^{0}) + exp (η_{22}^{0})

. Then, we are exactly in the same situation as in Section 3.1.

Example 2.

In [43] (Section 4.1.2), simulations results were shown for artificial Gaussian or Student (with 5 degrees of freedom) time series generated by model Equations (17)–(20) with the following values for

θ^{0} = {(B_{11}^{' 0}, B_{22}^{″ 0}, η_{22}^{0})}^{T} = {(0.8, 0.4, 0.7)}^{T}

but with a linear expression for

B_{t}^{(n)}

that we keep here instead of the exponential that was easier for the analytic developments:

B_{t, 11}^{(n)} = B_{11}^{'} + B_{11}^{″} L (t, n)

and

B_{t, 22}^{(n)} = B_{22}^{'} + B_{22}^{″} L (t, n)

. Moreover,

B_{11}^{″} = 0

,

B_{22}^{'} = 0.25

, and

η_{11} = 0

are taken as constants, as well as

α = β = - 0.6

. First, note that these values satisfy the requirements after a linear approximation of the exponentials. Indeed,

Φ^{1 / 2} = max (| B_{11}^{' 0} | exp (B_{11}^{″} / 2), | B_{22}^{' 0} | exp (B_{22}^{″} / 2)) < 1

and

η_{22} > 0

.

Let us consider 1000 time series of length

n = 25, 50, 100, 200, and 400

obtained using multivariate Laplace distribution. The empirical estimation results are shown in Table 3. In all cases, the estimates in column (a) (emp.est.) are often close to the true value, and closer when n increases; the sample standard errors in column (b) (est.s.e.) decrease with n and are very close to the averages across simulations of the estimated standard errors (obtained using the sandwich formula and estimates

\hat{V}

and

\hat{W}

of V and W, respectively, see [52] for details) shown in column (c) (emp.s.e.) and the approximate standard errors in column (d) (theor.s.e.), also based on a sandwich formula but now on the two finite averages of (4) evaluated at

θ^{0}

; the percentages of simulations where the hypothesis

H_{0} (θ_{i} = θ_{i}^{0})

is rejected at the 5% level in column (e) (% rej.) are close to

0.05

at least for large n. Like in Example 1, for n large enough, the estimates become close to the true value, the estimated standard errors are close to the empirical results (but also for the parameter

η_{22}

related to heteroscedasticity) and the approximated values derived from the theory, and, finally, the percentage of rejection in the tests are slightly closer to 5%. Thanks to the sandwich correction, the standard errors are not underestimated. For instance, for

η_{22}

and

n = 400

, the average of the standard errors using

{\hat{V}}^{- 1}

instead of

{\hat{V}}^{- 1} \hat{W} {\hat{V}}^{- 1}

would lead to

0.0746

instead of

0.1078

shown in column (c) and

0.0731

instead of

0.1154

in column (d) , far from the empirical standard error of

0.1190

shown in column (b). Note that the results are slightly better for a normal distribution of the errors. In Table 4, we present the normal results obtained for

n = 400

as we had only shown them for

n = 100

in [43]. The agreement is better for

η_{22}

with

0.0747

in row (b) (emp.s.e.),

0.0744

in row (c1) (est.s.e.) or

0.0729

in row (c2) (est.s.e.) or

0.0731

in row (d) (theor.s.e.). In the normal case, we have computed an estimate of

V_{33}

using the integration approach in [55] (Procedure 6). Using this, refer to the details in Appendix A.3,

V_{33} = 0.465316

and the corresponding standard error for

η_{22}

when

n = 400

is

0.0732986

, in full agreement with the empirical, estimated, and theoretical approximation standard errors in the last column of Table 4.

3.3. Treatment of a tdVARM $A^{(n)}$ (1,1) Model

Starting from here, we consider only homoscedastic models. Let

{x_{t}^{(n)}; t = 1, \dots, n, n \in N}

be an r-vector time series satisfying

x_{t}^{(n)} = A_{t}^{(n)} (θ) x_{t - 1}^{(n)} + e_{t}^{(n)} (θ) + B_{t}^{(n)} (θ) e_{t - 1}^{(n)} (θ),

and

g_{t}^{(n)} (θ) = I_{r}

. Hence, (ii) and (v) have no object since

Σ_{t}^{(n)} (θ) = Σ

does not depend on

θ

and

W = V

. We will not cover the tdVARM

A^{(n)} (1, 1)

model in detail, simply show how the coefficients

ψ_{t i k}^{(n)}

can be computed. It is a special case of the model defined in (1) with

p = 1

and

q = 1

. Now, following [44], in this special case, the coefficients of the pure moving average representation are given by:

ψ_{t k}^{(n)} (θ) = \{\prod_{l = 0}^{k - 2} A_{t - l}^{(n)} (θ)\} \{B_{t - k + 1}^{(n)} (θ) + A_{t - k + 1}^{(n)} (θ)\}, for k = 1, 2, \dots, t - 1,

where a product for

l = 0

to

- 1

is set to

I_{r}

. The coefficients of the pure autoregressive form are

π_{t k}^{(n)} (θ) = \{{(- 1)}^{k - 1} \prod_{l = 0}^{k - 2} B_{t - l}^{(n)} (θ)\} \{A_{t - k + 1}^{(n)} (θ) + B_{t - k + 1}^{(n)} (θ)\},

so, for

i = 1, \dots, m

, their derivatives are given by

\frac{\partial π_{t 1}^{(n)} (θ)}{\partial θ_{i}} = \frac{\partial A_{t}^{(n)} (θ)}{\partial θ_{i}} + \frac{\partial B_{t}^{(n)} (θ)}{\partial θ_{i}},

\frac{\partial π_{t 2}^{(n)} (θ)}{\partial θ_{i}} = - \frac{\partial B_{t}^{(n)} (θ)}{\partial θ_{i}} {A_{t - 1}^{(n)} (θ) + B_{t - 1}^{(n)} (θ)} - B_{t}^{(n)} (θ) \{\frac{\partial A_{t - 1}^{(n)} (θ)}{\partial θ_{i}} + \frac{\partial B_{t - 1}^{(n)} (θ)}{\partial θ_{i}}\},

\begin{matrix} \frac{\partial π_{t 3}^{(n)} (θ)}{\partial θ_{i}} & = & \frac{\partial B_{t}^{(n)} (θ)}{\partial θ_{i}} B_{t - 1}^{(n)} (θ) \{A_{t - 2}^{(n)} (θ) + B_{t - 2}^{(n)} (θ)\} \\ + & B_{t}^{(n)} (θ) \frac{\partial B_{t - 1}^{(n)} (θ)}{\partial θ_{i}} \{A_{t - 2}^{(n)} (θ) + B_{t - 2}^{(n)} (θ)\} \\ + & B_{t}^{(n)} (θ) B_{t - 1}^{(n)} (θ) \{\frac{\partial A_{t - 2}^{(n)} (θ)}{\partial θ_{i}} + \frac{\partial B_{t - 2}^{(n)} (θ)}{\partial θ_{i}}\}, \dots \end{matrix}

Consequently

\frac{\partial π_{t k}^{(n)} (θ)}{\partial θ_{i}} = {(- 1)}^{k - 1} \sum_{l = 1}^{k} (\prod_{h = 1}^{k} χ_{t + 1 - h, k, l, h, i}^{(n)} (θ)),

where

χ_{t, k, l, h, i}^{(n)} (θ) = \{\begin{matrix} \frac{\partial χ_{t, k, l, h}^{(n)} (θ)}{\partial θ_{i}} & if & h = l, \\ χ_{t, k, l, h}^{(n)} (θ) & if & h \neq l, \end{matrix}

and

χ_{t, k, l, h}^{(n)} (θ) = \{\begin{matrix} B_{t}^{(n)} (θ) & i f h < k, \\ A_{t}^{(n)} (θ) + B_{t}^{(n)} (θ) & i f h = k . \end{matrix}

Then

ψ_{t i k}^{(n)} (θ) = \sum_{u = 1}^{k} \{\sum_{l = 1}^{u} (\prod_{h = 1}^{u} χ_{t + 1 - h, k, l, h, i}^{(n)} (θ))\} \{\prod_{h = u + 1}^{k} {\tilde{χ}}_{t + 1 - h, k, h}^{(n)} (θ^{0})\},

{\tilde{χ}}_{t + 1 - h, k, h}^{(n)} (θ) = \{\begin{matrix} A_{t}^{(n)} (θ) & i f h < k, \\ A_{t}^{(n)} (θ) + B_{t}^{(n)} (θ) & i f h = k . \end{matrix}

These results correct the findings presented in the univariate case by [12].

From this expression, it is possible to check (iii) and (vi) relatively easily. Indeed, again let

A_{t}^{(n)} = A_{t}^{(n)} (θ^{0})

and

B_{t}^{(n)} = B_{t}^{(n)} (θ^{0})

. If we assume that

∥ A_{t}^{(n)} ∥_{F} < Φ

and

∥ B_{t}^{(n)} ∥_{F} < Φ

, for all t, where

0 < Φ < 1

, since only a few factors involve sums like

A_{t}^{(n)} + B_{t}^{(n)}

and the others are bounded by

Φ

, the Frobenius norm of the

ψ_{t k}^{(n)}

and the

ψ_{t i k}^{(n)}

are bounded by

Φ^{k}

. Of course, checking (iii), the existence of V, depends heavily on the parametrization. For example, if it exists, V will not be invertible if

A_{t}^{(n)} (θ^{0}) = B_{t}^{(n)} (θ^{0})

for all t (or even for most t).

3.4. Treatment of a More General tdVARM $A^{(n)}$ $(p, q)$ Model

Ref. [45] indicates how to handle more generally homoscedastic tdVARM

A^{(n)}

(

p, q

) models, with

g_{t}^{(n)} (θ) = I_{r}

. Indeed, it is shown in [54] that (iii) and (vi) are valid if the determinants of the tdVAR and tdVMA polynomials, respectively,

I_{r} - \sum_{i = 1}^{p} A_{t i}^{(n)} z^{i}

and

I_{r} + \sum_{j = 1}^{q} B_{t j}^{(n)} z^{j}

, do not vanish when

| z | \leq 1

. Of course, it is only a sufficient condition. That argument was used in Section 3.2.2 to simplify the treatment, whereas Appendix A.2 does not use that argument for the same model and is therefore lengthier. Again, (ii) and (v) have no object, while (i) and (iii) depend on the specific parametrization.

4. Discussion

The results presented in Section 3.1 and Section 3.2 confirm that the theory exposed in [43] is applicable and that its assumptions can be checked analytically, at least in the two simple bivariate tdVA

R^{(n)}

(1) and tdVM

A^{(n)}

(1) models. The results have already been exploited in the simulation experiments in [43] (Sections 4.1.2 and 4.2.2) as the values of the parameters used there meet the conditions stated here. The treatment of other models by analytical methods is certainly more challenging, although we discussed it in some detail for a tdVARM

A^{(n)}

(1, 1) model. For more complex models, the approach of [54] can also be considered, as it allows us to put the model in tdVAR form, albeit with a higher dimension.

As mentioned in Remark 1, we have taken linear functions of time for illustrative purposes, but it should be clear that the theory works in whole generality. First, a linear function of time with the divisor

n - 1

appearing in (10) is compatible with Dahlhaus’ LSP theory (see [16]), since the (Frobenius norm of the) coefficient can easily be bounded from above by 1. Anyway, we can consider that linearity is a first attempt if constancy cannot be retained. Second, in [19], the authors have shown a univariate tdA

R^{(n)}

process of order 1, where the coefficient can be greater than 1 during an interval of time that shrinks to 0 as n increases. We can have the same change here, meaning that the process does not need to be locally stationary. What is essential is that the coefficients of the pure tdMA representation are bounded by a decreasing exponential function of the lag. Third, the tdA

R^{(n)}

coefficient does not need to be differentiable with respect to time. The theory in [43] is valid when there are any number of breaks, provided they do not add too many parameters, for instance, if there is a periodicity of 2, with a coefficient equal to

θ_{1}

for t odd, and

θ_{2}

for t even. Indeed, it is a generalization of [12] where such an example is shown. We can even have a periodic behavior, even with an incommensurable period for the different matrix entries, as shown by [44].

The lag of one can be replaced with another integer without any change. For instance, for quarterly data, we can replace it with 4, or 12 for monthly data. The results in Section 3.1 and Section 3.2 are easily adapted. Because the inference is infill, meaning that more and more observations are assumed to be made between the first and last ones, the LSP theory is no longer valid in the case of periodicity. On the contrary, the inference in [43,44] is of type outfill or increasing domain, that is to say, that more years are supposed to be observed, therefore preserving the period of the periodic behavior. Ref. [19] has also shown a tdA

R^{(n)}

(1) process with a coefficient that varies linearly with time but where heteroscedasticity (so

g_{t}^{(n)}

) is a periodic function of time. Finally, ref. [19] indicates that the so-called multiplicative seasonal ARMA models of [41] cannot be generalized in the LSP framework for the same reason. On the contrary, there is no problem working with those models in our context. That remark made here for the tdVA

R^{(n)}

(1) model extends to the tdVMA(1) model and all the tdVARM

A^{(n)}

models, as well.

It is possible to obtain the coefficients of the pure MA representations by simple recursive relations (see [54] for algorithms). Ref. [39] has even proposed an explicit solution, although it is limited to univariate processes. With these coefficients, everything can be computed, at least for finite n, and the assumptions can be checked. Of course, when we have a finite multivariate time series, it is not immediately apparent to suggest a tdVARMA model. What we propose in practice (see the examples treated in [43,53]) is first fitting a VARMA model, and then adding slopes for the linearly varying coefficients and, possibly, heteroscedasticity, before removing non-significant parameters one by one. At this time, there is no large-scale study on real data using time-dependent multivariate models. Still, the results for univariate models in [21] (with marginal heteroscedasticity only) and [56] seem promising.

We have specified in the introduction that our approach is not adequate for high-dimensional tdVARM

A^{(n)}

models, i.e., when r is greater than a very small integer because the number of parameters grows too quickly. Here, we have the entries in the VAR and VMA polynomials but also the coefficients of polynomials if we extend the linear dependence in (10) or (20) to a polynomial dependence, for instance. In principle, the concept in [46] should be valid in our framework of time-dependent VARMA models, including the case of polynomial dependence, multiple breaks, or threshold models with multiple regimes, although bigtime in [47] should be strongly changed for that purpose. This is left for future research.

It should be noted that, although it is interesting to see that the assumptions of [43] can be verified, they are not particularly helpful in the face of a real multivariate time series. We note that, unfortunately, verifying the same assumptions using the data (one realization of length n) is impossible. The same criticism, however, holds for the entire literature on models with time-dependent coefficients.

Funding

This research received no external funding.

Data Availability Statement

This research did not require data.

Acknowledgments

I acknowledge moral support from my research department ECARES and my university. I thank my longtime coauthor Rajae Azrak and younger previous co-author Abdelkamel Alj for their encouragement. I thank deeply the two reviewers and Danna Zhang for their numerous suggestions that led to an improvement of the paper, and particularly the reviewer who pointed out the computational aspects for handling high-dimensional problems that can be useful for future research. I also thank the reviewers of [43].

Conflicts of Interest

The author declares no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

AR	Autoregressive
ARIMA	Autoregressive integrated moving average
ARMA	Autoregressive moving average
MA	Moving average
tdAR	Time-dependent autoregressive
tdMA	Time-dependent moving average
tdVAR	Time-dependent vector autoregressive
tdVARMA	Time-dependent vector autoregressive moving average
tdVMA	Time-dependent vector moving average
VARMA	Vector autoregressive moving average

Appendix A. Details for the First-Order Models

Appendix A.1. Details for the tdVAR⁽ⁿ⁾(1) Model

We refer to the tdVA

R^{(n)}

(1) model of Section 3.1. In checking Assumption (iii) for that model, the derivation of the upper bound for the sum of

∥ ψ_{t 1 k}^{(n)} ∥_{F}

for

k = ν

to

t - 1

, after (15), by

N_{1} Φ^{ν - 1}

is not so easy. The first term in (14) can be bounded like for

∥ ψ_{t 2 k}^{(n)} ∥_{F}

. For the second term, we first replace the upper summation bound by ∞. Let

S (ν) = \sum_{k = ν}^{\infty} {(k - 1)}^{2} Φ^{k - 2}

. This is called a polynomial geometric series with a polynomial of degree

m = 2

here. A standard trick for evaluating them (e.g., ref. [57], and references therein) is to multiply the series by

{(1 - Φ)}^{m + 1}

, hence

\begin{matrix} {(1 - Φ)}^{3} S (ν) & = & (1 - 3 Φ + 3 Φ^{2} - Φ^{3}) S (ν) \\ = & {(ν - 1)}^{2} Φ^{ν - 1} + ν^{2} Φ^{ν} + {(ν + 1)}^{2} Φ^{ν + 1} \\ - & 3 {{(ν - 1)}^{2} Φ^{ν} + ν^{2} Φ^{ν + 1}} + 3 {(ν - 1)}^{2} Φ^{ν + 1} \end{matrix}

because the other terms cancel, since

{(k + 2)}^{2} - 3 {(k + 1)}^{2} + 3 k^{2} - {(k - 1)}^{2} = 0

. Therefore, since

| Φ | < 1

, an upper bound of

{| (1 - Φ)}^{3} S (ν) |

is

Φ^{ν - 1}

. The arguments are formalized and put a step further in [45].

Appendix A.2. Details for the tdVMA⁽ⁿ⁾(1) Model

We consider the tdVM

A^{(n)}

(1) model of Section 3.2. Let us start with a direct derivation of bounds for

ψ_{t i k}^{(n)}

. Consider the

B_{t}^{(n) [k - 1]} (θ)

defined in (18). Omitting the superscripts ⁽ⁿ⁾ in the entries of

B_{t}^{(n) [k - 1]} (θ)

for simplicity, it can be shown by induction that

\begin{matrix} B_{t}^{[k - 1]} (θ) = (\begin{matrix} B_{t, 11}^{[k - 1]} (θ) & B_{t, 12}^{[k - 1]} (θ) \\ B_{t, 22}^{[k - 1]} (θ) \end{matrix}) \end{matrix}

and

B_{t, 11}^{[k - 1]} (θ) = \prod_{j = 0}^{k - 1} {(B_{t - j}^{(n)} (θ))}_{11}, B_{t, 22}^{[k - 1]} (θ) = \prod_{j = 0}^{k - 1} {(B_{t - j}^{(n)} (θ))}_{22},

B_{t, 12}^{[k - 1]} (θ) = B_{12}^{'} \sum_{ℓ = 1}^{k - 1} \prod_{f = 1}^{k - ℓ - 1} {(B_{t - f + 1}^{(n)} (θ))}_{11} \prod_{f = k - ℓ}^{k - 2} {(B_{t - f}^{(n)} (θ))}_{22} .

Hence, for

j = 1, 2

, letting

δ_{l f} = 1

if

l = f

and 0 elsewhere,

\frac{\partial B_{t, j j}^{[k - 1]} (θ)}{\partial θ_{i}} = \sum_{ℓ = 0}^{k - 1} \{\prod_{f = 0}^{k - 1} {(B_{t - ℓ, j j}^{(n)} (θ))}^{1 - δ_{l f}} \frac{\partial B_{t - ℓ, j j}^{(n)} (θ)}{\partial θ_{i}}\},

(A1)

and a more complex expression, with a double sum and a double product, for the derivative of

B_{t, 12}^{[k - 1]} (θ)

.

The exponential evolution of the coefficients in

B_{t}^{(n)}

will simplify the subsequent computations. Indeed, for

j = 1, 2

,

B_{t, j j}^{[k - 1]} (θ) = \prod_{ℓ = 0}^{k - 1} {(B_{t - ℓ}^{(n)} (θ))}_{j j} = B_{j j}^{' k} exp (B_{j j}^{″} \sum_{ℓ = 0}^{k - 1} L (t - ℓ, n)) .

Hence

\frac{\partial B_{t, j j}^{[k - 1]} (θ)}{\partial B_{j j}^{'}} = k B_{j j}^{' k - 1} exp (B_{j j}^{″} \sum_{ℓ = 0}^{k - 1} L (t - ℓ, n)),

and

\frac{\partial B_{t, j j}^{[k - 1]} (θ)}{\partial B_{j j}^{″}} = B_{j j}^{' k} \sum_{ℓ = 0}^{k - 1} L (t - ℓ, n) exp (B_{j j}^{″} \sum_{ℓ = 0}^{k - 1} L (t - ℓ, n)) .

If

θ_{i} = B_{j j}^{'}

, (19) and (A1) imply

\begin{matrix} {(ψ_{t i k}^{(n)} (θ))}_{j j} & = & {(- 1)}^{k} [k B_{j j}^{' k - 1} exp (B_{j j}^{″} \sum_{ℓ = 0}^{k - 1} L (t - ℓ, n)) \\ - (k - 1) B_{j j}^{' k - 2} exp (B_{j j}^{″} \sum_{ℓ = 0}^{k - 2} L (t - ℓ, n)) B_{j j}^{' 0} exp (B_{j j}^{″ 0} L (t - k + 1, n))] \\ = & {(- 1)}^{k} B_{j j}^{' k - 2} exp (B_{j j}^{″} \sum_{ℓ = 0}^{k - 2} L (t - ℓ, n)) \\ [k B_{j j}^{'} exp (B_{j j}^{″} L (t - k + 1, n)) - (k - 1) B_{j j}^{' 0} exp (B_{j j}^{″ 0} L (t - k + 1, n))], \end{matrix}

and, if

θ_{i} = B_{j j}^{″}

, (19) and (A1) imply

\begin{matrix} {(ψ_{t i k}^{(n)} (θ))}_{j j} & = & {(- 1)}^{k} [B_{j j}^{' k} \sum_{ℓ = 0}^{k - 1} L (t - ℓ, n) exp (B_{j j}^{″} \sum_{ℓ = 0}^{k - 1} L (t - ℓ, n)) \\ - B_{j j}^{' k - 1} \sum_{ℓ = 0}^{k - 2} L (t - ℓ, n) exp (B_{j j}^{″} \sum_{ℓ = 0}^{k - 2} L (t - ℓ, n)) B_{j j}^{' 0} exp (B_{j j}^{″ 0} L (t - k + 1, n))] \\ = & {(- 1)}^{k} B_{j j}^{' k - 1} exp (B_{j j}^{″} \sum_{ℓ = 0}^{k - 2} L (t - ℓ, n)) \\ \times [B_{j j}^{'} \sum_{ℓ = 0}^{k - 1} L (t - ℓ, n) exp (B_{j j}^{″} L (t - k + 1, n)) \\ - B_{j j}^{' 0} \sum_{ℓ = 0}^{k - 2} L (t - ℓ, n) exp (B_{j j}^{″ 0} L (t - k + 1, n))] . \end{matrix}

In order to obtain the

ψ_{t i k}^{(n)}

involved in the assumptions, we still need to replace

θ

by

θ^{0}

. If

θ_{i} = B_{j j}^{'}

,

\begin{matrix} {(ψ_{t i k}^{(n)})}_{j j} & = & {(- 1)}^{k} {(B_{j j}^{' 0})}^{k - 1} exp (B_{j j}^{″ 0} \sum_{ℓ = 0}^{k - 1} L (t - ℓ, n)), \end{matrix}

(A2)

and, if

θ_{i} = B_{j j}^{″}

,

\begin{matrix} {(ψ_{t i k}^{(n)})}_{j j} & = & {(- 1)}^{k} L (t - k + 1, n) {(B_{j j}^{' 0})}^{k} exp (B_{j j}^{″ 0} \sum_{ℓ = 0}^{k - 1} L (t - ℓ, n)) . \end{matrix}

(A3)

Given that the functions of time are assumed to have distinct parameters, the matrices

ψ_{t i k}^{(n)} (θ, θ^{0})

are zero except for the diagonal element corresponding to

θ_{i}

. Consequently,

∥ ψ_{t i k}^{(n)} (θ, θ^{0}) ∥_{F}

is the absolute value of that element.

Hence,

{(ψ_{t 1 k}^{(n)})}_{11}

is given by (A2) for

i = j = 1

and

{(ψ_{t 2 k}^{(n)})}_{22}

is given by (A3) for

i = j = 2

.

Therefore, given that

| L (t - ℓ, n) | \leq \frac{1}{2}

,

∥ ψ_{t i k}^{(n)} ∥_{F}^{2} < Φ^{k}

,

i = 1, 2

, and that fulfills the first part of Assumption (iii). The reasoning is similar for the second part.

Appendix A.3. Details for Computing V for the tdVMA⁽ⁿ⁾(1) Model

We have seen that

V_{t, i j}^{(n)}

,

i, j = m_{A B} + 1, \dots, m

is a linear combination of terms of the form

v_{t}^{(n)}

in (23). Hence, we can try using Procedure 2.6 of [55] in order to replace the limit for

n \to \infty

of

(1 / n) \sum_{t = 1}^{n} v_{t}^{(n)}

with an integral that can be computed. The integral has the form

\int_{- 0.5}^{0.5} \frac{x^{2} e^{ξ x}}{{(α β - e^{η x})}^{2}} d x,

for which a primitive exists, involving in general the hypergeometric function, see [58] (Chapter 15) or [59]. In particular, for the case of Example 2, there is a simpler expression

\begin{matrix} \frac{345}{5831} (7 x (\frac{175 e^{7 x / 10} x}{9 - 25 e^{7 x / 10}} + (20 + 7 x) log [1 - \frac{25}{9} e^{7 x / 10}]) \\ + 20 (10 + 7 x) {Li}_{2} (\frac{25}{9} e^{7 x / 10}) - 200 {Li}_{3} (\frac{25}{9} e^{7 x / 10})) + C, \end{matrix}

where

{Li}_{s} (z) = \sum_{k = 1}^{\infty} z^{k} / k^{s}

,

| z | < 1

, is the polylogarithm function [60] with the dilogarithmic

{Li}_{2} (z)

and trilogarithmic functions

{Li}_{3} (z)

as special cases. We have given the value of the definite integral at the end of the example.

References

Düker, C.; Matteson, D.S.; Tsay, R.S.; Wilms, I. Vector autoregressive moving average models: A review. WIREs Comput. Stat. 2025, 17, e70009. [Google Scholar]
Quenouille, M.H. The Analysis of Multiple Time Series; Griffin: London, UK, 1957. [Google Scholar]
Priestley, M.B. Evolutionary spectra and non-stationary processes. J. R. Stat. Soc. B 1965, 27, 204–237. [Google Scholar]
Whittle, P. Recursive relations for predictors of non-stationary processes. J. R. Stat. Soc. Ser. B 1965, 27, 523–532. [Google Scholar]
Subba Rao, T. The fitting of non-stationary time-series models with time-dependent parameters. J. R. Stat. Soc. Ser. B 1970, 32, 312–322. [Google Scholar]
Priestley, M.B.; Tong, H. On the analysis of bivariate non-stationary processes. J. R. Stat. Soc. Ser. B 1973, 35, 153–166. [Google Scholar]
Bordignon, S.; Masarotto, G. Una classe di modelli non stazionari. Statistica 1983, 43, 83–104. [Google Scholar]
Tjøstheim, D. Estimation in Linear Time Series Models II: Some Nonstationary Series; Departement of Mathematics, University of Bergen: Bergen, Norway, 1984. [Google Scholar]
Kwoun, G.H.; Yajima, Y. On an autoregressive model with time-dependent coefficients. Ann. Inst. Stat. Math. Part A 1986, 38, 297–309. [Google Scholar]
Grillenzoni, C. Modeling time-varying dynamical systems. J. Am. Stat. Assoc. 1990, 85, 499–507. [Google Scholar]
Bibi, A.; Francq, C. Consistent and asymptotically normal estimators for cyclically time-dependent linear models. Ann. Inst. Stat. Math. 2003, 55, 41–68. [Google Scholar]
Azrak, R.; Mélard, G. Asymptotic properties of quasi-likelihood estimators for ARMA models with time-dependent coefficients. Stat. Inference Stoch. Process. 2006, 9, 279–330. [Google Scholar]
Dahlhaus, R. Maximum likelihood estimation and model selection for locally stationary processes. J. Nonparametric Stat. 1996, 6, 171–191. [Google Scholar]
Dahlhaus, R. On the Kullback-Leibler information divergence of locally stationary processes. Stoch. Process. Their Appl. 1996, 62, 139–168. [Google Scholar]
Dahlhaus, R. Asymptotic statistical inference for nonstationary processes with evolutionary spectra. In Athens Conference on Applied Probability and Time Series Analysis 2; Robinson, P.M., Rosenblatt, M., Eds.; Springer: New York, NY, USA, 1996; pp. 145–159. [Google Scholar]
Dahlhaus, R. Fitting time series models to nonstationary processes. Ann. Stat. 1997, 25, 1–37. [Google Scholar]
Dahlhaus, R. A likelihood approximation for locally stationary processes. Ann. Stat. 2000, 28, 1762–1794. [Google Scholar]
Dahlhaus, R. Locally stationary processes. In Handbook of Statistics; Rao, T.S., Rao, S.S., Rao, C.R., Eds.; Elsevier: Amsterdam, The Netherlands, 2012; Volume 30, Chapter 13; pp. 351–413. [Google Scholar]
Azrak, R.; Mélard, G. Autoregressive models with time-dependent coefficients—A comparison between several approaches. Stats 2022, 5, 784–804. [Google Scholar] [CrossRef]
Ombao, H.C.; Raz, J.A.; von Sachs, R.; Malow, B.A. Automatic statistical analysis of bivariate nonstationary time series. J. Am. Stat. Assoc. 2001, 96, 543–560. [Google Scholar]
Van Bellegem, S.; von Sachs, R. Forecasting economic time series with unconditional time-varying variance. Int. J. Forecast. 2004, 20, 611–627. [Google Scholar]
Ombao, H.; von Sachs, R.; Guo, W. SLEX analysis of multivariate nonstationary time series. J. Am. Stat. Assoc. 2005, 100, 519–531. [Google Scholar]
Van Bellegem, S.; Dahlhaus, R. Semiparametric estimation by model selection for locally stationary processes. J. R. Stat. Soc. Ser. B 2006, 68, 721–746. [Google Scholar]
Nason, G.P. A test for second-order stationarity and approximate confidence intervals for localized autocovariances for locally stationary time series. J. R. Stat. Soc. Ser. B 2015, 75, 879–904. [Google Scholar]
Puchstein, R.; Preuss, P. Testing for stationarity in multivariate locally stationary processes. J. Time Ser. Anal. 2015, 37, 3–29. [Google Scholar] [CrossRef]
Dette, H.; Wu, W. Prediction in locally stationary time series. J. Bus. Econ. Stat. 2020, 40, 370–381. [Google Scholar] [CrossRef]
Killick, R.; Knight, M.I.; Nason, G.P.; Eckley, I.A. The local partial autocorrelation function and some applications. Electron. J. Statist. 2020, 14, 3268–3314. [Google Scholar] [CrossRef]
Bardet, J.-M. Stationarity and Goodness-of-Fit Tests for Locally Stationary Time Series. 2024. hal-04675274. Available online: https://hal.science/hal-04675274 (accessed on 24 March 2025).
Bourhattas, A.; Laïb, N. Nonparametric Estimation in Nonlinear Time-Varying Autoregressive Locally Stationary Processes with ARCH-Errors. 2024. hal-04611601. Available online: https://hal.science/hal-04611601 (accessed on 24 March 2025).
Killick, R.; Knight, M.I.; Nason, G.P.; Nunes, M.A.; Eckley, I.A. Automatic locally stationary time series forecasting with application to predicting UK gross value added time series. J. R. Stat. Soc. Ser. C 2025, 74, 18–33. [Google Scholar] [CrossRef]
Lütkepohl, H. New Introduction to Multiple Time Series Analysis; Springer-Verlag: New York, NY, USA, 2005. [Google Scholar]
Creal, D.D.; Koopman, S.J.; Lucas, A. Generalized autoregressive score models with applications. J. Appl. Econom. 2013, 28, 777–795. [Google Scholar] [CrossRef]
Teräsvirta, T.; Tjøstheim, D.; Granger, C.W.J. Modelling Nonlinear Economic Time Series; Oxford University Press: Oxford, UK, 2010. [Google Scholar]
Teräsvirta, T.; Yang, Y. Linearity and Misspecification Tests for Vector Smooth Transition Regression Models; CREATES Research Papers No. 2014-04; Aarhus Universitet: Aarhus, Denmark, 2014. [Google Scholar]
El Yaagoubi Bourakna, A.; Pinto-Orellana, M.A.; Fortin, N.; Ombao, H. Smooth online parameter estimation for time varying VAR models with application to rat local field potential activity data. Stat. Its Interface 2023, 16, 227–257. [Google Scholar] [CrossRef]
Li, X.; Yuan, J. DeepTVAR: Deep learning for a time-varying VAR model with extension to integrated VAR. Int. J. Forecast. 2023, 40, 1123–1133. [Google Scholar]
Hindrayanto, I.; Koopman, S.J.; Ooms, M. Exact maximum likelihood estimation for non-stationary periodic time series models. Comput. Stat. Data Anal. 2010, 54, 2641–2654. [Google Scholar]
Yan, Y.; Gao, J.; Peng, B. Asymptotics for time-varying vector MA(∞) processes. Econom. Theory 2024, in press. [Google Scholar] [CrossRef]
Karanasos, M.; Paraskevopoulos, A.; Magdalinos, A.; Canepa, A. A unified theory for ARMA models with varying coefficients: One solution fits all. Econom. Theory 2024, in press. [Google Scholar]
Triantafyllopoulos, K.; Nason, G.P. A Bayesian analysis of moving average processes with time-varying parameters. Comput. Stat. Data Anal. 2007, 52, 1025–1046. [Google Scholar] [CrossRef]
Box, G.E.P.; Jenkins, G.M.; Reinsel, G.C.; Ljung, G.C. Time Series Analysis: Forecasting and Control, 5th ed.; Wiley: New York, NY, USA, 2015. [Google Scholar]
Fischer, B.; Planas, C. Large scale fitting of regression models with ARIMA errors. J. Off. Stat. 2000, 16, 173–184. [Google Scholar]
Alj, A.; Azrak, R.; Mélard, G. General estimation results for tdVARMA array models. J. Time Ser. Anal. 2025, 46, 137–151. [Google Scholar] [CrossRef]
Alj, A.; Azrak, R.; Ley, C.; Mélard, G. Asymptotic properties of QML estimators for VARMA models with time-dependent coefficients. Scand. J. Stat. 2017, 44, 617–635. [Google Scholar] [CrossRef]
Mélard, G. An indirect proof for the asymptotic properties of VARMA model estimators. Econom. Stat. 2022, 21, 96–111. [Google Scholar] [CrossRef]
Wilms, I.; Basu, S.; Bien, J.; Matteson, D.S. Sparse identification and estimation of large-scale vector autoregressive moving averages, J. Am. Stat. Assoc 2023, 118, 571–582. [Google Scholar] [CrossRef]
Wilms, I.; Basu, S.; Bien, J.; Matteson, D.S. Bigtime: Sparse Estimation of Large Time Series Models, R Package Version 0.1.0. 2017. Available online: https://CRAN.R-project.org/package=bigtime (accessed on 24 March 2025).
Ling, S.; McAleer, M. Asymptotic theory for a vector ARMA-GARCH model. Econom. Theory 2003, 19, 280–310. [Google Scholar] [CrossRef]
Engle, R.F.; Kroner, K.F. Multivariate simultaneous generalized ARCH. Econom. Theory 1995, 11, 122–150. [Google Scholar] [CrossRef]
Francq, C.; Zakoïan, J.-M. GARCH Models: Structure, Statistical Inference and Financial Applications; Wiley: New York, NY, USA, 2019. [Google Scholar]
Mélard, G. The information matrix of time-dependent models for vector time series. 2025; submitted. [Google Scholar]
Mélard, G. New computational aspects for estimating time-dependent VARMA models. 2025; submitted. [Google Scholar]
Alj, A.; Jónasson, K.; Mélard, G. The exact Gaussian likelihood estimation of time-dependent VARMA models. Comput. Stat. Data Anal. 2016, 100, 633–644. [Google Scholar] [CrossRef]
Mélard, G. Time-dependent processes and time series models: Comments on Marc Hallin’s early contributions and a pragmatic view on estimation. In Recent Advances in Econometrics and Statistics, Festschrift in Honour of Marc Hallin; Barigozzi, M., Hörmann, S., Paindaveine, D., Eds.; Springer Nature: Cham, Switzerland, 2024; pp. 429–446. [Google Scholar]
Azrak, R.; Mélard, G. Asymptotic properties of conditional least-squares estimators for array time series. Stat. Inference Stoch. Process. 2021, 24, 525–547. [Google Scholar] [CrossRef]
Mélard, G. ARMA models with time-dependent coefficients: Official statistics examples. In Time Series Analysis—New Insights; Rifaat, A., El-Diasty, M., Kostogryzov, A., Makhutov, N., Eds.; IntechOpen: London, UK, 2023; pp. 18–35. [Google Scholar]
Boyadzhiev, K.N.; Dil, A. Geometric polynomials: Properties and applications to series with zeta values. Anal. Math. 2016, 42, 203–224. [Google Scholar]
Abramowitz, M.; Stegun, I.A. Handbook of Mathematical Functions with Formulas, Graphs, and Mathematical Tables; Dover Publications: New York, NY, USA, 1965. [Google Scholar]
Erdélyi, A. Higher Transcendental Functions, Volume 1; McGraw-Hill: New York, NY, USA, 1953. [Google Scholar]
Wood, D. The Computation of Polylogarithms; Technical Report 15-92; Computing Laboratory, University of Kent: Canterbury, UK, 1992; Available online: https://www.cs.kent.ac.uk/pubs/1992/110/ (accessed on 24 March 2025).

Table 1. Estimation results for the tdVA

R^{(n)}

(1) model (8) under (10) with Laplace errors. (a) emp.est. is the empirical estimate (average over the simulation estimates), (b) emp.s.e. is the empirical standard error (standard deviation over the simulation estimates), (c) est.s.e. is the estimated standard error (average of the standard errors as a by-product of estimation), (d) theor.s.e. is the (approximate) theoretical standard error (finite approximation of standard error using (4)), and (e) % rej. is the percentage of rejection of the test of the hypothesis

H_{0} : θ = θ^{0}

across the simulations.

Table 1. Estimation results for the tdVA

R^{(n)}

(1) model (8) under (10) with Laplace errors. (a) emp.est. is the empirical estimate (average over the simulation estimates), (b) emp.s.e. is the empirical standard error (standard deviation over the simulation estimates), (c) est.s.e. is the estimated standard error (average of the standard errors as a by-product of estimation), (d) theor.s.e. is the (approximate) theoretical standard error (finite approximation of standard error using (4)), and (e) % rej. is the percentage of rejection of the test of the hypothesis

H_{0} : θ = θ^{0}

across the simulations.

Parameter:	Sample	(a)	(b)	(c)	(d)	(e)
True Value:	Size $n$	emp.est.	emp.s.e.	est.s.e.	theor.s.e.	% rej.
$A_{11}^{'}$ : $0.8$	25	0.7507	0.1295	0.0997	0.1136	13.0
	50	0.7843	0.0811	0.0692	0.0767	8.5
	100	0.7889	0.0558	0.0509	0.0530	6.7
	200	0.7951	0.0374	0.0373	0.0370	6.3
	400	0.7971	0.0269	0.0258	0.0260	5.7
$A_{22}^{″}$ : $0.75$	25	0.6014	0.6437	0.5440	0.7030	15.7
	50	0.7004	0.4759	0.4305	0.4827	9.8
	100	0.7041	0.3461	0.3099	0.3363	7.9
	200	0.7347	0.2364	0.2224	0.2360	6.8
	400	0.7461	0.1690	0.1604	0.1662	6.2
$η_{22}$ : $0.7$	25	0.6969	0.7711	0.5630	0.7442	18.3
	50	0.7035	0.5422	0.4310	0.5369	11.8
	100	0.7084	0.3865	0.3249	0.3834	11.9
	200	0.7078	0.2684	0.2449	0.2725	8.3
	400	0.6886	0.1948	0.1801	0.1932	7.5

Table 2. Estimation results for the tdVA

R^{(n)}

(1) model (8) under (10) with Student errors with 5 degrees of freedom. (a) emp.est. is the empirical estimate (average over the simulation estimates), (b) emp.s.e. is the empirical standard error (standard deviation over the simulation estimates), (c) est.s.e. is the estimated standard error (average of the standard errors as a by-product of estimation), (d) theor.s.e. is the (approximate) theoretical standard error (finite approximation of standard error using (4)), and (e) % rej. is the percentage of rejection of the test of the hypothesis

H_{0} : θ = θ^{0}

across the simulations.

Table 2. Estimation results for the tdVA

R^{(n)}

(1) model (8) under (10) with Student errors with 5 degrees of freedom. (a) emp.est. is the empirical estimate (average over the simulation estimates), (b) emp.s.e. is the empirical standard error (standard deviation over the simulation estimates), (c) est.s.e. is the estimated standard error (average of the standard errors as a by-product of estimation), (d) theor.s.e. is the (approximate) theoretical standard error (finite approximation of standard error using (4)), and (e) % rej. is the percentage of rejection of the test of the hypothesis

H_{0} : θ = θ^{0}

across the simulations.

Parameter:	Sample	(a)	(b)	(c)	(d)	(e)
True Value:	Size $n$	emp.est.	emp.s.e.	est.s.e.	theor.s.e.	% rej.
$A_{11}^{'}$ : $0.8$	25	0.7608	0.1193	0.0960	0.1136	12.1
	50	0.7773	0.0855	0.0706	0.0767	8.6
	100	0.7918	0.0539	0.0504	0.0530	6.1
	200	0.7938	0.0387	0.0363	0.0370	5.8
	400	0.7977	0.0260	0.0257	0.0260	4.7
$A_{22}^{″}$ : $0.75$	25	0.6101	0.6910	0.5501	0.7030	18.5
	50	0.7153	0.4564	0.4290	0.4827	9.3
	100	0.7303	0.3191	0.3097	0.3363	6.6
	200	0.7324	0.2343	0.2237	0.2360	6.3
	400	0.7464	0.1698	0.1607	0.1662	6.6
$η_{22}$ : $0.7$	25	0.7051	0.6554	0.5366	1.2453	12.6
	50	0.7102	0.4976	0.3946	0.8984	12.4
	100	0.6936	0.3844	0.3092	0.6416	11.2
	200	0.7072	0.2796	0.2391	0.4560	9.0
	400	0.7052	0.2068	0.1803	0.3232	7.1

Table 3. Estimation results for the tdVM

A^{(n)}

(1) model (8) under (10) but with linear expressions for

B_{t, 11}^{(n)}

and

B_{t, 22}^{(n)}

, and with Laplace errors. (a) emp.est. is the empirical estimate (average over the simulation estimates), (b) emp.s.e. is the empirical standard error (standard deviation over the simulation estimates), (c) est.s.e. is the estimated standard error (average of the standard errors as a by-product of estimation), (d) theor.s.e. is the (approximate) theoretical standard error (finite approximation of standard error using (4)), and (e) % rej. is the percentage of rejection of the test of the hypothesis

H_{0} : θ = θ^{0}

across the simulations.

Table 3. Estimation results for the tdVM

A^{(n)}

(1) model (8) under (10) but with linear expressions for

B_{t, 11}^{(n)}

and

B_{t, 22}^{(n)}

, and with Laplace errors. (a) emp.est. is the empirical estimate (average over the simulation estimates), (b) emp.s.e. is the empirical standard error (standard deviation over the simulation estimates), (c) est.s.e. is the estimated standard error (average of the standard errors as a by-product of estimation), (d) theor.s.e. is the (approximate) theoretical standard error (finite approximation of standard error using (4)), and (e) % rej. is the percentage of rejection of the test of the hypothesis

H_{0} : θ = θ^{0}

across the simulations.

Parameter:	Sample	(a)	(b)	(c)	(d)	(e)
True Value:	Size $n$	emp.est.	emp.s.e.	est.s.e.	theor.s.e.	% rej.
$B_{11}^{'}$ : $0.8$	25	0.7827	0.1045	0.0989	0.0913	11.5
	50	0.8014	0.0696	0.0658	0.0614	9.0
	100	0.8043	0.0453	0.0435	0.0423	9.0
	200	0.8026	0.0310	0.0301	0.0295	5.9
	400	0.8012	0.0216	0.0208	0.0207	7.3
$B_{22}^{″}$ : $0.4$	25	0.3850	0.4856	0.4473	0.4722	15.6
	50	0.4082	0.3412	0.3069	0.3207	10.9
	100	0.3934	0.2308	0.2149	0.2223	7.8
	200	0.3943	0.1606	0.1508	0.1557	6.2
	400	0.4004	0.1127	0.1078	0.1095	6.2
$η_{22}$ : $0.7$	25	0.7458	0.5624	0.3567	0.4424	24.4
	50	0.7349	0.3492	0.2626	0.3200	16.7
	100	0.7168	0.2415	0.1973	0.2289	14.1
	200	0.7115	0.1658	0.1455	0.1627	10.0
	400	0.6950	0.1190	0.1078	0.1154	8.7

Table 4. Estimation results for the tdVM

A^{(n)}

(1) model,

n = 400

, with normal errors. (a) emp.est. is the empirical estimate (average over the simulation estimates), (b) emp.s.e. is the empirical standard error (standard deviation over the simulation estimates), (c1) and (c2) est.s.e. are the estimated standard errors (average of the standard errors as a by-product of estimation), using

\hat{V}

or

{\hat{V}}^{- 1} \hat{W} {\hat{V}}^{- 1}

, respectively; (d) theor.s.e. is the (approximate) theoretical standard error (finite approximation of standard error using (4)), and (e) % rej. is the percentage of rejection of the test of the hypothesis

H_{0} : θ = θ^{0}

across the simulations.

Table 4. Estimation results for the tdVM

A^{(n)}

(1) model,

n = 400

, with normal errors. (a) emp.est. is the empirical estimate (average over the simulation estimates), (b) emp.s.e. is the empirical standard error (standard deviation over the simulation estimates), (c1) and (c2) est.s.e. are the estimated standard errors (average of the standard errors as a by-product of estimation), using

\hat{V}

or

{\hat{V}}^{- 1} \hat{W} {\hat{V}}^{- 1}

, respectively; (d) theor.s.e. is the (approximate) theoretical standard error (finite approximation of standard error using (4)), and (e) % rej. is the percentage of rejection of the test of the hypothesis

H_{0} : θ = θ^{0}

across the simulations.

Parameter $θ_{i}$	$B_{11}^{'}$	$B_{22}^{″}$	$η_{22}$
True value $θ_{i}^{0}$	0.8000	0.4000	0.7000
(a) emp.est.	0.8010	0.4007	0.6977
(b) emp.s.e.	0.0216	0.1134	0.0747
(c1) est.s.e., based on $\hat{V}$	0.0210	0.1105	0.0744
(c2) est.s.e., based on ${\hat{V}}^{- 1} \hat{W} {\hat{V}}^{- 1}$	0.0208	0.1095	0.0729
(d) theor.s.e.	0.0207	0.1095	0.0731
(e) % rej.	7.1	6.0	5.5

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Mélard, G. Estimator’s Properties of Specific Time-Dependent Multivariate Time Series. Mathematics 2025, 13, 1163. https://doi.org/10.3390/math13071163

AMA Style

Mélard G. Estimator’s Properties of Specific Time-Dependent Multivariate Time Series. Mathematics. 2025; 13(7):1163. https://doi.org/10.3390/math13071163

Chicago/Turabian Style

Mélard, Guy. 2025. "Estimator’s Properties of Specific Time-Dependent Multivariate Time Series" Mathematics 13, no. 7: 1163. https://doi.org/10.3390/math13071163

APA Style

Mélard, G. (2025). Estimator’s Properties of Specific Time-Dependent Multivariate Time Series. Mathematics, 13(7), 1163. https://doi.org/10.3390/math13071163

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Estimator’s Properties of Specific Time-Dependent Multivariate Time Series

Abstract

1. Introduction

2. Materials and Methods

2.1. The General tdVARMA Array Model

2.2. Typical Assumptions

2.3. Obtaining Approximations of True V and W

3. Results

3.1. Treatment of a tdVA R ( n ) (1) Model

3.1.1. Assumptions (i) and (ii)

3.1.2. Assumption (iii)

3.1.3. Assumption (iv)

3.1.4. Assumption (v)

3.1.5. Assumption (vi)

3.2. Treatment of a tdVM A ( n ) (1) Model

3.2.1. Assumptions (i) and (ii)

3.2.2. Assumption (iii)

3.2.3. Assumption (iv)

3.2.4. Assumption (v)

3.2.5. Assumption (vi)

3.3. Treatment of a tdVARM A ( n ) (1,1) Model

3.4. Treatment of a More General tdVARM A ( n ) ( p , q ) Model

4. Discussion

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

Appendix A. Details for the First-Order Models

Appendix A.1. Details for the tdVAR(n)(1) Model

Appendix A.2. Details for the tdVMA(n)(1) Model

Appendix A.3. Details for Computing V for the tdVMA(n)(1) Model

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

3.1. Treatment of a tdVA $R^{(n)}$ (1) Model

3.2. Treatment of a tdVM $A^{(n)}$ (1) Model

3.3. Treatment of a tdVARM $A^{(n)}$ (1,1) Model

3.4. Treatment of a More General tdVARM $A^{(n)}$ $(p, q)$ Model

Appendix A.1. Details for the tdVAR⁽ⁿ⁾(1) Model

Appendix A.2. Details for the tdVMA⁽ⁿ⁾(1) Model

Appendix A.3. Details for Computing V for the tdVMA⁽ⁿ⁾(1) Model