Multivariate Modeling of Some Datasets in Continuous Space and Discrete Time

Te, Rigele; Du, Juan

doi:10.3390/e27080837

Open AccessArticle

Multivariate Modeling of Some Datasets in Continuous Space and Discrete Time

by

Rigele Te

and

Juan Du

^*

Department of Statistics, Kansas State University, Manhattan, KS 66506, USA

^*

Author to whom correspondence should be addressed.

Entropy 2025, 27(8), 837; https://doi.org/10.3390/e27080837

Submission received: 22 June 2025 / Revised: 31 July 2025 / Accepted: 31 July 2025 / Published: 6 August 2025

(This article belongs to the Special Issue Statistical Methods for Modeling High-Dimensional and Complex Data: Second Edition)

Download

Browse Figures

Versions Notes

Abstract

Multivariate space–time datasets are often collected at discrete, regularly monitored time intervals and are typically treated as components of time series in environmental science and other applied fields. To effectively characterize such data in geostatistical frameworks, valid and practical covariance models are essential. In this work, we propose several classes of multivariate spatio-temporal covariance matrix functions to model underlying stochastic processes whose discrete temporal margins correspond to well-known autoregressive and moving average (ARMA) models. We derive sufficient and/or necessary conditions under which these functions yield valid covariance matrices. By leveraging established methodologies from time series analysis and spatial statistics, the proposed models are straightforward to identify and fit in practice. Finally, we demonstrate the utility of these multivariate covariance functions through an application to Kansas weather data, using co-kriging for prediction and comparing the results to those obtained from traditional spatio-temporal models.

Keywords:

ARMA margin; covariance matrix functions; Gaussian random field; multivariate spatial process; spatio-temporal model

1. Introduction

Multivariate space–time datasets frequently arise in environmental science, meteorology, geophysics, and many other fields. Examples include studying the impact of soil greenhouse gas fluxes on global warming potential, or analyzing temperature–precipitation relationships under climate change (see [1,2,3], among others). Typically, temporal data are collected at regularly spaced intervals, in contrast to spatial data that are often recorded at irregular locations, such as weather stations. With the increasing availability and complexity of such datasets, it is essential to develop efficient models that capture their intricate dependence structures.

This paper focuses on constructing valid covariance matrix functions that jointly incorporate spatial and temporal information for multivariate random fields. While the spatial statistics literature includes various spatial models, few account for discrete-time dependencies, despite time series playing a crucial role in most environmental and geophysical processes. Traditional approaches often rely on separable space–time covariance structures, which assume the overall covariance is the product of purely spatial and purely temporal components. While computationally convenient, these models ignore space–time interactions that are often fundamental to the underlying physical processes. An increasing body of work has highlighted the importance of nonseparable models. For example, ref. [4] introduced nonseparable stationary spatio-temporal covariance functions, and subsequent generalizations for both stationary and nonstationary processes were developed in [5,6,7], among others. Applications to environmental data such as air pollution are explored by [8,9], while ref. [10] incorporates an inflated gamma distribution to model precipitation trends with zero inflation. However, most of these models are constructed under the assumption of continuous time. In practice, time is typically observed on a discrete, regular grid, whereas spatial locations are distributed more irregularly. Although some models incorporate discrete time through stochastic differential equations or spectral methods (e.g., [11,12]), these approaches often lack closed-form expressions for the covariance structure. While ref. [13] deals with the univariate case, in this work, we derive explicit covariance matrix functions for multivariate space–time processes with discrete temporal components, where the temporal margins follow well-established autoregressive and moving average (ARMA) models. Leveraging the rich theoretical foundation of ARMA processes along with classical spatial modeling, we aim to build flexible, interpretable, and computationally feasible models.

In many modern scientific applications, such as geosciences, environmental monitoring, and economics, large numbers of variables are observed simultaneously. These variables are often correlated, and borrowing information from related (secondary) variables can improve the prediction of a primary variable, especially when the latter is sparsely observed. For simplicity, spatial variables are often modeled separately, ignoring cross-variable dependencies.A key contribution of this work is the development of multivariate spatial covariance structures that capture both within-variable spatial dependence and cross-variable covariances, while also incorporating discrete time information. This enables more accurate predictions through co-kriging across a wide range of applications. While previous efforts have been made in this direction, many are limited to purely spatial or continuous-time settings, or they rely on Bayesian frameworks. Notable contributions include [14,15,16,17], among others. For example, multivariate Poisson-lognormal spatial models have improved predictions in traffic safety studies [18], and recent works have established kriging formulas [19] and copula-based models [20] for multivariate spatial data. We aim to integrate parameter interpretability from analytic model expressions into a unified space–time framework to facilitate multivariate fitting and co-kriging.

On a global scale, many spatial datasets are collected using spherical coordinates. Euclidean-based distances and covariance structures can become distorted on the sphere, especially over large distances, making spherical modeling critical in geophysical and atmospheric sciences. Recent advances include constructions of isotropic positive definite functions on spheres [21], covariance functions for stationary and isotropic Gaussian vector fields [22], and isotropic variogram matrix functions expressed through ultraspherical polynomials [23]. Drawing from these approaches, we also extend some of our discrete-time multivariate spatio-temporal models to spherical domains to ensure validity across both Euclidean and spherical spaces.

We aim to develop a flexible multivariate spatio-temporal modeling framework that incorporates discrete-time structure, spatial correlation (in both Euclidean and spherical spaces), and cross-variable dependencies. Specifically, we consider a p-variate space–time random field

{Z (s, t) = {(Z_{1} (s, t), \dots, Z_{p} (s, t))}^{'}, s \in S^{d} or R^{d}, t \in Z},

with covariance matrix function

C (s_{1}, s_{2}, t_{1}, t_{2}) = (\begin{matrix} C_{1, 1} (s_{1}, s_{2}, t_{1}, t_{2}) & \dots & C_{1, p} (s_{1}, s_{2}, t_{1}, t_{2}) \\ ⋮ & ⋱ & ⋮ \\ C_{p, 1} (s_{1}, s_{2}, t_{1}, t_{2}) & \dots & C_{p, p} (s_{1}, s_{2}, t_{1}, t_{2}) \end{matrix}),

where each entry

C_{i, j} (s_{1}, s_{2}, t_{1}, t_{2}) = Cov (Z_{i} (s_{1}, t_{1}), Z_{j} (s_{2}, t_{2})),

for

i, j = 1, \dots, p

, where

S^{d}

and

R^{d}

denote the d-dimensional unit sphere and Euclidean space, respectively. The process is stationary in both space and time if

E (Z (s, t))

is constant for all

(s, t)

and

C (s_{0}, s_{0} + s; t_{0}, t_{0} + t)

depends only on the spatial lag

s

and temporal lag t. We then denote the spatial and temporal margins as

C (s_{1}, s_{2}, t, t)

and

C (s, s, t_{1}, t_{2})

, respectively, following [24]. In practice, analyzing multivariate space–time data often begins with marginal exploration, applying time series models to study temporal behavior and multivariate spatial analysis to capture cross-variable structure. Given the substantial research advances in both areas, combining their strengths provides a robust foundation for model development, selection, and estimation.

The remainder of this paper is organized as follows. In Section 2, we propose several classes of multivariate spatio-temporal covariance matrix functions, whose discrete-time margins follow ARMA models. We derive necessary and sufficient conditions for these functions to define valid covariance matrices. Section 3 extends the models to incorporate general ARMA margins. In Section 4, we apply our models to Kansas weather data to demonstrate their performance in spatio-temporal prediction compared to traditional methods.

2. Moving-Average-Type Temporal Margin

We begin constructing the foundation of our overall framework by examining the covariance structure corresponding to a first-order moving average (MA(1)) model in the discrete temporal margin. It is straightforward to verify that Equation (1) satisfies the defining properties of an MA(1) process at a fixed spatial location. Notably, this structure does not rely on the assumption of temporal stationarity. The main challenge in proving the validity of Equation (1) lies in its nature as a discrete space–time matrix function that varies across different time scales, making it more complex than simply verifying a static covariance matrix. Theorem 8 in [25] offers useful insights that support the proof of the following Theorem 1 (see Appendix A).

Theorem 1.

Let

G_{0} (s_{1}, s_{2})

and

G_{1} (s_{1}, s_{2}), s_{1}, s_{2} \in D

,

D \subset R^{d}

or

S^{d}, d \geq 1

be

p \times p

matrix functions, and let

G_{0} (s_{1}, s_{2})

be symmetric, i.e.,

G_{0} {(s_{1}, s_{2})}^{'} = G_{0} (s_{1}, s_{2})

. Then, the

p \times p

function

C (s_{1}, s_{2}; t) = \{\begin{matrix} G_{0} (s_{1}, s_{2}), & t = 0, \\ G_{1} (s_{1}, s_{2}), & t = 1, \\ G_{1} {(s_{2}, s_{1})}^{'}, & t = - 1, \\ 0, & t \neq 0, \pm 1, t \in Z, s_{1}, s_{2} \in D . \end{matrix}

(1)

is a covariance matrix function on

D \times Z

if and only if the following two conditions are satisfied:

(i): $G_{0} (s_{1}, s_{2}) + G_{1} (s_{1}, s_{2}) + G_{1} {(s_{2}, s_{1})}^{'}$ is a covariance matrix function on $D$ ,
(ii): $G_{0} (s_{1}, s_{2}) - G_{1} (s_{1}, s_{2}) - G_{1} {(s_{2}, s_{1})}^{'}$ is a covariance matrix function on $D$ .

This theorem reduces the verification of a complex space–time problem to that of a purely spatial covariance model. Building upon the foundational structure developed earlier, we are now prepared to incorporate a broader range of spatial covariance margins to enrich the class of admissible models. Specifically, we integrate the widely used Matérn-type spatial covariance function into our framework and derive the full set of parameter conditions required to ensure validity. In Theorem 2, we begin with a parsimonious Matérn structure in which all smoothness parameters

α

in

M (h ∣ v, α)

are assumed to be equal, as specified in Equation (4) below. Theorem 3 of [14] provides necessary and sufficient conditions under various settings for Equation (4) to define a valid covariance matrix. These results offer important insights that inform the conditions of the theorem and corollary that follows.

Theorem 2.

Let

v = (v_{1}, v_{2}, \dots, v_{p})

,

α = (α_{1}, α_{2})

, and

β = (β_{1}, β_{2})

be constant vectors.

v_{k} \geq 0, α_{k} \geq 0, - 1 / 2 \leq β_{k} \leq 1 / 2

, and let

v_{i j} = (v_{i} + v_{j}) / 2

,

D \subset R^{d}

. The sufficient condition for the

p \times p

matrix function

C (h; t) = \{\begin{matrix} c M (h | v, α_{1}) + (1 - c) M (h | v, α_{2}), & t = 0, \\ c M (h | v, α_{1}) β_{1} + (1 - c) M (h | v, α_{2}) β_{2}, & t = \pm 1, h \in D \\ 0, & o t h e r w i s e, \end{matrix}

(2)

to be a correlation matrix function on

D \times Z

is that the constant c satisfies

0 \leq c \leq 1 .

(3)

And if $p \leq 2$ , (3) is also necessary.
where

M (h | v, α) = ({(ρ_{i j} m (h | v_{i j}, α))}_{1 \leq i, j \leq p},

(4)

m (h | v_{i j}, α) = \frac{2^{1 - v_{i j}}}{Γ (v_{i j})} {(α h)}^{v_{i j}} K_{v_{i j}} (α h)

,

i, j = 1, 2

,

ρ_{i j} = \frac{Γ {(v_{i} + \frac{d}{2})}^{\frac{1}{2}}}{Γ {(v_{i})}^{\frac{1}{2}}} \frac{Γ {(v_{j} + \frac{d}{2})}^{\frac{1}{2}}}{Γ {(v_{j})}^{\frac{1}{2}}} \frac{Γ (v_{i j})}{Γ (v_{i j} + \frac{d}{2})}

.

The following theorem generalizes the parsimonious Matérn covariance structure by relaxing the constraint that all smoothness parameters

α

in

M (h ∣ v, α)

must be equal, as in Equation (4). Following [14], we assume that

M (h ∣ v, α, ρ_{12})

is a general multivariate Matérn covariance function in Theorem 3. In addition, the choice of c is assumed to satisfy the conditions of Theorem 2 in [13], ensuring that the main diagonal elements of the resulting matrix structure are valid univariate correlation functions.

Theorem 3.

Let

v = (v_{1}, v_{2}, v_{12})

,

α = (α_{1}, α_{2}, α_{12})

,

α^{'} = (α_{1}^{'}, α_{2}^{'}, α_{12}^{'})

,

β = (β_{1}, β_{2})

be constant vectors.

v_{k} \geq 0, α_{k} \geq 0, α_{k}^{'} \geq 0, - 1 / 2 \leq β_{k} \leq 1 / 2

,

D \subset R^{d}

. A sufficient and necessary condition for the

p \times p

matrix function,

p \leq 2

C (h; t) = \{\begin{matrix} c M (h | v, α, ρ_{12}) + (1 - c) M (h | v, α^{'}, ρ_{12}^{'}), & t = 0, \\ c M (h | v, α, ρ_{12}) β_{1} + (1 - c) M (h | v, α^{'}, ρ_{12}^{'}) β_{2}, & t = \pm 1, h \in D \\ 0, & o t h e r w i s e, \end{matrix}

(5)

to be a correlation matrix function on

D \times Z

is that the constant c satisfies

\begin{matrix} \underset{h \geq 0, D (h) > 0}{i n f} \frac{c^{2} {(1 \pm 2 β_{1})}^{2} H (h) + {(1 - c)}^{2} {(1 \pm 2 β_{2})}^{2} \tilde{H} (h)}{(1 \pm 2 β_{1}) (1 \pm 2 β_{2}) D (h)} \geq c (c - 1) . \end{matrix}

(6)

given

D (h) \neq 0

. Where

M (h | v, α, ρ_{12}) = [\begin{matrix} m_{11} (h | v_{1}, α_{1}) & ρ_{12} m_{12} (h | v_{12}, α_{12}) \\ ρ_{12} m_{12} (h | v_{12}, α_{12}) & m_{22} (h | v_{2}, α_{2}) \end{matrix}],

(7)

m_{i j} (h | v_{k}, α_{k}) = \frac{2^{1 - v_{k}}}{Γ (v_{k})} {(α_{k} h)}^{v_{k}} K_{v_{k}} (α_{k} h)

,

i, j = 1, 2

,

k = 1, 2, 12

.

\begin{matrix} H (h) = \frac{α_{1}^{2 v_{1}} α_{2}^{2 v_{2}} c_{v_{1}} c_{v_{2}}}{{(α_{1}^{2} + h^{2})}^{v_{1} + d / 2} {(α_{2}^{2} + h^{2})}^{v_{2} + d / 2}} - \frac{ρ_{12}^{2} α_{12}^{4 v_{12}} c_{v_{12}}^{2}}{{(α_{12}^{2} + h^{2})}^{2 v_{12} + d}}, \end{matrix}

\tilde{H} (h)

is defined like

H (h)

with

α_{i}

replaced with

α_{i}^{'}

,

i = 1, 2, 12

and

\begin{matrix} D (h) & = \frac{α_{1}^{2 v_{1}} α_{2}^{' 2 v_{2}} c_{v_{1}} c_{v_{2}}}{{(α_{1}^{2} + h^{2})}^{v_{1} + d / 2} {(α_{2}^{' 2} + h^{2})}^{v_{2} + d / 2}} + \frac{α_{1}^{' 2 v_{1}} α_{2}^{2 v_{2}} c_{v_{1}} c_{v_{2}}}{{(α_{1}^{' 2} + h^{2})}^{v_{1} + d / 2} {(α_{2}^{2} + h^{2})}^{v_{2} + d / 2}} \\ - \frac{2 ρ_{12} ρ_{12}^{'} α_{12}^{2 v_{12}} α_{12}^{' 2 v_{1} 2} c_{v_{12}}^{2}}{{((α_{12}^{2} + h^{2}) (α_{12}^{' 2} + h^{2}))}^{v_{12} + d / 2}} . \end{matrix}

If fact, from [14],

M (h | v, α, ρ_{12})

is a valid covariance matrix if and only if

\begin{matrix} ρ_{12}^{2} \leq & \frac{c_{v 1} c_{v 2}}{c_{v_{12}}^{2}} \frac{α_{1}^{2 v_{1}} α_{2}^{2 v_{2}}}{α_{12}^{4 v_{12}}} \underset{h \geq 0}{i n f} \frac{{(α_{12}^{2} + h^{2})}^{2 v_{12} + d}}{{(α_{1}^{2} + h^{2})}^{v_{1} + d / 2} {(α_{2}^{2} + t^{2})}^{v_{2} + d / 2}}, \end{matrix}

(8)

\begin{matrix} ρ_{12}^{' 2} \leq & \frac{c_{v 1} c_{v 2}}{c_{v_{12}}^{2}} \frac{α_{1}^{' 2 v_{1}} α_{2}^{' 2 v_{2}}}{α_{12}^{' 4 v_{12}}} \underset{h \geq 0}{i n f} \frac{{(α_{12}^{' 2} + h^{2})}^{2 v_{12} + d}}{{(α_{1}^{' 2} + h^{2})}^{v_{1} + d / 2} {(α_{2}^{' 2} + t^{2})}^{v_{2} + d / 2}} . \end{matrix}

(9)

where

c_{v} = π^{- d / 2} Γ (v + d / 2) / Γ (v)

. Therefore, we can show that

H (h) \geq 0

,

\tilde{H} (h) \geq 0

, and

D (h) \geq 0

. Under certain conditions, the minimum of the left-hand side of inequality (8) can be equal to zero, which leads to the following corollary.

Corollary 1.

The sufficient and necessary condition for Equation (5) to be a correlation matrix function can be reduced to

0 \leq c \leq 1

in the following cases:

(a) When

α_{12} \leq m i n (α_{1}, α_{2})

,

α_{12}^{'} \leq m i n (α_{1}^{'}, α_{2}^{'})

,

v_{12} = \frac{v_{1} + v_{2}}{2}

,

\begin{matrix} ρ_{12}^{2} = \frac{c_{v_{1}} c_{v_{2}}}{c_{v_{12}}^{2}} {(\frac{α_{12}^{2}}{α_{1} α_{2}})}^{d}, ρ_{12}^{' 2} = \frac{c_{v_{1}} c_{v_{2}}}{c_{v_{12}}^{2}} {(\frac{α_{12}^{' 2}}{α_{1}^{'} α_{2}^{'}})}^{d} . \end{matrix}

(b) When

α_{12} \geq m a x (α_{1}, α_{2})

,

α_{12}^{'} \geq m a x (α_{1}^{'}, α_{2}^{'})

,

v_{12} = \frac{v_{1} + v_{2}}{2}

,

\begin{matrix} ρ_{12}^{2} = \frac{c_{v_{1}} c_{v_{2}}}{c_{v_{12}}^{2}} {(\frac{α_{1}}{α_{12}})}^{2 v_{1}} {(\frac{α_{2}}{α_{12}})}^{2 v_{2}}, ρ_{12}^{' 2} = \frac{c_{v_{1}} c_{v_{2}}}{c_{v_{12}}^{2}} {(\frac{α_{1}^{'}}{α_{12}^{'}})}^{2 v_{1}} {(\frac{α_{2}^{'}}{α_{12}^{'}})}^{2 v_{2}} . \end{matrix}

The proofs of the theorems and corollary are deferred to the Appendix A. It is well known that setting

v = 1 / 2

in the Matérn covariance function yields the exponential form. This leads to the following example:

Example 1.

Let

α

,

α^{'}

,

ρ_{12}

,

ρ_{12}^{'}

and

β_{k} (k = 1, 2)

be assumed as in Theorem 3, and take

v_{1} = v_{2} = v_{12} = \frac{1}{2}

; then, A sufficient and necessary condition for the matrix function of exponential type

C (h; t) = \{\begin{matrix} c E (h | α, ρ_{12}) + (1 - c) E (h | α^{'}, ρ_{12}^{'}), & t = 0, \\ c E (h | α, ρ_{12}) β_{1} + (1 - c) E (h | α^{'}, ρ_{12}^{'}) β_{2}, & t = \pm 1, h \in D \\ 0, & o t h e r w i s e, \end{matrix}

(10)

to be a stationary correlation matrix function on

D \times Z

is that the constant c satisfies inequality (6). Where

E (h | α, ρ_{12}) = [\begin{matrix} e_{11} (h | α_{1}) & ρ_{12} e_{12} (h | α_{12}) \\ ρ_{12} e_{12} (h | α_{12}) & e_{22} (h | α_{2}) \end{matrix}] .

(11)

e_{i j} (h | α_{k}) = e x p (- α_{k} h)

,

i, j = 1, 2

,

k = 1, 2, 12

.

3. ARMA Type Temporal Margin

In the previous section, we considered the spatio-temporal covariance structure with a moving average of order one (MA(1)) as the temporal margin. in this section, we extend the covariance matrix to more general cases involving some other autoregressive and moving average (ARMA) temporal margins.

The following model establishes the necessary and sufficient conditions for a valid spatio-temporal covariance matrix with ARMA-type temporal dependence. As before, this theorem assumes uniform

α

in

M (h ∣ v, α)

.

Theorem 4.

Let

v = (v_{1}, v_{2}, v_{12})

,

β = (β_{1}, β_{2})

, be constant vectors.

v_{k} \geq 0, α_{k} \geq 0, - 1 \leq β_{k} \leq 1

, and let

v_{12} = (v_{1} + v_{2}) / 2

,

D \subset R^{d}

or

S^{d}

. A sufficient condition for the

p \times p

matrix function

C (h; t) = c M (h | v, α_{1}) β_{1}^{| t |} + (1 - c) M (h | v, α_{2}) β_{2}^{| t |}, t \in Z, h \in D

(12)

to be a correlation matrix function on

D \times Z

is that the constant c satisfies the following:

0 \leq c \leq 1 .

(13)

And if $p \leq 2$ , (13) is also necessary.
where

M (h | v, α) = ({(ρ_{i j} m (h | v_{i j}, α))}_{1 \leq i, j \leq p},

(14)

m (h | v_{k}, α) = \frac{2^{1 - v_{k}}}{Γ (v_{k})} {(α h)}^{v_{k}} K_{v_{k}} (α h)

,

i, j = 1, 2

,

k = 1, 2, 12

,

ρ_{12} = \frac{Γ {(v_{1} + \frac{d}{2})}^{\frac{1}{2}}}{Γ {(v_{1})}^{\frac{1}{2}}} \frac{Γ {(v_{2} + \frac{d}{2})}^{\frac{1}{2}}}{Γ {(v_{2})}^{\frac{1}{2}}} \frac{Γ (v_{12})}{Γ (v_{12} + \frac{d}{2})}

.

We now extend this theorem to various values of

α

in

M (h ∣ v, α)

. As in the preceding section, we follow [14] and assume that both

M (h ∣ v, α, ρ_{12})

and

M (h ∣ v, α^{'})

below are general multivariate Matérn covariance functions. Furthermore, we assume that the choice of c satisfies the conditions in Theorem 4 in [13], ensuring that the main diagonal elements in the resulting matrix structure are valid univariate correlation functions.

Theorem 5.

Let

v = (v_{1}, v_{2}, v_{12})

,

α = (α_{1}, α_{2}, α_{12})

,

α^{'} = (α_{1}^{'}, α_{2}^{'}, α_{12}^{'})

,

β = (β_{1}, β_{2})

be constant vectors.

v_{k} \geq 0, α_{k} \geq 0, α_{k}^{'} \geq 0, - 1 \leq β_{k} \leq 1

,

D \subset R^{d}

or

S^{d}

. A sufficient and necessary condition for

p \times p

matrix function,

p \leq 2

C (h; t) = c M (h | v, α, ρ_{12}) β_{1}^{| t |} + (1 - c) M (h | v, α^{'}, ρ_{12}^{'}) β_{2}^{| t |}, t \in Z, h \in D

(15)

to be a correlation matrix function on

D \times Z

is that the constant c satisfies

\begin{matrix} \underset{h \geq 0, D (h) > 0}{i n f} \frac{c^{2} {(β_{1}^{*})}^{2} H (h) + {(1 - c)}^{2} {(β_{2}^{*})}^{2} \tilde{H} (h)}{(β_{1}^{*}) (β_{2}^{*}) D (h)} \geq c (c - 1) . \end{matrix}

(16)

where

M (h | v, α, ρ_{12}) = [\begin{matrix} m_{11} (h | v_{1}, α_{1}) & ρ_{12} m_{12} (h | v_{12}, α_{12}) \\ ρ_{12} m_{12} (h | v_{12}, α_{12}) & m_{22} (h | v_{2}, α_{2}) \end{matrix}],

m_{i j} (h | v_{k}, α_{k}) = \frac{2^{1 - v_{k}}}{Γ (v_{k})} {(α_{k} h)}^{v_{k}} K_{v_{k}} (α_{k} h)

,

β_{i}^{*} = \frac{1 - β_{i}^{2}}{1 + β_{i}^{2} - 2 β_{i} c o s (ω)}, i, j = 1, 2, k = 1, 2, 12

\begin{matrix} H (h) = \frac{α_{1}^{2 v_{1}} α_{2}^{2 v_{2}} c_{v_{1}} c_{v_{2}}}{{(α_{1}^{2} + h^{2})}^{v_{1} + d / 2} {(α_{2}^{2} + h^{2})}^{v_{2} + d / 2}} - \frac{ρ_{12}^{2} α_{12}^{4 v_{12}} c_{v_{12}}^{2}}{{(α_{12}^{2} + h^{2})}^{2 v_{12} + d}}, \end{matrix}

\tilde{H} (h)

is defined like

H (h)

, with

α_{i}

replaced with

α_{i}^{'}

,

i = 1, 2, 12

.

\begin{matrix} D (h) & = \frac{α_{1}^{2 v_{1}} α_{2}^{' 2 v_{2}} c_{v_{1}} c_{v_{2}}}{{(α_{1}^{2} + h^{2})}^{v_{1} + d / 2} {(α_{2}^{' 2} + h^{2})}^{v_{2} + d / 2}} + \frac{α_{1}^{' 2 v_{1}} α_{2}^{2 v_{2}} c_{v_{1}} c_{v_{2}}}{{(α_{1}^{' 2} + h^{2})}^{v_{1} + d / 2} {(α_{2}^{2} + h^{2})}^{v_{2} + d / 2}} \\ - \frac{2 ρ_{12} ρ_{12}^{'} α_{12}^{2 v_{12}} α_{12}^{' 2 v_{1} 2} c_{v_{12}}^{2}}{{((α_{12}^{2} + h^{2}) (α_{12}^{' 2} + h^{2}))}^{v_{12} + d / 2}} . \end{matrix}

Incorporating different

α_{i}

values into the model allows for more detailed spatial parameterization, enabling a more precise capture of spatial trends. Once again, the condition in this theorem can be simplified under several special cases:

Corollary 2.

The sufficient and necessary condition for Equation (15) to be a correlation matrix function can be reduced to

0 \leq c \leq 1

in the following cases:

(a) When

α_{12} \leq m i n (α_{1}, α_{2})

,

α_{12}^{'} \leq m i n (α_{1}^{'}, α_{2}^{'})

,

v_{12} = \frac{v_{1} + v_{2}}{2}

,

\begin{matrix} ρ_{12}^{2} = \frac{c_{v_{1}} c_{v_{2}}}{c_{v_{12}}^{2}} {(\frac{α_{12}^{2}}{α_{1} α_{2}})}^{d}, ρ_{12}^{' 2} = \frac{c_{v_{1}} c_{v_{2}}}{c_{v_{12}}^{2}} {(\frac{α_{12}^{' 2}}{α_{1}^{'} α_{2}^{'}})}^{d} . \end{matrix}

(b) When

α_{12} \geq m a x (α_{1}, α_{2})

,

α_{12}^{'} \geq m a x (α_{1}^{'}, α_{2}^{'})

,

v_{12} = \frac{v_{1} + v_{2}}{2}

,

\begin{matrix} ρ_{12}^{2} = \frac{c_{v_{1}} c_{v_{2}}}{c_{v_{12}}^{2}} {(\frac{α_{1}}{α_{12}})}^{2 v_{1}} {(\frac{α_{2}}{α_{12}})}^{2 v_{2}}, ρ_{12}^{' 2} = \frac{c_{v_{1}} c_{v_{2}}}{c_{v_{12}}^{2}} {(\frac{α_{1}^{'}}{α_{12}^{'}})}^{2 v_{1}} {(\frac{α_{2}^{'}}{α_{12}^{'}})}^{2 v_{2}} . \end{matrix}

The proof of this corollary is similar to that of Corollary 1. The temporal margin in both theorems is given by

C (0, t) = c I_{2 \times 2} β_{1}^{| t |} + (1 - c) I_{2 \times 2} β_{2}^{| t |}, t \in Z,

which is a linear combination of valid correlation matrices. This structure encompasses a family of valid spatio-temporal correlation functions with stationary AR(1) (first-order autoregressive model), AR(2), and ARMA(2,1) temporal margins. The parameters

α_{k}

and

ν_{k}

for

k = 1, 2, 12

can be interpreted as the spatial scaling and smoothness parameters, respectively. The parameters

β_{1}

and

β_{2}

govern the temporal dynamics, while c serves as a mixing parameter balancing the two components.

To apply the proposed parametric models, one may first use time series techniques to fit ARMA models at each spatial location. This process can help determine the appropriate ARMA order and provide starting values for

β_{1}

,

β_{2}

, and c. Final parameter estimation can then be performed using either maximum likelihood estimation or the weighted least squares method of [26] (see also Equation (22) in [27]). For the spatial component, standard procedures in spatial statistics can be employed to estimate initial values for

α_{i}

and the cross-correlation parameters

ρ

,

ρ^{'}

. For instance, one can use the fitted parameters from the marginal spatial and cross-correlation functions at different time lags as starting points. Additional insights into the temporal structure can be obtained using tools such as the autocorrelation function (ACF), partial autocorrelation function (PACF), and information criteria like AIC and BIC. Since the temporal margin can initially be analyzed independently, this step provides useful guidance for model selection. Ultimately, the choice of the final model should be guided by space–time fitting criteria, which are generally robust to small variations in the marginal temporal model. Simplicity is also an important consideration in final model selection. Therefore, the proposed models, along with the stepwise estimation strategy, offer a practical and flexible approach by decomposing the complex spatio-temporal modeling problem into two more manageable steps. The proposed framework also provides an intuitive path toward modeling multivariate spatio-temporal processes, where each spatial location may follow an ARMA-type temporal process. One benefit of this approach is that it allows the multivariate MA(1) process to be approximated by analyzing marginal trends. Since the spatial correlation structure can differ across variables and time lags, it is often beneficial to estimate the trend separately at each time lag to obtain more accurate initial values. These components can then be integrated into a unified model, which is subsequently refined using joint estimation.

For the data application presented in the next section, parameter estimation was conducted using the least squares method from [7] and the techniques developed in [27]. Extending these techniques to accommodate general ARMA(p, q) temporal margins would require further theoretical development of the results presented here. However, such extensions remain computationally feasible, particularly when using Cressie’s weighted least squares approach. We leave the exploration of more complex temporal margins for future research.

4. Data Example: Kansas Daily Temperature Data

This dataset is sourced from the National Oceanic and Atmospheric Administration (NOAA) and includes observations from 105 weather stations across Kansas. For our real-data application, we focus on two highly correlated variables: daily maximum and minimum temperatures recorded over 8030 days, from 1 January 1990, to 31 December 2011, across all 105 counties. To preprocess the data, we compute weekly averages over the 8030 days, resulting in 1144 weeks of average maximum and minimum temperatures, which we use as our raw dataset. To reduce short-term variability to obtain a more stable pattern, we compute weekly averages from the daily temperature data. We divide the dataset into training and testing sets: the first 800 weeks (approximately the first fifteen years) are used for training, and the remaining 344 weeks (the last seven years) are used for testing. To detrend and deseasonalize the data, we follow the procedure outlined in [27] by subtracting the overall mean weekly temperature for each calendar week. Specifically:

Let

X_{y, w, i}

be the weekly average temperature in year y, week w, location i;

{\bar{X}}_{w, i}

be the average temperature for week w at location i across n years; and

X_{y, w, i}^{*}

represents the weekly value at location i with the seasonal mean removed, defined as:

X_{y, w, i}^{*} = X_{y, w, i} - {\bar{X}}_{w, i}

(17)

where

{\bar{X}}_{w, i} = \frac{1}{n} \sum_{y = 1}^{n} X_{y, w, i}, w = 1, 2, \dots, 52

(18)

This deseasonalization step removes the dominant annual signal and yields weekly anomalies, which reveals the underlying MA(1) correlation pattern.

We then compute the autocorrelation function (ACF) and cross-correlation function (CCF) of the de-trended minimum and maximum temperature series across the 105 counties using the training period. Figure 1 and Figure 2 display the ACFs of average maximum and minimum temperatures for all locations, as well as for three randomly chosen stations. Based on the ACF and CCF plots, both variables exhibit a pattern consistent with a moving average process of order one (MA(1)), supporting the use of a spatio-temporal model with an MA(1) temporal margin.

The next step is to calculate space–time correlation using detrended data,

X_{y, w, i}

for model fitting. Since the data includes many location pairs at each distance, it is hard to extract stable spatial trends across time lags. To reduce noise, we apply spatial binning using

h = 4

and

δ = 2

, which means that we average the spatial correlations within each 4-km bin and discard any empty bins. The binned correlations are the input data for further model fitting. We use the least squares optimization method to fit empirical spatial correlations for minimum temperature, maximum temperature, and their cross-correlation at lag zero. These fits provide suitable initial values for the PMM, SMM, and Cauchy models introduced below.

Guided by this exploratory analysis, using an MA(1)-type temporal margin is a suitable choice for Theorem 3 application. While the correlation approaches 1 at the distance

h = 0

in Theorem 3, real world data often exhibits a nugget effect and must be accounted for. By incorporating the nugget effect as described in Theorem 3, we formulate the proposed model, referred to as the PMM (Partially Mixed Model) based on

C (h; t)

in Equation (5), as follows:

C_{P M M} (h; t) = [\begin{matrix} (1 - η_{1}) & 1 \\ 1 & (1 - η_{2}) \end{matrix}] \circ C (h; t) + C (0; t) \circ [\begin{matrix} η_{1} 1_{{h = 0}} & 0 \\ 0 & η_{2} 1_{{h = 0}} \end{matrix}] .

(19)

Cressie’s weighted-least-squares optimization method [26] for parameter estimation (Algorithm 1):

Algorithm 1 Estimation Procedure

Initialize parameters:

θ^{(0)} = (η_{1}, η_{2}, α_{11}, α_{1}, α_{1}^{'}, α_{2}, α_{2}^{'} α_{12}, α_{12^{'}}, c, β_{1}, β_{2}, ρ_{12}, ρ_{12^{'}})

;

Set iteration counter

d = 0

;

Repeat

Compute predicted covariances in Equation (19) at

t = 0, 1, 2

across all distances;

Calculate weighted sum of squares:

{WSS}^{(d)} = \sum {[residuals at t = 0, 1, 2 across all distances]}^{2}

;

Update parameters

θ^{(d + 1)}

by minimizing

{WSS}^{(d)}

using the L-BFGS-B algorithm;

d \leftarrow d + 1

;

until convergence:

| {WSS}^{(d + 1)} - {WSS}^{(d)} | < δ

, for a small threshold

δ > 0

.

Finally, the fitted and estimated parameter values for the PMM model are as follows:

η_{1} = 0.1014

,

η_{2} = 0.1280

,

α_{1} = 0.000025

,

α_{1}^{'} = 0.004088

,

α_{2} = 0.003852

,

α_{2}^{'} = 0.000025

,

α_{12} = 0.002868

,

α_{12}^{'} = 0.000100

,

c = 0.5254

,

β_{1} = 0.2496

,

β_{2} = 0.2591

,

ρ_{12} = 0.6964

,

ρ_{12}^{'} = 0.6523

, and all

v_{i j}

are set to 2.5. All of the estimated parameters satisfy the conditions in Theorem 3 to ensure Equation (19) is valid as a covariance matrix function. Otherwise, the involved matrix is not invertible, and co-kriging cannot be performed. Next, we apply the purely spatial multivariate Matérn model (SMM), as proposed in [14], for comparison with the incorporated nugget effect.

C (h) = [\begin{matrix} (1 - η_{1}) & 1 \\ 1 & (1 - η_{2}) \end{matrix}] \circ \{M (h | v, α, ρ_{12})\} + [\begin{matrix} η_{1} 1_{{h = 0}} & 0 \\ 0 & η_{2} 1_{{h = 0}} \end{matrix}] .

(20)

In addition, we compared the performance of the Cauchy separable model in continuous time, as proposed in [14], with the nugget effect incorporated.

C (h; t) = {[\begin{matrix} (1 - η_{1}) & 1 \\ 1 & (1 - η_{2}) \end{matrix}] \circ M (h | v, α, ρ_{12}) + [\begin{matrix} η_{1} 1_{h = 0} & 0 \\ 0 & η_{2} 1_{h = 0} \end{matrix}]} \cdot {(1 + {a | t |}^{2 α})^{- 1}},

(21)

where

t \in R, h \in D

.

Figure 3 and Figure 4 show the fitted PMM, SMM, and Cauchy models at time lags of 0 and 1 for maximum temperature, minimum temperature, and their cross space–time correlations. In Figure 3, the PMM model fits the empirical correlations better than the SMM and Cauchy models, capturing the underlying structure more accurately. In Figure 4, for maximum and minimum temperature correlations at lag 1, the PMM better capture the correlation patterns, while the Cauchy model performs slightly better for the cross-correlation.

Across both figures, correlation dispersion increases at long distances, as seen in the first plot of Figure 3. This pattern aligns with real-world expectations, where correlation typically decreases with distance, and it also contributes to reduced model fitting performance. Figure 5 shows that at time lag 2, all correlations are near zero, highlighting the MA(1) temporal structure in the data. PMM model correlation estimates are also zero by definition.

After fitting the PMM, SMM, and Cauchy models on the training data, the next step is to perform co-kriging for prediction on the testing data, as described below.

The response variable

\hat{Y} (s_{0}, t_{0})

at location

s_{0}

, time

t_{0}

is estimated as follows:

\hat{Y} (s_{0}, t_{0}) = \sum_{i = 1}^{n} \sum_{j = 1}^{m} λ_{i j}^{Y} Y (s_{i}, t_{j}) + \sum_{i = 1}^{n} \sum_{j = 1}^{m} λ_{i j}^{X} X (s_{i}, t_{j},) .

(22)

where the weights

λ_{i j}^{Y}

and

λ_{i j}^{X}

are obtained by solving the following:

λ_{ck} = K_{ck}^{- 1} k_{ck},

(23)

K_{ck} = (\begin{matrix} C_{Y Y} & C_{Y X} & 1 & 0 \\ C_{X Y} & C_{X X} & 0 & 1 \\ 1^{⊤} & 0^{⊤} & 0 & 0 \\ 0^{⊤} & 1^{⊤} & 0 & 0 \end{matrix}),

(24)

where:

C_{Y Y} = (\begin{matrix} C_{Y Y} (s_{i} - s_{j}, t_{1} - t_{1}) & \dots & C_{Y Y} (s_{i} - s_{j}, t_{1} - t_{m}) \\ ⋮ & ⋱ & ⋮ \\ C_{Y Y} (s_{i} - s_{j}, t_{m} - t_{1}) & \dots & C_{Y Y} (s_{i} - s_{j}, t_{m} - t_{m}) \end{matrix}) .

(25)

C_{Y Y} (s_{i} - s_{j}, t_{1} - t_{1})

is the covariance matrix across distances at time lag

t_{1} - t_{1}

for variable Y and

k_{ck} = (\begin{matrix} C_{Y Y} (s_{0} - s_{1}, t_{0} - t_{1}) \\ ⋮ \\ C_{Y Y} (s_{0} - s_{n}, t_{0} - t_{m}) \\ C_{Y X} (s_{0} - s_{1}, t_{0} - t_{1}) \\ ⋮ \\ C_{Y X} (s_{0} - s_{n}, t_{0} - t_{m}) \\ 1 \\ 0 \end{matrix}) .

(26)

In the PMM and Cauchy models, co-kriging is performed using the minimum and maximum values across all locations at

t - 1

and

t - 2

as input data, and SMM model uses

t - 1

only.

In addition, we consider a traditional time series modeling approach. Since standard time series prediction functions in the R package do not support forecasting with fixed parameters, we developed a custom implementation using the Innovations algorithm described in [4]. We fit the time series model on the training data for maximum temperatures at all 105 stations. Specifically, for each station, we estimated the parameters

θ

and

σ

, which are the key components of a moving average process of order one (MA(1)), and we used them to generate predictions on the testing data.

Finally, predictions were obtained for the testing period. The root mean squared error (RMSE) and 95% data interval was computed for each method to assess predictive performance. Table 1 reports the model performance across all counties for maximum temperature.

The percentage of Stations with the Lowest RMSE shows that the PMM model outperforms the others at most locations, achieving the lowest RMSE at 93.3% of the 105 stations. This demonstrates the model’s broad applicability and consistency across different locations. Consequently, the PMM model also produces the lowest average RMSE across all locations. While this difference may seem small, it is important in the de-seasonalized weekly average temperature data, where fluctuations are limited, making even small improvements both statistically and practically meaningful; see [28]. Moreover, the PMM model proves to be more reliable at individual stations, consistently providing better local predictions. This suggests that the model captures more complex spatial structures than simple MA(1) temporal margin or simple spatial correlation margin like the SMM model. The models used for comparison also have strong performance, using the marginal average as the starting points. All models shared those initial value together, so slight improvements are still considered beneficial. Based on this analysis, the proposed PMM model demonstrates consistently better predictive performance, particularly when the temporal margin of the space–time process is properly modeled using an MA(1) structure. The PMM model can also perform well when there is a large number of missing values in the primary variable by leveraging information from the secondary variable, which time series models are unable to utilize. Also, in real-world applications involving complex spatial–temporal data, model selection can be challenging. The PMM model stands out as an easy choice by simply using the marginal spatial and temporal correlations to select the appropriate structure in the PMM model. These results suggest that incorporating both strongly correlated spatial components and discrete-time dependence improves the overall predictive accuracy.

5. Discussion

This work presents a foundational framework for direct modeling of space–time random fields with spatially correlated structures and time series components. The methodology developed here enables the integration of spatial covariance models with some autoregressive and moving average temporal structures, offering a tractable yet flexible approach for analyzing spatio-temporal data. Looking ahead, several avenues for further development are promising. One direction is to incorporate more complex forms of temporal dependence, such as general ARMA or nonstationary time dynamics, to better reflect the intricate temporal behaviors observed in environmental and geophysical data. From an inferential standpoint, parameter estimation techniques can be enhanced by moving beyond least squares approaches. Specifically, adopting maximum likelihood estimation to fit the full correlation structure could lead to more efficient and statistically robust inference, particularly when the data exhibits strong space–time interactions. Additionally, while some of the current framework relies on the Matérn class of spatial covariance functions due to its theoretical and practical appeal, other families of spatial structures, including compactly supported or nonstationary models, may offer advantages in specific applications. Exploring these alternatives can further improve the adaptability of the modeling strategy to diverse scientific domains.

Author Contributions

Conceptualization, J.D. and R.T.; methodology, J.D. and R.T.; software, R.T.; validation, R.T. and J.D.; formal analysis, R.T. and J.D.; investigation, R.T.; resources, J.D. and R.T.; data curation, R.T.; writing—original draft preparation, R.T.; writing—review and editing, J.D.; visualization, R.T.; supervision, J.D. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Data Availability Statement

The raw data supporting the conclusions of this paper will be made available by the authors upon request.

Acknowledgments

The authors sincerely thank the Associate Editor and the two anonymous reviewers for their careful reading of the previous version of the manuscript and for their constructive comments and suggestions, which have greatly improved the quality of this paper.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

ARMA	Autoregressive and Moving Average
ACF	Autocorrelation Function (ACF)
PMM	Partially Mixed Model
RMSE	Root Mean Squared Error

Appendix A. Proofs

Proof of Theorem 1.

Under the assumptions (i) and (ii), we are going to apply Theorem 8 of [29] to verify that (1) is a covariance matrix function on

D \times Z

. Clearly,

{C (s_{1}, s_{2}; t)}^{'} = C (s_{2}, s_{1}; - t), s_{1}, s_{2} \in D, t \in Z

. Thus, it suffices that the inequality

\sum_{i = 1}^{n} \sum_{j = 1}^{n} a_{i}^{'} C (s_{i}, s_{j}; i - j) a_{j} \geq 0

or, equivalently,

\sum_{i = 1}^{n} a_{i}^{'} G_{0} (s_{i}, s_{i}) a_{i} + \sum_{i = 1}^{n - 1} {a_{i}^{'} G_{1}^{'} (s_{i + 1}, s_{i}) a_{i + 1} + a_{i + 1}^{'} G_{1} (s_{i + 1}, s_{i}) a_{i}} \geq 0,

(A1)

holds for every positive integer n, any

s_{k} \in D

, and any

a_{k} \in R^{m}

.

Since

G_{0} (s_{1}, s_{2}) + G_{1} (s_{1}, s_{2}) + G_{1}^{'} (s_{1}, s_{2})

is a covariance matrix function on

D

, its transpose

{G_{0} (s_{1}, s_{2}) + G_{1} (s_{1}, s_{2}) + G_{1}^{'} (s_{1}, s_{2})}^{'} = G_{0} (s_{1}, s_{2}) + G_{1}^{'} (s_{1}, s_{2}) + G_{1} (s_{1}, s_{2})

equals

G_{0} (s_{2}, s_{1}) + G_{1} (s_{2}, s_{1}) + G_{1}^{'} (s_{2}, s_{1})

, so that

G_{1}^{'} (s_{1}, s_{2}) + G_{1} (s_{1}, s_{2}) = G_{1} (s_{2}, s_{1}) + G_{1}^{'} (s_{2}, s_{1}), s_{1}, s_{2} \in D .

Notice that the matrix function

\frac{1}{2} (C (s_{1}, s_{2}; t) + C^{'} (s_{1}, s_{2}; t))

can be written as follows:

\begin{matrix} \frac{1}{2} (C (s_{1}, s_{2}; t) + C^{'} (s_{1}, s_{2}; t)) \\ = & \{\begin{matrix} G_{0} (s_{1}, s_{2}), & t = 0, \\ \frac{1}{2} {G_{1} (s_{1}, s_{2}) + G_{1}^{'} (s_{1}, s_{2})}, & t = 1, \\ \frac{1}{2} {G_{1}^{'} (s_{2}, s_{1}) + G_{1} (s_{2}, s_{1})}, & t = - 1, \\ 0, & t \pm 2, \pm 3, \dots, s_{1}, s_{2} \in D, \end{matrix} \\ = & \{\begin{matrix} G_{0} (s_{1}, s_{2}), & t = 0, \\ \frac{1}{2} {G_{1} (s_{1}, s_{2}) + G_{1}^{'} (s_{1}, s_{2})}, & t = 1, \\ \frac{1}{2} {G_{1} (s_{1}, s_{2}) + G_{1}^{'} (s_{1}, s_{2})}, & t = - 1, \\ 0, & t \pm 2, \pm 3, \dots, s_{1}, s_{2} \in D, \end{matrix} \\ = & \frac{G_{0} (s_{1}, s_{2}) + G_{1} (s_{1}, s_{2}) + G_{1}^{'} (s_{1}, s_{2})}{2} \cdot \{\begin{matrix} I, & t = 0, \\ \frac{1}{2} I, & t = 1, \\ \frac{1}{2} I, & t = - 1, \\ 0, & t = \pm 2, \pm 3, \dots, s_{1}, s_{2} \in D, \end{matrix} \\ + \frac{G_{0} (s_{1}, s_{2}) - G_{1} (s_{1}, s_{2}) - G_{1}^{'} (s_{1}, s_{2})}{2} \cdot \{\begin{matrix} I, & t = 0, \\ - \frac{1}{2} I, & t = 1, \\ - \frac{1}{2} I, & t = - 1, \\ 0, & t = \pm 2, \pm 3, \dots, s_{1}, s_{2} \in D . \end{matrix} \end{matrix}

This is a sum of two separable covariance matrix functions and is thus a covariance matrix function on

D \times Z

. Based on Theorem 8 of [29], we obtain

\begin{matrix} 0 & \leq & \frac{1}{2} \sum_{i = 1}^{n} \sum_{j = 1}^{n} a_{i}^{'} {C (s_{i}, s_{j}; i - j) + C^{'} (s_{i}, s_{j}; i - j)} a_{j} \\ = & \frac{1}{2} \sum_{i = 1}^{n} a_{i}^{'} {C (s_{i}, s_{i}; 0) + C^{'} (s_{i}, s_{i}; 0)} a_{i} \\ + \frac{1}{2} \sum_{i = 1}^{n - 1} a_{i}^{'} {C (s_{i}, s_{i + 1}; - 1) + C^{'} (s_{i}, s_{i + 1}; - 1)} a_{i + 1} \\ + \frac{1}{2} \sum_{i = 1}^{n - 1} a_{i + 1}^{'} {C (s_{i + 1}, s_{i}; 1) + C^{'} (s_{i + 1}, s_{i}; 1)} a_{i} \\ = & \sum_{i = 1}^{n} a_{i}^{'} G_{0} (s_{i}, s_{i}) a_{i} + \frac{1}{2} \sum_{i = 1}^{n - 1} a_{i}^{'} {G_{1}^{'} (s_{i + 1}, s_{i}) + G_{1} (s_{i + 1}, s_{i})} a_{i + 1} \\ + \frac{1}{2} \sum_{i = 1}^{n - 1} a_{i + 1}^{'} {G_{1} (s_{i + 1}, s_{i}) + G_{1}^{'} (s_{i + 1}, s_{i})} a_{i} \\ = & \sum_{i = 1}^{n} a_{i}^{'} G_{0} (s_{i}, s_{i}) a_{i} + \sum_{i = 1}^{n - 1} {a_{i}^{'} G_{1}^{'} (s_{i + 1}, s_{i}) a_{i + 1} + a_{i + 1}^{'} G_{1} (s_{i + 1}, s_{i}) a_{i}}, \end{matrix}

where the last equality follows from

a_{i}^{'} G_{1} (s_{i + 1}, s_{i}) a_{i + 1} = a_{i + 1}^{'} G_{1}^{'} (s_{i + 1}, s_{i})} a_{i}

. Thus, inequality (A1) is derived. Conversely, suppose that Equation (1) is a covariance matrix function on

D \times Z

. Then, for arbitrary n locations and l integer time points at each location, we formulate

n m

pairs

s_{i}

and

t_{j} = j

, choose the corresponding vectors as the products

a_{i} b_{j}

,

i = 1, \dots, n, j = 1, \dots, l

, and obtain

\sum_{i = 1}^{n} \sum_{i^{'} = 1}^{n} \sum_{j = 1}^{l} \sum_{j^{'} = 1}^{l} b_{j} b_{j^{'}} a_{i}^{'} C (s_{i}, s_{i^{'}}; j - j^{'}) a_{i^{'}} \geq 0,

or

\sum_{i = 1}^{n} \sum_{i^{'} = 1}^{n} a_{i}^{'} (\sum_{j = 1}^{l} b_{j}^{2} C (s_{i}, s_{i^{'}}; 0) + \sum_{j = 1}^{l - 1} b_{j} b_{j + 1} (C (s_{i}, s_{i^{'}}; 1) + C (s_{i}, s_{i^{'}}; - 1))) a_{i^{'}} \geq 0,

or

\sum_{i = 1}^{n} \sum_{i^{'} = 1}^{n} a_{i}^{'} (\sum_{j = 1}^{l} b_{j}^{2} G_{0} (s_{i}, s_{i^{'}}) + \sum_{j = 1}^{l - 1} b_{j} b_{j + 1} (G_{1} (s_{i}, s_{i^{'}}) + G_{1}^{'} (s_{i^{'}}, s_{i}))) a_{i^{'}} \geq 0 .

(A2)

In particular, in Equation (A2), taking

b_{j} = 1, j = 1, \dots, l

and dividing both sides by l yields

\sum_{i = 1}^{n} \sum_{i^{'} = 1}^{n} a_{i}^{'} (G_{0} (s_{i}, s_{i^{'}}) + \frac{l - 1}{l} (G_{1} (s_{i}, s_{i^{'}}) + G_{1}^{'} (s_{i^{'}}, s_{i}))) a_{i^{'}} \geq 0 .

Letting

l \to \infty

gives

\sum_{i = 1}^{n} \sum_{i^{'} = 1}^{n} a_{i}^{'} (G_{0} (s_{i}, s_{i^{'}}) + G_{1} (s_{i}, s_{i^{'}}) + G_{1}^{'} (s_{i^{'}}, s_{i})) a_{i^{'}} \geq 0 .

This implies that

G_{0} (s_{1}, s_{2}) + G_{1} (s_{1}, s_{2}) + G_{1}^{'} (s_{2}, s_{1})

is a covariance matrix function on

S

, based on Theorem 8 of [29]. Thus, condition (i) is confirmed.

Similarly, in order to confirm condition (ii), in Equation (A2), we take

b_{j} = {(- 1)}^{j}, j = 1, \dots, l

, divide both sides by l, and obtain

\sum_{i = 1}^{n} \sum_{i^{'} = 1}^{n} a_{i}^{'} (G_{0} (s_{i}, s_{i^{'}}) - \frac{l - 1}{l} (G_{1} (s_{i}, s_{i^{'}}) + G_{1}^{'} (s_{i^{'}}, s_{i}))) a_{i^{'}} \geq 0 .

Letting

l \to \infty

gives

\sum_{i = 1}^{n} \sum_{i^{'} = 1}^{n} a_{i}^{'} G_{0} (s_{i}, s_{i^{'}}) - G_{1} (s_{i}, s_{i^{'}}) - G_{1}^{'} (s_{i^{'}}, s_{i})) a_{i^{'}} \geq 0 .

This implies that

G_{0} (s_{1}, s_{2}) - G_{1} (s_{1}, s_{2}) - G_{1}^{'} (s_{2}, s_{1})

is a covariance matrix function on

S

, based on Theorem 8 of [29]. □

Proof of Theorem 2.

Following from Theorem 1, it is equivalent to show that the inequality (3) is a necessary and sufficient condition for

G_{0} (h) \pm G_{1} (h) \pm G_{1} {(h)}^{'}

to be a valid covariance matrix function on

D

with

h = | | s_{i} - s_{j} | |

. Under the scenario of this theorem,

G_{0} (h) \pm G_{1} (h) \pm G_{1} {(h)}^{'} = c M (h | v, α_{1}) (1 \pm 2 β_{1}) + (1 - c) M (h | v, α_{2}) (1 \pm 2 β_{2}), h \in D .

(A3)

For sufficiency in general, by applying Theorem 2 in [22], A positive linear combination of two covariance matrix functions is also a valid covariance matrix function on

D

. Since condition (3) holds—i.e.,

0 \leq c \leq 1

,

- \frac{1}{2} \leq β_{i} \leq \frac{1}{2}

for

i = 1, 2

, and

M (h)

is a spatial correlation matrix function, Equation (A3) is also a valid covariance matrix function. Furthermore, by Theorem 1, Equation (2) defines a valid spatio-temporal covariance matrix function.

When

p \leq 2

, we will show condition (3) is both sufficient and necessary. To this end, let’s consider the spectral density of the Matérn class of a function (see Equation (32) of [30]). The Fourier transforms of

G_{0} (h) + G_{1} (h) + G_{1} {(h)}^{'}

and

G_{0} (h) - G_{1} (h) - G_{1} {(h)}^{'}

are given as follows:

F (h) = [\begin{matrix} c_{ν_{1}} f_{11} (h) & c_{ν_{12}} ρ_{12} f_{12} (h) \\ c_{ν_{12}} ρ_{12} f_{21} (h) & c_{ν_{2}} f_{22} (h) \end{matrix}]; G (h) = [\begin{matrix} c_{ν_{1}} g_{11} (h) & c_{ν_{12}} ρ_{12} g_{12} (h) \\ c_{ν_{12}} ρ_{12} g_{21} (h) & c_{ν_{2}} g_{22} (h) \end{matrix}],

where

c_{ν} = π^{- d / 2} Γ (ν + d / 2) / Γ (ν)

,

\begin{matrix} f_{i j} (h) = c {(α_{1})}^{v_{i} + v_{j}} {(h^{2} + α_{1}^{2})}^{- \frac{v_{i} + v_{j}}{2} - d / 2} (1 + 2 β_{1}) + (1 - c) α_{2}^{v_{i} + v_{j}} {(h^{2} + α_{2}^{2})}^{- \frac{v_{i} + v_{j}}{2} - d / 2} (1 + 2 β_{2}), \end{matrix}

and

\begin{matrix} g_{i j} (h) = c {(α_{1})}^{v_{i} + v_{j}} {(h^{2} + α_{1}^{2})}^{- \frac{v_{i} + v_{j}}{2} - d / 2} (1 - 2 β_{1}) + (1 - c) α_{2}^{v_{i} + v_{j}} {(h^{2} + α_{2}^{2})}^{- \frac{v_{i} + v_{j}}{2} - d / 2} (1 - 2 β_{2}) \\ s \in D . \end{matrix}

respectively. Hence, it is reduced to show that inequality (3) is necessary and sufficient for

F (h)

and

G (h)

to be nonnegative definite, which means

f_{11} (h) \geq 0

,

f_{22} (h) \geq 0

,

g_{11} (h) \geq 0

,

g_{22} (h) \geq 0

and

\begin{matrix} c_{v_{1}} c_{v_{2}} f_{11} (h) f_{22} (h) - c_{v_{12}}^{2} ρ_{12}^{2} f_{12} (h) f_{21} (h) & \geq 0, \end{matrix}

(A4)

\begin{matrix} c_{v_{1}} c_{v_{2}} g_{11} (h) g_{22} (h) - c_{v_{12}}^{2} ρ_{12}^{2} g_{12} (h) g_{21} (h) & \geq 0, \end{matrix}

(A5)

based on Cramér’s Theorem [31]. From Theorem 2 in [13], we already know that

f_{11} (h) \geq 0

and

g_{11} (h) \geq 0

if and only if

\begin{matrix} {1 - \frac{α_{2}^{d} (1 - 2 β_{1})}{α_{1}^{d} (1 - 2 β_{2})}}^{- 1} \leq c \leq {1 - \frac{α_{1}^{2 v_{1}} (1 + 2 β_{1})}{α_{2}^{2 v_{1}} (1 + 2 β_{2})}}^{- 1} . \end{matrix}

(A6)

Also,

f_{22} (h) \geq 0

and

g_{22} (h) \geq 0

if and only if

\begin{matrix} {1 - \frac{α_{2}^{d} (1 - 2 β_{1})}{α_{1}^{d} (1 - 2 β_{2})}}^{- 1} \leq c \leq {1 - \frac{α_{1}^{2 v_{2}} (1 + 2 β_{1})}{α_{2}^{2 v_{2}} (1 + 2 β_{2})}}^{- 1} . \end{matrix}

(A7)

Since

0 \leq v_{1} \leq v_{2}, 0 \leq α_{1} \leq α_{2}, - 1 / 2 \leq β_{1} \leq β_{2} \leq 1 / 2

,

f_{i i} \geq 0

and

g_{i i} \geq 0

,

i = 1, 2,

entail

\begin{matrix} {1 - \frac{α_{2}^{d} (1 - 2 β_{1})}{α_{1}^{d} (1 - 2 β_{2})}}^{- 1} \leq c \leq {1 - \frac{α_{1}^{2 v_{2}} (1 + 2 β_{1})}{α_{2}^{2 v_{2}} (1 + 2 β_{2})}}^{- 1} . \end{matrix}

(A8)

To evaluate inequalities (A4) and (A5), noting that

c_{v_{1}} c_{v_{2}} = c_{v_{12}}^{2} ρ_{12}^{2}

, we expand the LHS of Equations (A4) and (A5) with this positive factor removed.

\begin{matrix} c_{v_{1}} c_{v_{2}} (c α_{1}^{2 v_{1}} {(| | h | |}^{2} + α_{1}^{2})^{- v_{1} - d / 2} (1 \pm 2 β_{1}) + (1 - c) α_{2}^{2 v_{1}} {(h^{2} + α_{2}^{2})}^{- v_{1} - d / 2} (1 \pm 2 β_{2})) \\ \cdot (c α_{1}^{2 v_{2}} {(h^{2} + α_{1}^{2})}^{- v_{2} - d / 2} (1 \pm 2 β_{1}) + (1 - c) α_{2}^{2 v_{2}} {(h^{2} + α_{2}^{2})}^{- v_{2} - d / 2} (1 \pm 2 β_{2})) \\ - c_{v_{12}}^{2} ρ_{12}^{2} {(c α_{1}^{v_{1} + v_{2}} {(h^{2} + α_{1}^{2})}^{- \frac{v_{1} + v_{2}}{2} - d / 2} (1 \pm 2 β_{1}) + (1 - c) α_{2}^{v_{1} + v_{2}} {(h^{2} + α_{2}^{2})}^{- \frac{v_{1} + v_{2}}{2} - d / 2} (1 \pm 2 β_{2}))}^{2} \\ = & c^{2} α_{1}^{2 v_{1} + 2 v_{2}} {(h^{2} + α_{1}^{2})}^{- v_{1} - d / 2} {(h^{2} + α_{1}^{2})}^{- v_{2} - d / 2} {(1 \pm 2 β_{1})}^{2} \\ + c (1 - c) α_{1}^{2 v_{2}} α_{2}^{2 v_{1}} {(h^{2} + α_{1}^{2})}^{- v_{2} - d / 2} {(h^{2} + α_{2}^{2})}^{- v_{1} - d / 2} (1 \pm 2 β_{1}) (1 \pm 2 β_{2}) \\ + c (1 - c) α_{1}^{2 v_{1}} α_{2}^{2 v_{2}} {(h^{2} + α_{1}^{2})}^{- v_{1} - d / 2} {(h^{2} + α_{2}^{2})}^{- v_{2} - d / 2} (1 \pm 2 β_{1}) (1 \pm 2 β_{2}) \\ + {(1 - c)}^{2} α_{2}^{2 v_{1} + 2 v_{2}} {(h^{2} + α_{2}^{2})}^{- v_{1} - d / 2} {(h^{2} + α_{2}^{2})}^{- v_{2} - d / 2} {(1 \pm 2 β_{2})}^{2} \\ - c^{2} α_{1}^{2 v_{1} + 2 v_{2}} {(h^{2} + α_{1}^{2})}^{- (v_{1} + v_{2}) - d} {(1 \pm 2 β_{1})}^{2} \\ - 2 c (1 - c) α_{1}^{v_{1} + v_{2}} α_{2}^{v_{1} + v_{2}} {(h^{2} + α_{1}^{2})}^{- (v_{1} + v_{2}) / 2 - d / 2} {(h^{2} + α_{2}^{2})}^{- (v_{1} + v_{2}) / 2 - d / 2} (1 \pm 2 β_{1}) (1 \pm 2 β_{2}) \\ - {(1 - c)}^{2} α_{2}^{2 v_{1} + 2 v_{2}} {(h^{2} + α_{2}^{2})}^{- (v_{1} + v_{2}) - d} {(1 \pm 2 β_{2})}^{2} \\ = & c (1 - c) {α_{1}^{2 v_{2}} α_{2}^{2 v_{1}} {(h^{2} + α_{1}^{2})}^{- v_{2} - 2 / d} {(h^{2} + α_{2}^{2})}^{- v_{1} - 2 / d} \\ + α_{1}^{2 v_{1}} α_{2}^{2 v_{2}} {(h^{2} + α_{1}^{2})}^{- v_{1} - 2 / d} {(h^{2} + α_{2}^{2})}^{- v_{2} - 2 / d} \\ - 2 α_{1}^{v_{1} + v_{2}} α_{2}^{v_{1} + v_{2}} {(h^{2} + α_{2}^{2})}^{- (v_{1} + v_{2}) / 2 - 2 / d} {(h^{2} + α_{2}^{2})}^{- (v_{1} + v_{2}) / 2 - 2 / d}} (1 \pm 2 β_{1}) (1 \pm 2 β_{2}) \end{matrix}

\begin{matrix} = & c (1 - c) {α_{1}^{v_{2} - v_{1}} α_{2}^{v_{1} - v_{2}} {(h^{2} + α_{1}^{2})}^{(v_{1} - v_{2}) / 2} {(h^{2} + α_{2}^{2})}^{(v_{2} - v_{1}) / 2} \\ + α_{1}^{v_{1} - v_{2}} α_{2}^{v_{2} - v_{1}} {(h^{2} + α_{1}^{2})}^{(v_{2} - v_{1}) / 2} {(h^{2} + α_{2}^{2})}^{(v_{1} - v_{2}) / 2} - 2} (1 \pm 2 β_{1}) (1 \pm 2 β_{2}) \\ = & c (1 - c) {{(\frac{α_{1}^{2} (h^{2} + α_{2}^{2})}{α_{2}^{2} (h^{2} + α_{1}^{2})})}^{(v_{2} - v_{1}) / 2} + {(\frac{α_{2}^{2} (h^{2} + α_{1}^{2})}{α_{1}^{2} (h^{2} + α_{2}^{2})})}^{(v_{2} - v_{1}) / 2} - 2} (1 \pm 2 β_{1}) (1 \pm 2 β_{2}) \end{matrix}

For the necessary part, with $(1 \pm 2 β_{1}) (1 \pm 2 β_{2}) \geq 0$ , letting $h \to + \infty$ in ${(\frac{α_{1}^{2} (h^{2} + α_{2}^{2})}{α_{2}^{2} (h^{2} + α_{1}^{2})})}^{(v_{2} - v_{1}) / 2} + {(\frac{α_{2}^{2} (h^{2} + α_{1}^{2})}{α_{1}^{2} (h^{2} + α_{2}^{2})})}^{(v_{2} - v_{1}) / 2} - 2$ yields ${(\frac{α_{1}^{2}}{α_{2}^{2}})}^{(v_{2} - v_{1}) / 2} + {(\frac{α_{2}^{2}}{α_{1}^{2}})}^{(v_{2} - v_{1}) / 2} - 2$ , which is greater than zero, so $c \in (0, 1)$ if inequalities (A4) and (A5) hold.
For the sufficient part, other than using Theorem 2 in [22], we can work on ${(\frac{α_{1}^{2} (h^{2} + α_{2}^{2})}{α_{2}^{2} (h^{2} + α_{1}^{2})})}^{(v_{2} - v_{1}) / 2}$ $+ {(\frac{α_{2}^{2} (h^{2} + α_{1}^{2})}{α_{1}^{2} (h^{2} + α_{2}^{2})})}^{(v_{2} - v_{1}) / 2} \geq 2$ by using inequality $a + b \geq \sqrt{2 a b}$ along with $c \in (0, 1)$ , we can prove both inequalities (A4) and (A5). Finally noting that condition $c \in (0, 1)$ automatically satisfies inequalities (A7) and (A8), therefore Equation (2) is a valid correlation matrix function if and only if $c \in (0, 1)$ . □

Proof of Theorem 3.

If we let

\begin{matrix} G_{0} (h) & = c M (h | v, α) + (1 - c) M (h | v, α^{'}), \\ G_{1} (h) & = c M (h | v, α) β_{1} + (1 - c) M (h | v, α^{'}) β_{2}, \end{matrix}

in Theorem 1, from which we only need to show

G_{0} (h) \pm 2 G_{1} (h)

is a valid covariance matrix function. By Cramér’s theorem in spectral domain, it is to show that the Fourier transform of

G_{0} (h) \pm 2 G_{1} (h)

is nonnegative definite. Consider the following Fourier transform matrix function:

\begin{matrix} (\begin{matrix} f_{11} (h) & f_{12} (h) \\ f_{21} (h) & f_{22} (h) \end{matrix}), \end{matrix}

where

\begin{matrix} f_{11} (h) = c σ^{2} {(h^{2} + α_{1}^{2})}^{- v_{1} - d / 2} \cdot \frac{Γ (v_{1} + d / 2) α_{1}^{2 v_{1}}}{Γ (v_{1}) π^{d / 2}} (1 \pm 2 β_{1}) + \\ (1 - c) σ^{2} {(h^{2} + α_{1}^{' 2})}^{- v_{1} - d / 2} \cdot \frac{Γ (v_{1} + d / 2) α_{1}^{' 2 v_{1}}}{Γ (v_{1}) π^{d / 2}} (1 \pm 2 β_{2}), \end{matrix}

\begin{matrix} f_{12} (s) = c ρ_{12} σ^{2} {(| | s | |}^{2} + α_{12}^{2})^{- v_{12} - d / 2} \cdot \frac{Γ (v_{12} + d / 2) α_{12}^{2 v_{12}}}{Γ ((v_{1} + v_{2}) / 2) π^{d / 2}} (1 \pm 2 β_{1}) + \\ (1 - c) ρ_{12}^{'} σ^{2} {(| | s | |}^{2} + α_{12}^{' 2})^{- v_{12} - d / 2} \cdot \frac{Γ ((v_{1} + v_{2}) / 2 + d / 2) α_{12}^{' 2 v_{12}}}{Γ ((v_{1} + v_{2}) / 2) π^{d / 2}} (1 \pm 2 β_{2}), \end{matrix}

\begin{matrix} f_{21} (h) = f_{12} (h), \end{matrix}

\begin{matrix} f_{22} (h) = c σ^{2} {(h^{2} + α_{2}^{2})}^{- v_{2} - d / 2} \cdot \frac{Γ (v_{2} + d / 2) α_{2}^{2 v_{1}}}{Γ (v_{2}) π^{d / 2}} (1 \pm 2 β_{1}) + \\ (1 - c) σ^{2} {(h^{2} + α_{2}^{' 2})}^{- v_{2} - d / 2} \cdot \frac{Γ (v_{2} + d / 2) α_{2}^{' 2 v_{1}}}{Γ (v_{2}) π^{d / 2}} (1 \pm 2 β_{2}), \end{matrix}

then it is to show that, the condition (6) is equivalent to

f_{11} (h) f_{22} (h) - f_{12} (h) f_{21} (h) \geq 0

for any

h \geq 0

based on Cramér’s Theorem, since

f_{11} (h) \geq 0

,

f_{22} (h) \geq 0

have been ensured by condition like (A8) with

α_{1}

and

α_{2}

replaced with

α_{i}

and

α_{i}^{'}

following [13,14], i=1,2.

Let

h_{1} = h^{2} + α_{1}^{2}, h_{1}^{'} = h^{2} + α_{1}^{' 2}, h_{2} = h^{2} + α_{2}^{' 2}, h_{2}^{'} = h^{2} + α_{2}^{' 2}, c (v) = π^{- d / 2} Γ (v + d / 2) / Γ (v)

. Note that

\begin{matrix} f_{11} (h) f_{22} (h) - f_{12} (h) f_{21} (h) \\ = c^{2} h_{1}^{- v_{1} - \frac{d}{2}} c_{v_{1}} α_{1}^{2 v_{1}} (1 \pm 2 β_{1}) h_{2}^{- v_{2} - \frac{d}{2}} c_{v_{2}} α_{2}^{2 v_{2}} (1 \pm 2 β_{1}) \\ + c (1 - c) h_{1}^{- v_{1} - \frac{d}{2}} c_{v_{1}} α_{1}^{2 v_{1}} (1 \pm 2 β_{1}) h_{2}^{' - v_{2} - \frac{d}{2}} c_{v_{2}} α_{2}^{' 2 v_{2}} (1 \pm 2 β_{2}) \\ + c (1 - c) h_{1}^{' - v_{1} - \frac{d}{2}} c_{v_{1}} α_{1}^{' 2 v_{1}} (1 \pm 2 β_{2}) h_{2}^{- 2 v_{2} - \frac{d}{2}} c_{v_{2}} α_{2}^{2 v_{2}} (1 \pm 2 β_{2}) \\ + {(1 - c)}^{2} h_{1}^{' - v_{1} - d / 2} c_{v_{1}} α_{1}^{' 2 v_{1}} (1 \pm 2 β_{2}) h_{2}^{' - v_{2} - d / 2} c_{v_{2}} α_{2}^{' 2 v_{2}} (1 \pm 2 β_{2}) \\ - c^{2} ρ_{12}^{2} h_{12}^{- 2 v_{12} - d} c_{v_{12}}^{2} α_{12}^{4 v_{12}} {(1 \pm 2 β_{1})}^{2} \\ - 2 c (1 - c) ρ_{12} ρ_{12}^{'} h_{12}^{- v_{12} - \frac{d}{2}} c_{v_{12}} α_{12}^{2 v_{12}} (1 \pm 2 β_{1}) h_{12}^{' - v_{12} - \frac{d}{2}} c_{v_{12}} α_{12}^{' 2 v_{12}} (1 \pm 2 β_{2}) \\ - {(1 - c)}^{2} ρ_{12}^{' 2} h_{12}^{' - 2 v_{12} - d} c_{v_{12}}^{2} α_{12}^{' 4 v_{12}} {(1 \pm 2 β_{2})}^{2} \\ = c^{2} {(1 \pm 2 β_{1})}^{2} (h_{1}^{- v_{1} - \frac{d}{2}} h_{2}^{- v_{2} - \frac{d}{2}} α_{1}^{2 v_{1}} α_{2}^{2 v_{2}} c_{v_{1}} c_{v_{2}} - ρ_{12}^{2} h_{12}^{- 2 v_{12} - d} α_{12}^{4 v_{12}} c_{v_{12}}^{2}) \\ + c (1 - c) (1 \pm 2 β_{1}) (1 \pm 2 β_{2}) (h_{1}^{- v_{1} - \frac{d}{2}} h_{2}^{' - v_{2} - \frac{d}{2}} α_{1}^{2 v_{1}} α_{2}^{' 2 v_{2}} c_{v_{1}} c_{v_{2}} + h_{1}^{' - v_{1} - d / 2} h_{2}^{- v_{2} - d / 2} α_{1}^{' 2 v_{1}} α_{2}^{2 v_{2}} c_{v_{1}} c_{v_{2}} \\ - 2 ρ_{12} ρ_{12}^{'} h_{12}^{- v_{12} - \frac{d}{2}} h_{12}^{' - v_{12} - \frac{d}{2}} α_{12}^{2 v_{12}} α_{12}^{' 2 v_{12}} c_{v_{12}}^{2}) \\ + {(1 - c)}^{2} {(1 \pm 2 β_{2})}^{2} (h_{1}^{' - v_{1} - d / 2} h_{2}^{' - v_{2} - d / 2} α_{1}^{' 2 v_{1}} α_{2}^{' 2 v_{2}} c_{v_{1}} c_{v_{2}} - ρ_{12}^{' 2} h_{12}^{' - 2 v_{12} - d} α_{12}^{' 4 v_{12}} c_{v_{12}}^{2}) \\ = c^{2} {(1 \pm 2 β_{1})}^{2} H (h) + c (1 - c) (1 \pm 2 β_{1}) (1 \pm 2 β_{2}) D (h) + {(1 - c)}^{2} {(1 \pm 2 β_{2})}^{2} \tilde{H} (h) . \end{matrix}

(A9)

Moreover inequalities (8) and (9) imply

H (h) \geq 0, \tilde{H} (h) \geq 0

. The proof of this equivalence can be proved if we can show

D (h) \geq 0

. Because

\begin{matrix} H (h) = \frac{α_{1}^{2 v_{1}} α_{2}^{2 v_{2}} c_{v_{1}} c_{v_{2}}}{{(α_{1}^{2} + h^{2})}^{v_{1} + d / 2} {(α_{2}^{2} + h^{2})}^{v_{2} + d / 2}} - \frac{ρ_{12}^{2} α_{12}^{4 v_{12}} c_{v_{12}}^{2}}{{(α_{12}^{2} + h^{2})}^{2 v_{12} + d}} \geq 0, \end{matrix}

\begin{matrix} \tilde{H} (h) = \frac{α_{1}^{' 2 v_{1}} α_{2}^{' 2 v_{2}} c_{v_{1}} c_{v_{2}}}{{(α_{1}^{' 2} + h^{2})}^{v_{1} + d / 2} {(α_{2}^{' 2} + h^{2})}^{v_{2} + d / 2}} - \frac{ρ_{12}^{2} α_{12}^{' 4 v_{12}} c_{v_{12}}^{2}}{{(α_{12}^{' 2} + h^{2})}^{2 v_{12} + d}} \geq 0, \end{matrix}

by using inequality

a^{2} + b^{2} \geq 2 a b

, we have

\begin{matrix} D (h) = & \frac{α_{1}^{2 v_{1}} α_{2}^{' 2 v_{2}} c_{v_{1}} c_{v_{2}}}{{(α_{1}^{2} + h^{2})}^{v_{1} + d / 2} {(α_{2}^{' 2} + h^{2})}^{v_{2} + d / 2}} + \frac{α_{1}^{' 2 v_{1}} α_{2}^{2 v_{2}} c_{v_{1}} c_{v_{2}}}{{(α_{1}^{' 2} + h^{2})}^{v_{1} + d / 2} {(α_{2}^{2} + h^{2})}^{v_{2} + d / 2}} \\ - \frac{2 ρ_{12} ρ_{12}^{'} α_{12}^{2 v_{12}} α_{12}^{' 2 v_{12}} c_{v_{12}}^{2}}{{((α_{12}^{2} + h^{2}) (α_{12}^{' 2} + h^{2}))}^{v_{12} + d / 2}} \\ \geq & \frac{2 c_{v_{1}} c_{v_{2}} α_{1}^{v_{1}} α_{1}^{' v_{1}} α_{2}^{v_{2}} α_{2}^{' v_{2}}}{{(α_{1}^{2} + h^{2})}^{v_{1} / 2 + d / 4} {(α_{1}^{' 2} + h^{2})}^{v_{1} / 2 + d / 4} {(α_{2}^{2} + h^{2})}^{v_{2} / 2 + d / 4} {(α_{2}^{' 2} + h^{2})}^{v_{2} / 2 + d / 4}} \\ - \frac{2 ρ_{12} ρ_{12}^{'} α_{12}^{2 v_{12}} α_{12}^{' 2 v_{1} 2} c_{v_{12}}^{2}}{{((α_{12}^{2} + h^{2}) (α_{12}^{' 2} + h^{2}))}^{v_{12} + d / 2}} \\ \geq & 0 . \end{matrix}

where the last inequality holds since

a^{2} \geq b^{2}

,

a^{' 2} \geq b^{' 2}

, then

a a^{'} \geq b b^{'}

if

a, a^{'}, b, b^{'}

are all greater than zero. When

D (h) = 0

, it follows from (A9) that

f_{11} (h) f_{22} (h) - f_{12} (h) f_{21} (h) \geq 0

holds automatically. So the remaining case is when

D (h) > 0

, and that the RHS of (A9) being greater than or equal to zero for all

h \geq 0

if and only if

\begin{matrix} \underset{h \geq 0, D (h) > 0}{i n f} \frac{c^{2} {(1 \pm 2 β_{1})}^{2} H (h) + {(1 - c)}^{2} {(1 \pm 2 β_{2})}^{2} \tilde{H} (h)}{(1 \pm 2 β_{1}) (1 \pm 2 β_{2}) D (h)} \geq c (c - 1) . \end{matrix}

□

Proof of Corollary 1.

The sufficiency can be proved by following the proof of Theorem 2. For the necessary condition, first note that those conditions will ensure (8) and (9) by Theorem 3 in [14] and the inequality (6) will be evaluated in the following cases.

(a) When

α_{12} \leq m i n (α_{1}, α_{2})

,

α_{12}^{'} \leq m i n (α_{1}^{'}, α_{2}^{'})

,

v_{12} = \frac{v_{1} + v_{2}}{2}

, equalities

ρ_{12}^{2} = \frac{c_{v_{1}} c_{v_{2}}}{c_{v_{12}}^{2}} {(\frac{α_{12}^{2}}{α_{1} α_{2}})}^{d}

,

ρ_{12}^{' 2} = \frac{c_{v_{1}} c_{v_{2}}}{c_{v_{12}}^{2}} {(\frac{α_{12}^{' 2}}{α_{1}^{'} α_{2}^{'}})}^{d}

hold. The minimum zero of LHS of inequality (6) can be reached when

h = 0

, since

\begin{matrix} H (0) & = \frac{α_{1}^{2 v_{1}} α_{2}^{2 v_{2}} c_{v_{1}} c_{v_{2}}}{{(α_{1}^{2} + 0^{2})}^{v_{1} + d / 2} {(α_{2}^{2} + 0^{2})}^{v_{2} + d / 2}} - \frac{ρ_{12}^{2} α_{12}^{4 v_{12}} c_{v_{12}}^{2}}{{(α_{12}^{2} + 0^{2})}^{2 v_{12} + d}} \\ = \frac{α_{1}^{2 v_{1}} α_{2}^{2 v_{2}} c_{v_{1}} c_{v_{2}}}{{(α_{1}^{2} + 0^{2})}^{v_{1} + d / 2} {(α_{2}^{2} + 0^{2})}^{v_{2} + d / 2}} - \frac{c_{v_{1}} c_{v_{2}}}{c_{v_{12}}^{2}} {(\frac{α_{12}^{' 2}}{α_{1}^{'} α_{2}^{'}})}^{d} \frac{α_{12}^{4 v_{12}} c_{v_{12}}^{2}}{{(α_{12}^{2} + 0^{2})}^{2 v_{12} + d}} \\ = α_{1}^{- d} α_{2}^{- d} c_{v_{1}} c_{v_{2}} - α_{1}^{- d} α_{2}^{- d} c_{v_{1}} c_{v_{2}} \\ = 0 . \end{matrix}

Similarly,

\tilde{H} (0) = 0

. So

0 \leq c \leq 1

, given that c is also constrained by condition like (A8) with

α_{1}

and

α_{2}

replaced with

α_{i}

and

α_{i}^{'}, i = 1, 2

.

(b) When

α_{12} \geq m a x (α_{1}, α_{2})

,

α_{12}^{'} \geq m a x (α_{1}^{'}, α_{2}^{'})

,

v_{12} = \frac{v_{1} + v_{2}}{2}

, equalities

ρ_{12}^{2} = \frac{c_{v_{1}} c_{v_{2}}}{c_{v_{12}}^{2}} {(\frac{α_{1}}{α_{12}})}^{2 v_{1}} {(\frac{α_{2}}{α_{12}})}^{2 v_{2}}

,

ρ_{12}^{' 2} = \frac{c_{v_{1}} c_{v_{2}}}{c_{v_{12}}^{2}} {(\frac{α_{1}^{'}}{α_{12}^{'}})}^{2 v_{1}} {(\frac{α_{2}^{'}}{α_{12}^{'}})}^{2 v_{2}}

hold; the minimum zero of LHS of inequality (6) is obtained when

h \to \infty

,

\begin{matrix} H (\infty) & = lim_{h \to \infty} \frac{α_{1}^{2 v_{1}} α_{2}^{2 v_{2}} c_{v_{1}} c_{v_{2}}}{{(α_{1}^{2} + h^{2})}^{v_{1} + d / 2} {(α_{2}^{2} + h^{2})}^{v_{2} + d / 2}} - \frac{ρ_{12}^{2} α_{12}^{4 v_{12}} c_{v_{12}}^{2}}{{(α_{12}^{2} + h^{2})}^{2 v_{12} + d}} = 0 . \end{matrix}

Similarly,

\tilde{H} (\infty) = 0

. So

0 \leq c \leq 1

, given that c is also constrained by condition like (A8) with

α_{1}

and

α_{2}

replaced with

α_{i}

and

α_{i}^{'}, i = 1, 2

. □

Proof of Theorem 4.

The sufficiency can be established by following the similar proof of Theorem 2. The necessary part will be proved using Cramér’s Theorem for the case

p = 2

as follows. The Fourier transform of (12) is equal to

F (h) = [\begin{matrix} c_{ν_{1}} f_{11} (h) & c_{ν_{12}} ρ_{12} f_{12} (h) \\ c_{ν_{12}} ρ_{12} f_{21} (h) & c_{ν_{2}} f_{22} (h) \end{matrix}],

where

c_{ν} = π^{- d / 2} Γ (ν + d / 2) / Γ (ν)

,

\begin{matrix} f_{i j} (h) = c {(α_{1})}^{v_{i} + v_{j}} {(h^{2} + α_{1}^{2})}^{- \frac{v_{i} + v_{j}}{2} - d / 2} β_{1}^{*} + (1 - c) α_{2}^{v_{i} + v_{j}} {(h^{2} + α_{2}^{2})}^{- \frac{v_{i} + v_{j}}{2} - d / 2} β_{2}^{*} ., i, j = 1, 2 . \end{matrix}

Hence, it is reduced to show that

0 \leq c \leq 1

is necessary for

F (h)

to be nonnegative definite for any

h > 0

based on Cramér’s Theorem. Which means

f_{11} (h) \geq 0

,

f_{22} (h) \geq 0

, and

\begin{matrix} c_{v_{1}} c_{v_{2}} f_{11} (h) f_{22} (h) - c_{v_{12}}^{2} ρ_{12}^{2} f_{12} (h) f_{21} (h) & \geq 0, h \geq 0 \end{matrix}

(A10)

From [13], we already know that

f_{11} (h) \geq 0

if and only if

\begin{matrix} {1 - \frac{α_{2}^{d} (1 - β_{1}) (1 + β_{2})}{α_{1}^{d} (1 + β_{1}) (1 - β_{2})}}^{- 1} \leq c \leq {1 - \frac{α_{1}^{2 v_{1}} (1 + β_{1}) (1 - β_{2})}{α_{2}^{2 v_{1}} (1 - β_{1}) (1 + β_{2})}}^{- 1} . \end{matrix}

(A11)

Also,

f_{22} (h) \geq 0

if and only if

\begin{matrix} {1 - \frac{α_{2}^{d} (1 - β_{1}) (1 + β_{2})}{α_{1}^{d} (1 + β_{1}) (1 - β_{2})}}^{- 1} \leq c \leq {1 - \frac{α_{1}^{2 v_{2}} (1 + β_{1}) (1 - β_{2})}{α_{2}^{2 v_{2}} (1 - β_{1}) (1 + β_{2})}}^{- 1} . \end{matrix}

(A12)

To evaluate (A10), noting that

c_{v_{1}} c_{v_{2}} = c_{v_{12}}^{2} ρ_{12}^{2}

, We expand the left-hand side of (A10), omitting the positive factor, as follows: Letting

f_{α v} = {(h^{2} + α^{2})}^{- v - d / 2} α^{2 v}

,

\begin{matrix} c_{v_{1}} c_{v_{2}} (c α_{1}^{2 v_{1}} {(h^{2} + α_{1}^{2})}^{- v_{1} - d / 2} β_{1}^{*} + (1 - c) α_{2}^{2 v_{1}} {(h^{2} + α_{2}^{2})}^{- v_{1} - d / 2} β_{2}^{*}) \\ \cdot (c α_{1}^{2 v_{2}} {(h^{2} + α_{1}^{2})}^{- v_{2} - d / 2} β_{1}^{*} + (1 - c) α_{2}^{2 v_{2}} {(h^{2} + α_{2}^{2})}^{- v_{2} - d / 2} β_{2}^{*}) \\ - c_{v_{12}}^{2} ρ_{12}^{2} {(c α_{1}^{v_{1} + v_{2}} {(h^{2} + α_{1}^{2})}^{- \frac{v_{1} + v_{2}}{2} - d / 2} β_{1}^{*} + (1 - c) α_{2}^{v_{1} + v_{2}} {(h^{2} + α_{2}^{2})}^{- \frac{v_{1} + v_{2}}{2} - d / 2} β_{2}^{*})}^{2} . \\ = & c^{2} f_{α_{1} v_{1}} f_{α_{1} v_{2}} {(β_{1}^{*})}^{2} + c (1 - c) f_{α_{1} v_{2}} f_{α_{2} v_{1}} β_{1}^{*} β_{2}^{*} + c (1 - c) f_{α_{1} v_{1}} f_{α_{2} v_{2}} β_{1}^{*} β_{2}^{*} + {(1 - c)}^{2} f_{α_{2} v_{1}} f_{α_{2} v_{2}} {(β_{2}^{*})}^{2} \\ - c^{2} f_{α_{1} v_{12}} f_{α_{1} v_{12}} {(β_{1}^{*})}^{2} - 2 c (1 - c) f_{α_{1} v_{12}} f_{α_{2} v_{12}} β_{1}^{*} β_{2}^{*} - {(1 - c)}^{2} f_{α_{2} v_{12}} f_{α_{2} v_{12}} {(β_{2}^{*})}^{2} \\ = & c (1 - c) {f_{α_{1} v_{2}} f_{α_{2} v_{1}} + f_{α_{1} v_{1}} f_{α_{2} v_{2}} - 2 f_{α_{1} v_{12}} f_{α_{2} v_{12}}} β_{1}^{*} β_{2}^{*} \\ = & c (1 - c) {α_{1}^{v_{2} - v_{1}} α_{2}^{v_{1} - v_{2}} {(h^{2} + α_{1}^{2})}^{(v_{1} - v_{2}) / 2} {(h^{2} + α_{2}^{2})}^{(v_{2} - v_{1}) / 2} \\ + α_{1}^{v_{1} - v_{2}} α_{2}^{v_{2} - v_{1}} {(h^{2} + α_{1}^{2})}^{(v_{2} - v_{1}) / 2} {(h^{2} + α_{2}^{2})}^{(v_{1} - v_{2}) / 2} - 2} β_{1}^{*} β_{2}^{*} \\ = & c (1 - c) {{(\frac{α_{1}^{2} (h^{2} + α_{2}^{2})}{α_{2}^{2} (h^{2} + α_{1}^{2})})}^{(v_{2} - v_{1}) / 2} + {(\frac{α_{2}^{2} (h^{2} + α_{1}^{2})}{α_{1}^{2} (h^{2} + α_{2}^{2})})}^{(v_{2} - v_{1}) / 2} - 2} β_{1}^{*} β_{2}^{*} \end{matrix}

With

β_{1}^{*} β_{2}^{*} \geq 0

,

f_{α_{1} v_{12}} f_{α_{2} v_{12}} \geq 0

, letting

h \to \infty

in

{(\frac{α_{1}^{2} (h^{2} + α_{2}^{2})}{α_{2}^{2} (h^{2} + α_{1}^{2})})}^{(v_{2} - v_{1}) / 2} + {(\frac{α_{2}^{2} (h^{2} + α_{1}^{2})}{α_{1}^{2} (h^{2} + α_{2}^{2})})}^{(v_{2} - v_{1}) / 2} - 2

yields

{(\frac{α_{1}^{2}}{α_{2}^{2}})}^{(v_{2} - v_{1}) / 2} + {(\frac{α_{2}^{2}}{α_{1}^{2}})}^{(v_{2} - v_{1}) / 2} - 2

, which is greater than zero, so

c \in (0, 1)

. Moreover,

c \in (0, 1)

satisfies the constraint in inequality (13). Finally, the nonnegative definiteness of

F (h)

for any

h \geq 0

implies

c \in (0, 1)

. □

Proof of Theorem 5.

The proof idea and procedure are similar to those of Theorem 3. Following a similar setup, we consider the Fourier transform of Equation (15) as follows:

(\begin{matrix} f_{11} (h) & f_{12} (h) \\ f_{21} (h) & f_{22} (h) \end{matrix}) = (\begin{matrix} c f_{α_{1} v_{1}} β_{1}^{*} + (1 - c) f_{α_{1}^{'} v_{1}} β_{2}^{*} & c f_{α_{12} v_{12}} β_{1}^{*} ρ_{12} + (1 - c) f_{α_{12}^{'} v_{12}} β_{2}^{*} ρ_{12}^{'} \\ c f_{α_{12} v_{12}} β_{1}^{*} ρ_{12} + (1 - c) f_{α_{12}^{'} v_{12}} β_{2}^{*} ρ_{12}^{'} & c f_{α_{2} v_{2}} β_{1}^{*} + (1 - c) f_{α_{2}^{'} v_{2}} β_{2}^{*} \end{matrix}),

where

\begin{matrix} f_{α v} = {(h^{2} + α^{2})}^{- v - d / 2} \cdot \frac{Γ (v + d / 2) α^{2 v}}{Γ (v) π^{d / 2}} . \end{matrix}

It remains to show that condition (16) is equivalent to

f_{11} (h) f_{22} (h) - f_{12} (h) f_{21} (h) \geq 0

for all

h \geq 0

, based on Cramér’s Theorem. Since

f_{11} (h) \geq 0

and

f_{22} (h) \geq 0

are guaranteed by a condition similar to (A8), with

α_{1}

and

α_{2}

replaced by

α_{i}

and

α_{i}^{'}

respectively, following [13,14] for

i = 1, 2

. To this end, note that

\begin{matrix} f_{11} (h) f_{22} (h) - f_{12} (h) f_{21} (h) \\ = c^{2} f_{α_{1} v_{1}} f_{α_{2} v_{2}} {(β_{1}^{*})}^{2} + c (1 - c) f_{α_{1} v_{1}} f_{α_{2}^{'} v_{2}} β_{1}^{*} β_{2}^{*} \\ + c (1 - c) f_{α_{1}^{'} v_{1}} f_{α_{2} v_{2}} β_{1}^{*} β_{2}^{*} + {(1 - c)}^{2} f_{α_{1}^{'} v_{1}} f_{α_{2}^{'} v_{2}} {(β_{2}^{*})}^{2} \\ - c^{2} ρ_{12}^{2} f_{α_{12} v_{12}}^{2} {(β_{1}^{*})}^{2} - 2 c (1 - c) ρ_{12} ρ_{12}^{'} f_{α_{12} v_{12}} f_{α_{12}^{'} v_{12}} β_{1}^{*} β_{2}^{*} - {(1 - c)}^{2} ρ_{12}^{' 2} f_{α_{12}^{'} v_{12}}^{2} {(β_{2}^{*})}^{2} \\ = c^{2} {(β_{1}^{*})}^{2} (f_{α_{1} v_{1}} f_{α_{2} v_{2}} - f_{α_{12} v_{12}}^{2} ρ_{12}^{2}) \\ + c (1 - c) β_{1}^{*} β_{2}^{*} (f_{α_{1} v_{1}} f_{α_{2}^{'} v_{2}} + f_{α_{1}^{'} v_{1}} f_{α_{2} v_{2}} - 2 ρ_{12} ρ_{12}^{'} f_{α_{12} v_{12}} f_{α_{12}^{'} v_{12}}) \\ + {(1 - c)}^{2} {(β_{2}^{*})}^{2} (f_{α_{1}^{'} v_{1}} f_{α_{2}^{'} v_{2}} - f_{α_{12}^{'} v_{12}}^{2} ρ_{12}^{2}) \\ = c^{2} {(β_{1}^{*})}^{2} H (h) + c (1 - c) β_{1}^{*} β_{2}^{*} D (h) + {(1 - c)}^{2} {(β_{2}^{*})}^{2} \tilde{H} (h) . \end{matrix}

(A13)

Similar to the proof of Theorem 3, inequalities (8) and (9) imply that

H (h) \geq 0

,

\tilde{H} (h) \geq 0

and

D (h) \geq 0

.

When

D (h) = 0

, it follows from Equation (A13) that

f_{11} (h) f_{22} (h) - f_{12} (h) f_{21} (h) \geq 0

holds naturally. It turns out that

D (h) = 0

whenever

\begin{matrix} {(\frac{1}{α_{1}^{2} + h^{2}})}^{ν_{1} + d / 2} {(\frac{1}{α_{2}^{' 2} + h^{2}})}^{ν_{2} + d / 2} α_{1}^{2 ν_{1}} α_{2}^{2 ν_{2}} = {(\frac{1}{α_{1}^{' 2} + h^{2}})}^{ν_{1} + d / 2} {(\frac{1}{α_{2}^{2} + h^{2}})}^{ν_{2} + d / 2} α_{1}^{' 2 ν_{1}} α_{2}^{' 2 ν_{2}} . \end{matrix}

Hence the remaining case is when

D (h) > 0

, and the right-hand side of Equation (A13) is nonnegative for all

h \geq 0

if and only if

\begin{matrix} inf_{\begin{matrix} h \geq 0 \\ D (h) > 0 \end{matrix}} \frac{c^{2} {(β_{1}^{*})}^{2} H (h) + {(1 - c)}^{2} {(β_{2}^{*})}^{2} \tilde{H} (h)}{β_{1}^{*} β_{2}^{*} D (h)} \geq c (c - 1) . \end{matrix}

□

References

Gaspari, G.; Cohn, S.E. Construction of correlation functions in two and three dimensions. Q. J. R. Meteorol. Soc. 1999, 125, 723–757. [Google Scholar] [CrossRef]
Sain, S.R.; Furrer, R.; Cressie, N. A spatial analysis of multivariate output from regional climate models. Ann. Appl. Stat. 2011, 5, 150–175. [Google Scholar] [CrossRef]
Tebaldi, C.; Lobell, D.B. Towards probabilistic projections of climate change impacts on global crop yields. Geophys. Res. Lett. 2008, 35, L08705. [Google Scholar] [CrossRef]
Cressie, N.; Huang, H.-C. Classes of nonseparable, spatio-temporal stationary covariance functions. J. Am. Stat. Assoc. 1999, 94, 1330–1340. [Google Scholar] [CrossRef]
Ma, C. Families of spatio-temporal stationary covariance models. J. Stat. Plann. Inference 2003, 116, 489–501. [Google Scholar] [CrossRef]
Castruccio, S.; Stein, M.L. Global space-time models for climate ensembles. Ann. Appl. Stat. 2013, 7, 1593–1611. [Google Scholar] [CrossRef]
Cressie, N.; Wikle, C.K. Statistics for Spatio-Temporal Data; John Wiley & Sons: Hoboken, NJ, USA, 2015. [Google Scholar]
Wan, Y.; Xu, M.; Huang, H.; Chen, S.X. A spatio-temporal model for the analysis and prediction of fine particulate matter concentration in Beijing. Environmetrics 2021, 32, e2648. [Google Scholar] [CrossRef]
Xu, J.; Yang, W.; Han, B.; Wang, M.; Wang, Z.; Zhao, Z.; Bai, Z.; Vedal, S. An advanced spatio-temporal model for particulate matter and gaseous pollutants in Beijing, China. Atmos. Environ. 2019, 211, 120–127. [Google Scholar] [CrossRef]
Medeiros, E.S.; de Lima, R.R.; Olinda, R.A.; Dantas, L.G.; Santos, C.A.C. Space–time kriging of precipitation: Modeling the large-scale variation with model GAMLSS. Water 2019, 11, 2368. [Google Scholar] [CrossRef]
Storvik, G.; Frigessi, A.; Hirst, D. Stationary time autoregressive representation. Stat. Probab. Lett. 2002, 60, 263–269. [Google Scholar]
Stein, M.L. Statistical methods for regular monitoring data. J. R. Stat. Soc. Ser. B 2005, 67, 667–687. [Google Scholar] [CrossRef]
Demel, S.S.; Du, J. Spatio-temporal models for some data sets in continuous space and discrete time. Stat. Sin. 2015, 25, 81–98. [Google Scholar]
Gneiting, T.; Kleiber, W.; Schlather, M. Matérn cross-covariance functions for multivariate random fields. J. Am. Stat. Assoc. 2010, 105, 1167–1177. [Google Scholar] [CrossRef]
Sain, S.R.; Cressie, N. A spatial model for multivariate lattice data. J. Econom. 2007, 140, 226–259. [Google Scholar] [CrossRef]
Zhu, X.; Huang, D.; Pan, R.; Wang, H. Multivariate spatial autoregressive model for large scale social networks. J. Econom. 2020, 215, 591–606. [Google Scholar] [CrossRef]
Dörr, C.; Schlather, M. Covariance models for multivariate random fields resulting from pseudo cross-variograms. J. Multivar. Anal. 2023, 205, 105199. [Google Scholar] [CrossRef]
Hosseinpour, M.; Sahebi, S.; Zamzuri, Z.; Yahaya, A.; Ismail, N. Predicting crash frequency for multi-vehicle collision types using multivariate Poisson-lognormal spatial model: A comparative analysis. Accid. Anal. Prev. 2018, 118, 277–288. [Google Scholar] [CrossRef]
Somayasa, W.; Makulau; Pasolon, Y.B.; Sutiari, D.K. Universal kriging of multivariate spatial data under multivariate isotropic power type variogram model. In Proceedings of the 7th International Conference on Mathematics—Pure, Applied and Computation (ICoMPAC 2020), Surabaya, Indonesia, 24 October 2020. [Google Scholar]
Krupskii, P.; Genton, M.G. A copula model for non-Gaussian multivariate spatial data. J. Multivar. Anal. 2019, 169, 264–277. [Google Scholar] [CrossRef]
Gneiting, T. Strictly and non-strictly positive definite functions on spheres. Bernoulli 2013, 19, 1327–1349. [Google Scholar] [CrossRef]
Ma, C. Stationary and isotropic vector random fields on spheres. Math. Geosci. 2012, 44, 765–778. [Google Scholar] [CrossRef]
Du, J.; Ma, C.; Li, Y. Isotropic variogram matrix functions on spheres. Math. Geosci. 2013, 45, 341–357. [Google Scholar] [CrossRef]
Ma, C. Spatio-temporal variograms and covariance models. Adv. Appl. Probab. 2005, 37, 706–725. [Google Scholar] [CrossRef][Green Version]
Du, J.; Ma, C. Spherically invariant vector random fields in space and time. IEEE Trans. Signal Process. 2011, 59, 5921–5929. [Google Scholar] [CrossRef]
Cressie, N. Statistics for Spatial Data, rev. ed.; Wiley: New York, NY, USA, 1993. [Google Scholar]
Gneiting, T. Nonseparable, stationary covariance functions for space-time data. J. Am. Stat. Assoc. 2002, 97, 590–600. [Google Scholar] [CrossRef]
Gneiting, T.; Genton, M.G.; Guttorp, P. Geostatistical space-time models, stationarity, separability, and full symmetry. Monogr. Stat. Appl. Probab. 2006, 107, 151–174. [Google Scholar]
Ma, C. Vector random fields with second-order moments or second-order increments. Stoch. Anal. Appl. 2011, 29, 197–215. [Google Scholar] [CrossRef]
Stein, M.L. Interpolation of Spatial Data: Some Theory for Kriging; Springer: Berlin/Heidelberg, Germany, 1999. [Google Scholar]
Wackernagel, H. Multivariate Geostatistics, 3rd ed.; Springer: Berlin/Heidelberg, Germany, 2003. [Google Scholar]

Figure 1. ACFs of maximum temperature in Kansas counties.

Figure 2. ACFs of minimum temperature in Kansas counties.

Figure 3. Empirical Temperature space–time correlations and fitted models at time lag 0 in Kansas.

Figure 4. Empirical Temperature space–time correlations and fitted models at time lag 1 in Kansas.

Figure 5. Empirical Temperature space–time correlations at time lag 2 in Kansas.

Table 1. Kansas maximum temperature RMSE statistics.

Measure	PMM	SMM	Cauchy	Time Series
% Stations w/Lowest RMSE	93.3%	4.8%	0%	1.9%
AVG. RMSE at All Stations	3.887092	4.686701	3.938303	3.914282
95% Data Interval	[3.81, 3.96]	[4.64, 4.73]	[3.86, 4.02]	[3.84, 3.99]

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Te, R.; Du, J. Multivariate Modeling of Some Datasets in Continuous Space and Discrete Time. Entropy 2025, 27, 837. https://doi.org/10.3390/e27080837

AMA Style

Te R, Du J. Multivariate Modeling of Some Datasets in Continuous Space and Discrete Time. Entropy. 2025; 27(8):837. https://doi.org/10.3390/e27080837

Chicago/Turabian Style

Te, Rigele, and Juan Du. 2025. "Multivariate Modeling of Some Datasets in Continuous Space and Discrete Time" Entropy 27, no. 8: 837. https://doi.org/10.3390/e27080837

APA Style

Te, R., & Du, J. (2025). Multivariate Modeling of Some Datasets in Continuous Space and Discrete Time. Entropy, 27(8), 837. https://doi.org/10.3390/e27080837

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Multivariate Modeling of Some Datasets in Continuous Space and Discrete Time

Abstract

1. Introduction

2. Moving-Average-Type Temporal Margin

3. ARMA Type Temporal Margin

4. Data Example: Kansas Daily Temperature Data

5. Discussion

Author Contributions

Funding

Institutional Review Board Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

Appendix A. Proofs

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI