Varying-Coefficient Additive Models with Density Responses and Functional Auto-Regressive Error Process

Han, Zixuan; Li, Tao; You, Jinhong; Balakrishnan, Narayanaswamy

doi:10.3390/e27080882

Open AccessArticle

Varying-Coefficient Additive Models with Density Responses and Functional Auto-Regressive Error Process

¹

Division of Public Health Sciences, Fred Hutchinson Cancer Center, Seattle, WA 98109, USA

²

School of Statistics and Data Science, Shanghai University of Finance and Economics, Shanghai 200433, China

³

Department of Mathematics and Statistics, McMaster University, Hamilton, ON L8S 4L8, Canada

^*

Author to whom correspondence should be addressed.

Entropy 2025, 27(8), 882; https://doi.org/10.3390/e27080882

Submission received: 11 July 2025 / Revised: 15 August 2025 / Accepted: 19 August 2025 / Published: 20 August 2025

(This article belongs to the Section Information Theory, Probability and Statistics)

Download

Browse Figures

Versions Notes

Abstract

In many practical applications, data collected over time often exhibit autocorrelation, which, if unaccounted for, can lead to biased or misleading statistical inferences. To address this issue, we propose a varying-coefficient additive model for density-valued responses, incorporating a functional auto-regressive (FAR) error process to capture serial dependence. Our estimation procedure consists of three main steps, utilizing spline-based methods after mapping density functions into a linear space via the log-quantile density transformation. First, we obtain initial estimates of the bivariate varying-coefficient functions using a B-spline series approximation. Second, we estimate the error process from the residuals using spline smoothing techniques. Finally, we refine the estimates of the additive components by adjusting for the estimated error process. We establish theoretical properties of the proposed method, including convergence rates and asymptotic behavior. The effectiveness of our approach is further demonstrated through simulation studies and applications to real-world data.

Keywords:

varying-coefficient; density response; functional auto-regressive error process; log-quantile density transformation

1. Introduction

Density data or, more broadly, distributional data, are increasingly encountered across a wide range of scientific and applied research domains. Notable examples include the distributions of cross-sectional or intraday stock returns [1,2], mortality rate densities [3], and the distributions of intrahub connectivity in neuroimaging studies [4,5]. In many settings, such density functions are observed sequentially over time, forming what we refer to in this paper as a density time series. A motivating example is presented in Figure 1. Panel (a) displays a density time series of the global COVID-19 mortality rate (‰) over a 100-day period from 22 January 2020 to 15 April 2021. Panel (b) offers a complementary view by plotting the densities on three selected days, highlighting the temporal evolution of the distributional patterns. In this work, we study a regression framework in which the response consists of a density time series, while the predictors are scalar covariates. This setting allows for the exploration of how scalar factors influence the dynamic evolution of entire distributions over time.

Unlike conventional functional data, density functions do not form a linear space due to inherent constraints, such as nonnegativity and the requirement that they integrate with one. These restrictions pose significant challenges for directly applying standard functional data analysis techniques to random densities. To address this, several approaches have been proposed, which can be broadly grouped into two categories. The first approach involves transforming densities into a Hilbert space through suitable continuous and invertible mappings, thereby overcoming the nonlinear structure of the density space. For example, Petersen and Müller [6] introduced two such transformations, the log-hazard transformation and the log-quantile density (LQD) transformation, that map probability densities to an unrestricted space of square-integrable functions. Building on this, Han et al. [7] employed the LQD transformation to model density responses within an additive functional-to-scalar regression framework. Similarly, Kokoszka et al. [1] developed two methods for forecasting density functions derived from cross-sectional and intraday financial returns, using compositional data analysis and a modified log-quantile transformation combined with functional principal component (FPC) analysis and exponential smoothing techniques. The second category of methods takes a geometric perspective by defining appropriate metrics on the space of probability distributions. For instance, Talská et al. [8] used an infinite-dimensional extension of Aitchison geometry to construct a density-on-scalar linear regression model within Bayes-Hilbert spaces. Meanwhile, Petersen and Müller [3] studied Fréchet regression in general metric spaces equipped with the Wasserstein metric. Extending this line of work, Chen et al. [9] leveraged the geometry of tangent bundles in Wasserstein space to propose distribution-on-distribution regression models and developed auto-regressive extensions for distribution-valued time series. Additionally, Zhang et al. [10] explored auto-regressive models of order p for density-valued time series using the Wasserstein metric through a different methodological framework.

Let

F

denote the space of density functions f defined on a common support

U

. Without of generality, we assume that

U = [0, 1]

. Given a transformation

Ψ : F \to L 2

, the conditional Fréchet mean of a random density f, given a covariate

X \in R^{d}

, is defined as

μ (\cdot | X) = arg min_{d \in F} E (| | Ψ (f) - {Ψ (d) | |}_{2}^{2}) X),

where the expectation E represents the joint distribution of

(X, f)

.

This is equivalent to the following formulation:

Ψ (μ (\cdot | X)) (u) = E (Ψ (f) (u) | X), 0 \leq u \leq 1,

leading to the fact that

μ (s | X) = Ψ^{- 1} (E (Ψ (f) (u) | X)) (s), 0 \leq s \leq 1 .

The data considered in this article consist of a density time series

d_{t}

, observed sequentially over time, along with associated with scalar predictors

(X_{t}, Z_{t})

. To facilitate the analysis of density functions, we employ the LQD transformation

Ψ : F \to L_{2}

, where

F

denotes the space of density functions d satisfying the moment condition

\int_{R} u^{2} d (u) d u < \infty

. For each

d_{t} \in F

, let

F_{t} (y)

be the corresponding cumulative distribution function with support on [0,1], and let

Q_{t} (u)

denote the associated quantile function. The quantile density function is given by

q_{t} (u)

, i.e.,

q_{t} (u) = Q_{t}^{'} (u) = \frac{d}{d u} F_{t}^{- 1} (u)

for

u \in [0, 1]

. Then, the LQD transformation of

d_{t}

is defined as

Ψ (d_{t}) (u) = log (\frac{d}{d u} F_{t}^{- 1} (u)), u \in [0, 1] .

In this study, we propose a varying-coefficient additive model with a functional auto-regressive error process to estimate the conditional expectation

E (Ψ (d_{t}) | X_{t}, Z_{t})

. Under this framework, the density function

d_{t}

can be expressed as

d_{t} = Ψ^{- 1} (E (Ψ (d_{t}) | X_{t}, Z_{t})) + δ_{t 1},

where

δ_{t 1}

represents the regression error.

In addition to

δ_{t 1}

, a second source of error commonly arises from the estimation of the density function

d_{t}

. Specifically, in most practical settings, the density

d_{t}

is not directly observed. Instead, only a finite sample

Y_{t 1}, \dots, Y_{t n_{t}} \sim d_{t}

is available at each time point t, leading to an estimated density

{\hat{d}}_{t}

given by

{\hat{d}}_{t} = d_{t} + δ_{t 2},

where

δ_{t 2}

denotes the error due to density estimation. Throughout this article, we assume that the sample size

n_{t} = n

is fixed across time.

Following the approach of [6], we estimate

d_{t}

using a modified kernel density estimator that addresses boundary effects. The estimator is defined as

{\hat{d}}_{t} (y) = \sum_{i = 1}^{n} K (\frac{y - Y_{t i}}{h}) w (y, h) / \sum_{i = 1}^{n} \int_{0}^{1} K (\frac{s - Y_{t i}}{h}) w (s, h) d s,

where

K

is a symmetric kernel function with bandwidth

h < 1 / 2

and the weight function

w (y, h)

is designed to correct for boundary bias. Specifically,

w (y, h)

is given by

w (y, h) = {(\int_{- y / h}^{1} K (u) d u)}^{- 1} I_{y \in [0, h)} + {(\int_{- 1}^{(1 - y) / h} K (u) d u)}^{- 1} I_{y \in (1 - h, 1]} + I_{y \in [h, 1 - h]} .

We assume that the kernel

K

is of bounded variation, symmetric about 0, and satisfies the following conditions:

\int_{0}^{1} K (u) d u > 0

;

\int_{R} | u | K (u) d u

,

\int_{R} K^{2} (u) d u

, and

\int_{R} | u | K^{2} (u) d u

are finite. Therefore, when fitting the regression model using the estimated density

{\hat{d}}_{t}

in place of the true

d_{t}

, the model can be written as

{\hat{d}}_{t} = Ψ^{- 1} (E (Ψ (d_{t}) | X_{t}, Z_{t})) + δ_{t 1} + δ_{t 2} .

The key contribution of this article lies in the integration of density time series modeling with a functional auto-regressive (FAR) error process, a direction that has not been previously studied in the literature. A common assumption in regression analysis, including functional and density-based models, is the independence of random errors; however, this assumption is often violated in time-indexed data, where observations naturally exhibit serial dependence. By explicitly incorporating a FAR(1) structure into the error process, our approach effectively captures the temporal correlation inherent in density time series, thereby enhancing both the flexibility and accuracy of the model. Many real-world phenomena are characterized by time-evolving densities that exhibit strong temporal dependencies [11,12,13,14]. In these settings, the observed distribution at a given time point is not independent of previous distributions, but rather influenced by them through complex temporal dynamics. This phenomenon, commonly referred to as sequence dependence, cannot be adequately modeled under the assumption of independent errors. Ignoring such dependencies often leads to biased estimation, underestimated variability, and invalid inference, as documented in numerous empirical and theoretical studies. While FAR models have been widely explored as standalone tools for modeling functional time series data [11,12,13,14,15,16,17], their use as an error structure within a regression model for density-valued responses remains largely underdeveloped. This article addresses this methodological gap by embedding a FAR(1) process into the error term of a varying-coefficient additive regression framework tailored to density time series. This novel integration enables the model to more faithfully capture both the structured signal and the dynamic residual behavior present in such data. In addition to this modeling innovation, we make several theoretical contributions. Specifically, we develop a new estimation procedure that accommodates both the infinite-dimensional nature of the response and the temporal dependence in the errors. We further derive the asymptotic normality of the proposed estimator, which requires nontrivial extensions of existing techniques in functional data analysis and Hilbert space theory. This allows for valid statistical inference and construction of confidence intervals in practice. In summary, this work contributes a new class of models for density-valued time series with auto-regressive error dynamics, bridging gaps between functional time series, density regression, and auto-regressive modeling. The proposed framework provides both a theoretical foundation and a practical tool for analyzing complex time-evolving distributional data in a wide range of applications.

The remainder of this article is organized as follows: Section 2 presents the methodology for constructing a varying-coefficient additive model with a density response, incorporating a functional auto-regressive (FAR) error process. In Section 3, we propose a three-step estimation procedure for the bivariate varying-coefficient components within the model. Section 4 establishes the theoretical properties of the proposed model and discusses related inferential results. Section 5 reports Monte Carlo simulation studies that evaluate the efficiency and robustness of our approach. In Section 6, we demonstrate the practical utility of the model through applications to COVID-19 mortality data and U.S. income distribution data. Finally, Section 7 offers concluding remarks, and the Supplementary Material contains detailed proofs of the theoretical results.

2. Model Setup

In this article, we focus on modeling density responses. Due to the inherent constraints of density functions, namely nonnegativity and integration to one, we work with their representations after applying the log-quantile density (LQD) transformation.

Our primary goal is to estimate the conditional expectation

E (Ψ (d_{t}) (u) | x)

through the transformation of density function, expressed as

E (Ψ (d_{t}) (u) | x) = \sum_{m = 1}^{k} z_{t, m} g_{m} (u, x_{t, m}), 0 \leq u \leq 1,

(1)

which leads to the proposed varying-coefficient additive models with density responses and functional auto-regressive error process

F A R (p)

(DVCA-FAR):

f_{t} (u) = Ψ (d_{t}) (u) = \sum_{m = 1}^{k} z_{t, m} g_{m} (u, x_{t, m}) + ε_{t} (u), 0 \leq u \leq 1, 1 \leq t \leq T,

(2)

where the error process

ε_{t} (u)

follows a functional auto-regressive process of order p:

ε_{t} (u) = \int γ_{1} (s, u) ε_{t - 1} (s) d s + \dots + \int γ_{p} (s, u) ε_{t - p} (s) d s + e_{t} (u) .

(3)

In this framework, the random density

d_{t} (\cdot) \in F

serves as the response variable, and

Ψ : F ⟶ L_{2}

denotes the LQD transformation. Each density is associated with two sets of

k -

dimensional covariates,

x_{t} = {(x_{t, 1}, \dots, x_{t, k})}^{τ}

and

z_{t} = {(z_{t, 1}, \dots, z_{t, k})}^{τ}

, with supports

S_{x}

and

S_{z}

, respectively. Without loss of generality, we assume

S_{x} = S_{z} = [0, 1]

. In this article, the covariate

x_{t}

can represent

z_{t}

or the rescaled time index

t / T

.

The bivariate functions

g_{m} (\cdot, x_{m})

capture the effects of the covariates

z

, while the kernel functions

γ_{l} (\cdot, \cdot)

are smooth and satisfy the integrability condition

\int \int γ_{l}^{2} (s, u) d u d s < \infty

. The innovation process

e_{t} (u)

consists of independent and identically distributed random functions with zero mean

E (e_{t} (u)) = 0

and covariance function

C o v (e_{t} (u), e_{t} (s)) = σ_{t}^{2} (u, s)

.

When the density functions are estimated, denoted by

\hat{d}

, we write

{\hat{f}}_{t} = Ψ ({\hat{d}}_{t})

. Then DVCA-FAR model then takes the form

{\hat{f}}_{t} (u) = \sum_{m = 1}^{k} z_{t, m} g_{m} (u, x_{t, m}) + ε_{t} (u) + ε_{f_{t}} (u), 0 \leq u \leq 1,

(4)

where

ε_{f_{t}} (u)

represents the additional random error introduced by the transformation of estimated density.

3. Three-Step Estimation Methodology

We propose a three-step estimation procedure to estimate the varying-coefficient functions in the presence of a functional auto-regressive error structure. In the first step, we apply B-spline smoothing to obtain initial estimates of the bivariate varying-coefficient functions, ignoring the temporal dependence in the error process. In the second step, using the initial estimates and the transformed response, we estimate the error component. Specifically, the order and structure of the functional auto-regressive (FAR) process are determined using the sequential testing procedure proposed by Kokoszka and Reimherr [16]. In the final step, after removing the estimated FAR error from the response, we refine the estimation of the varying-coefficient functions using the spline-based method to obtain improved results.

3.1. Initial Estimation of Bivariate Varying-Coefficient Function

To begin, we estimate the bivariate varying-coefficient functions

g_{m} (u, x_{m})

, for

m = 1, \dots, k,

by applying a tensor product B-spline approximation, ignoring the temporal structure in the error term.

Let

{B_{0} (u), \dots, B_{N_{0}} (u)}

denote a set of B-spline basis functions of order q with

L_{0}

interior knots, defined on the domain of

u \in [0, 1]

, so that

N_{0} + 1 = L_{0} + q

. Similarly, for each

m = 1, \dots, k

, let

{B_{0, m} (x_{m}), \dots, B_{N_{m}, m} (x_{m})}

be a set of B-spline basis functions of order q for the covariate

x_{m}

, with

L_{m}

interior knots, so that

N_{m} + 1 = L_{m} + q

. Let

b_{j, m}^{*} (x_{m})

denote the normalized B-spline basis functions of

B_{j, m} (x_{m})

for

x_{m}

, and define the scaled basis

b_{r} (u) = N_{0}^{1 / 2} B_{r} (u)

of

B_{r} (u)

.

The tensor product of the B-spline basis functions is given by

b_{r, j, m} (u, x_{m}) = b_{r} (u) b_{j, m}^{*} (x_{m}), 1 \leq r \leq N_{0}, 1 \leq j \leq N_{m}, 1 \leq m \leq k .

Using this basis, the function of

g_{m} (u, x_{m})

can be approximated as

g_{m} (u, x_{m}) \approx \sum_{r = 1}^{N_{0}} \sum_{j = 1}^{N_{m}} λ_{r, j, m} b_{r, j, m} (u, x_{m}), 1 \leq m \leq k,

where

λ_{r, j, m}

are the spline coefficients.

The least squares estimator of

g_{m} (u, x_{m})

is then given by

{\tilde{g}}_{m} (u, x_{m}) = \sum_{r = 1}^{N_{0}} \sum_{j = 1}^{N_{m}} {\tilde{λ}}_{r, j, m} b_{r, j, m} (u, x_{m}), 1 \leq m \leq k,

(5)

where the vector of estimated coefficients is defined as

\tilde{λ} = {({\tilde{λ}}_{1, 1, 1}, \dots, {\tilde{λ}}_{N_{0}, N_{k}, k})}^{τ}

, a

(N_{0} \sum_{m = 1}^{k} N_{m})

-dimensional parameter vector obtained by solving

\tilde{λ} = arg min_{λ} \sum_{t = 1}^{T} \sum_{i = 1}^{n} {[{\hat{f}}_{t} (u_{i}) - \sum_{m = 1}^{k} z_{t, m} \sum_{r = 1}^{N_{0}} \sum_{j = 1}^{N_{m}} λ_{r, j, m} b_{r, j, m} (u_{i}, x_{t, m})]}^{2} .

(6)

Theoretical properties of this estimator are established in Theorem 1, which shows that the initial estimators

{\tilde{g}}_{m} (u, x_{m})

are uniformly consistent under suitable regularity conditions.

3.2. Estimation of FAR Error Process

With the initial estimates

{\tilde{g}}_{m} (u, x_{m})

obtained, we proceed to estimate the FAR error process. To do so, define the residuals as

{\tilde{ε}}_{t} (u) = f_{t} (u) - \sum_{m = 1}^{k} z_{t, m} {\tilde{g}}_{m} (u, x_{t, m}), 1 \leq t \leq T .

(7)

Let

ρ_{t} (u) = \sum_{l = 1}^{p} \int γ_{l} (s, u) ε_{t - l} (s) d s

denote the additive component in the

F A R (p)

error process (3). Then, the FAR process can be written as

ε_{t} (u) = ρ_{t} (u) + e_{t} (u),

where

e_{t} (u)

is a zero-mean innovation term.

Let

{B_{0} (u), B_{2} (u), \dots, B_{N} (u)}

be a set of B-spline basis functions of order q with L interior knots, such that

N + 1 = L + q

. Define the tensor product of the B-spline basis as

b_{r, j} (u, s) = B_{r} (u) B_{j} (s), 1 \leq r, j \leq N .

Using this basis, the FAR kernel functions

γ_{l} (\cdot, \cdot)

are approximated as

γ_{l} (s, u) = \sum_{r = 1}^{N} \sum_{j = 1}^{N} μ_{r, j, l} b_{r, j} (u, s), 1 \leq l \leq p .

The vector of spline coefficients

μ = {(μ_{1, 1, 1}, \dots, μ_{N, N, p})}^{τ} \in R^{p N^{2}}

is obtained by minimizing the following squared error criterion:

\hat{μ} = arg min_{μ} \sum_{t = p + 1}^{T} \sum_{i = 1}^{n} {[{\tilde{ε}}_{t} (u_{i}) - \sum_{l = 1}^{p} \sum_{r = 1}^{N} \sum_{j = 1}^{N} μ_{r, j, l} \int b_{r, j} (u_{i}, s) {\tilde{ε}}_{t - l} (s) d s]}^{2} .

The estimated FAR kernel functions

γ_{l} (\cdot, \cdot)

and the additive component of the error process

ρ_{t} (u)

are given by

{\hat{γ}}_{l} (s, u) = \sum_{r = 1}^{N} \sum_{j = 1}^{N} {\hat{μ}}_{r, j, l} b_{r, j} (u, s), 1 \leq l \leq p,

and

{\hat{ρ}}_{t} (u) = \sum_{l = 1}^{p} \sum_{r = 1}^{N} \sum_{j = 1}^{N} {\hat{μ}}_{r, j, l} \int b_{r, j} (u, s) {\hat{ε}}_{t - l} (s) d s, 0 \leq u \leq 1, p + 1 \leq t \leq T,

(8)

respectively.

Since the order p of the FAR error process is typically unknown in practice, we employ the sequential testing procedure proposed by [16] to determine the optimal order p. The details of this procedure are provided in Section 3.4.2.

3.3. Improved Estimation of Bivariate Varying-Coefficient Function

With an estimate of the FAR error component obtained, we now refine the estimation of the varying-coefficient functions

g_{m} (u, x_{m})

by removing the estimated serial dependence (8) from the response

f_{t} (u)

.

Define the adjusted response function as

f_{t}^{c} (u) = f_{t} (u) - \sum_{l = 1}^{p} \int γ_{l} (s, u) ε_{t - l} (s) d s, 0 \leq u \leq 1, p + 1 \leq t \leq T,

and its empirical estimates

f_{t}^{c} (u)

as

{\hat{f}}_{t}^{c} (u) = f_{t} (u) - \sum_{l = 1}^{p} \int {\hat{γ}}_{l} (s, u) {\hat{ε}}_{t - l} (s) d s, 0 \leq u \leq 1, p + 1 \leq t \leq T .

From the model specification, we have

f_{t}^{c} (u) = \sum_{m = 1}^{k} z_{t, m} g_{m} (u, x_{t, m}) + e_{t} (u), 0 \leq u \leq 1 .

which allows us to re-estimate

g_{m} (u, x_{m})

by repeating the same spline-based procedure as described in Section 3.1, but now applied to the corrected responses

{\hat{f}}_{t}^{c} (u)

.

The improved spline approximation estimates take the form

{\hat{g}}_{m} (u, x_{m}) = \sum_{r = 1}^{N_{0}} \sum_{j = 1}^{N_{m}} {\hat{λ}}_{r, j, m} b_{r, j, m} (u, x_{m}), 1 \leq m \leq k,

(9)

where the coefficient vector

\hat{λ} = {({\hat{λ}}_{1, 1, 1}, \dots, {\hat{λ}}_{N_{0}, N_{k}, k})}^{T}

is a

(N_{0} \sum_{m = 1}^{k} N_{m})

-dimensional vector obtained by minimizing

\hat{λ} = arg min_{λ} \sum_{t = 1}^{T} \sum_{i = 1}^{n} {[{\hat{f}}_{t}^{c} (u_{i}) - \sum_{m = 1}^{k} z_{t, m} \sum_{r = 1}^{N_{0}} \sum_{j = 1}^{N_{m}} λ_{r, j, m} b_{r, j, m} (u_{i}, x_{t, m})]}^{2} .

(10)

Theoretical guarantees for this refined estimator are provided in Theorems 2 and 3, which establish its uniform convergence and asymptotic normality under regularity conditions. In addition, simulation results reported in Section 5 demonstrate that the improved estimator

{\hat{g}}_{m} (u, x_{m})

achieves greater efficiency and accuracy compared to the initial estimator

{\tilde{g}}_{m} (u, x_{m})

.

3.4. Implementation

3.4.1. Selection of Bandwidth

In empirical applications, it is necessary to estimate the underlying density functions before model fitting. This step requires selecting an appropriate bandwidth for the modified kernel density estimator. In this section, we adopt a leave-one-out cross-validation (LOOCV) strategy to select the optimal bandwidth.

Specifically, the bandwidth h is chosen to minimize the following mean squared error (MSE) criterion:

C V (h) = \frac{1}{n T} \sum_{t = 1}^{T} \sum_{i = 1}^{n} {[d_{t} (y_{t i}) - {\hat{d}}_{t}^{(- i)} (y_{t i})]}^{2},

where for each

i = 1, \dots, n

,

{\hat{d}}_{t}^{(- i)} (y_{t i})

denotes the density estimate of

d_{t} (y_{t i})

with bandwidth h at point

y_{t i}

using all observations from time point t except the i-th one.

3.4.2. Identifying the Order of the FAR Process

To determine the order p of FAR error process, we apply the sequential testing procedure procedure proposed by [16]. The method frames FAR modeling as a fully functional linear regression with dependent regressors and systematically tests whether increasing the order improves the model fit.

The procedure tests the following nested hypotheses:

H_{0, p} : {ε_{t}} follows a F A R (p) vs H_{a, p + 1} : {ε_{t}} follows a F A R (p + 1), p = 0, 1, 2, \dots,

Here,

F A R (0)

corresponds to an independent and identically distributed process. The testing begins at

p = 0

and continues sequentially. The process stops when

H_{0, p}

is not rejected, at which point the selected model order is taken to be p. See [16] for the full theoretical development.

To construct the test statistic, define the following components. Let

η_{j} (s) = \sum_{l = 1}^{p} {\tilde{ε}}_{j - l} (s p - (l - 1)) I_{l} (s), φ (s, u) = p \sum_{l = 1}^{p} γ_{l} (s p - (l - 1), u) I_{l} (s),

where

I_{l}

is the indicator function on the interval

[(l - 1) / p, l / p]

. Denote

{\hat{C}}_{η} (s, u) = \frac{1}{T} \sum_{j = 1}^{T} (η_{j} (s) - \bar{η} (s)) (η_{j} (u) - \bar{η} (u))

as the empirical covariance operator of

{η_{j}}

, where

\bar{η} (s)

is the sample mean function. Let

{{\hat{x}}_{j}}

and

{{\hat{λ}}_{j}}

be the eigenfunctions and corresponding decreasingly ordered eigenvalues of

{\hat{C}}_{η}

, respectively. We retain only the first

q_{η}

eigenfunctions for dimensionality reduction. Similarly, for the functional responses

{π_{j}}

, define eigenpairs

{{\hat{y}}_{j}}

and corresponding number

q_{π}

analogously.

For the product space

L^{2} ([0, 1] \times [0, 1])

, define the projections

η (j, k) = 〈 η_{j}, {\hat{x}}_{k} 〉, π (j, m) = 〈 π_{j}, {\hat{y}}_{m} 〉, ψ (k, m) = 〈 φ, {\hat{x}}_{k} \otimes {\hat{y}}_{m} 〉 .

Denote the matrices

η = {[η (j, k)]}_{T \times q_{η}}, π = {[π (j, m)]}_{T \times q_{π}}

, and

ψ = {[ψ (k, m)]}_{q_{η} \times q_{π}}

,

j = 1, \dots, T

,

k = 1, \dots, q_{η}

,

m = 1, \dots, q_{π}

.

Next, construct the matrix

\hat{A} \in R^{q_{η} \times q_{η}}

with entries

\hat{A} (k, k^{'}) = 〈 {\hat{x}}_{k, p}, {\hat{x}}_{k^{'}, p} 〉, where {\hat{x}}_{k, p} (s) = {\hat{x}}_{k} (\frac{s + p - 1}{p}), 0 \leq s \leq 1 .

Define the orthonormal eigenvectors

{\hat{β}}_{k}

with corresponding ordered eigenvalues

{\hat{ξ}}_{1} \geq \dots \geq {\hat{ξ}}_{q_{η}}

as

\hat{A} {\hat{β}}_{k} = {\hat{ξ}}_{k} {\hat{β}}_{k}, 1 \leq k \leq q_{η}

. Define the matrix

\hat{B} = [{\hat{β}}_{1}, \dots, {\hat{β}}_{q^{*}}]

, where

q^{*} = max {k \in {1, \dots, q_{η}} : | | {\hat{z}}_{k, p} {| |}^{2} \geq 0.9 p}, with {\hat{z}}_{k, p} (s) = \sum_{i = 1}^{q_{η}} {\hat{β}}_{k, i} {\hat{x}}_{k, p} (s) .

Finally, following [16], the test statistic is constructed as

{\hat{τ}}_{p} = \frac{1}{T} {(v e c [{\hat{B}}^{τ} \hat{ψ}])}^{τ} {[(I_{q_{ε}} \otimes {\hat{B}}^{τ}) (\hat{C} \otimes \hat{Λ}) (I_{q_{ε}} \otimes \hat{B})]}^{- 1} v e c [{\hat{B}}^{τ} \hat{ψ}],

(11)

where

\hat{Λ} = d i a g ({\hat{λ}}_{1}, \dots, {\hat{λ}}_{q_{η}}), \hat{C} = \frac{1}{T} {(π - η \hat{ψ})}^{τ} (π - η \hat{ψ})

. Under

H_{0, p}

, the test statistic

{\hat{τ}}_{p + 1}

asymptotically follows a chi-squared distribution with degrees of freedom

q_{π} q^{*}

.

4. Theoretical Results

In this section, we investigate the asymptotic properties of both the initial and improved estimators of the bivariate varying-coefficient functions

g_{m} (u, x_{m})

. We also establish the consistency of the estimator for the order p of the functional auto-regressive (FAR) process. All technical proofs are deferred to the Supplementary Materials.

Throughout the remainder of this paper, for any fixed interval

[a, b]

, we denote the space of functions that are l-times continuously differentiable on

[a, b]

as

C^{(l)} [a, b] = {g | g^{(l)} \in [a, b]}

. Let

L i p ([a, b], C) = {g | | g (x) - g (x^{'}) | \leq | x - x^{'} |, \forall x, x^{'} \in [a, b]}

denote the class of Lipschitz-continuous functions with Lipschitz constant

C > 0

. Let

S_{x_{m}}

and

S_{z_{m}}

denote the supports of

x_{m}

and

z_{m}

, respectively. Then, the supports of the covariate vectors

x

and

z

are given by

S_{x} = \prod_{m = 1}^{k} S_{x_{m}}

and

S_{z} = \prod_{m = 1}^{k} S_{z_{m}}

, respectively. The following regularity conditions are imposed to derive the asymptotic properties of the proposed estimators.

(A1): For any density function $d \in F$ , d is differentiable and there exists a constant $M > 1$ such that ${| | d | |}_{\infty}$ , $| | 1 / {d | |}_{\infty}$ , and $| | d^{'} {| |}_{\infty}$ are all bounded by M.
(A2): (a) The kernel function $K$ is Lipschitz-continuous, bounded, and symmetric about zero. Furthermore, $K \in L i p ([- 1, 1], L_{k})$ for some constant $L_{k} > 0$ . (b) The kernel function satisfies the following conditions: $\int_{0}^{1} K (u) d u > 0$ , $\int_{R} | u | K (u) d u < \infty$ , $\int_{R} K^{2} (u) d u < \infty$ , and $\int_{R} | u | K^{2} (u) d u < \infty$ .
(A3): The covariates $x_{t, m}, z_{t, m}$ , for $1 \leq m \leq k$ , and the error functions $ε_{t} (u)$ satisfy the following moment conditions: for some $s > 2$ ,

$max_{1 \leq t \leq T} max_{1 \leq m \leq k} E (| x_{t, m} |^{2 s}) < \infty, max_{1 \leq t \leq T} max_{1 \leq m \leq k} E (| z_{t, m} |^{2 s}) < \infty, max_{1 \leq t \leq T} sup_{u} E (| ε_{t} (u) |^{2 s}) < \infty .$

For each $t = 1, \dots, T$ , the covariance function of the error process $C o v (ε_{t} (s), ε_{t} (v)) = Σ_{t} (s, v)$ has finite nondecreasing eigenvalues $λ_{1} \leq \dots \leq λ_{m a x}$ such that $\sum_{j} λ_{j} < \infty$ .
(A4): The varying-coefficient functions $g_{m} (u, x_{m})$ are continuous over the domain $[0, 1] \times [a_{m}, b_{m}]$ and are twice continuously partially differentiable with respect to u and $x_{m}$ , for each $1 \leq m \leq k$ . Here, $[a_{m}, b_{m}]$ is a compact subset of the support $S_{x_{m}}$ .
(A5): The numbers of basis functions satisfy $N_{0} \sim {(n T)}^{1 / 6} log n T$ , $N_{m} \sim {(n T)}^{1 / 6} log n T$ , $1 \leq m \leq k$ , and the bandwidth satisfies $h \sim n^{- 1 / 3}$ , as $n, T \to \infty$ .

Remark 1.

Assumption (A1) is standard and ensures the well-posedness of density transformations. Assumption (A2) imposes mild conditions on the kernel function

K (\cdot)

, which are satisfied by commonly used kernel functions such as the uniform and Epanechnikov kernels. The moment conditions in (A3) are essential for establishing the uniform convergence and other asymptotic properties of spline-based estimators. Assumption (A4) requires only moderate smoothness of the coefficient functions and is relatively weak compared to traditional nonparametric assumptions. Lastly, the growth conditions in (A5) are widely adopted in the literature on spline smoothing to ensure optimal convergence rates.

We begin by examining the uniform convergence of the initial estimator of bivariate functions

g_{m} (u, x_{m})

, as stated in Theorem 1.

Theorem 1.

Assume that Assumptions (A1)–(A5) hold, and let

{\tilde{g}}_{m} (u, x_{m})

denote the initial estimator of

g_{m} (u, x_{m})

, defined in Equation (5), for

m = 1, \dots, k

. Then, as

n \to \infty

and

T \to \infty

, we have

sup_{u, x_{m} \in [0, 1]} | {\tilde{g}}_{m} (u, x_{m}) - g_{m} (u, x_{m}) | = O_{p} ({(n T)}^{- 1 / 3} log (n T) + n^{- 1 / 3}) .

Theorem 2 characterizes the uniform convergence of the improved estimation of

g_{m} (u, x_{m})

, and Theorem 3 describes the asymptotic properties of both the initial and improved estimators.

Theorem 2.

Assume that Assumptions (A1)–(A5) hold, and that the order p of the functional error process is known. Let

{\hat{g}}_{m} (u, x_{m})

denote the improved estimator of

g_{m} (u, x_{m})

, as defined in Equation (9), for

m = 1, \dots, k

. Then, as

n \to \infty

and

T \to \infty

, it holds that

sup_{u, x_{m} \in [0, 1]} | {\hat{g}}_{m} (u, x_{m}) - g_{m} (u, x_{m}) | = O_{p} ({(n T)}^{- 1 / 3} {(log (n T))}^{- 2} + n^{- 1 / 3}) .

To establish the asymptotic normality of the estimators, we introduce the following notations. Denote

b (u, x_{t, m}) = {(b_{1, 1, m} (u, x_{t, m}), \dots, b_{N_{0}, N_{m}, m} (u, x_{t, m}))}^{τ}

,

b_{z} (u, x_{t, m}) = z_{t, m} b (u, x_{t, m})

,

{Bz}_{t, m} = {(b_{z} (u_{1}, x_{t, m}), \dots, b_{z} (u_{n}, x_{t, m}))}_{n \times N_{0} N_{m}}^{τ}

,

B_{m} = {({Bz}_{1, m}^{τ}, \dots, {Bz}_{T, m}^{τ})}^{τ}

,

B = (B_{1}, \dots, B_{k})

, and

B_{*} = B / \sqrt{n T}

.

Let

A_{m} = (0, \dots, I, \dots, 0)

denote a block matrix of dimension

1 \times k

, where the m-th block is an identity matrix of size

N_{0} N_{m} \times N_{0} N_{m}

, and all other blocks are zero matrices of appropriate dimensions.

Theorem 3.

Assume that Assumptions (A1)–(A5) hold, let

{\tilde{g}}_{m} (u, x_{m})

and

{\hat{g}}_{m} (u, x_{m})

denote the initial and improved estimators of

g_{m} (u, x_{m})

, as defined in Equations (5) and (9),

m = 1, \dots, k

, respectively. Then, as

n ≫ T \to \infty

, for any

u \in (0, 1)

and

x_{m} \in [0, 1]

, the following results hold:

(i): The initial estimator ${\tilde{g}}_{m} (u, x_{m})$ is asymptotically normally distributed, i.e.,

$\sqrt{n T} {(C_{m} Σ_{ε} C_{m}^{τ})}^{- 1} ({\tilde{g}}_{m} (u, x_{m}) - g_{m} (u, x_{m})) \overset{D}{\to} N (0, 1),$

where $C_{m} = b^{τ} (u, x_{m}) E (A_{m} {(B_{*}^{τ} B_{*})}^{- 1} B_{*}^{τ})$ , the covariance matrix $Σ_{ε} = {(Σ_{t, s})}_{1 \leq t, s \leq T}$ , with $Σ_{t, s} = C o v (ε_{t}, ε_{s})$ .
(ii): The improved estimator ${\hat{g}}_{m} (u, x_{m})$ is asymptotically normally distributed, i.e.,

$\sqrt{n T} {(C_{m} Ξ_{ε} C_{m}^{τ})}^{- 1} ({\hat{g}}_{m} (u, x_{m}) - g_{m} (u, x_{m})) \overset{D}{\to} N (0, 1),$

where the covariance matrix $Ξ_{ε} = diag {(Ξ_{t, t})}_{1 \leq t \leq T}$ , with $Ξ_{t, t} (u, s) = σ_{t}^{2} (u, s)$ .

5. Numerical Study

In this section, we present two simulation studies designed to evaluate the performance of the proposed identification and estimation procedures for the additive model.

5.1. Case 1

This scenario aims to assess the estimation accuracy of the proposed method when the order of the auto-regressive error process is known with finite n and T. We consider a DVCA-FAR(1) model given by

f_{t} (u) = z_{t, 1} g_{1} (u, x_{t, 1}) + z_{t, 2} g_{2} (u, x_{t, 2}) + ε_{t} (u), 0 \leq u \leq 1,

(12)

and the functional error process

ε_{t} (u)

takes form as

ε_{t} (u) = \int γ_{1} (s, u) ε_{t - 1} (s) d s + e_{t} (u), 2 \leq t \leq T .

The bivariate varying-coefficient functions are specified as

g_{1} (u, x_{t, 1}) = sin (2 π u) (2 x_{t, 1} - 1), g_{2} (u, x_{t, 2}) = sin (2 π u) sin (2 π x_{t, 2}),

and the coefficient functions and innovation process are given by

γ_{1} (s, u) = 0.2 u s, e_{t} (u) = 0.2 η_{t, 1} sin (π u) + η_{t, 2} sin (2 π u),

with

η_{t, 1} \sim N (0, 0 . 1^{2})

,

η_{t, 2} \sim N (0, 0 . 05^{2})

, and

η_{t, 1}

are independent of

η_{t, 2}

for

u \in [0, 1]

.

The covariates are generated as follows:

z_{t, 1} \sim N (0, 1), z_{t, 2} \sim N (0, 0 . 5^{2})

, and

{(x_{t, 1}, x_{t, 2})}^{τ} = {(Φ (v_{t, 1}), Φ (v_{t, 2}))}^{τ}

,

1 \leq t \leq T

, where

Φ

denotes the cumulative distribution function of the standard normal distribution and

v_{t, 1}

,

v_{t, 2}

are independent standard normal variables.

To generate the response densities, for each given

Z = z

and

X = x

, let

α (u, x, z)

be the additive predictor given by

α (u, x, z) = \sum_{m = 1}^{p} z_{m} g_{m} (u, x_{m})

. The conditional quantile function

Q (\cdot | x, z)

with the error process

ε (u)

, corresponding to the density

d_{t}

, is constructed as

Q (u | x, z) = F^{- 1} (u | x, z) = θ {(x, z)}^{- 1} \int_{0}^{u} exp {α (v, x, z) + ε (v)} d v

, where

θ (x, z) = \int_{0}^{1} exp {α (v, x, z) + ε (v)} d v

.

Given this construction, we generate response samples by applying the quantile function to uniform random variables

{U_{t, 1} \leq \dots \leq U_{t, n_{t}}} \sim U (0, 1)

, which are independent of

X_{t}

and

Z_{t}

. Specifically, for each

1 \leq t \leq T

, we obtain the random samples

Y_{t} = {Y_{t, j} = Q (U_{t, j} | X_{t}, Z_{t}) : 1 \leq j \leq n_{t}}

, ensuring that

Y_{t, 1}, \dots, Y_{t, n_{t}} \sim d_{t}

, where

d_{t}

denotes the response density. The transformed density, as used in model (12), is then defined as

f_{t} (u) = Ψ (d_{t} (u))

. For simplicity, we assume that independent and identically distributed observations are available for each response distribution, i.e.,

n_{t} = n

.

The simulation is conducted with

T = 100, n = 100

, and results are averaged over 200 Monte Carlo replications. Figure 2 presents the true error process

ε (u)

in panel (a) and its corresponding spline-based estimations in panel (b), demonstrating a high degree of accuracy in error recovery. Figure 3 provides a a comparative view of the bivariate function estimates. Specifically, the left panel displays the true surfaces of

g_{m} (u, x_{m})

, while the middle and right panels show the average of the initial and improved estimations, respectively. The initial estimates are obtained without accounting for the FAR(1) error structure, whereas the improved estimates incorporate the estimated error process. To facilitate visual comparison, the surfaces are presented from two distinct viewing angles. This allows for a more comprehensive assessment of the estimation performance before and after error correction.

As illustrated in Figure 2, the proposed method achieves a highly accurate estimation of the error process. Moreover, the right panel of Figure 3 clearly demonstrates that incorporating the estimated FAR structure leads to substantially improved function estimates when compared to the initial results shown in the middle panel. These findings confirm the effectiveness of the proposed approach in refining the estimation of bivariate varying-coefficient functions by properly addressing the temporal dependence in the functional error process.

To further evaluate the performance of the proposed estimation procedure, we conduct simulations under varying sample sizes, specifically

T = 50, 100

and

n = 50, 100

. The accuracy of the initial and improved estimators of

g_{m} (u, x_{m})

is assessed using the root mean squared error (RMSE), defined as

R M S E ({\dot{g}}_{m}) = \frac{1}{T} \sum_{t = 1}^{T} {\frac{1}{n} \sum_{i = 1}^{n} | | {\dot{g}}_{m} (u_{i}, x_{t, m}) - g_{m} (u_{i}, x_{t, m}) {| |}_{2}^{2}}^{\frac{1}{2}} .

where

{\dot{g}}_{m}

denotes either the initial estimate

{\tilde{g}}_{m}

or the improved estimate

{\hat{g}}_{m}

.

Table 1 presents the average root mean square errors (RMSEs) along with their standard deviations, calculated over 200 Monte Carlo replications for both the initial and improved estimators of

g_{m} (u, x_{m})

. The results demonstrate a clear trend that the RMSEs decrease as both the number of time points T and the number of observations per curve n increase. More importantly, the improved estimators consistently outperform the initial estimators, yielding substantially lower RMSEs across all settings. This improvement is anticipated, as the initial estimates are obtained without adjusting for the auto-regressive error structure, which introduces bias and additional variability into the estimation process. In contrast, the improved estimates incorporate the estimated error component, leading to more accurate and reliable results.

To offer a deeper understanding of the relative performance between the two estimation strategies, we also compare their biases and standard deviations, taking the first setting in the simulation study as a representative example.

Table 2 presents the average bias and standard deviation of both the initial and improved estimators of

g_{m} (u, x_{m})

. The results further confirm the superiority of the improved approach, indicating that across all combinations of sample size, both the bias and standard deviation of the improved estimators are markedly smaller than those of the initial estimators. Furthermore, both metrics exhibit a decreasing trend as the sample size increases, highlighting the consistency and efficiency of the improved estimation method. These findings provide strong empirical support for the theoretical result that the improved estimator possesses a smaller asymptotic variance–covariance matrix, thereby offering enhanced precision and robustness in practical applications.

5.2. Case 2

Case 2 is designed to evaluate the efficiency of identifying the auto-regressive order of the functional error process. The response densities are also generated from model (12), but now the error process follows a

F A R (2)

structure, with the second-order coefficient function specified as

γ_{2} (s, u) = \frac{1}{4} u s^{2}

. All other simulation settings remain consistent with those in Case 1.

Table 3 reports the empirical power of the testing procedure used to determine the order of the FAR error process across various sample sizes and significance levels. The results clearly show that the test’s power increases as both the number of time points T and the number of observations per curve n grow. In particular, the power approaches one when T and n reach 100, especially when testing the null hypothesis of independent and identically distributed (i.i.d.) errors. This indicates that the test becomes highly reliable with larger sample sizes. While the power is somewhat lower when testing the null hypothesis of FAR(1) against FAR(2), this is expected due to the inherent difficulty in distinguishing between these closely related models. Additionally, the test maintains an appropriate size when assessing FAR(2) against higher-order alternatives, confirming its accuracy and practical feasibility for identifying the correct order of the functional error process. Overall, these findings demonstrate the robustness and effectiveness of the proposed testing algorithm in diverse settings.

To further explore the impact of correctly identifying the auto-regressive order p on estimation accuracy, Table 4 presents the average RMSEs of the bivariate varying-coefficient functions. The observed pattern closely parallels the results in Case 1, reinforcing the validity and reliability of the model’s identification and estimation procedures. This evidence highlights the critical role that the accurate determination of the auto-regressive order plays in improving estimation precision. The consistent RMSE patterns across different sample sizes and scenarios underline the model’s robustness in effectively accounting for the error structure, thus providing precise and reliable estimates of the bivariate varying-coefficient functions.

5.3. Case 3

To further examine the performance of proposed estimation approach and identification procedure under different scenarios, we consider the coefficient functions and innovation process which takes form as

γ_{1} (s, u) = 0.2 u s, e_{t} (u) = 0.2 η_{t, 1} sin (π u) + η_{t, 2} sin (2 π u),

with

η_{t, 1} \sim G a m m a (3, 2)

,

η_{t, 2} \sim t (5)

, and

η_{t, 1}

being independent of

η_{t, 2}

for

u \in [0, 1]

. All other simulation settings remain consistent with those in Case 2.

Table 5 summarizes the power performance of the proposed testing approach under a setting where the innovation process

e_{t} (u)

follows a non-Gaussian distribution with increased variability. The outcomes indicate that, similar to Case 2, the test remains capable of effectively identifying the correct order of the FAR process, even in the presence of more complex error structures. Although the overall power is somewhat increased relative to the non-Gaussian innovation process case, particularly in distinguishing closely related models such as FAR(1) versus FAR(2), the test still demonstrates satisfactory performance, especially as the sample size increases. Notably, when both the number of time points T and the number of observations per curve n are large (e.g., 100), the power approaches nearly unity, confirming that the test is still reliable under more challenging, non-ideal conditions.

Table 6 further examines how accurately identifying the auto-regressive order influences the estimation quality of the bivariate varying-coefficient functions. Despite the non-Gaussian error distribution and larger noise fluctuations leading to visibly higher RMSEs—both for the initial and refined estimates—consistent improvements are observed when the correct FAR order is utilized. These improvements closely mirror the trends seen in Case 2, reaffirming the stability and practical value of the estimation procedure. The results suggest that, although the estimation becomes inherently more difficult under heavy-tailed or heteroskedastic error conditions, the proposed methods remain applicable and beneficial in terms of both model identification and estimation refinement.

6. Real Data Analysis

In this section, we illustrate the feasibility and effectiveness of the proposed estimation procedure through the analysis of two real-world datasets. By applying our methodology to empirical data, we demonstrate its practical capability to capture the underlying patterns and dependencies present in complex data. This analysis not only serves to validate the performance of the estimation approach but also underscores its broad applicability across diverse domains. Moreover, it highlights the model’s flexibility and robustness in handling intricate, time-dependent, and non-Euclidean data structures, thus emphasizing its value as a versatile tool for real-world applications.

6.1. COVID-19 Data

On 11 March 2020, the World Health Organization (WHO) officially declared COVID-19, a contagious disease caused by the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), a global pandemic. The rapid and widespread transmission of the virus presented unprecedented challenges to global public health, prompting countries worldwide to implement lockdowns and other measures aimed at controlling the spread of the disease. As of 15 August 2021, WHO reports indicated a staggering 221,885,822 confirmed cases and 4,583,539 deaths spanning nearly all countries, underscoring the extensive and profound impact of the pandemic. Given the scale of this crisis, it is crucial for international health organizations and research institutions to continuously monitor the evolving global trends of COVID-19. Such monitoring enables timely, accurate analysis that supports effective public health responses, informs medical treatment strategies, and guides prevention and control measures for future outbreaks. Understanding the epidemic’s dynamics through data-driven modeling is therefore essential for shaping informed policy decisions and improving health outcomes worldwide amid this ongoing global crisis.

To illustrate this point, we focus on the mortality rate as a key indicator for tracking the global trend of the COVID-19 pandemic. The mortality rate is defined as the ratio of the cumulative number of deaths each day to the total population of each country, serving as a critical measure of the disease’s lethality and spread. Importantly, the calculation of mortality rates inherently involves temporal dependence, as daily figures are based on previous days’ data. Consequently, the mortality rates, and thus the global epidemic trend, exhibit temporal autocorrelation.

The data on COVID-19-related deaths, essential to our analysis, are sourced from the publicly accessible Johns Hopkins University repository. This resource provides a dynamic tracking map offering comprehensive insights into global pandemic trends. The dataset, available at https://www.jhu.edu/ (accessed on 25 July 2021), covers the period from 22 January 2020 through 15 April 2021. Additionally, the most recent population data required to compute mortality rates for each country are obtained from the World Bank’s online platform, accessible at https://data.worldbank.org (accessed on 17 September 2021). These publicly available datasets form a valuable foundation for monitoring the pandemic’s progression and conducting rigorous statistical and epidemiological analyses to better understand the disease’s behavior across regions.

Because the timing of outbreaks varies between countries and regions, we standardize the time scale by defining day zero as the date when each country reached 100 cumulative confirmed COVID-19 cases. Our analysis considers daily cumulative death data from 189 countries over the subsequent 100-day period. At each time point t, we estimate the density function of the mortality rate, denoted as

{\hat{d}}_{t} (y)

, using data from these countries. Figure 1a displays the estimated densities of the global mortality rate (‰) across the 100 days, with data from up to 189 countries at each time point. Figure 1b presents an alternative view by showing the estimated densities on three selected days. From these visualizations, it is clear that the mortality rate densities remain well-defined throughout the observed period. Moreover, a temporal dependency among the distributions is clearly observable, suggesting the presence of an auto-regressive structure in the data, which supports the hypothesis of a functional auto-regressive (FAR) error process.

The main objective of this analysis, based on the COVID-19 data, is to identify the FAR process underlying the mortality rate and estimate its component functions. For the sake of simplicity, we begin by considering a special case where the covariate z is constant (set to 1), and x represents a scaled time variable, denoted as

t / T

. The model is specified as

{\hat{f}}_{t} (u) = Ψ ({\hat{d}}_{t}) (u) = g_{1} (u, x_{t, 1}) + ε_{t} (u), 1 \leq t \leq 100,

where

ε_{t} (u) = \sum_{l = 1}^{p} \int γ_{l} (u, s) ε_{t - l} (s) d s + e_{t} (u)

and

x_{t, 1}

denotes the time scale

t / T

.

Using the initial spline estimate of

g_{1} (u, x_{1})

, we apply a testing algorithm to determine the order p of the FAR process. Table 7 reports the corresponding p-values under different hypotheses. The results provide strong evidence of significant autocorrelation in the data, supporting the model of a first-order functional auto-regressive process,

F A R (1)

. These findings empirically confirm the presence of temporal dependencies in the COVID-19 mortality rates, further justifying the application the use of a FAR error structure to effectively capture the evolving epidemic dynamics.

Figure 4 presents a heat map of the estimated bivariate function

{\hat{g}}_{1} (u, x_{1})

, obtained after accounting for the functional error process and determining the auto-regressive order. The heat map reveals a relatively stable temporal pattern, where the function initially attains a minimum at lower values of u, gradually increases, and reaches a maximum at later time points. This pattern reflects the underlying dynamics of the COVID-19 mortality rate over successive days. The observed correlation between consecutive days further supports the notion that the global mortality rate exhibits substantial temporal dependence, consistent with the nature of the mortality measure derived from prior daily data.

To evaluate the uncertainty associated with these estimates, we conducted a residual-based bootstrap analysis. Specifically, we first fitted the model to obtain the estimated coefficient surface

{\hat{g}}_{1} (u, x_{1})

and the residual functions. Then, using the estimated functional auto-regressive operator from the FAR(1) error process, we recursively generated bootstrap residual samples by resampling the innovation functions with replacement. For each of the 500 bootstrap replications, new response functions were constructed by adding the bootstrap residuals to the fitted values based on

{\hat{g}}_{1} (u, x_{1})

. The entire estimation procedure was repeated on each bootstrap dataset to obtain bootstrap replicates of the coefficient surface. The variability among these bootstrap replicates was then used to calculate point-wise standard errors and confidence intervals. The resulting standard errors were generally small across the domain, typically ranging between 0.04 and 0.07, indicating stable estimates throughout. The 95% confidence intervals for the bivariate varying-coefficient surface consistently excluded zero along the increasing temporal trend, confirming its statistical significance. Moreover, the bootstrap results showed that the identified pattern, a minimum at early u values followed by a gradual rise, was robust across replications, demonstrating that the observed dynamic is unlikely to be due to random noise. These findings underscore the reliability of the estimated surface and validate the importance of accounting for the temporal dependence captured by the FAR(1) process in modeling the functional response. This suggests that the observed structure is not an artifact of noise, but reflects a meaningful underlying dynamic, which reinforces the reliability of the visual patterns in the figure. Overall, these findings reinforce the necessity of incorporating a functional auto-regressive process to accurately model the temporal structure of the mortality rate. The clear and significant progression observed in the heat map further validates the model’s ability to capture the global evolution of the pandemic over time.

6.2. USA Income Data

Personal income statistics are essential for enabling governments to understand the interplay between national income, consumption, and saving. These statistics also serve as a valuable tool for assessing and comparing economic well-being across different regions or countries. In this study, we focus on the density time series of per capita personal income, defined as the total personal income of a region divided by its population. This metric offers a detailed perspective on the economic conditions within a region by capturing the distribution and evolution of income on a per-person basis over time. Analyzing such time series allows policymakers and researchers to gain insights into the long-term economic trends of a region, evaluate income disparities, and make more informed decisions regarding fiscal policies, social welfare programs, and strategies for economic development.

Income data for the United States are publicly publicly accessible through the official website of the United States Bureau of Economic Analysis (http://www.bea.gov/ accessed on 16 October 2021). We consider the quarterly per capita personal income of all 50 states in the USA spanning from the first quarter of 2010 through the fourth quarter of 2020, resulting in 44 time points,

t = 1, \dots, 44

. At each quarter t, we estimte the density function of per capita income,

{\hat{d}}_{t} (y)

, based on these 50 observations. Given that the quarterly personal income reflects broader national economic conditions, we incorporate two related covariates, ‘GDP’ (quarterly gross domestic product of the USA) and ‘Population’ (quarterly total population of the USA), both also available from the BEA (http://www.bea.gov/ accessed on 16 October 2021).

Traditionally, income curves are studied as panel data in economics, focusing on the relationship between consumers’ equilibrium points. As individual incomes fluctuate, the connections among these equilibrium points form trajectories that represent not only income growth but also increased consumer satisfaction. This perspective highlights the dynamic nature of income changes and their impact on well-being, offering valuable insights into consumer behavior over time.

In contrast, the income density curve, treated here as functional data, captures the distribution of income within a region or demographic group. It visually represents the shape and trends of income across different intervals, providing a more comprehensive view of the socio-economic environment. By examining income density curves, one can effectively observe income inequality within a population and identify key patterns of wealth distribution. Such curves are critical for economic research as they facilitate a deeper understanding of consumption behavior, socio-economic status, and the design of social policies.

Furthermore, income density curves are important tools for economic forecasting and analysis. By tracking changes in income distribution over time, economists can integrate insights about consumer preferences and consumption habits at different income levels. This approach enhances the ability to predict future economic conditions and shifts in consumption patterns, making income density curves indispensable for both microeconomic and macroeconomic analyses. Consequently, these curves play a crucial role in shaping policy decisions, economic planning, and our broader comprehension of economic well-being.

Figure 5a depicts the density time series of quarterly personal income across the 44 quarters. The density curves indicate a consistent pattern in the distribution of per capita income across states over the past decade. Specifically, there are relatively few individuals in the high-income and upper-middle-income brackets, a moderate number in the middle-income category, and a larger share in the lower-middle-income range.

To illustrate the temporal changes in income distribution more clearly, Figure 5b shows density curves at three distinct time points: the second quarter of 2015, the first quarter of 2017, and the third quarter of 2018. The curves reveal a gradual shift towards higher income levels over time, alongside a corresponding decrease in the peak density. This trend aligns with broader economic and technological progress in recent years. As the economy develops, the proportion of individuals in lower income brackets steadily decreases, while the middle-to-high income groups grow. As a result, income distribution is becoming more balanced, with an increasing share of the population moving into middle- and higher-income categories. This pattern reflects general trends of economic growth and income redistribution over the period.

We model the income density curves using the following dynamic varying-coefficient auto-regressive functional regression (DVCA-FAR) model:

{\hat{f}}_{t} (u) = Ψ ({\hat{d}}_{t}) (u) = g_{0} (u, x_{t, 0}) + z_{t, 1} g_{1} (u, x_{t, 1}) + z_{t, 2} g_{2} (u, x_{t, 2}) + ε_{t} (u), 1 \leq t \leq 44,

where

ε_{t} (u) = \sum_{l = 1}^{p} \int γ_{l} (u, s) ε_{t - l} (s) d s + e_{t} (u)

. Here,

z_{t, 1}

denotes the quarterly gross domestic product of the USA,

z_{t, 2}

denotes the quarterly total population of the USA, and

x_{t, 0}, x_{t, 1}, x_{t, 2}

represent scaled time variables

t / T

.

Following the initial spline-based estimation, we apply a testing procedure to determine the appropriate order p of the functional auto-regressive error process. The p-values in Table 8 indicate that an FAR(2) model best captures the autocorrelation structure in the error terms. This finding suggests that a second-order functional auto-regressive process effectively accounts for the dynamic dependencies within the income data.

Using the three-step estimation procedure, we estimate the bivariate varying-coefficient functions. Figure 6 presents heat maps of the three estimated functions, where

g_{0} (u, x_{0})

represents the baseline effect over time,

g_{1} (u, x_{1})

captures the influence of GDP, and

g_{2} (u, x_{2})

reflects the impact of population on data respectively. The heat map of

g_{0}

shows an alternating pattern over time between high and low values, indicating that individuals at both high and low per capita income levels experience similar effects, whereas those in the middle-income range tend to display the opposite trend. The function

g_{1}

, related to GDP, exhibits a consistent pattern across income levels u that changes over time, with an initial peak followed by a dip and a subsequent rise toward the end of the period. Conversely, the effect of population, as shown in

g_{2}

, generally opposes the baseline pattern and remains relatively stable across both high- and low-income groups.

To evaluate the uncertainty associated with these estimated varying-coefficient surfaces, we conducted a residual-based bootstrap analysis analogous to that described previously. Specifically, we used the estimated FAR(2) operator and innovation residuals to generate 500 bootstrap samples, each time reconstructing the response functions and refitting the entire model. Point-wise standard errors and confidence intervals for all

g_{j} (u, x_{j})

derived from these bootstrap replications indicate that the main features of the estimated coefficient functions are statistically significant and robust throughout the domain. Point-wise standard errors and confidence intervals for all estimated coefficient functions

g_{j} (u, x_{j})

obtained from the 500 bootstrap replications show that the estimation uncertainty is generally low across most of the domain. For example, the standard errors of

g_{0} (u, x_{0})

remain below 0.25 in regions corresponding to the high- and low-income groups, supporting the stability of the observed alternating pattern over time. Similarly,

g_{1} (u, x_{1})

, representing the GDP effect, exhibits standard errors typically under 0.38 near the temporal peaks and troughs, confirming that the initial peak, mid-period dip, and late-period rise are statistically significant features rather than random fluctuations. The function

g_{2} (u, x_{2})

, reflecting population impact, has slightly larger uncertainty in the mid-income range but remains stable and significant across the majority of the income spectrum. Overall, the 95% bootstrap confidence intervals exclude zero in these key regions, reinforcing the robustness and reliability of the identified macroeconomic influences on quarterly personal income distributions. Together, these findings highlight a significant dependence of quarterly personal income distributions in the United States on prior values, with the dynamics closely linked to key macroeconomic factors such as GDP and demographic changes represented by population growth.

7. Conclusions

Sequentially collected data often exhibit autocorrelation, which must be properly addressed to ensure accurate statistical modeling. At the same time, the analysis of non-Euclidean data structures, such as probability density functions, has gained increasing attention in modern statistical research. To address these challenges, we propose a varying-coefficient additive model with density-valued responses, incorporating a functional auto-regressive (FAR) error process to capture temporal dependence. Given the intrinsic nonlinearity and geometric constraints of density functions, we begin by applying the transformation method proposed by Petersen and Müller [6] to map the density functions into a linear Hilbert space, enabling the use of conventional regression techniques. We then develop a three-step estimation procedure for the varying-coefficient components. In the first step, we employ B-spline series approximations to obtain preliminary estimates of the bivariate varying-coefficient functions, initially ignoring the functional error structure. In the second step, we determine the order of the FAR process using the test statistic introduced by Kokoszka and Reimherr [16], based on the residuals obtained from the first step. In the final step, we account for the FAR error process and construct refined estimators for the varying-coefficient functions by removing the estimated auto-correlated components and reapplying the B-spline estimation. We provide theoretical justification for the proposed procedure by establishing convergence rates and asymptotic properties for both the initial and refined estimators. The effectiveness of the proposed method is further demonstrated through comprehensive simulation studies and applications to two real-world datasets. The results underscore the importance of addressing temporal dependence in density-valued data and validate the accuracy and efficiency of our approach.

This work opens several avenues for future research. While our model establishes the relationship between density function responses and scalar predictors using a varying-coefficient additive framework, the growing prevalence of complex, high-dimensional data calls for further methodological extensions. In particular, although a FAR structure is assumed for temporal dependence, consistent estimation may still be possible under simpler or approximate structures, similar to working correlations in GEE. Exploring such alternatives could offer more flexible and efficient modeling in future work. While the estimation method proposed in this article combines well-established techniques, its tailored integration within the context of density time series with functional auto-regressive errors addresses unique challenges in this setting. Nevertheless, developing more novel and efficient estimation approaches remains an important direction for future research, with potential to further improve accuracy and computational performance. Furthermore, although the least squares-based estimation method used in this paper is not optimal in the classical sense due to the presence of temporally dependent functional errors, it remains a theoretically justified and practically effective approach in our setting. The estimator achieves consistency and asymptotic normality, and is tailored to the model’s structural complexity. Developing more efficient alternatives, such as methods that explicitly incorporate the error dependence structure, represents a promising direction for future research. Our work adapts classical theoretical tools to this complex setting, but there exist additional methods and frameworks that could further strengthen the theoretical properties. Developing and applying these tools offers valuable opportunities for future research. Additionally, a limitation of the current approach is that it models the distribution of responses pooled across units at each time point, thus not capturing individual unit trajectories over time. This means that development within specific countries or states cannot be directly traced. Extending the model to include unit-specific effects or hierarchical structures, such as functional mixed-effects models, would allow for tracking of within-unit temporal dynamics while accounting for autocorrelation. Future studies will aim to develop such extensions and explore their theoretical and empirical properties in greater depth.

Supplementary Materials

The following supporting information can be downloaded at https://www.mdpi.com/article/10.3390/e27080882/s1. The supplementary materials provide a detailed proof of the theoretical results. Refs. [18,19] are cited in the supplementary materials.

Author Contributions

Conceptualization, Z.H., T.L. and J.Y.; data curation, Z.H.; formal analysis, Z.H.; funding acquisition, T.L. and J.Y.; methodology, Z.H., T.L. and J.Y.; project administration, T.L., J.Y. and N.B.; supervision, T.L., J.Y. and N.B.; writing—original draft, Z.H.; writing—review and editing, Z.H., T.L., J.Y. and N.B. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Li and You. Li’s research is supported by grants from the Humanities and Social Science Fund of the Ministry of Education of China (No. 21YJA910001). You’s research is supported by grants from the National Natural Science Foundation of China (NSFC) (No. 11971291).

Data Availability Statement

The original datasets employed in this study are publicly accessible from the official website of Johns Hopkins University at https://www.jhu.edu/ (accessed on 25 July 2021), the World Bank’s online platform at https://data.worldbank.org/ (accessed on 17 September 2021), and the United States Bureau of Economic Analysis at http://www.bea.gov/ (accessed on 16 October 2021).

Conflicts of Interest

The authors declare no conflicts of interest.

References

Kokoszka, P.; Miao, H.; Petersen, A.; Shang, H.L. Forecasting of density functions with an application to cross-sectional and intraday returns. Int. J. Forecast. 2019, 35, 1304–1317. [Google Scholar] [CrossRef]
Sen, R.; Ma, C. Forecasting density function: Application in finance. J. Math. Financ. 2015, 5, 433–447. [Google Scholar] [CrossRef]
Petersen, A.; Müller, H. Fréchet regression for random objects with Euclidean predictors. Ann. Stat. 2019, 47, 691–719. [Google Scholar] [CrossRef]
Petersen, A.; Chen, C.; Müller, H. Quantifying and visualizing intraregional connectivity in resting-state functional magnetic resonance imaging with correlation densities. Brain Connect. 2019, 9, 37–47. [Google Scholar] [CrossRef] [PubMed]
Saha, A.; Banerjee, S.; Kurtek, S.; Narang, S.; Lee, J.; Rao, G.; Martinez, J.; Bharath, K.; Rao, A.; Baladandayuthapani, V. DEMARCATE: Density-based magnetic resonance image clustering for assessing tumor heterogeneity in cancer. NeuroImage Clin. 2016, 12, 132–143. [Google Scholar] [CrossRef] [PubMed]
Petersen, A.; Müller, H. Functional data analysis for density functions by transformation to a Hilbert space. Ann. Stat. 2016, 44, 183–218. [Google Scholar] [CrossRef]
Han, K.; Müller, H.; Park, B. Additive functional regression for densities as responses. J. Am. Stat. Assoc. 2020, 115, 997–1010. [Google Scholar] [CrossRef]
Talská, R.; Menafoglio, A.; Machalová, J.; Hron, K.; Fiserová, E. Compositional regression with functional response. Comput. Stat. Data Anal. 2018, 123, 66–85. [Google Scholar] [CrossRef]
Chen, Y.; Lin, Z.; Müller, H. Wasserstein regression. J. Am. Stat. Assoc. 2023, 118, 869–882. [Google Scholar] [CrossRef]
Zhang, C.; Kokoszka, P.; Petersen, A. Wasserstein autoregressive models for density time series. J. Time Ser. Anal. 2022, 43, 30–52. [Google Scholar] [CrossRef]
Berhoune, K.; Bensmain, N. Sieves estimator of functional autoregressive process. Stat. Probab. Lett. 2018, 135, 60–69. [Google Scholar] [CrossRef]
Chen, Y.; Chua, W.S.; Hardle, W. Forecasting limit order book liquidity supply-demand curves with functional autoregressive dynamics. Quant. Financ. 2019, 19, 1473–1489. [Google Scholar] [CrossRef]
Chen, Y.; Li, B. An adaptive functional autoregressive forecast model to predict electricity price curves. J. Bus. Econ. Stat. 2017, 35, 371–388. [Google Scholar] [CrossRef]
Daniel, R.; David, S.; David, R. Functional autoregression for sparsely sampled data. J. Bus. Econ. Stat. 2019, 37, 97–109. [Google Scholar]
Bosq, D. Linear Processes in Function Spaces: Theory and Applications; Springer Science & Business Media: New York, NY, USA, 2000. [Google Scholar]
Kokoszka, P.; Reimherr, M. Determining the order of the functional autoregressive model. J. Time Ser. Anal. 2013, 34, 116–129. [Google Scholar] [CrossRef]
Xu, X.; Chen, Y.; Zhang, G.; Koch, T. Modeling functional time series and mixed-type predictors with partially functional autoregressions. J. Bus. Econ. Stat. 2022, 42, 349–366. [Google Scholar] [CrossRef]
Stone, C. The use of polynomial splines and their tensor products in multivariate function estimation. Ann. Stat. 1994, 22, 118–171. [Google Scholar] [PubMed]
DeVore, R.; Lorentz, G. Constructive Approximation; Springer Science & Business Media: New York, NY, USA, 1993; Volume 303. [Google Scholar]

Figure 1. Densities of global COVID-19 mortality rates (‰) observed over a 100-day period. (a) Three-dimensional representation of the evolving density time series across the entire time span. (b) Density curves plotted for three selected days.

Figure 2. Average estimates of the

F A R (1)

error process

ε (u)

obtained from 200 Monte Carlo replications with sample size

T = 100

and

n = 100

observations. Panel (a) represents true curves, while panel (b) represents spline-based estimates. Each color represents a curve corresponding to an individual simulated subject.

Figure 2. Average estimates of the

F A R (1)

error process

ε (u)

obtained from 200 Monte Carlo replications with sample size

T = 100

and

n = 100

observations. Panel (a) represents true curves, while panel (b) represents spline-based estimates. Each color represents a curve corresponding to an individual simulated subject.

Figure 3. Average estimates of the bivariate functions

g_{m} (u, x_{m}), m = 1, 2

. Left panels: true density surfaces, middle panels: initial spline-based estimates, right panels: improved estimates after adjusting for the error structure. Top two panels correspond to

g_{1} (u, x_{1})

viewed from two different angles, bottom panels illustrate

g_{2} (u, x_{2})

.

Figure 3. Average estimates of the bivariate functions

g_{m} (u, x_{m}), m = 1, 2

. Left panels: true density surfaces, middle panels: initial spline-based estimates, right panels: improved estimates after adjusting for the error structure. Top two panels correspond to

g_{1} (u, x_{1})

viewed from two different angles, bottom panels illustrate

g_{2} (u, x_{2})

.

Figure 4. Heat map of bivariate varying-coefficient function

g_{1} (u, x_{1})

in the model based on the COVID-19 mortality rate (‰) data.

Figure 4. Heat map of bivariate varying-coefficient function

g_{1} (u, x_{1})

in the model based on the COVID-19 mortality rate (‰) data.

Figure 5. Densities of national quarterly personal income in the USA over 44 quarters. (a) Three-dimensional view of the density time series over the entire period; (b) density curves at three selected quarters.

Figure 6. Heat maps of bivariate varying-coefficient functions

g_{m} (u, x_{m}), m = 0, 1, 2,

based on the USA income data.

Figure 6. Heat maps of bivariate varying-coefficient functions

g_{m} (u, x_{m}), m = 0, 1, 2,

based on the USA income data.

Table 1. Average RMSEs of both initial and improved estimators of bivariate varying-coefficient additive functions

g_{m} (u, x_{m})

.

Table 1. Average RMSEs of both initial and improved estimators of bivariate varying-coefficient additive functions

g_{m} (u, x_{m})

.

Average RMSEs of Bivariate Varying-Coefficient Additive Functions
Sample Size		$g_{1} (u, x_{1})$		$g_{2} (u, x_{2})$
$T$	$n$	Initial	Improved	Initial	Improved
50	50	0.2247	0.1848	0.2139	0.1785
	100	0.1759	0.1325	0.1844	0.1521
100	50	0.1826	0.1471	0.1732	0.1354
	100	0.1431	0.1164	0.1319	0.1057

Table 2. Average Standard Deviation (SD) and Bias of both initial and improved estimators of bivariate varying-coefficient additive functions

g_{m} (u, x_{m})

.

Table 2. Average Standard Deviation (SD) and Bias of both initial and improved estimators of bivariate varying-coefficient additive functions

g_{m} (u, x_{m})

.

Average SD and Bias of Bivariate Varying-Coefficient Additive Functions
Sample Size		$g_{1} (u, x_{1})$				$g_{2} (u, x_{2})$
		Initial		Improved		Initial		Improved
$T$	$n$	SD	Bias	SD	Bias	SD	Bias	SD	Bias
50	50	0.205	0.147	0.168	0.104	0.219	0.137	0.183	0.117
	100	0.179	0.122	0.142	0.093	0.196	0.128	0.164	0.095
100	50	0.174	0.136	0.151	0.082	0.187	0.131	0.158	0.086
	100	0.133	0.099	0.112	0.057	0.153	0.111	0.129	0.061

Table 3. Empirical power of testing algorithm to determine the order of FAR error process under different significance levels.

Null Hypothesis		$p = 0$		$p \leq 1$		$p \leq 2$
Alternative Hypothesis		$p \geq 1$		$p \geq 2$		$p \geq 3$
Sample Size		Significance Level		Significance Level		Significance Level
$T$	$n$	0.05	0.1	0.05	0.1	0.05	0.1
50	50	0.893	0.962	0.787	0.846	0.082	0.134
	100	0.931	0.985	0.824	0.893	0.073	0.125
100	50	0.942	0.972	0.821	0.881	0.071	0.121
	100	0.985	1.000	0.889	0.935	0.064	0.113

Table 4. Average RMSEs of both initial and improved estimators of bivariate varying-coefficient additive functions

g_{m} (u, x_{m})

.

Table 4. Average RMSEs of both initial and improved estimators of bivariate varying-coefficient additive functions

g_{m} (u, x_{m})

.

Average RMSEs of Bivariate Varying-Coefficient Additive Functions
Sample Size		$g_{1} (u, x_{1})$		$g_{2} (u, x_{2})$
$T$	$n$	Initial	Improved	Initial	Improved
50	50	0.2739	0.2438	0.2691	0.2235
	100	0.2264	0.1852	0.2157	0.1809
100	50	0.2136	0.1817	0.2232	0.1761
	100	0.1729	0.1263	0.1816	0.1224

Table 5. Empirical power of testing algorithm to determine the order of FAR error process under different significance levels.

Null Hypothesis		$p = 0$		$p \leq 1$		$p \leq 2$
Alternative Hypothesis		$p \geq 1$		$p \geq 2$		$p \geq 3$
Sample Size		Significance Level		Significance Level		Significance Level
$T$	$n$	0.05	0.1	0.05	0.1	0.05	0.1
50	50	0.832	0.891	0.724	0.795	0.154	0.197
	100	0.876	0.932	0.776	0.843	0.136	0.162
100	50	0.884	0.937	0.792	0.838	0.131	0.159
	100	0.923	0.951	0.854	0.893	0.089	0.127

Table 6. Average RMSEs of both initial and improved estimators of bivariate varying-coefficient additive functions

g_{m} (u, x_{m})

.

Table 6. Average RMSEs of both initial and improved estimators of bivariate varying-coefficient additive functions

g_{m} (u, x_{m})

.

Average RMSEs of Bivariate Varying-Coefficient Additive Functions
Sample Size		$g_{1} (u, x_{1})$		$g_{2} (u, x_{2})$
$T$	$n$	Initial	Improved	Initial	Improved
50	50	0.5379	0.4906	0.5582	0.5072
	100	0.4631	0.4125	0.4824	0.4436
100	50	0.4582	0.4207	0.4871	0.4320
	100	0.3965	0.3518	0.4241	0.3736

Table 7. p-values from the testing algorithm applied to identify the order of functional error process based on the mortality rate data of COVID-19.

Null Hypothesis	$p = 0$	$p \leq 1$
Alternative Hypothesis	$p \geq 1$	$p \geq 2$
p-value	0.000	0.194

Table 8. p-values from the testing algorithm for determining the order of the functional error process based on the USA income data.

Null Hypothesis	$p = 0$	$p \leq 1$	$p \leq 2$
Alternative Hypothesis	$p \geq 1$	$p \geq 2$	$p \geq 3$
p-value	0.000	0.000	0.436

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Han, Z.; Li, T.; You, J.; Balakrishnan, N. Varying-Coefficient Additive Models with Density Responses and Functional Auto-Regressive Error Process. Entropy 2025, 27, 882. https://doi.org/10.3390/e27080882

AMA Style

Han Z, Li T, You J, Balakrishnan N. Varying-Coefficient Additive Models with Density Responses and Functional Auto-Regressive Error Process. Entropy. 2025; 27(8):882. https://doi.org/10.3390/e27080882

Chicago/Turabian Style

Han, Zixuan, Tao Li, Jinhong You, and Narayanaswamy Balakrishnan. 2025. "Varying-Coefficient Additive Models with Density Responses and Functional Auto-Regressive Error Process" Entropy 27, no. 8: 882. https://doi.org/10.3390/e27080882

APA Style

Han, Z., Li, T., You, J., & Balakrishnan, N. (2025). Varying-Coefficient Additive Models with Density Responses and Functional Auto-Regressive Error Process. Entropy, 27(8), 882. https://doi.org/10.3390/e27080882

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Varying-Coefficient Additive Models with Density Responses and Functional Auto-Regressive Error Process

Abstract

1. Introduction

2. Model Setup

3. Three-Step Estimation Methodology

3.1. Initial Estimation of Bivariate Varying-Coefficient Function

3.2. Estimation of FAR Error Process

3.3. Improved Estimation of Bivariate Varying-Coefficient Function

3.4. Implementation

3.4.1. Selection of Bandwidth

3.4.2. Identifying the Order of the FAR Process

4. Theoretical Results

5. Numerical Study

5.1. Case 1

5.2. Case 2

5.3. Case 3

6. Real Data Analysis

6.1. COVID-19 Data

6.2. USA Income Data

7. Conclusions

Supplementary Materials

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI