Estimation for Longitudinal Varying Coefficient Partially Nonlinear Models Based on QR Decomposition

Ge, Jiangcui; Zhou, Xiaoshuang; Wang, Cuiping

doi:10.3390/axioms14120875

Open AccessArticle

Estimation for Longitudinal Varying Coefficient Partially Nonlinear Models Based on QR Decomposition

by

Jiangcui Ge

^1,2,

Xiaoshuang Zhou

^2,*

and

Cuiping Wang

³

¹

School of Mathematics, North University of China, Taiyuan 030051, China

²

School of Mathematics and Big Data, Dezhou University, Dezhou 253023, China

³

School of Mathematics and Statistics, Shandong University of Technology, Zibo 255022, China

^*

Author to whom correspondence should be addressed.

Axioms 2025, 14(12), 875; https://doi.org/10.3390/axioms14120875

Submission received: 28 October 2025 / Revised: 21 November 2025 / Accepted: 26 November 2025 / Published: 28 November 2025

(This article belongs to the Special Issue Computational Statistics and Its Applications, 2nd Edition)

Download

Browse Figures

Versions Notes

Abstract

To address the estimation efficiency issues arising from multicollinearity and longitudinal data correlation in the varying coefficient partially nonlinear models (VCPNLM), a method based on QR decomposition and quadratic inference function (QIF) is proposed to obtain the orthogonality estimation of parameter components and varying coefficient functions. QR decomposition eliminates the pathology of the design matrix, and combines the adaptive weighting of the relevant structures within the group by QIF to effectively capture the complex correlation structure of longitudinal data. The theoretical analysis proves the asymptotic nature of the estimator, and the efficiency of the estimation method proposed in this paper is verified by simulation experiments.

Keywords:

varying coefficient partially nonlinear model; longitudinal data; QR decomposition; quadratic inference function

MSC:

62G05; 62G20

1. Introduction

In recent years, longitudinal data have attracted much attention due to their wide applications in domains like biomedicine, economics and social sciences. This kind of data results from repeatedly observing the same subject at various time intervals. As a result, multiple measurement results of the same subject exhibit temporal or spatial dependence, which makes the traditional independent and identically distributed assumption no longer applicable. To address this issue, the VCPNLM has become one of the research hotspots, as it combines the interpretability of parametric models with the flexibility of nonparametric models. Li and Mei [1] put forward a profile nonlinear least squares estimation approach to estimate the parameter vector and coefficient function vector of the VCPNLM, and further derived the asymptotic characteristics of the obtained estimators.

Important progress has been made in model estimation in existing studies. Yu et al. [2] constructed robust estimators for the partially linear additive model under functional data. Yan et al. [3] explored the empirical likelihood inference targeting the partially linear errors-in-variables model under longitudinal data. VCPNLM, based on mode regression, a robust two-stage method for estimation and variable selection, was proposed by Xiao and Liang [4]. Zhao et al. [5] put forward a new orthogonality-based empirical likelihood inference method through orthogonal estimation techniques and empirical likelihood inference methods, which is used to estimate the parametric and nonparametric components in a class of varying coefficient partially linear instrumental variable models for longitudinal data. Liang and Zeger [6] applied the generalized estimating equation to longitudinal data analysis for effective handling of its correlation issue. For the varying coefficient partially nonlinear quantile regression model under randomly left-censored data, Xu et al. [7] introduced a three-stage estimation weighting method. A comprehensive overview of longitudinal data analysis methods can be found in Diggle et al. [8].

The VCPNLM has received widespread attention in statistical research due to its flexibility in capturing dynamic relationships between variables and adaptability to complex data structures [4,6,7]. It integrates the advantages of varying coefficient models and partially nonlinear models, making it a powerful tool for practical applications such as biomedical research, economic forecasting, and environmental monitoring. However, the complexity of the model structure and the characteristics of real-world data pose great challenges to accurate parameter estimation, which motivates the need for further improvements in existing methods.

Despite the notable advancements mentioned above, significant challenges still exist in the existing methods for estimating the VCPNLM. For instance, the estimation of nonlinear parameters is easily disturbed by nonparametric components and longitudinal correlation structures, leading to reduced estimation efficiency; when there is multicollinearity among explanatory variables, the ill-posed nature of the design matrix will further decrease the stability of estimation. Such issues severely limit the performance and practicality of the model in practical applications.

To address the above issues, existing studies have proposed a variety of improved methods. For example, Qu et al. [9] used the QIF to advance the estimating equation, and this method maintains optimal performance even as the working correlation structure is misspecified. Bai et al. [10] applied the quadratic inference function to handle longitudinal data, and the results demonstrate that the proposed estimation method exhibits excellent asymptotic properties. For semiparametric varying coefficient partially linear models, Tian et al. [11] proposed penalized QIF. Schumaker [12] utilized B-spline basis functions to approximate the varying coefficient part, which improves the computational efficiency. For linear models with randomly missing responses, Wei et al. [13] introduced a model averaging approach. Jiang et al. [14] proposed an estimation method for the VCPNLM based on the exponential squared loss function. Xiao and Chen [15] advanced a procedure for local bias-corrected cross-sectional nonlinear least squares estimation. By adopting an orthogonality-projection method, Yang and Yang [16] developed smooth-threshold estimating equations for VCPNLM. Additional studies focusing on this model include Refs. [7,17,18,19]. These investigations address various data scenarios and model settings, offering targeted estimation methods and inference strategies to further supplement and optimize the research framework within this field.

This paper proposes an orthogonal estimation framework that integrates QR decomposition and the QIF. QR decomposition can effectively eliminate the ill-posed nature of the design matrix and improve numerical stability. The QIF method avoids the limitations of traditional generalized estimating equations by adaptively weighting the intra-group correlation structure, significantly enhancing estimation efficiency. This study not only provides a new theoretical tool for longitudinal data analysis but also offers a feasible solution to complex modeling problems in practical applications.

The structure of this paper is arranged as follows: Section 2 introduces the model specification and estimation method, including the specific implementation of QR decomposition and QIF; Section 3 addresses the asymptotic properties of the estimators; Section 4 verifies the superiority of the proposed method using simulation experiments; Appendix A contains the proof process of the key conclusions.

2. Models and Methods

Consider the VCPNLM introduced by Li and Mei [1]:

Y = X^{T} α (U) + g (Z, β) + ε,

(1)

where

Y \in R

is the response variable,

X \in R^{p}

,

U \in R

, and

Z \in R^{r}

are covariates,

α (U) = {(α_{1} (U), α_{2} (U), \dots, α_{p} (U))}^{T}

and

α_{k} (U), k = 1, 2, \dots, p

are unknown smooth functions,

g (\cdot, \cdot)

is a nonlinear function with a known form,

β = {(β_{1}, β_{2}, \dots, β_{q})}^{T}

denotes a q-dimensional unknown parameter vector, and

ε

is the model error with mean zero and variance

σ^{2}

.

2.1. Estimation of Parameter Vector

Considering the model (1) under longitudinal data, suppose the j-th observation of the i-th individual satisfies

\begin{matrix} Y_{i j} = X_{i j}^{T} α (U_{i j}) + g (Z_{i j}, β) + ε_{i j}, i = 1, 2, \dots, n, j = 1, 2, \dots, n_{i}, \end{matrix}

(2)

among them,

ε_{i j}

has a mean of zero, and

X_{i j} \in R^{p}, Z_{i j} \in R^{r}, U_{i j}

are covariates.

Based on the idea of Schumaker [12], the unknown function

α_{k} (\cdot), k = 1, 2, \dots, p

is approximated through basis functions. The B-spline basis functions are denoted by the vector

B (u) = {(B_{1} (u), B_{2} (u), \dots, B_{L} (u))}^{T}

. The dimension L is defined as

L = K + M

, where K and M represent the number of interior knots and the order of the spline, respectively. Then

α_{k} (u)

is approximately expressed as

\begin{matrix} α_{k} (u) \approx B {(u)}^{T} γ_{k}, k = 1, 2, \dots, p; \end{matrix}

(3)

here,

γ_{k} = {(γ_{k 1}, γ_{k 2}, \dots, γ_{k L})}^{T}

are the coefficient vector for the B-spline basis functions. model (2) is expressed as

Y_{i j} \approx W_{i j}^{T} γ + g (Z_{i j}, β) + ε_{i j}, i = 1, 2, \dots, n, j = 1, 2, \dots, n_{i},

(4)

define

W_{i j} = I_{p} \otimes B (U_{i j}) \cdot X_{i j}

, where ⊗ is the Kronecker product, and let

γ = {(γ_{1}^{T}, γ_{2}^{T}, \dots, γ_{p}^{T})}^{T}

,

Y_{i} = {(Y_{i 1}, Y_{i 2}, \dots, Y_{i n_{i}})}^{T}

,

W_{i} = {(W_{i 1}, W_{i 2}, \dots, W_{i n_{i}})}^{T}

,

g (Z_{i}, β) = (g (Z_{i 1}, β), g (Z_{i 2}, β), \dots,

{g (Z_{i n_{i}}, β))}^{T}

,

ε_{i} = {(ε_{i 1}, ε_{i 2}, \dots, ε_{i n_{i}})}^{T}

. Then model (4) is expressed as

Y_{i} \approx W_{i} γ + g (Z_{i}, β) + ε_{i}, i = 1, 2, \dots, n .

(5)

We first state a fundamental result on the QR decomposition of full-column-rank matrices, which is essential for the subsequent steps.

Lemma 1

(QR Decomposition for Full-Column-Rank Matrices). Let

W \in R^{m \times k}

be a matrix with full column rank k. Then, an orthogonal matrix

Q \in R^{m \times m}

and an upper triangular matrix

R \in R^{k \times k}

exist, satisfying

W = Q (\begin{matrix} R \\ 0 \end{matrix}),

where

0

is a

(m - k) \times k

zero matrix. Moreover, Q can be partitioned as

Q = (Q_{1}, Q_{2})

, with

Q_{1} \in R^{m \times k}

and

Q_{2} \in R^{m \times (m - k)}

, satisfying

Q_{2}^{⊤} Q_{1} = 0

.

We now proceed with the derivation from Equation (5). Suppose that for all

i = 1, 2, \dots, n

, the matrices

W_{i}

have full column rank, their QR decomposition can be expressed as

\begin{matrix} W_{i} = Q_{i} (\begin{matrix} R_{i} \\ 0 \end{matrix}), \end{matrix} i = 1, 2, \dots, n,

(6)

where the definitions of

Q_{i}

, R, and

0

are the same as those in Lemma 1 above. The matrix

Q_{i}

can be divided into two parts as

Q_{i} = (Q_{i 1}, Q_{i 2})

, where

Q_{i 1}

is a

n_{i} \times p L

matrix and

Q_{i 2}

is a

n_{i} \times (n_{i} - p L)

matrix. Substitute the decomposition of

Q_{i}

into (6) to obtain

W_{i} = Q_{i 1} R_{i}

, from the properties of orthogonal matrices,

Q_{i 2}^{T} Q_{i 1} = 0

can be derived, and then

Q_{i 2}^{T} W_{i} = Q_{i 2}^{T} Q_{i 1} R_{i} = 0

is obtained. Multiplying both sides of Equation (5) by

Q_{i 2}^{T}

, we get

Q_{i 2}^{T} Y_{i} \approx Q_{i 2}^{T} g (Z_{i}, β) + Q_{i 2}^{T} ε_{i}, i = 1, 2, \dots, n .

(7)

Obviously, model (7) is a regression model containing only unknown parameters. Following Liang [6], the generalized estimating equation for

β

can be formulated as

\sum_{i = 1}^{n} g^{(1)} {(Z_{i}, β)}^{T} Q_{i 2} V_{i}^{- 1} Q_{i 2}^{T} (Y_{i} - g (Z_{i}, β)) = 0,

(8)

among them,

g^{(1)} (Z_{i}, β) = {(g^{(1)} (Z_{i 1}, β), \dots, g^{(1)} (Z_{i n_{i}}, β))}^{T}

,

V_{i} = Q_{i 2}^{T} Σ_{i} Q_{i 2}

,

Σ_{i}

is the covariance matrices of

Y_{i}

, and the structure of

V_{i}

can also be expressed as

V_{i} = A_{i}^{\frac{1}{2}} R (ρ) A_{i}^{\frac{1}{2}}

according to the method of Liang [6], where

A_{i}

is a diagonal matrix,

R (ρ)

is a working correlation matrix, and

ρ

is a correlation parameter. Since a consistent estimator of

ρ

is not always available in practice, we adopt the QIF method to approximate the working correlation matrix

R^{- 1} (ρ)

through several basis matrices, thereby avoiding directly specifying the correlation structure. Drawing on the classic specifications proposed by Qu et al. [9] and combined with the correlation structure characteristics of the data in this study, we select a set of basis matrices

M_{1}, M_{2}, \dots, M_{s}

with corresponding coefficients

a_{1}, a_{2}, \dots, a_{s}

that satisfy

R^{- 1} (ρ) \approx a_{1} M_{1} + a_{2} M_{2} + \dots + a_{s} M_{s} .

(9)

Substituting (9) into (8), we obtain

\sum_{i = 1}^{n} g^{(1)} {(Z_{i}, β)}^{T} Q_{i 2} A_{i}^{- \frac{1}{2}} (a_{1} M_{1} + \dots + a_{s} M_{s}) A_{i}^{- \frac{1}{2}} Q_{i 2}^{T} (Y_{i} - g (Z_{i}, β)) = 0 .

(10)

Define the extended score function as follows:

G_{n} (β) = \frac{1}{n} \sum_{i = 1}^{n} η_{i} (β) ≜ \frac{1}{n} \sum_{i = 1}^{n} (\begin{matrix} g^{(1)} {(Z_{i}, β)}^{T} Q_{i 2} A_{i}^{- \frac{1}{2}} M_{1} A_{i}^{- \frac{1}{2}} Q_{i 2}^{T} (Y_{i} - g (Z_{i}, β)) \\ g^{(1)} {(Z_{i}, β)}^{T} Q_{i 2} A_{i}^{- \frac{1}{2}} M_{2} A_{i}^{- \frac{1}{2}} Q_{i 2}^{T} (Y_{i} - g (Z_{i}, β)) \\ ⋮ \\ g^{(1)} {(Z_{i}, β)}^{T} Q_{i 2} A_{i}^{- \frac{1}{2}} M_{s} A_{i}^{- \frac{1}{2}} Q_{i 2}^{T} (Y_{i} - g (Z_{i}, β)) \end{matrix}) .

(11)

Thus, the QIF for

β

can be defined as

M_{n} (β) = G_{n}^{T} (β) Φ_{n}^{- 1} (β) G_{n} (β),

(12)

where

Φ_{n} (β) = \frac{1}{n} \sum_{i = 1}^{n} η_{i} (β) η_{i}^{T} (β)

, in this case, we obtain the estimator

\hat{β}

of

β

by minimizing the objective function

M_{n} (β)

:

\hat{β} = arg min_{β} M_{n} (β) .

(13)

2.2. Estimation of the Coefficient Functions

The QIF effectively handles the correlation in longitudinal data and improves estimation efficiency by constructing a set of estimating equations. Therefore, after obtaining the initial estimates of the parameters

β

, we also use the QIF method to estimate the coefficient functions

α_{k} (u), k = 1, 2, \dots, p .

Substitute the estimator of into Equation (5), resulting in

{\tilde{Y}}_{i} \approx W_{i} γ + ε_{i}, i = 1, 2, \dots, n,

(14)

where

{\tilde{Y}}_{i} = Y_{i} - g_{i} (Z_{i}, \hat{β})

, assuming that the covariance matrix of

{\tilde{Y}}_{i}

is

Ψ_{i}

, and its structure is expressed following Liang [6] as

Ψ_{i} = A_{i}^{\frac{1}{2}} R (ρ) A_{i}^{\frac{1}{2}}

, where

A_{i}

and

R (ρ)

are defined as in the previous subsection. Assuming

V_{i}

is known, construct the estimating equation

\underset{i = 1}{\sum^{n}} W_{i}^{T} Ψ_{i}^{- 1} ({\tilde{Y}}_{i} - W_{i} γ) = 0 .

(15)

In practical applications,

V_{i}

is usually unknown. Based on this, we still adopt the QIF method and use

M_{1}, M_{2}, \dots, M_{s}

to approximate

R_{i}^{- 1}

, then we have

\underset{i = 1}{\sum^{n}} W_{i}^{T} A_{i}^{- \frac{1}{2}} (a_{1} M_{1} + a_{2} M_{2} + \dots + a_{s} M_{s}) A_{i}^{- \frac{1}{2}} (Y_{i} - W_{i} γ) = 0 .

(16)

Define the extended score function as follows:

G_{n} (γ) = \frac{1}{n} \sum_{i = 1}^{n} φ_{i} (γ) ≜ \frac{1}{n} \sum_{i = 1}^{n} (\begin{matrix} W_{i}^{T} A_{i}^{- \frac{1}{2}} M_{1} A_{i}^{- \frac{1}{2}} (Y_{i} - W_{i} γ) \\ W_{i}^{T} A_{i}^{- \frac{1}{2}} M_{2} A_{i}^{- \frac{1}{2}} (Y_{i} - W_{i} γ) \\ ⋮ \\ W_{i}^{T} A_{i}^{- \frac{1}{2}} M_{s} A_{i}^{- \frac{1}{2}} (Y_{i} - W_{i} γ) \end{matrix}) .

(17)

Thus, the QIF for

γ

is defined as follows:

Q_{n} (γ) = G_{n}^{T} (γ) C_{n}^{- 1} (γ) G_{n} (γ),

(18)

where

C_{n} (γ) = \frac{1}{n} \sum_{i = 1}^{n} φ_{i} (γ) φ_{i}^{T} (γ)

. In this case, we obtain the estimator

\hat{γ}

of

γ

by minimizing the objective function

Q_{n} (γ)

,

\hat{γ} = arg min Q_{n} (γ) .

(19)

Thus, the estimate of the coefficient function

α_{k} (u)

can be expressed as

{\hat{α}}_{k} (u) \approx B {(u)}^{T} {\hat{γ}}_{k}, k = 1, 2, \dots, p,

(20)

where

{\hat{γ}}_{k}

is the component corresponding to the k-th coefficient function in

\hat{γ}

.

3. Main Conclusions

This section studies the asymptotic properties of the estimators

\hat{β}

and

{\hat{α}}_{k} (u),

k = 1, 2, \dots, p

, assuming that

β_{0}

and

α_{0} (\cdot)

are the true values of

β

and

α (\cdot)

, respectively,

γ_{0}

is the true value of

γ

, and

α_{k 0} (\cdot)

and

γ_{k 0}

correspond to the k-th elements of

α_{0} (\cdot)

and

γ

, respectively. First, we present some common regularity conditions in longitudinal data analysis as follows:

(C1): The support $Ω$ of the random variable U is bounded, and its probability density function $f_{u} (\cdot)$ has continuous second-order derivatives.
(C2): The varying coefficient functions $α_{1} (u), \dots, α_{p} (u)$ are continuously differentiable of order r on $[0, 1]$ , where $r > 2$ .
(C3): For arbitrary Z, $g (Z, β)$ exhibits continuity with respect to $β$ , and $g (Z, β)$ has continuous partial derivatives of order r.
(C4): ${sup}_{i} ∥E (ε_{i} ε_{i}^{T})∥ < \infty$ holds, and there exists some $δ > 0$ satisfying $E ({∥ε_{i}∥}^{2 + δ}) < \infty$ .
(C5): The covariates $X_{i j}$ and $Z_{i j}$ are assumed to satisfy the following conditions: $sup_{i j} E \{{∥X_{i j}∥}^{4}\} < \infty$ , $sup_{i j} E \{{∥Z_{i j}∥}^{4}\} < \infty$ , $i = 1, 2, \dots, n, j = 1, 2, \dots, n_{i} .$
(C6): Let $t_{1}, \dots, t_{k}$ be interior nodes on $[0, 1]$ . Furthermore, let $t_{0} = 0, t_{k + 1} = 1, ξ_{i} = t_{i} - t_{i - 1},$ then a constant $C_{0}$ exists such that: $\frac{max \{ξ_{i}\}}{min \{ξ_{i}\}} \leq C_{0}, max \{|ξ_{i + 1} - ξ_{i}|\} = o (K^{- 1})$
(C7): Define $P_{i k} = Q_{i 2} A_{i}^{- \frac{1}{2}} M_{k} A_{i}^{- \frac{1}{2}} Q_{i 2}^{T}$ , then we have:

$\frac{1}{n} \sum_{i = 1}^{n} E \{g^{(1)} {(Z_{i}, β)}^{T} P_{i k} g^{(1)} (Z_{i}, β)\} \overset{P}{\to} D_{k}, k = 1, 2, \dots, s .$

$\frac{1}{n} \sum_{i = 1}^{n} E \{g^{(1)} {(Z_{i}, β)}^{T} P_{i l}^{T} V_{i} P_{i l} g^{(1)} (Z_{i}, β)\} \overset{P}{\to} Ω_{l k}, l, k = 1, 2, \dots, s .$

Among them,

D_{k}

and

Ω_{l k}

are constant matrices, and

\overset{P}{\to}

denotes convergence in probability. Define

Ω_{0} = {(Ω_{l k})}_{p s \times p s}

and

Γ_{0} = {(D_{1}^{T}, \dots, D_{s}^{T})}^{T}

, additionally, assume that

Ω_{0}

and

Γ_{0}^{T} Ω_{0}^{- 1} Γ_{0}

are both invertible.

It is noted that conditions (C1) to (C5) are common conditions in VCPNLM components, condition (C6) indicates that

t_{0}, \dots, t_{k + 1}

is a uniform partition sequence over the interval

[0, 1]

, and condition (C7) is used for subsequent proofs.

Theorem 1.

Under conditions (C1) to (C7), and when

n \to \infty

, thus

\sqrt{n} (\hat{β} - β_{0}) \overset{L}{\to} N (0, Σ),

where the matrix

Σ = {(Γ_{0}^{T} Ω_{0}^{- 1} Γ_{0})}^{- 1}

,

\overset{L}{\to}

denotes convergence in distribution.

Theorem 2.

Under conditions (C1) to (C7), the number of nodes

K = O (n^{\frac{1}{2 r + 1}})

, and when

n \to \infty

, it follows that

∥{\hat{α}}_{k} (\cdot) - α_{k 0} (\cdot)∥ = O_{p} (n^{- \frac{r}{2 r + 1}}), k = 1, 2, \dots, p,

in which

∥\cdot∥

denotes the function’s

L_{2}

norm.

4. Simulation Study

This section assesses the finite-sample performance of the proposed orthogonality estimation method based on QR decomposition and QIF in VCPNLM through a Monte Carlo simulation study. We define the following model:

Y = X^{T} α (U) + g (Z, β) + ε,

among them, the covariates

X, Z = {(Z_{1}, Z_{2})}^{T}

both follow normal distributions,

g (Z, β)

is defined as

g (Z, β) = β_{1} Z_{1} + β_{2} exp (Z_{2})

, with the parameter vector

β = {(β_{1}, β_{2})}^{T} = {(0.8, - 0.5)}^{T}

. Additionally,

U \sim U (0, 1)

, the coefficient function

α (U) = sin (2 π U)

, the error term

ε

follows an AR(1) process, and its structure is:

ε_{i, j} = 0.5 ε_{i, j - 1} + v_{i, j}

, where

v_{i, j} \sim N (0, 0.5 \sqrt{1 - {0.5}^{2}})

.

The sample size is set to

n = 100, 150, 200

; for the i-th subject, the number of repeated measurements is

m_{i} \equiv 8

, and 1000 simulation runs are conducted for each case. The method combining QR decomposition and QIF proposed in this paper (OQIF) is compared with the profile nonlinear least squares method (PNLS) introduced by Li and Mei [1]. After sorting out, Table 1 and Table 2 are obtained.

As illustrated in Table 1, increasing the sample size leads to a reduction in both bias and standard deviation for both methods. Notably, the OQIF method demonstrates smaller bias and standard deviation, indicating superior estimation accuracy. Although larger sample sizes generally enhance the precision of both methods, the OQIF method outperforms in terms of bias control.

The results in Table 2 show that the mean length for the confidence interval of the OQIF method is significantly shorter than that of the PNLS method, demonstrating higher estimation efficiency; the coverage rate of the OQIF method is closer to the ideal 95%, while although the coverage rate of the PNLS method has improved, it is still below 95%.

In conclusion, as the sample size grows from 100 to 200, the OQIF method demonstrates higher estimation accuracy and stronger robustness across all evaluation indicators. These trend analyses further confirm the theoretical advantages and practical application value of the OQIF method in handling nonlinear longitudinal data.

We further conducted 1000 simulations and plotted boxplots of the 1000 RMSE values for parameters

{\hat{β}}_{1}

and

{\hat{β}}_{2}

, as shown in Figure 1 and Figure 2. From the figures, we can observe the following: The RMSE values of

{\hat{β}}_{1}

and

{\hat{β}}_{2}

for both methods decrease as the sample size increases; however, the OQIF method already exhibits good performance with a small sample size (n = 100), while the PNLS method requires a larger sample size to achieve similar accuracy. The boxplots of the OQIF method are more symmetric and compact, indicating that the distribution of its estimators is closer to the normal distribution and has better statistical properties. Additionally, the OQIF method has significantly fewer outliers than the PNLS method, demonstrating stronger robustness against abnormal data. This suggests that the overall performance of the OQIF method in parameter estimation is superior to that of the PNLS method, with more pronounced advantages especially in cases of finite samples.

Next, we estimate the coefficient function

\hat{α} (U)

, simulate 1000 times, and draw the box plot of the RMSE of

\hat{α} (U)

under different samples, resulting in Figure 3 below.

As can be seen from the above boxplots, with the increase in sample size, the error distributions of both the OQIF method and the PNLS method become more concentrated, but the OQIF method has much smaller errors than the PNLS method. This further verifies the superiority of the OQIF method in the VCPNLM.

Author Contributions

Conceptualization, J.G., X.Z. and C.W.; methodology, J.G., X.Z. and C.W.; software, J.G., and C.W.; validation, J.G., X.Z. and C.W.; data curation, J.G. and X.Z.; writing—original draft preparation, J.G.; writing—review and editing, X.Z.; funding acquisition, X.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Natural Science Foundation of Shandong Province (Grant No. ZR2022MA065).

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

VCPNLM	Varying coefficient partially nonlinear model
QIF	Quadratic inference function
OQIF	The combination of QR decomposition and QIF
PNLS	Profile nonlinear least squares

Appendix A

To prove Theorems 1 and 2, the following lemmas are presented as follows:

Lemma A1.

Let

c \in R_{+}

, and let

ξ_{i}

,

i = 1, 2, \dots n .

be a sequence of independent and identically distributed random variables. If it satisfies conditions

E (ξ_{i}) = 0

and

E (ξ_{i}^{2}) < c < \infty

, we get

\max_{1 \leq k \leq n} |\sum_{i = 1}^{k} ξ_{i}| = O_{p} (\sqrt{n} log n)

.

Proof.

The result has been proven in Lemma A2 of Zhao and Xue [20]. □

Lemma A2.

Assume that the conditions (C1) to (C7) hold, when

n \to \infty

, we get

\sqrt{n} G_{n} (β_{0}) \overset{L}{\to} N (0, Ω_{0}),

where

Ω_{0}

is given in condition (C7).

Proof.

Let

R (U_{i}) = {(X_{i 1}^{T} r (U_{i 1}), \dots, X_{i n_{i}}^{T} r (U_{i n_{i}}))}^{T}

,

r (u) = {(r_{1} (u), \dots, r_{p} (u))}^{T}

,

r_{k} (u) = α_{k} (u) - B {(u)}^{T} γ_{k}

, combining with

Q_{i 2}^{T} W_{i} = 0

, we can obtain

\begin{matrix} Q_{i 2}^{T} Y_{i} & = Q_{i 2}^{T} g (Z_{i}, β_{0}) + Q_{i 2}^{T} W_{i} γ_{0} + Q_{i 2}^{T} R (U_{i}) + Q_{i 2}^{T} ε_{i} \\ = Q_{i 2}^{T} g (Z_{i}, β_{0}) + Q_{i 2}^{T} R (U_{i}) + Q_{i 2}^{T} ε_{i} . \end{matrix}

(A1)

Let

G_{n, k} (β)

represent the k-th component of

G_{n} (β)

, combining with Equation (11), we can obtain

\begin{matrix} \sqrt{n} G_{n, k} (β_{0}) & = \frac{1}{\sqrt{n}} \sum_{i = 1}^{n} g^{(1)} {(Z_{i}, β_{0})}^{T} Q_{i 2} A_{i}^{- \frac{1}{2}} M_{k} A_{i}^{- \frac{1}{2}} Q_{i 2}^{T} (Y_{i} - g (Z_{i}, β_{0})) \\ = \frac{1}{\sqrt{n}} \sum_{i = 1}^{n} g^{(1)} {(Z_{i}, β_{0})}^{T} Q_{i 2} A_{i}^{- \frac{1}{2}} M_{k} A_{i}^{- \frac{1}{2}} Q_{i 2}^{T} R (U_{i}) \\ + \frac{1}{\sqrt{n}} \sum_{i = 1}^{n} g^{(1)} {(Z_{i}, β_{0})}^{T} Q_{i 2} A_{i}^{- \frac{1}{2}} M_{k} A_{i}^{- \frac{1}{2}} Q_{i 2}^{T} ε_{i} . \end{matrix}

(A2)

Conclusion

∥R (U_{i})∥ = O (K^{- r})

can be derived from conditions (C2), (C5), and Corollary 6.21 of Schumaker [12], further combining with Lemma A1, we can obtain

\frac{1}{\sqrt{n}} \sum_{i = 1}^{n} g^{(1)} {(Z_{i}, β_{0})}^{T} Q_{i 2} A_{i}^{- \frac{1}{2}} M_{k} A_{i}^{- \frac{1}{2}} Q_{i 2}^{T} R (U_{i}) = O_{p} (n^{- \frac{1}{2}} n^{\frac{1}{2}} log n K^{- r}) = o_{p} (1) .

(A3)

Let

ζ_{i k} = g^{(1)} {(Z_{i}, β_{0})}^{T} Q_{i 2} A_{i}^{- \frac{1}{2}} M_{k} A_{i}^{- \frac{1}{2}} Q_{i 2}^{T} (Y_{i} - g (Z_{i}, β_{0}))

, combining the conditions that the expectation of

ε_{i}

is 0 and the covariance is

V_{i}

given

X_{i}

and

Z_{i}

, we can conclude

E (ζ_{i k} |X_{i}, Z_{i}) = 0

and

cov (ζ_{i k} |X_{i}, Z_{i}) = g^{(1)} {(Z_{i}, β_{0})}^{T} P_{i k} V_{i} P_{i k}^{T} g^{(1)} (Z_{i}, β_{0})

, in which

P_{i k}

is defined in condition (C7). We further let

ζ_{i} = {(ζ_{i 1}^{T}, ζ_{i 2}^{T}, \dots, ζ_{i s}^{T})}^{T}

, we can obtain

E (ζ_{i} |X_{i}, Z_{i}) = 0

and

cov (ζ_{i} |X_{i}, Z_{i}) = (\begin{matrix} N_{i 11} & \dots & N_{i 1 s} \\ ⋮ & ⋱ & ⋮ \\ N_{i s 1} & \dots & N_{i s s} \end{matrix}),

(A4)

where

N_{i l k} = g^{(1)} {(Z_{i}, β_{0})}^{T} P_{i l} V_{i} P_{i k}^{T} g^{(1)} (Z_{i}, β_{0}), l, k = 1, 2, \dots, s .

combining with condition (C7) and the Law of Large Numbers, we conclude that

\frac{1}{n} \sum_{i = 1}^{n} N_{i l k} \overset{P}{\to} Ω_{l k} .

(A5)

Furthermore, for any constant vector

a \in R^{s p}

that satisfying condition

a^{T} a = 1

, we have the expectation of

a^{T} ζ_{i}

is 0 and

\sup_{i} E {∥a^{T} ζ_{i}∥}^{2 + δ} \leq {∥a∥}^{2 + δ} \sup_{i} E {∥ζ_{i}∥}^{2 + δ} \leq C \sup_{i} E {∥ε_{i}∥}^{2 + δ} < \infty

, where C is a positive constant. Therefore,

a^{T} ζ_{i}

satisfies the Lyapunov condition. Thus, we obtain:

\frac{\sum_{i = 1}^{n} a^{T} ζ_{i}}{{(a^{T} \sum_{i = 1}^{n} cov (ζ_{i}) a)}^{\frac{1}{2}}} \overset{L}{\to} N (0, 1) .

Further combining with Equations (A4) and (A5), we can obtain

\frac{1}{n} \sum_{i = 1}^{n} cov (ζ_{i}) \overset{P}{\to} Ω_{0} .

(A6)

Further combining with Equations (A2) and (A3), we get

\sqrt{n} G_{n} (β_{0}) = \frac{1}{\sqrt{n}} \sum_{i = 1}^{n} ζ_{i} + o_{p} (1) \overset{L}{\to} N (0, Ω_{0}) .

(A7)

□

Lemma A3.

Under the conditions (C1) to (C7), we have

{\dot{G}}_{n} (β_{0}) \overset{P}{\to} - Γ_{0}

and

Ω_{n} (β_{0}) \overset{P}{\to} Ω_{0}

, and

Γ_{0}

and

Ω_{0}

are specified in condition (C7).

Proof.

After simple calculations, we can obtain

{\dot{G}}_{n} (β_{0}) = - \frac{1}{n} \sum_{i = 1}^{n} (\begin{matrix} g^{(1)} {(Z_{i}, β_{0})}^{T} Q_{i 2} A_{i}^{- \frac{1}{2}} M_{1} A_{i}^{- \frac{1}{2}} Q_{i 2}^{T} g^{(1)} (Z_{i}, β_{0}) \\ ⋮ \\ g^{(1)} {(Z_{i}, β_{0})}^{T} Q_{i 2} A_{i}^{- \frac{1}{2}} M_{s} A_{i}^{- \frac{1}{2}} Q_{i 2}^{T} g^{(1)} (Z_{i}, β_{0}) \end{matrix}) .

(A8)

Combining with condition (C7) and the Law of Large Numbers, it follows that

\frac{1}{n} \sum_{i = 1}^{n} g^{(1)} {(Z_{i}, β_{0})}^{T} Q_{i 2} A_{i}^{- \frac{1}{2}} M_{k} A_{i}^{- \frac{1}{2}} Q_{i 2}^{T} g^{(1)} (Z_{i}, β_{0}) \overset{P}{\to} D_{k}, k = 1, 2, \dots s .

(A9)

Further combining with Equation (A8), we can obtain

{\dot{G}}_{n} (β_{0}) \overset{P}{\to} - Γ_{0}

. As demonstrated in the proof of Lemma 2:

Ω_{n} (β_{0}) = \frac{1}{n} \sum_{i = 1}^{n} ζ_{i} ζ_{i}^{T} + o_{p} (1) = \frac{1}{n} \sum_{i = 1}^{n} (\begin{matrix} ζ_{i 1} ζ_{i 1}^{T} & \dots & ζ_{i 1} ζ_{i s}^{T} \\ ⋮ & ⋱ & ⋮ \\ ζ_{i s} ζ_{i 1}^{T} & \dots & ζ_{i s} ζ_{i s}^{T} \end{matrix}) + o_{p} (1) .

(A10)

Similar to the proof of Equation (A5), we get

\frac{1}{n} \sum_{i = 1}^{n} ζ_{i l} ζ_{i k}^{T} \overset{P}{\to} Ω_{l k} .

(A11)

Therefore,

Ω_{n} (β_{0}) \overset{P}{\to} Ω_{0}

. □

Lemma A4.

Under the conditions (C1) to (C7), we can obtain

∥{\dot{M}}_{n} (β_{0}) - 2 {\dot{G}}_{n}^{T} (β_{0}) Ω_{n}^{- 1} (β_{0}) G_{n} (β_{0})∥ = O_{p} (n^{- 1}),

and

∥{\ddot{M}}_{n} (β_{0}) - 2 {\dot{G}}_{n}^{T} (β_{0}) Ω_{n}^{- 1} (β_{0}) {\dot{G}}_{n} (β_{0})∥ = o_{p} (1) .

Proof.

According to the definition of

M_{n} (β)

, we can obtain the following result through calculation:

{\dot{M}}_{n} (β_{0}) = 2 {\dot{G}}_{n}^{T} (β_{0}) Ω_{n}^{- 1} (β_{0}) G_{n} (β_{0}) + G_{n}^{T} (β_{0}) Ω_{n}^{- 1} (β_{0}) {\dot{Ω}}_{n} (β_{0}) Ω_{n}^{- 1} (β_{0}) G_{n} (β_{0}) .

(A12)

It follows from Lemma A2 that:

G_{n} (β_{0}) = O_{p} (n^{- \frac{1}{2}})

. Further, from condition (C7) and Lemma A3, we can obtain

G_{n}^{T} (β_{0}) Ω_{n}^{- 1} (β_{0}) {\dot{Ω}}_{n} (β_{0}) Ω_{n}^{- 1} (β_{0}) G_{n} (β_{0}) = O_{p} (n^{- 1}) .

(A13)

Combining the Equations (A12) and (A13), we can obtain

{\dot{M}}_{n} (β_{0}) = 2 {\dot{G}}_{n}^{T} (β_{0}) Ω_{n}^{- 1} (β_{0}) G_{n} (β_{0}) + O_{p} (n^{- 1}) .

(A14)

Similarly, we can obtain the following through calculation:

{\ddot{M}}_{n} (β_{0}) = 2 {\dot{G}}_{n}^{T} (β_{0}) Ω_{n}^{- 1} (β_{0}) {\dot{G}}_{n} (β_{0}) + R_{n},

(A15)

where,

\begin{matrix} R_{n} & = 2 {\ddot{G}}_{n}^{T} (β_{0}) Ω_{n}^{- 1} (β_{0}) G_{n} (β_{0}) \\ - 4 {\dot{G}}_{n}^{T} (β_{0}) Ω_{n}^{- 1} (β_{0}) {\dot{Ω}}_{n} (β_{0}) Ω_{n}^{- 1} (β_{0}) G_{n} (β_{0}) \\ + 2 G_{n}^{T} (β_{0}) Ω_{n}^{- 1} (β_{0}) {\dot{Ω}}_{n} (β_{0}) Ω_{n}^{- 1} (β_{0}) {\dot{Ω}}_{n} (β_{0}) Ω_{n}^{- 1} (β_{0}) G_{n} (β_{0}) \\ - G_{n}^{T} (β_{0}) Ω_{n}^{- 1} (β_{0}) {\ddot{Ω}}_{n} (β_{0}) Ω_{n}^{- 1} (β_{0}) G_{n} (β_{0}) \\ = T_{1} + T_{2} + T_{3} + T_{4} . \end{matrix}

(A16)

Combining Lemmas A2 and A3, we can obtain the following:

T_{1}

and

T_{2}

are both

O_{p} (n^{- \frac{1}{2}})

, while

T_{3}

and

T_{4}

are both

O_{p} (n^{- 1})

, thus,

R_{n} = o_{p} (1)

. □

Proof of Theorem 1.

Noting that

\hat{β} = arg min_{β} M_{n} (β)

, thus

{\dot{M}}_{n} (\hat{β}) = 0

, therefore,

{\dot{M}}_{n} (β_{0}) + {\ddot{M}}_{n} (β^{*}) (\hat{β} - β_{0}) = 0,

(A17)

specifically

β^{*}

lies between

\hat{β}

and

β_{0}

, thus, we have

\sqrt{n} (\hat{β} - β_{0}) = \sqrt{n} {\ddot{M}}_{n}^{- 1} (β^{*}) {\dot{M}}_{n} (β_{0}) .

(A18)

Combining Lemma A3 and Lemma A4, we can obtain

{\ddot{M}}_{n}^{- 1} (β^{*}) = {[2 {\dot{G}}_{n}^{T} (β^{*}) Ω_{n}^{- 1} (β^{*}) {\dot{G}}_{n} (β^{*}) + o_{p} (1)]}^{- 1} = \frac{1}{2} {(Γ_{0}^{T} Ω_{0}^{- 1} Γ_{0})}^{- 1} + o_{p} (1),

(A19)

and

\sqrt{n} {\dot{M}}_{n} (β_{0}) = 2 {\dot{G}}_{n}^{T} (β_{0}) Ω_{n}^{- 1} (β_{0}) \sqrt{n} G_{n} (β_{0}) + o_{p} (1) = 2 Γ_{0} Ω_{0}^{- 1} \sqrt{n} G_{n} (β_{0}) + o_{p} (1) .

(A20)

Combining Equations (A18) to (A20) and Slutsky’s Theorem, we get

\begin{matrix} \sqrt{n} (\hat{β} - β_{0}) & = [\frac{1}{2} {(Γ_{0}^{T} Ω_{0}^{- 1} Γ_{0})}^{- 1} + o_{p} (1)] [2 Γ_{0} Ω_{0}^{- 1} \sqrt{n} G_{n} (β_{0}) + o_{p} (1)] \\ = {(Γ_{0}^{T} Ω_{0}^{- 1} Γ_{0})}^{- 1} Γ_{0} Ω_{0}^{- 1} \sqrt{n} G_{n} (β_{0}) + o_{p} (1) \\ \overset{L}{\to} N (0, Σ), \end{matrix}

(A21)

where,

Σ = {(Γ_{0}^{T} Ω_{0}^{- 1} Γ_{0})}^{- 1}

. □

Proof of Theorem 2.

Let

τ = n^{- \frac{r}{2 r + 1}}, γ = γ_{0} + τ s

. That is, we need to prove that for arbitrarily given

ε > 0

, we can find a constant C ensuring the following holds:

P {sup_{∥s∥ = C} Q_{n} (γ) > Q_{n} (γ_{0})} < 1 - ε .

(A22)

Let

Δ (γ) = n K^{- 1} [Q_{n} (γ) - Q_{n} (γ_{0})]

. By Taylor’s Formula, we can obtain

\begin{matrix} Δ (γ) & = n K^{- 1} [Q_{n} (γ_{0} + τ s) - Q_{n} (γ_{0})] \\ = n K^{- 1} τ s^{T} {Q_{n}}^{'} (γ_{0}) + \frac{1}{2} n K^{- 1} τ^{2} s^{T} {Q_{n}}^{″} (γ_{0}) s + \frac{1}{6} n K^{- 1} τ^{3} {(\frac{\partial}{\partial γ} {s^{T} {Q_{n}}^{″} (γ^{*}) s})}^{T} s \\ = J_{1} + J_{2} + J_{3}, \end{matrix}

(A23)

where

γ^{*}

lies between

γ

and

γ_{0}

. Noting that

K = O (n^{\frac{1}{2 r + 1}})

, we can obtain the following through the assumption conditions and some calculations:

J_{1} = O_{p} (n τ K^{- 1}) ∥s∥ = o_{p} (1) ∥s∥

,

J_{2} = O_{p} (n τ^{2} K^{- 1}) {∥s∥}^{2} = O_{p} (1) {∥s∥}^{2}

,

J_{3} = O_{p} (n τ^{3} K^{- 1}) {∥s∥}^{3} = o_{p} (1) {∥s∥}^{3}

. Thus, there is a sufficiently large constant C ensuring that when

∥s∥ = C

,

J_{2}

can control

J_{1}

and

J_{3}

; therefore, Equation (A8) holds, and there further exists a maximum point

\hat{γ}

satisfying

∥\hat{γ} - γ_{0}∥ = O_{p} (τ) = O_{p} (n^{- \frac{r}{2 r + 1}}),

(A24)

therefore, we get

∥{\hat{α}}_{k} (\cdot) - {\hat{α}}_{k 0} (\cdot)∥ = O_{p} (n^{- \frac{r}{2 r + 1}}), k = 1, 2, \dots, p .

(A25)

□

References

Li, T.; Mei, C. Estimation and inference for varying coefficient partially nonlinear models. J. Stat. Plan. Inference 2013, 143, 2023–2037. [Google Scholar] [CrossRef]
Yu, P.; Zhu, Z.; Shi, J.; Ai, X. Robust estimation for partial functional linear regression model based on modal regression. J. Syst. Sci. Complex. 2020, 33, 527–544. [Google Scholar] [CrossRef]
Yan, L.; Tan, X.y.; Chen, X. Empirical likelihood for partially linear errors-in-variables models with longitudinal data. Acta Math. Appl. Sin. Engl. Ser. 2022, 38, 664–683. [Google Scholar] [CrossRef]
Xiao, Y.; Liang, L. Robust estimation and variable selection for varying-coefficient partially nonlinear models based on modal regression. J. Korean Stat. Soc. 2022, 51, 692–715. [Google Scholar] [CrossRef]
Zhao, P.; Zhou, X.; Wang, X.; Huang, X. A new orthogonality empirical likelihood for varying coefficient partially linear instrumental variable models with longitudinal data. Commun. Stat.-Simul. Comput. 2020, 49, 3328–3344. [Google Scholar] [CrossRef]
Liang, K.Y.; Zeger, S.L. Longitudinal data analysis using generalized linear models. Biometrika 1986, 73, 13–22. [Google Scholar] [CrossRef]
Xu, H.X.; Fan, G.L.; Liang, H.Y. Quantile regression for varying-coefficient partially nonlinear models with randomly truncated data. Stat. Pap. 2024, 65, 2567–2604. [Google Scholar] [CrossRef]
Diggle, P.J. Analysis of Longitudinal Data; Oxford University Press: Oxford, UK, 2002. [Google Scholar]
Qu, A.; Lindsay, B.G.; Li, B. Improving generalised estimating equations using quadratic inference functions. Biometrika 2000, 87, 823–836. [Google Scholar] [CrossRef]
Bai, Y.; Fung, W.K.; Zhu, Z.Y. Penalized quadratic inference functions for single-index models with longitudinal data. J. Multivar. Anal. 2009, 100, 152–161. [Google Scholar] [CrossRef]
Tian, R.; Xue, L.; Liu, C. Penalized quadratic inference functions for semiparametric varying coefficient partially linear models with longitudinal data. J. Multivar. Anal. 2014, 132, 94–110. [Google Scholar] [CrossRef]
Schumaker, L. Spline Functions: Basic Theory; Wiley: Hoboken, NJ, USA, 1981. [Google Scholar]
Wei, Y.; Wang, Q.; Liu, W. Model averaging for linear models with responses missing at random. Ann. Inst. Stat. Math. 2021, 73, 535–553. [Google Scholar] [CrossRef]
Jiang, Y.; Ji, Q.; Xie, B. Robust estimation for the varying coefficient partially nonlinear models. J. Comput. Appl. Math. 2017, 326, 31–43. [Google Scholar] [CrossRef]
Xiao, Y.T.; Chen, Z.S. Bias-corrected estimations in varying-coefficient partially nonlinear models with measurement error in the nonparametric part. J. Appl. Stat. 2018, 45, 586–603. [Google Scholar] [CrossRef]
Yang, J.; Yang, H. Smooth-threshold estimating equations for varying coefficient partially nonlinear models based on orthogonality-projection method. J. Comput. Appl. Math. 2016, 302, 24–37. [Google Scholar] [CrossRef]
Qian, Y.; Huang, Z. Statistical inference for a varying-coefficient partially nonlinear model with measurement errors. Stat. Methodol. 2016, 32, 122–130. [Google Scholar] [CrossRef]
Wang, X.; Zhao, P.; Du, H. Statistical inferences for varying coefficient partially non linear model with missing covariates. Commun. Stat.-Theory Methods 2021, 50, 2599–2618. [Google Scholar] [CrossRef]
Zhou, Y.; Mei, R.; Zhao, Y.; Hu, Z.; Zhao, M. Orthogonality-based bias-corrected empirical likelihood inference for partial linear varying coefficient EV models with longitudinal data. J. Comput. Appl. Math. 2024, 443, 115751. [Google Scholar] [CrossRef]
Zhao, P.; Xue, L. Empirical likelihood inferences for semiparametric varying-coefficient partially linear errors-in-variables models with longitudinal data. J. Nonparametric Stat. 2009, 21, 907–923. [Google Scholar] [CrossRef]

Figure 1. The boxplots of 1000 RMSE values for

{\hat{β}}_{1}

under the OQIF (A) and PNLS (B) methods.

Figure 1. The boxplots of 1000 RMSE values for

{\hat{β}}_{1}

under the OQIF (A) and PNLS (B) methods.

Figure 2. The boxplots of 1000 RMSE values for

{\hat{β}}_{2}

under the OQIF (A) and PNLS (B) methods.

Figure 2. The boxplots of 1000 RMSE values for

{\hat{β}}_{2}

under the OQIF (A) and PNLS (B) methods.

Figure 3. The boxplots of 1000 RMSE for

\hat{α} (U)

.

Figure 3. The boxplots of 1000 RMSE for

\hat{α} (U)

.

Table 1. The bias and standard deviation of

{\hat{β}}_{1}

and

{\hat{β}}_{2}

measured using distinct methods.

Table 1. The bias and standard deviation of

{\hat{β}}_{1}

and

{\hat{β}}_{2}

measured using distinct methods.

Method	Parameter	$n = 100$		$n = 150$		$n = 200$
Method	Parameter	Bias	SD	Bias	SD	Bias	SD
OQIF	${\hat{β}}_{1}$	0.00257	0.02012	0.00123	0.01890	0.00079	0.01568
OQIF	${\hat{β}}_{2}$	−0.00390	0.03890	−0.00235	0.03568	−0.00157	0.02946
PNLS	${\hat{β}}_{1}$	0.01457	0.03012	0.01235	0.02789	0.00988	0.02346
PNLS	${\hat{β}}_{2}$	−0.02789	0.05890	−0.02457	0.05235	−0.02012	0.04457

Table 2. Confidence interval length and coverage probability comparison.

Method	Parameter	$n = 100$		$n = 150$		$n = 200$
Method	Parameter	Length	Coverage	Length	Coverage	Length	Coverage
OQIF	${\hat{β}}_{1}$	0.07671	0.943	0.07190	0.947	0.05971	0.951
OQIF	${\hat{β}}_{2}$	0.14853	0.937	0.13551	0.941	0.11329	0.948
PNLS	${\hat{β}}_{1}$	0.11329	0.911	0.10065	0.918	0.08541	0.925
PNLS	${\hat{β}}_{2}$	0.20995	0.899	0.18773	0.905	0.15946	0.912

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Ge, J.; Zhou, X.; Wang, C. Estimation for Longitudinal Varying Coefficient Partially Nonlinear Models Based on QR Decomposition. Axioms 2025, 14, 875. https://doi.org/10.3390/axioms14120875

AMA Style

Ge J, Zhou X, Wang C. Estimation for Longitudinal Varying Coefficient Partially Nonlinear Models Based on QR Decomposition. Axioms. 2025; 14(12):875. https://doi.org/10.3390/axioms14120875

Chicago/Turabian Style

Ge, Jiangcui, Xiaoshuang Zhou, and Cuiping Wang. 2025. "Estimation for Longitudinal Varying Coefficient Partially Nonlinear Models Based on QR Decomposition" Axioms 14, no. 12: 875. https://doi.org/10.3390/axioms14120875

APA Style

Ge, J., Zhou, X., & Wang, C. (2025). Estimation for Longitudinal Varying Coefficient Partially Nonlinear Models Based on QR Decomposition. Axioms, 14(12), 875. https://doi.org/10.3390/axioms14120875

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Estimation for Longitudinal Varying Coefficient Partially Nonlinear Models Based on QR Decomposition

Abstract

1. Introduction

2. Models and Methods

2.1. Estimation of Parameter Vector

2.2. Estimation of the Coefficient Functions

3. Main Conclusions

4. Simulation Study

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Abbreviations

Appendix A

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI