Abstract
To address the estimation efficiency issues arising from multicollinearity and longitudinal data correlation in the varying coefficient partially nonlinear models (VCPNLM), a method based on QR decomposition and quadratic inference function (QIF) is proposed to obtain the orthogonality estimation of parameter components and varying coefficient functions. QR decomposition eliminates the pathology of the design matrix, and combines the adaptive weighting of the relevant structures within the group by QIF to effectively capture the complex correlation structure of longitudinal data. The theoretical analysis proves the asymptotic nature of the estimator, and the efficiency of the estimation method proposed in this paper is verified by simulation experiments.
Keywords:
varying coefficient partially nonlinear model; longitudinal data; QR decomposition; quadratic inference function MSC:
62G05; 62G20
1. Introduction
In recent years, longitudinal data have attracted much attention due to their wide applications in domains like biomedicine, economics and social sciences. This kind of data results from repeatedly observing the same subject at various time intervals. As a result, multiple measurement results of the same subject exhibit temporal or spatial dependence, which makes the traditional independent and identically distributed assumption no longer applicable. To address this issue, the VCPNLM has become one of the research hotspots, as it combines the interpretability of parametric models with the flexibility of nonparametric models. Li and Mei [] put forward a profile nonlinear least squares estimation approach to estimate the parameter vector and coefficient function vector of the VCPNLM, and further derived the asymptotic characteristics of the obtained estimators.
Important progress has been made in model estimation in existing studies. Yu et al. [] constructed robust estimators for the partially linear additive model under functional data. Yan et al. [] explored the empirical likelihood inference targeting the partially linear errors-in-variables model under longitudinal data. VCPNLM, based on mode regression, a robust two-stage method for estimation and variable selection, was proposed by Xiao and Liang []. Zhao et al. [] put forward a new orthogonality-based empirical likelihood inference method through orthogonal estimation techniques and empirical likelihood inference methods, which is used to estimate the parametric and nonparametric components in a class of varying coefficient partially linear instrumental variable models for longitudinal data. Liang and Zeger [] applied the generalized estimating equation to longitudinal data analysis for effective handling of its correlation issue. For the varying coefficient partially nonlinear quantile regression model under randomly left-censored data, Xu et al. [] introduced a three-stage estimation weighting method. A comprehensive overview of longitudinal data analysis methods can be found in Diggle et al. [].
The VCPNLM has received widespread attention in statistical research due to its flexibility in capturing dynamic relationships between variables and adaptability to complex data structures [,,]. It integrates the advantages of varying coefficient models and partially nonlinear models, making it a powerful tool for practical applications such as biomedical research, economic forecasting, and environmental monitoring. However, the complexity of the model structure and the characteristics of real-world data pose great challenges to accurate parameter estimation, which motivates the need for further improvements in existing methods.
Despite the notable advancements mentioned above, significant challenges still exist in the existing methods for estimating the VCPNLM. For instance, the estimation of nonlinear parameters is easily disturbed by nonparametric components and longitudinal correlation structures, leading to reduced estimation efficiency; when there is multicollinearity among explanatory variables, the ill-posed nature of the design matrix will further decrease the stability of estimation. Such issues severely limit the performance and practicality of the model in practical applications.
To address the above issues, existing studies have proposed a variety of improved methods. For example, Qu et al. [] used the QIF to advance the estimating equation, and this method maintains optimal performance even as the working correlation structure is misspecified. Bai et al. [] applied the quadratic inference function to handle longitudinal data, and the results demonstrate that the proposed estimation method exhibits excellent asymptotic properties. For semiparametric varying coefficient partially linear models, Tian et al. [] proposed penalized QIF. Schumaker [] utilized B-spline basis functions to approximate the varying coefficient part, which improves the computational efficiency. For linear models with randomly missing responses, Wei et al. [] introduced a model averaging approach. Jiang et al. [] proposed an estimation method for the VCPNLM based on the exponential squared loss function. Xiao and Chen [] advanced a procedure for local bias-corrected cross-sectional nonlinear least squares estimation. By adopting an orthogonality-projection method, Yang and Yang [] developed smooth-threshold estimating equations for VCPNLM. Additional studies focusing on this model include Refs. [,,,]. These investigations address various data scenarios and model settings, offering targeted estimation methods and inference strategies to further supplement and optimize the research framework within this field.
This paper proposes an orthogonal estimation framework that integrates QR decomposition and the QIF. QR decomposition can effectively eliminate the ill-posed nature of the design matrix and improve numerical stability. The QIF method avoids the limitations of traditional generalized estimating equations by adaptively weighting the intra-group correlation structure, significantly enhancing estimation efficiency. This study not only provides a new theoretical tool for longitudinal data analysis but also offers a feasible solution to complex modeling problems in practical applications.
The structure of this paper is arranged as follows: Section 2 introduces the model specification and estimation method, including the specific implementation of QR decomposition and QIF; Section 3 addresses the asymptotic properties of the estimators; Section 4 verifies the superiority of the proposed method using simulation experiments; Appendix A contains the proof process of the key conclusions.
2. Models and Methods
Consider the VCPNLM introduced by Li and Mei []:
where is the response variable, , , and are covariates, and are unknown smooth functions, is a nonlinear function with a known form, denotes a q-dimensional unknown parameter vector, and is the model error with mean zero and variance .
2.1. Estimation of Parameter Vector
Considering the model (1) under longitudinal data, suppose the j-th observation of the i-th individual satisfies
among them, has a mean of zero, and are covariates.
Based on the idea of Schumaker [], the unknown function is approximated through basis functions. The B-spline basis functions are denoted by the vector . The dimension L is defined as , where K and M represent the number of interior knots and the order of the spline, respectively. Then is approximately expressed as
here, are the coefficient vector for the B-spline basis functions. model (2) is expressed as
define , where ⊗ is the Kronecker product, and let , , , , . Then model (4) is expressed as
We first state a fundamental result on the QR decomposition of full-column-rank matrices, which is essential for the subsequent steps.
Lemma 1
(QR Decomposition for Full-Column-Rank Matrices). Let be a matrix with full column rank k. Then, an orthogonal matrix and an upper triangular matrix exist, satisfying
where is a zero matrix. Moreover, Q can be partitioned as , with and , satisfying .
We now proceed with the derivation from Equation (5). Suppose that for all , the matrices have full column rank, their QR decomposition can be expressed as
where the definitions of , R, and are the same as those in Lemma 1 above. The matrix can be divided into two parts as , where is a matrix and is a matrix. Substitute the decomposition of into (6) to obtain , from the properties of orthogonal matrices, can be derived, and then is obtained. Multiplying both sides of Equation (5) by , we get
Obviously, model (7) is a regression model containing only unknown parameters. Following Liang [], the generalized estimating equation for can be formulated as
among them, , , is the covariance matrices of , and the structure of can also be expressed as according to the method of Liang [], where is a diagonal matrix, is a working correlation matrix, and is a correlation parameter. Since a consistent estimator of is not always available in practice, we adopt the QIF method to approximate the working correlation matrix through several basis matrices, thereby avoiding directly specifying the correlation structure. Drawing on the classic specifications proposed by Qu et al. [] and combined with the correlation structure characteristics of the data in this study, we select a set of basis matrices with corresponding coefficients that satisfy
Substituting (9) into (8), we obtain
Define the extended score function as follows:
Thus, the QIF for can be defined as
where , in this case, we obtain the estimator of by minimizing the objective function :
2.2. Estimation of the Coefficient Functions
The QIF effectively handles the correlation in longitudinal data and improves estimation efficiency by constructing a set of estimating equations. Therefore, after obtaining the initial estimates of the parameters , we also use the QIF method to estimate the coefficient functions
Substitute the estimator of into Equation (5), resulting in
where , assuming that the covariance matrix of is , and its structure is expressed following Liang [] as , where and are defined as in the previous subsection. Assuming is known, construct the estimating equation
In practical applications, is usually unknown. Based on this, we still adopt the QIF method and use to approximate , then we have
Define the extended score function as follows:
Thus, the QIF for is defined as follows:
where . In this case, we obtain the estimator of by minimizing the objective function ,
Thus, the estimate of the coefficient function can be expressed as
where is the component corresponding to the k-th coefficient function in .
3. Main Conclusions
This section studies the asymptotic properties of the estimators and , assuming that and are the true values of and , respectively, is the true value of , and and correspond to the k-th elements of and , respectively. First, we present some common regularity conditions in longitudinal data analysis as follows:
- (C1)
- The support of the random variable U is bounded, and its probability density function has continuous second-order derivatives.
- (C2)
- The varying coefficient functions are continuously differentiable of order r on , where .
- (C3)
- For arbitrary Z, exhibits continuity with respect to , and has continuous partial derivatives of order r.
- (C4)
- holds, and there exists some satisfying .
- (C5)
- The covariates and are assumed to satisfy the following conditions: , ,
- (C6)
- Let be interior nodes on . Furthermore, let then a constant exists such that:
- (C7)
- Define , then we have:
Among them, and are constant matrices, and denotes convergence in probability. Define and , additionally, assume that and are both invertible.
It is noted that conditions (C1) to (C5) are common conditions in VCPNLM components, condition (C6) indicates that is a uniform partition sequence over the interval , and condition (C7) is used for subsequent proofs.
Theorem 1.
Under conditions (C1) to (C7), and when , thus
where the matrix , denotes convergence in distribution.
Theorem 2.
Under conditions (C1) to (C7), the number of nodes , and when , it follows that
in which denotes the function’s norm.
4. Simulation Study
This section assesses the finite-sample performance of the proposed orthogonality estimation method based on QR decomposition and QIF in VCPNLM through a Monte Carlo simulation study. We define the following model:
among them, the covariates both follow normal distributions, is defined as , with the parameter vector . Additionally, , the coefficient function , the error term follows an AR(1) process, and its structure is: , where .
The sample size is set to ; for the i-th subject, the number of repeated measurements is , and 1000 simulation runs are conducted for each case. The method combining QR decomposition and QIF proposed in this paper (OQIF) is compared with the profile nonlinear least squares method (PNLS) introduced by Li and Mei []. After sorting out, Table 1 and Table 2 are obtained.
Table 1.
The bias and standard deviation of and measured using distinct methods.
Table 2.
Confidence interval length and coverage probability comparison.
As illustrated in Table 1, increasing the sample size leads to a reduction in both bias and standard deviation for both methods. Notably, the OQIF method demonstrates smaller bias and standard deviation, indicating superior estimation accuracy. Although larger sample sizes generally enhance the precision of both methods, the OQIF method outperforms in terms of bias control.
The results in Table 2 show that the mean length for the confidence interval of the OQIF method is significantly shorter than that of the PNLS method, demonstrating higher estimation efficiency; the coverage rate of the OQIF method is closer to the ideal 95%, while although the coverage rate of the PNLS method has improved, it is still below 95%.
In conclusion, as the sample size grows from 100 to 200, the OQIF method demonstrates higher estimation accuracy and stronger robustness across all evaluation indicators. These trend analyses further confirm the theoretical advantages and practical application value of the OQIF method in handling nonlinear longitudinal data.
We further conducted 1000 simulations and plotted boxplots of the 1000 RMSE values for parameters and , as shown in Figure 1 and Figure 2. From the figures, we can observe the following: The RMSE values of and for both methods decrease as the sample size increases; however, the OQIF method already exhibits good performance with a small sample size (n = 100), while the PNLS method requires a larger sample size to achieve similar accuracy. The boxplots of the OQIF method are more symmetric and compact, indicating that the distribution of its estimators is closer to the normal distribution and has better statistical properties. Additionally, the OQIF method has significantly fewer outliers than the PNLS method, demonstrating stronger robustness against abnormal data. This suggests that the overall performance of the OQIF method in parameter estimation is superior to that of the PNLS method, with more pronounced advantages especially in cases of finite samples.
Figure 1.
The boxplots of 1000 RMSE values for under the OQIF (A) and PNLS (B) methods.
Figure 2.
The boxplots of 1000 RMSE values for under the OQIF (A) and PNLS (B) methods.
Next, we estimate the coefficient function , simulate 1000 times, and draw the box plot of the RMSE of under different samples, resulting in Figure 3 below.
Figure 3.
The boxplots of 1000 RMSE for .
As can be seen from the above boxplots, with the increase in sample size, the error distributions of both the OQIF method and the PNLS method become more concentrated, but the OQIF method has much smaller errors than the PNLS method. This further verifies the superiority of the OQIF method in the VCPNLM.
Author Contributions
Conceptualization, J.G., X.Z. and C.W.; methodology, J.G., X.Z. and C.W.; software, J.G., and C.W.; validation, J.G., X.Z. and C.W.; data curation, J.G. and X.Z.; writing—original draft preparation, J.G.; writing—review and editing, X.Z.; funding acquisition, X.Z. All authors have read and agreed to the published version of the manuscript.
Funding
This research was funded by the Natural Science Foundation of Shandong Province (Grant No. ZR2022MA065).
Data Availability Statement
The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.
Conflicts of Interest
The authors declare no conflicts of interest.
Abbreviations
The following abbreviations are used in this manuscript:
| VCPNLM | Varying coefficient partially nonlinear model |
| QIF | Quadratic inference function |
| OQIF | The combination of QR decomposition and QIF |
| PNLS | Profile nonlinear least squares |
Appendix A
To prove Theorems 1 and 2, the following lemmas are presented as follows:
Lemma A1.
Let , and let , be a sequence of independent and identically distributed random variables. If it satisfies conditions and , we get .
Proof.
The result has been proven in Lemma A2 of Zhao and Xue []. □
Lemma A2.
Assume that the conditions (C1) to (C7) hold, when , we get
where is given in condition (C7).
Proof.
Let , , , combining with , we can obtain
Let represent the k-th component of , combining with Equation (11), we can obtain
Conclusion can be derived from conditions (C2), (C5), and Corollary 6.21 of Schumaker [], further combining with Lemma A1, we can obtain
Let , combining the conditions that the expectation of is 0 and the covariance is given and , we can conclude and , in which is defined in condition (C7). We further let , we can obtain and
where combining with condition (C7) and the Law of Large Numbers, we conclude that
Furthermore, for any constant vector that satisfying condition , we have the expectation of is 0 and , where C is a positive constant. Therefore, satisfies the Lyapunov condition. Thus, we obtain: Further combining with Equations (A4) and (A5), we can obtain
Further combining with Equations (A2) and (A3), we get
□
Lemma A3.
Under the conditions (C1) to (C7), we have and , and and are specified in condition (C7).
Proof.
Lemma A4.
Under the conditions (C1) to (C7), we can obtain
and
Proof.
According to the definition of , we can obtain the following result through calculation:
It follows from Lemma A2 that: . Further, from condition (C7) and Lemma A3, we can obtain
Combining the Equations (A12) and (A13), we can obtain
Similarly, we can obtain the following through calculation:
where,
Combining Lemmas A2 and A3, we can obtain the following: and are both , while and are both , thus, . □
Proof of Theorem 1.
Proof of Theorem 2.
Let . That is, we need to prove that for arbitrarily given , we can find a constant C ensuring the following holds:
Let . By Taylor’s Formula, we can obtain
where lies between and . Noting that , we can obtain the following through the assumption conditions and some calculations: , , . Thus, there is a sufficiently large constant C ensuring that when , can control and ; therefore, Equation (A8) holds, and there further exists a maximum point satisfying
therefore, we get
□
References
- Li, T.; Mei, C. Estimation and inference for varying coefficient partially nonlinear models. J. Stat. Plan. Inference 2013, 143, 2023–2037. [Google Scholar] [CrossRef]
- Yu, P.; Zhu, Z.; Shi, J.; Ai, X. Robust estimation for partial functional linear regression model based on modal regression. J. Syst. Sci. Complex. 2020, 33, 527–544. [Google Scholar] [CrossRef]
- Yan, L.; Tan, X.y.; Chen, X. Empirical likelihood for partially linear errors-in-variables models with longitudinal data. Acta Math. Appl. Sin. Engl. Ser. 2022, 38, 664–683. [Google Scholar] [CrossRef]
- Xiao, Y.; Liang, L. Robust estimation and variable selection for varying-coefficient partially nonlinear models based on modal regression. J. Korean Stat. Soc. 2022, 51, 692–715. [Google Scholar] [CrossRef]
- Zhao, P.; Zhou, X.; Wang, X.; Huang, X. A new orthogonality empirical likelihood for varying coefficient partially linear instrumental variable models with longitudinal data. Commun. Stat.-Simul. Comput. 2020, 49, 3328–3344. [Google Scholar] [CrossRef]
- Liang, K.Y.; Zeger, S.L. Longitudinal data analysis using generalized linear models. Biometrika 1986, 73, 13–22. [Google Scholar] [CrossRef]
- Xu, H.X.; Fan, G.L.; Liang, H.Y. Quantile regression for varying-coefficient partially nonlinear models with randomly truncated data. Stat. Pap. 2024, 65, 2567–2604. [Google Scholar] [CrossRef]
- Diggle, P.J. Analysis of Longitudinal Data; Oxford University Press: Oxford, UK, 2002. [Google Scholar]
- Qu, A.; Lindsay, B.G.; Li, B. Improving generalised estimating equations using quadratic inference functions. Biometrika 2000, 87, 823–836. [Google Scholar] [CrossRef]
- Bai, Y.; Fung, W.K.; Zhu, Z.Y. Penalized quadratic inference functions for single-index models with longitudinal data. J. Multivar. Anal. 2009, 100, 152–161. [Google Scholar] [CrossRef]
- Tian, R.; Xue, L.; Liu, C. Penalized quadratic inference functions for semiparametric varying coefficient partially linear models with longitudinal data. J. Multivar. Anal. 2014, 132, 94–110. [Google Scholar] [CrossRef]
- Schumaker, L. Spline Functions: Basic Theory; Wiley: Hoboken, NJ, USA, 1981. [Google Scholar]
- Wei, Y.; Wang, Q.; Liu, W. Model averaging for linear models with responses missing at random. Ann. Inst. Stat. Math. 2021, 73, 535–553. [Google Scholar] [CrossRef]
- Jiang, Y.; Ji, Q.; Xie, B. Robust estimation for the varying coefficient partially nonlinear models. J. Comput. Appl. Math. 2017, 326, 31–43. [Google Scholar] [CrossRef]
- Xiao, Y.T.; Chen, Z.S. Bias-corrected estimations in varying-coefficient partially nonlinear models with measurement error in the nonparametric part. J. Appl. Stat. 2018, 45, 586–603. [Google Scholar] [CrossRef]
- Yang, J.; Yang, H. Smooth-threshold estimating equations for varying coefficient partially nonlinear models based on orthogonality-projection method. J. Comput. Appl. Math. 2016, 302, 24–37. [Google Scholar] [CrossRef]
- Qian, Y.; Huang, Z. Statistical inference for a varying-coefficient partially nonlinear model with measurement errors. Stat. Methodol. 2016, 32, 122–130. [Google Scholar] [CrossRef]
- Wang, X.; Zhao, P.; Du, H. Statistical inferences for varying coefficient partially non linear model with missing covariates. Commun. Stat.-Theory Methods 2021, 50, 2599–2618. [Google Scholar] [CrossRef]
- Zhou, Y.; Mei, R.; Zhao, Y.; Hu, Z.; Zhao, M. Orthogonality-based bias-corrected empirical likelihood inference for partial linear varying coefficient EV models with longitudinal data. J. Comput. Appl. Math. 2024, 443, 115751. [Google Scholar] [CrossRef]
- Zhao, P.; Xue, L. Empirical likelihood inferences for semiparametric varying-coefficient partially linear errors-in-variables models with longitudinal data. J. Nonparametric Stat. 2009, 21, 907–923. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).