Next Article in Journal
A Robust Controller of a Reactor Electromicrobial System Based on a Structured Fractional Transformation for Renewable Energy
Next Article in Special Issue
Hybrid Approach of Fractional Generalized Pareto Motion and Cosine Similarity Hidden Markov Model for Solar Radiation Forecasting
Previous Article in Journal
Investigation of the Oriented Structure Characteristics of Shale Using Fractal and Structural Entropy Theory
Previous Article in Special Issue
An Adaptive Generalized Cauchy Model for Remaining Useful Life Prediction of Wind Turbine Gearboxes with Long-Range Dependence
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Estimation and Testing of Random Effects Semiparametric Regression Model with Separable Space-Time Filters

College of Mathematics and Statistics, Fujian Normal University, Fuzhou 350117, China
*
Author to whom correspondence should be addressed.
Fractal Fract. 2022, 6(12), 735; https://doi.org/10.3390/fractalfract6120735
Submission received: 25 October 2022 / Revised: 29 November 2022 / Accepted: 6 December 2022 / Published: 11 December 2022
(This article belongs to the Special Issue New Trends in Fractional Stochastic Processes)

Abstract

:
This paper focuses on studying a random effects semiparametric regression model (RESPRM) with separable space-time filters. The model cannot only capture the linearity and nonlinearity existing in a space-time dataset, but also avoid the inefficient estimators caused by ignoring spatial correlation and serial correlation in the error term of a space-time data regression model. Its profile quasi-maximum likelihood estimators (PQMLE) for parameters and nonparametric functions, and a generalized F-test statistic for checking the existence of nonlinear relationships are constructed. The asymptotic properties of estimators and asymptotic distribution of test statistic are derived. Monte Carlo simulations imply that our estimators and test statistic have good finite sample performance. The Indonesian rice farming data are used to illustrate our methods.

1. Introduction

A space-time panel dataset is a sample collected from a number of spatial units over time periods. Such data widely exist in various research fields. It is of great theoretical significance and practical value to conduct statistical modeling and analysis on space-time panel data. There are three branches of panel data which are defined as ordinary panel data, spatial panel data, and space-time panel data given the variety heterogeneities of existing panel datasets. The regression models established on basis of ordinary panel data are called ordinary panel data regression models. Ordinary panel data regression models date back to the 1950s when the pioneering paper (Bates et al. [1]) was published. After that, the theories and methods of ordinary panel data regression models have been greatly enriched and they are widely available in many fields, such as economics, management and environmental science, see Chamberlain [2], Baltagi [3], Arellano [4], Baldev and Baltagi [5], and Hsiao [6], among others. Panel data spatial regression models generated from the turn of this century when spatial econometrics literature has exhibited a particular interest in the specification and estimation of econometric relationships based on spatial panel data (Elhorst [7]). Panel data spatial regression models deal with the spatial correlations between different individual units in panel data models and their theories have been well developed and widely used, see Druska and Horrace [8]; Egger, Pfaffermayr, and Winner [9]; Kapoor, Kelejian, and Prucha [10]; Lee and Yu [11], etc.
Two problems hampering when modeling with space-time panel data are serial correlation and spatial correlation. Serial correlation lies between the observations of each spatial unit over time, and it is the domain of the voluminous time-series econometrics literature. Spatial correlation lies between the observations of the spatial units at each time period, and it is of the spatial econometrics literature, a subfield of econometrics for dealing with spatial interaction effects among geographical units such as individuals, firms, governments, etc. In order to overcome the defects that the panel data spatial regression models do not account for serial correlation and the ordinary panel data regression models account for neither spatial correlation nor serial correlation, people have tried to combine serial correlation and spatial correlations, and established panel data paramertic regression models with separable/nonseparable space-time filters, which are used to do researches on estimation, testing and empirical analysis. Elhorst [12] developed ML method of a panel data linear regression model with nonseparable space-time filters, while did not establish asymptotic properties of the estimators. Parent and LeSage [13] explored Markov Chain Monte Carlo methods of random effects panel data linear regression models with separable space-time filters, and the performance of the method was demonstrated by both Monte Carlo simulations and an applied illustration. Lee and Yu [14] studied MLE and its asymptotic properties of fixed and random effects panel data parametric regression models with separable space-time filters. Lee and Yu [15] provided QMLE and its asymptotic properties for a fixed effects panel data linear regression model with disturbances contained both separable space-time filters and nonseparable space-time filters. Baltagi et al. [16] derived several Lagrange Multiplier tests for panel data linear regression model with separable space-time fliters containing random effects. Cohen and Paul [17] analysed the influencing factors of public infrastructure investment on the costs and productivity of private enterprises on basis of 1982–1996 state-level U.S. manufacturing data by panel data parametric regression model with separable space-time filters.
All literature on panel data regression models with separable/nonseparable space-time filters mentioned above focuses on panel data parametric regression models. Although the theories and applications of these models have been well developed, they are often unrealistic in real situations for the reason that they fail to capture complex structure (e.g., nonlinearity) owing to lacking of flexibility. Moreover, the form of parametric regression models may be misspecified, and estimators based on misspecified models are able to cause inconsistency and even erroneous conclusions. Driven by these reasons, Zhao et al. [18] constructed semiparametric minimum average variance estimation method, and proposed a F-test statistic of partially linear single-index panel regression model with separable space-time filters, then proved the asymptotic properties of estimators and test statistic. Bai, Hu, and You [19] established weighted semiparametric least squares and weighted polynomial spline series estimation method for parametric and nonparametric component, respectively, of panel data partially linear varying-coefficient regression model with separable space-time filters, and then proved their asymptotic normalities.
In this paper, we study estimation and testing of random effects semiparametric regression model (RESPRM) with separable space-time filters. By allowing a nonparametric component in parametric regression model with separable space-time filters, it can simultaneously capture the linear and nonlinear effects of covariates, spatial correlation of error structure, serial correlation of remainder error structure, and individual random effects. To the best of our knowledge, there is no related literature on this model. In this paper, we aim to study its profile quasi-maximum likelihood estimation (PQMLE) and hypothesis test methods, and then conduct systematic studies of the asymptotic properties and small sample performance for estimators and test statistic. Furthermore, we illustrate the proposed estimation and testing methods by using a real dataset.
The remainder of this paper is organized as follows. Section 2 introduces the RESPRM with separable space-time filters, and estimators for the model and test statistic for nonparametric component are constructed. Section 3 mainly provides asymptotic properties of estimators, asymptotic distribution of test statistic and conditional assumptions. Section 4 presents the finite sample performance of estimates and F-test statistic through Monte Carlo simulations. Section 5 illustrates the proposed methods by an application of Indonesian rice farming data. Conclusions are summarized in Section 6. The proofs of some important lemmas and theorems are given in Appendix A.

2. The Model, Estimation, and Testing

2.1. The Model

The RESPRM with separable space-time filters can be specified as:
y i t = x i t β + m ( z i t ) + b i + ε i t ,
ε i t = ρ j = 1 N w i j ε j t + ν i t ,
ν i t = λ ν i , t 1 + e i t , 1 i N , 1 < t T ,
where y i t is an observation of the response variable for the ith spatial unit at the tth time period, x i t = ( x i t 1 , x i t 2 , , x i t p ) , and z i t = ( z i t 1 , z i t 2 , , z i t q ) are observations of p × 1 and q × 1 vectors of covariates, respectively, b i is an individual random effect which is assumed to be I I D ( 0 , σ b 2 ) , ε i t is an error term, ν i t is a remainder error term which is weakly stationary sequence, e i t is a random error term which is assumed to be I I D ( 0 , σ e 2 ) , β = ( β 1 , β 2 , , β p ) is a regression coefficient vector of x i t , m ( · ) is an unknown link function, w i j is the ( i , j ) th element of spatial weights matrix, ρ and λ are spatial correlation and series correlation coefficients, respectively. Furthermore, we denote θ 0 = ( ζ 0 , β 0 ) , ζ 0 = ( ρ 0 , λ 0 , σ b 0 2 , σ e 0 2 ) as the true parameter vector of θ = ( ζ , β ) and ζ = ( ρ , λ , σ b 2 , σ e 2 ) , respectively, and m 0 ( z ) as the true value of nonparametric function of m ( z ) .
We find that separable space-time filters in our model result in a trade-off between the spatial correlation and serial correlation coefficients. According to Shi and Lee [20], it is not hard to obtain that the stationarity requires not only | ρ | < 1 , | λ | < 1 , but also | λ | + | ρ | + | ρ λ | < 1 , i.e., parameter space Λ of λ and ρ , see Figure 1.

2.2. Estimation

Because that random effect b i could be regarded as time-invariant permanent spillovers (see Baltagi, Fingleton, and Pirotte [21]), the model (1) can be written in matrix form
Y = X β + M + η ,
where Y = ( y 11 , , y N T ) , X = ( x 11 , , x N T ) , M = ( m ( z 11 ) , , m ( z N T ) ) , b = ( b 1 , , b N ) , ν = ( ν 11 , , ν N T ) , η = l T b + ( I T B 1 ) ν , l T is a T × 1 vector of ones, B = I N ρ W , I N is an N × N identity matrix, W is an N × N spatial weights matrix. Then we obtain the variance matrix of transformed error structure η as follows
Ω = E ( η η ) = σ b 2 ( l T l T I N ) + σ e 2 [ V λ ( B 1 B 1 ) ] ,
Ω 1 = 1 d 2 ( 1 λ 2 ) V λ 1 l T l T V λ 1 Z 0 + [ V λ 1 1 d 2 ( 1 λ 2 ) V λ 1 l T l T V λ 1 ] ( 1 σ b 2 B B ) ,
| Ω | = | V λ | N | σ e 2 B 1 B 1 | T 1 | Z 0 1 | ,
where
V λ = 1 1 λ 2 1 λ λ T 1 λ 1 λ T 2 λ T 1 λ T 2 1 ,
Z 0 = [ d 2 ( 1 λ 2 ) σ b 2 I N + σ e 2 B 1 B 1 ] 1 , d 2 = l T λ l T λ = 1 + λ 1 λ + ( T 1 ) and l T λ = ( 1 + λ 1 λ , 1 , , 1 ) . Therefore, the log-likelihood function of the new transformed model (4) is defined in (5)
l n L ( θ ) = N T 2 l n ( 2 π ) 1 2 l n | Ω | 1 2 ξ ( θ ) Ω 1 ξ ( θ ) ,
where ξ ( θ ) = Y M X β .
It is difficult to obtain the QMLE by maximizing (5) because that M * is unknown. Thus, we will combine PQML method and Working Independence theory (Cai [22]) to estimate unknown parameters and nonparametric function of the model. Working independence assumption is often used to deal with nonparametric/semiparametric regression models which exist correlation structures. By ignoring the correlation structure entirely, and pretending as if the data within a cluster were independent, the asymptotically most efficient estimator of nonparametric function is obtained (Lin and Carroll [23]). It has been widely used in various correlation structures of nonparametric/semiparametric models, such as Fan et al. [24], Fan and Zhang [25], and Tang and Liu [26], among others.
The estimation steps are as follows:
Step 1. By supposing that β is known, the initial estimator of m ( z ) can be obtained. Let
K H ( z ) = d i a g ( k H ( z 11 z ) , , k H ( z 1 T z ) , , k H ( z N 1 z ) , , k H ( z N T z ) ) ,
Z ( z ) = ( Z 11 ( z ) , , Z 1 T ( z ) , , Z N 1 ( z ) , , Z N T ( z ) ) ,
where k H ( z ) = | H | 1 k ( H 1 z ) , k ( · ) is q × 1 kernel function, | H | is the determinant of H = d i a g ( h 1 , , h q ) , Z i t ( z ) = [ 1 , H 1 ( z i t z ) ] . In addition, denote m ^ I N ( z ) = a ^ 1 , then we have δ ^ = ( a ^ 1 , ( H a ^ 2 ) ) and
δ ^ = arg min a 1 , a 2 1 N T [ Y ˜ Z ( z ) δ ] K H ( z ) [ Y ˜ Z ( z ) δ ] ,
where Y ˜ = ( y ˜ 11 , , y ˜ 1 T , , y ˜ N 1 , , y ˜ N T ) , y ˜ i t = y i t x i t β , a 2 is the first derivative of m ( z ) . Let S ( z ) = [ Z ( z ) K H ( z ) Z ( z ) ] 1 Z ( z ) K H ( z ) , then δ ^ = S ( z ) Y ˜ and
m ^ I N ( z ) = s ( z ) Y ˜ ,
where s ( z ) = e 0 S ( z ) , e 0 = ( 1 , 0 , , 0 ) is a ( q + 1 ) × 1 vector.
Step 2. Substituting (6) into (5), the approximate log-likelihood function is given by
l n L ( θ , m ^ I N ( z ) ) = N T 2 l n ( 2 π ) 1 2 l n | Ω | 1 2 ( Y X β ) ( I S ) Ω 1 ( I S ) ( Y X β ) .
By maximizing (7), the initial estimators of β can be obtained as follows
β ^ I N = [ X ( I S ) Ω 1 ( I S ) X ] 1 X ( I S ) Ω 1 ( I S ) Y ,
Step 3. Substituting β ^ I N into (7), we get the concentrated log-likelihood function of ρ and λ as
ζ ^ = arg max ζ l n L ( ζ ) .
Step 4. Substituting (9) into (8), the ultimate estimator of β ^ is obtained, and further substituting β ^ into (6), the ultimate estimator of m ^ ( z ) is obtained.

2.3. Testing

By allowing nonparametric component in a parametric regression model with separable space-time filters, it provides us the ability to explore potential nonlinear relationship lies between covariates and response variable. In order to check whether the parametric specification is correct, it is necessary to do the following hypothesis test
H 0 : m ( z i t ) = γ 0 + j = 1 q γ j z i t j H 1 : m ( z i t ) γ 0 + j = 1 q γ j z i t j ,
where γ 0 , γ q are unknown parameters. Following Fan, Zhang, and Zhang [27], we obtain m ^ ( z i t ) by PQML method, and m ˜ ( z i t ) = γ ˜ 0 + j = 1 q γ ˜ j z i t j is OLS estimator of the intercept and slopes in the linear regression of y i t x i t β ^ , β ^ is also obtained by PQML method. The resulting residual sums of squares under the null and alternative hypotheses, respectively, are as follows
R S S ( H 0 ) = i N t = 1 T [ y i t m ˜ ( z i t ) x i t β ^ ] 2 , R S S ( H 1 ) = i N t = 1 T [ y i t m ^ ( z i t ) x i t β ^ ] 2 .
To test the null hypothesis, we define generalized F-test statistic as follows:
F N T = N T 2 R S S ( H 0 ) R S S ( H 1 ) R S S ( H 1 ) .
The asymptotic distribution of F N T is given in Section 3.

3. Asymptotic Property

To prove the asymptotic properties of PQMLE and asymptotic distribution of generalized F-test statistic proposed in Section 2, we need to establish some useful assumptions and lemmas.

3.1. Assumptions

In order to provide a rigorous theoretical analysis, we make the following assumptions.
Assumption 1.
(i) { x i , z i } i = 1 , t = 1 N , T are nonstochastic variables and uniformly bounded (UB) in X × Z .
(ii) 
There is a positive density function f t ( · ) such that l i m N 1 N i = 1 N υ ( z i ) = υ ( z ) f t ( z ) d z for any bounded continuous function υ ( · ) , and f t ( · ) is bounded away from zero on Z .
(iii) 
e i t are iid and E ( | e i t | 4 ) < .
Assumption 2.
(i)W is a row normalized and prespecified spatial weights matrix.
(ii) 
B is nonsingular.
(iii) 
B and B 1 are UB in both row and column sums in absolute value.
Assumption 3.
As N , H 0 , N | H | 2 , H 4 | H | 1 0 and N | H | H 4 c [ 0 , ) , where H = t r ( H H ) 1 2 = { i j | h i j | 2 } 1 2 is the Euclidean norm of matrix H, and | H | is the determinant of matrix H.
Assumption 4.
The kernel function K ( · ) is a continuous density with compact support on R q , all odd order moments of K vanish. Further, K ( · ) is bounded, u 3 K ( u ) and u 3 K ( u ) are bounded and u 4 K ( u ) d u < .
Assumption 5.
For any ( ρ , λ ) Λ , we require that
l i m N , T [ 1 N T l n | Ω 0 | + 1 1 N T l n | Ω | t r ( Ω 1 Ω 0 ) ] 0 ,
where
| Ω 0 | = | V λ 0 | N | σ e 0 2 B 0 1 B 0 1 | T 1 | Z 00 1 | .
V λ 0 = 1 1 λ 0 2 1 λ 0 λ 0 T 1 λ 0 1 λ 0 T 2 λ 0 T 1 λ 0 T 2 1 ,
Z 00 = [ d 2 ( 1 λ 0 2 ) σ b 0 2 I N + σ e 0 2 B 1 B 0 1 ] 1 , d 0 2 = l T λ 0 l T λ 0 = 1 + λ 0 1 λ 0 + ( T 1 ) and
l T λ 0 = ( 1 + λ 0 1 λ 0 , 1 , , 1 ) .
Assumption 6.
L 11 ( L 44 L 22 L 24 2 ) L 14 ( L 14 L 22 + 2 L 12 L 24 ) > 0 , where L 11 = 2 t r [ ( B 1 ( ρ 0 ) W ) 2 ] N , L 44 = 1 2 σ e 0 4 , L 22 = t r ( C ¨ 0 C 0 1 ) + t r ( C 0 1 C ˙ 0 C 0 ˙ C 0 1 ) T + 1 3 λ 0 2 T ( 1 λ 0 2 ) , L 14 = t r ( W B 0 1 ) N σ e 0 2 , L 24 = t r ( C 0 1 C ˙ 0 ) T σ e 0 2 , L 12 = 2 N T t r ( C 0 1 C ˙ W B 0 1 ) .
Remark: Assumption 1 concerns the features of covariates, random error term, and density function for the model (1). Assumption 2 provides the basic features of spatial weight matrix, it parallels Assumption 3 of Su and Jin [28] and Assumption 5 of Lee [29]. The assumption that B and B 1 are UB limits the spatial correlation to a manageable degree. Assumption 3 concerns the bandwidth sequence, it parallels Assumption 4 of Hamilton and Truong [30], and Assumption 7 of Ullah and Su [31]. Assumption 4 concerns the kernel function. Assumptions 5 and 6 are necessary for the consistency and asymptotic properties of the estimators.

3.2. Asymptotic Properties

From the model (4), we obtain the reduced form equation of Y as
Y = X β 0 + M 0 + η 0 .
Let S = ( s 11 , , s N T ) , where s i t = s ( z i t ) . Denote a typical entry of s ( z ) by
s ( z i t , z ) = e [ Z ( z ) K H ( z ) Z ( z ) ] 1 Z i t ( z ) K H ( z i t z ) , where Z i t ( z ) is a typical column of Z ( z ) .
Lemma 1.
Suppose that Assumptions 1–5 hold, then we have
S = O p ( N 1 / 2 | H | 1 / 2 ) .
The proof is given by Hamilton and Truong [30].
Lemma 2.
Suppose that Assumptions 1–5 hold, then we have
(i) 
s ( z i t , z ) = N 1 K H ( z i t z ) f ¯ ( z ) { 1 + o p ( 1 ) } , where f ¯ ( z ) = t = 1 T f t ( z ) .
(ii) 
There is a positive constant c such that l i m N P { N 1 [ Z ( z ) K H ( z ) Z ( z ) ] i j c for z c f , i , j = 1 , , q + 1 } = 1 .
(iii) 
There exists a positive constant c such that
l i m N P { s u p z m a x 1 i N , 1 t T | s ( z i t , z ) | c ( N T ) 1 | H | 1 } = 1 .
The proofs is given by Ullah and Su [31].
Lemma 3.
Suppose that Assumptions 1–5 hold, then we have
( I S ) M = O p ( H 2 ) .
The proof is given by Hamilton and Truong [30].
With the above lemmas, we then state main results as follows. Their detail proofs can be found in the Appendix A.
Theorem 1.
Suppose that Assumptions 1–5 hold, then we have θ ^ θ 0 = o p ( 1 ) .
Theorem 2.
Suppose that Assumptions 1–6 hold, then we have
N T ( θ ^ θ 0 ) D N ( 0 , Σ θ 0 1 + Σ θ 0 1 Ω θ Σ θ 0 1 ) ,
where “ D ” means convergence in distribution, Σ θ 0 = E [ 1 N T 2 l n L ( θ 0 ) θ θ ] is the average Hessian matrix (information matrix when e and b obey normal distributions), Ω θ = E ( Ω θ N T ) and Ω θ N T = 1 N T l n L ( θ 0 ) θ · 1 N T l n L ( θ 0 ) θ + 1 N T 2 l n L ( θ 0 ) θ θ .
Theorem 3.
Suppose that Assumptions 1–4 hold, then we have
N | H | δ ^ δ Q 1 f ¯ ( z ) 2 ( u u K ( u ) d u H m ¨ ( z ) H ) 0 D N ( 0 , σ 2 ( δ ) ) ,
where m ¨ ( z ) is the second derivative of m ( z ) , σ 2 ( δ ) = σ e 2 { S ( z ) D I ( C 0 C 0 B 0 B 0 ) 1 D I S ( z ) , D I = I D U ˜ ^ 1 U ˜ ^ ( C ^ B ^ B ^ ) ( I S ) and D U ˜ ^ = U ˜ ^ ( I T B ^ B ^ ) U ˜ ^ , Q = f ¯ ( z ) 1 0 0 u u K ( u ) d u , f ¯ ( z ) = t = 1 T f t ( z ) .
Theorem 4.
Suppose that Assumptions 1–4 hold, then we have
r k F N T D χ d f 2 ,
where r k = K ( 0 ) 1 2 K 2 ( z ) d u ( K ( z ) 1 2 K K ( z ) ) 2 d u , c k = K ( 0 ) 1 2 K 2 ( z ) d u , K ( z ) = d i a g ( H 1 k ( H 1 ( z 11 z ) ) , , H 1 k ( H 1 ( z N T z ) ) ) , d f = r k c k | Z | / | H | , Z is the support of z i t and H is a bandwidth sequence, besides, note that when K ( 0 ) = m a x K ( z ) , we get K ( 0 ) > K 2 2 , c k > 1 2 K ( 0 ) and hence r k > 0 .

4. Monte Carlo Simulation

4.1. Performance of the Estimates

In this section, we report the results of a small-scale Monte Carlo study which is aimed to evaluate the finite sample performance of PQMLEs. We use mean square error ( M S E ), standard deviation ( S D ), and mean absolute deviation error ( M A D E ) to measure the parametric and nonparametric estimation performance, respectively, where
MADE j = Q 1 q = 1 Q | m ^ ( z q ) m ( z q ) | , j = 1 , 2 , , m c n ,
m c n = 2000 , { z q } q = 1 Q are Q fixed grid points in support set of z. Furthermore, we apply the rule of thumb method of Mack and Silverman [32] to choose the optimal width and let K ( u ) = ( 3 4 5 ) ( 1 1 5 u 2 ) 1 ( u 2 5 ) (see Su [33]).
We use data generated from the following model
y i t = x i t β + m ( z i t ) + b i + ε i t , ε i t = ρ j = 1 N w i j ε j t + ν i t , ν i t = λ ν i , t 1 + e i t ,
where x i t U [ 2 , 2 ] , z i t U [ 3 , 3 ] , e i t i . i . d . N ( 0 , 1 ) , b i i . i . d . N ( 0 , 1 ) , m ( z i t ) = 0.3 z i t 2 , β = 1 , ρ = 0.2 , 0.4 , and λ = 0.2 , 0.4 respectively. In addition, we defined weights matrix as Rook matrix (see Anselin [34]). The simulation results of cases N = 50 , 100 , 200 , and T = 10 , 20 are presented in Table 1, Table 2, Table 3 and Table 4.
It is obvious from Table 1, Table 2, Table 3 and Table 4 that: (1) the M S E s of all parametric estimators are fairly small, which indicates the parametric estimators are approximately unbiased; (2) for fixed T, when N increases, the M S E of ρ ^ , λ ^ , β ^ , σ ^ b 2 and σ ^ e 2 decrease rapidly. For fixed N, when T increases, the behavior of parametric estimates is similar to the case where N changes with the fixed T. (3) The S D s and M A D E s of m ^ ( z ) decrease as T or N increases. It presents that the nonparametric estimates are convergent.
Figure 2 shows the fitting image and 95% confidence intervals of m ^ ( z ) under ρ = 0.2 and λ = 0.2 , respectively, where the black short dashed line is the average fit over 2000 simulations m ^ ( z ) by PQML method, the red solid line is the true values of m ( z ) , and the two black long dashed lines are the corresponding 95% confidence bands. By observing subgraphs of Figure 2, we know that shorter dotted line is fairly close to solid line and confidence band is narrow. These mean that nonparametric estimation procedure performances well under finite sample. To save space, we do not show the others cases under ρ = 0.2 with λ = 0.4 , ρ = 0.4 with λ = 0.2 and ρ = 0.4 with λ = 0.4 of different N and T because they are similar with the case ρ = 0.2 with λ = 0.2 .

4.2. Comparison of Results of Different Models

In this section, we will investigate the necessity of including nonparametric components, spatial correlation, and serial correlation if the real model is model (12). By deleting spatial correlation, serial correlation, and spatiotemporal correlations, respectively, in the model (12), we get the following three deleted models:
y i t = x i t β + m ( z i t ) + b i + ν i t , ν i t = λ ν i , t 1 + e i t ,
y i t = x i t β + m ( z i t ) + b i + ε i t , ε i t = ρ j = 1 N w i j ε j t + e i t ,
y i t = x i t β + m ( z i t ) + b i + e i t ,
where all variables in above models are the same as the model (12). To save space, we only study the case that ρ = 0.4 and λ = 0.4 . In addition, we set N = 50 , 100 , T = 5 and m c n = 2000 . By using PQML method in Section 2, the experimental results are presented in Table 5.
Table 5 reports Means, MSEs and G R 1 s of parametric estimates in the models (12)–(15), where G R 1 is growth rate of MSE on the basis of that in model (12). From Table 5, we get the following conclusions: (1) the means of parametric estimates in the real model get closer to the true values as the sample size increases. However, we find that the means of estimates of σ ^ e 2 and σ ^ b 2 in model (13), σ ^ e 2 in the models (14) and (15) are very unstable, they do not converge to true values, respectively. Although the means of other parametric estimates get close to true values as the sample size increases, they still far away from true values and do not converge to true values. (2) In comparison with the real model, the MSEs of each parameter estimate in the deleted models increase. For most parametric estimates, they decrease with the increase in sample size except for λ ^ and σ ^ b 2 in the model (13), ρ ^ , σ ^ e 2 and σ ^ b 2 in the model (14), σ ^ e 2 in the model (15). (3) The G R 1 s of all parametric estimates are large, they do not decrease with the increase in sample size for some parametric estimates except for β ^ in the model (13)–(15) and σ ^ e 2 in the model (13).
Table 6 reports the medians and SDs of MADEs, G R 2 s, and G R 3 s for nonparametric estimates of the models (12)–(15), where G R 2 and G R 3 are growth rates of median and SD, respectively, on the basis of that in the model (12). It can be seen from Table 6 that, in comparison with the real model, the values of medians and SDs of MADEs for nonparametric estimates of the models (13)–(15) increase rapidly, and do not decrease with the increase in sample size.
The above facts reveal that the efficiency of both parametric and nonparametric estimates depends on the correct specification of errors in (1)–(3), and violation of which can lead to inconsistency.

4.3. Performance of the Test

In this section, we aim to evaluate the power of hypothesis test proposed in Section 2.3. Consider the model (12), and the settings of x i t , b i , w i j and e i t are the same as those in model (12). In addition, we only study the case that ρ = 0.2 and λ = 0.2 . Then, the corresponding hypothesis is obtained as follows:
H 0 : m ( z i t ) = γ 0 + γ 1 z i t v e r s u s H 1 : m ( z i t ) γ 0 + γ 1 z i t ,
where m ( z i t ) = η s i n ( π ( z i t 0.5 ) ) + 0.8 z i t + 0.6 , η are values consisting of ( 0 , 0.05 , 0.1 , 0.2 , 0.5 ) and z i t U [ 0.2 , 1.6 ] . In particular, when η = 0 , the above null hypothesis is exactly the true nonparametric function. As the value of η increases, the true nonparametric function is increasingly farther from the null hypothesis. Figure 3 presents the true nonparametric functions with different η -values. In this section, we conduct simulations that N = 100, T = 8, 16 under significant levels α = 0.01 and 0.05, respectively. For each case, there are 1000 repetitions. Then, the powers of the hypothesis test are shown in Figure 4.
We get from the Figure 4 and Table 7 that: (1) while η = 0, the power of the test is very close to the significance level. As the value of η increases, the power of the test statistic increases considerably, which indicates that our test statistic is sensitive to the alternative hypothesis of the proposed test problem. (2) While the number of spatial units is the same, the power of F-test with T = 16 is higher than that with T = 8. This shows that the F-test statistic performance get better as sample size increases.

5. Real Data Analysis

In this section, we will illustrate the prescribed estimation and testing methods by Indonesian rice farming dataset with N = 171 and T = 6 which is a quintessential example of large N and small T in the stochastic frontier literature(see detail for Feng and Horrace [35]). This dataset has five variables, one response variable and four covariates. The response variable is natural logarithm of output of Indonesian rice farming, and the covariates include high, mixed, seed, and land, which are defined in Table 8. The dataset is from the agricultural economic research center of the Ministry of agriculture of Indonesia and compiled by the Agro Economic Survey. Based on the panel data of related variables of 171 farms over 6 growing seasons (three wet and three dry seasons), we will explore the influencing factors of output of Indonesian rice farming by RESPRM with separable space-time filters.
In order to take the proposed RESPRM with separable space-time filters to Indonesian rice farming dataset, we apply the F-test statistic proposed in Section 2.3 to verify whether or not a nonlinear relationship between covariates and response variable exists. The test results are given in Table 9. From Table 9, we find that land (other covariates) has (have) significant nonlinear (linear) relationship with natural logarithm of output of Indonesian rice farming at significant level α = 0.01 .
Therefore, we consider the model (1)–(3), and set 1 i 171 , 1 t 6 , y i t is the ith observation of l n ( r i c e ) at tth growing season, x i t are the ith observation of high, mixed, and seed at tth growing season, respectively, z i t is land for the ith observation at tth growing season. w i j equals 1 if farms i and j are in the same village, equals 0 otherwise (see detail for Druska and Horrace [8]).
The parametric estimates results of the model (1)–(3) for fitting Indonesia rice farming data are described in Table 10. Via Table 10, we find that: (1) β i ( i = 1 , 2 , 3 ) > 0 show expectation of dummy variables for high, mixed, and seed have promotional effects on output of rice. (2) σ e 2 = 2.4049 presents that the output of rice in different farm is relatively unstable and is affected by external fluctuations.
Figure 5 shows the results of estimation and corresponding 95% confidence intervals of m ( z ) , where the blue solid line is the average fit over 500 simulations, the red and green dotted lines are the corresponding 95% confidence bands. From Figure 5, we know that land has obvious nonlinear effects on output of rice, and output of rice increases with the increase in land.

6. Conclusions

In this paper, we study PQMLE and hypothesis test for RESPRM with separable space-time filters. The proposed model can simultaneously capture linear and nonlinear effects of covariates, spatial correlation of error structure, serial correlation of remainder error structure, and individual random effects. With the given conditional assumptions, asymptotic properties of PQMLE and asymptotic distribution of nonparametric components are proved. Monte Carlo simulations are applied to investigate the performance of estimators and test statistic under finite samples, and the results show that proposed estimators and test statistic are well-behaved under finite samples, and consistency of parametric estimates is hard to guarantee if spatial correlation and series correlation in the real model are ignored in some cases. In addition, the practicability of the proposed methods are also assessed by a real dataset.
This paper focuses only on RESPRM with separable space-time filters but does not account for spatially lagged response variables. How to extend our proposed estimation and testing methods to a random effects spatial lag semiparametric regression model with separable space-time filters is an open problem. In addition, we may combine generalized moment estimation method, quantile regression method and local linear method to estimate parametric and nonparametric components of the random effects spatial lag semiparametric regression model with separable space-time filters and RESPRM with separable space-time filters.

Author Contributions

Formal analysis, S.L. and B.L.; Methodology, S.L. and J.C.; Software and Writing-original draft, S.L.; Supervision, Writing—review & editing and Funding acquisition, J.C.; Data curation, B.L. All authors have read and agreed to the published version of manuscript.

Funding

This work is supported by National Social Science Fund of China (22BTJ024) and Natural Science Foundation of Fujian Province (2020J01170, 2022J01193).

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

This section aims to prove some lemmas and theorems given by Section 3, before that, we provide two frequently used evident facts.
Fact 1. If the row and column sums of the n × n matrices B 1 n and B 2 n are UB in absolute value, then the row and column sums of B 1 n B 2 n are also UB in absolute value.
Fact 2. If the row (resp. column) sums of B 1 n are UB in absolute value and B 2 n is a conformable matrix whose elements are uniformly O ( o n ) , then so are the elements of B 1 n B 2 n (resp. B 2 n B 1 n ).
Proof of Theorem 1. 
Define Q N T = E [ l n L ( θ ) ] . By direct calculation, we have
Q N T ( θ ) = 1 2 N T l n [ ( 2 π ) ] 1 2 l n | Ω | 1 2 E [ ξ ( θ ) Ω 1 ξ ( θ ) ] ,
and
l n L ( θ ) = 1 2 N T l n [ ( 2 π ) ] 1 2 l n | Ω | 1 2 ξ ( θ ) Ω 1 ξ ( θ ) .
To show the consistency of θ ^ , we follow Lee [29] by showing the uniform convergence of 1 N T [ l n L ( θ ) Q N T ( θ ) ] to zero on a compact convex parameter space Λ . By comparison with Lee [29], the difference is that our model setting is more complex. By White [36] (Theorem 3.4), it needs only to show
1 N T [ l n L ( θ ) Q N T ( θ ) ] = o p ( 1 ) .
and
l i m N 1 N T [ Q N T ( θ ) Q N T ( θ 0 ) ] < 0 .
We first prove (A3). By direct calculation, we know that
ξ ( θ ) = X β 0 + M 0 ( X β + M ) + ξ ,
and
1 N T [ l n L ( θ ) Q N T ( θ ) ] = 1 2 N T { ξ ( θ ) Ω 1 ξ ( θ ) E [ ξ ( θ ) Ω 1 ξ ( θ ) ] } = 1 2 N T ξ Ω 1 ξ p ( ζ ) + 1 N T ξ Ω 1 ( X β 0 + M 0 ) ,
where p ( ζ ) = 1 N T t r ( Ω 1 Ω 0 ) . As Ω 1 is UB, we know 1 N T ξ Ω 1 ξ p ( ζ ) = o p ( 1 ) , and as 1 N T ξ Ω 1 ( X β 0 + M 0 ) P 0 , that is uniform convergence is proved.
We then prove (A4). Recall (A1) and Q N T ( θ 0 ) = 1 2 N T l n [ ( 2 π ) ] 1 2 l n | Ω 0 | 1 2 . By direct calculation, we know
E [ 1 N T ξ ( θ ) Ω 1 ξ ( θ ) ] = 1 N T [ X β 0 + M 0 ( X β + M ) ] Ω 1 [ X β 0 + M 0 ( X β + M ) ] + 1 N T t r ( Ω 1 Ω 0 ) ,
then
Q N T ( θ ) Q N T ( θ 0 ) = 1 2 N T l n | Ω | 1 2 N T t r ( Ω 1 Ω 0 ) 1 2 N T H N T ( 1 2 N T l n | Ω 0 | 1 2 ) = T 1 N T T 2 N T ,
where T 1 N T = 1 2 N T l n | Ω | 1 2 N T t r ( Ω 1 Ω 0 ) [ 1 2 N T l n | Ω 0 | 1 2 ] , T 2 N T = 1 2 N T H N T . Conduct a random effects model Y = X β + U b + ε , we know its log-likelihood function is as follows
l n L N T p ( θ ˜ ) = 1 2 N T l n [ ( 2 π ) ] 1 2 l n | Ω | 1 2 N T ξ ( θ ˜ ) Ω 1 ξ ( θ ˜ ) ,
where ξ ( θ ˜ ) = Y X β . Using the information inequality of above model, T 1 N T 0 . In addition, T 2 N T is a quadratic function of [ X β 0 + M 0 ( X β + M ) ] and T 2 N T > 0 for given any ζ ζ 0 . Next, we prove that T 1 N T < 0 strictly, i.e.,
1 N T l n | Ω 0 | + 1 1 N T l n | Ω | t r ( Ω 1 Ω 0 ) 0 .
Combined Assumption 5 and uniform convergence, θ ^ θ 0 = o p ( 1 ) is proved. □
Proof of Theorem 2. 
By Taylor expansion of (5) at point θ 0 , we obtain
l n L ( θ ) θ | θ = θ 0 + 2 l n L ( θ ) θ θ | θ = θ ˜ ( θ ^ θ 0 ) = 0
where θ ˜ = ( ζ ˜ , β ˜ ) , ζ = ( ρ ˜ , λ ˜ , σ ˜ b 2 , σ ˜ e 2 ) and θ ˜ lies in between θ ^ and θ 0 . According to Theorem 1, we have θ ˜ P θ 0 . Denote
l n L ( θ 0 ) θ = ^ l n L ( θ ) θ | θ = θ 0 , 2 l n L ( θ 0 ) θ θ = ^ 2 l n L ( θ ) θ θ | θ = θ 0 , 2 l n L ( θ ˜ ) θ θ = ^ 2 l n L ( θ ) θ θ | θ = θ ˜ .
Therefore,
N T ( θ ^ θ 0 ) = ( 1 N T 2 l n L ( θ ˜ ) θ θ ) 1 1 N T l n L ( θ 0 ) θ .
Next, we aim to prove
1 N T 2 l n L ( θ ˜ ) θ θ 1 N T 2 l n L ( θ 0 ) θ θ = o p ( 1 ) ,
and
1 N T l n L ( θ 0 ) θ D N ( 0 , Σ θ 0 + Φ θ ) .
In order to prove (A5), we will show that all elements of 1 N T 2 l n L ( θ ˜ ) θ θ 1 N T 2 l n L ( θ 0 ) θ θ converge to 0 in probability. It can be calculated that
1 N T l n L ( θ 0 ) β = 1 N T X Ω 0 1 ξ ( θ ) , 1 N T l n L ( θ 0 ) δ i = 1 2 [ ξ ( θ ) Ω 0 1 δ i ξ ( θ ) t r ( Ω 0 δ i Ω 0 1 ) ] ( i = 1 , 2 , 3 , 4 ) .
where
Ω ρ = σ e 2 { V λ B 1 [ ( W B 1 ) + W B 1 ] B 1 } , Ω λ = σ e 2 { [ 2 λ 1 λ 2 V λ + 1 1 λ 2 F λ ] B 1 B 1 } , Ω σ e 2 = V λ ( B 1 B 1 ) , Ω σ b 2 = ( l T l T ) I N ,
Ω 0 is the true value of Ω , F λ is the first derivative of V λ with respect to λ . Consequently, we get
1 N T 2 l n L ( θ 0 ) β β = 1 N T X Ω 0 1 X , 1 N T 2 l n L ( θ 0 ) β δ i = 0 , 1 N T 2 l n L ( θ 0 ) δ i 2 = 1 2 N T t r ( Ω 0 δ i Ω 0 1 δ j ) ( i , j = 1 , 2 , 3 , 4 ) .
Then, by mean value theorem, Assumption 2 ( i v ) and Fact 2, we easily get
1 N T 2 l n L 1 ( θ 0 ) β β 1 N T 2 l n L 1 ( θ ˜ ) β β = X ( Ω 0 1 Ω ˜ 1 ) X = X [ V λ 1 l T l T V λ 1 d 2 ( 1 λ 2 ) Z 0 V λ ˜ 1 l T l T V λ ˜ 1 d ˜ 2 ( 1 λ ˜ 2 ) Z ˜ 0 ] X + X ( V λ 1 B B σ b 2 V λ ˜ 1 B ˜ B ˜ σ ˜ b 2 ) X X [ V λ 1 l T l T V λ 1 d 2 ( 1 λ 2 ) B B σ b 2 V λ ˜ 1 l T l T V λ ˜ 1 d ˜ 2 ( 1 λ ˜ 2 ) B ˜ B ˜ σ ˜ b 2 ] X = o p ( 1 ) .
Then we can similarly prove that (A5) holds.
In order to prove (A6), we follow the idea of Lee [29]. It is easy to obtain that the components of 1 N T l n L ˜ ( θ 0 ) θ are linear or quadratic functions of e and their means are all o p ( 1 ) . With Assumption 1, we have that 1 N T l n L ( θ 0 ) θ is asymptotically normal distributed with 0 means by using the CLT for linear-quadratic forms of Theorem 1 in Kelejian and Prucha [37]. In the next step, we calculate the variance. Denote μ η 03 and μ η 04 as the third and forth moments of η , respectively, A i i be the ii-th element of A. By using the facts that E ( U n A n U n ) = σ 0 2 t r ( A n ) , E ( U n A n U n U n B n U n ) = ( μ 04 3 σ 0 4 ) i = 1 n a n , i i b n , i i + σ 0 4 [ t r ( A n B n ) + t r ( A n B n ) + t r ( A n B n ) ] , it follows by direct calculation that
V a r ( 1 N T l n L ( θ 0 ) θ ) = E ( 1 N T l n L ( θ 0 ) θ · 1 N T l n L ( θ 0 ) θ ) = E ( 1 N T 2 l n L ( θ 0 ) θ θ ) + E ( Φ θ N T ) ,
where
1 N T 2 l n L ( θ 0 ) θ θ = 1 N T ( X Ω 0 X ) p × p 0 p × 4 0 4 × p 1 2 N T t r ( Ω 0 δ i Ω 0 1 δ j ) + o p ( 1 ) Σ θ 0 + o p ( 1 ) ,
Φ θ = 0 p × p 1 2 N T μ η 03 Ω 0 δ i d i a g ( X Ω 0 1 ) p × 4 ( ( μ η 04 3 σ η 0 2 ) i = 1 N T ( Ω 0 1 δ i ) i i 4 N T ) 4 × 4 .
In particular, it is not difficult to know that Φ θ = o ( 1 ) when e and b obey normal distribution. This completes the proof. □
Proof of Theorem 3. 
We follow the idea of Ullah and Su [31] to prove the theorem. The difference is that our error structure setting is more complex. Recall
δ ^ = S ( z ) ( Y X β ^ ) ,
then
δ ^ = S ( z ) ( Y X β ^ ) = S ( z ) [ m ( z ) + η X ( β ^ β 0 ) ] .
Write δ = m ( z ) H m ˙ ( z ) , where m ˙ ( z ) is the first derivative of m ( z ) . By the second order Taylor expression, we obtain m ( z i t ) = Z i t ( z ) δ + 1 2 ( z i t z ) m ¨ ( z ) ( z i t z ) + o p ( H 2 ) . Thus, we have
S ( z ) m ( z i t ) = [ Z ( z ) K H ( z ) Z ( z ) ] 1 Z ( z ) K H ( z ) Z ( z ) δ + 1 2 S ( z i t ) ( z i t z ) m ¨ ( z ) ( z i t z ) = δ + 1 2 S ( z i t ) ( z i t z ) m ¨ ( z ) ( z i t z ) .
Thus, we get
δ ^ δ = 1 2 i = 1 N t = 1 T S ( z i t ) ( z i t z ) m ¨ ( z ) ( z i t z ) + S ( z ) η S ( z ) X ( β ^ β )
and
N | H | ( δ ^ δ ) = B 11 + B 12 B 13 ,
where
B 11 = N | H | 2 i = 1 N t = 1 T S ( z i t ) ( z i t z ) m ¨ ( z ) ( z i t z ) , B 12 = N | H | S ( z ) η , B 13 = N | H | S ( z ) X ( β ^ β ) .
For B 11 , write
N | H | 2 i = 1 N t = 1 T S ( z i t ) ( z i t z ) m ¨ ( z ) ( z i t z ) = N | H | 2 Q 1 1 N T i = 1 N t = 1 T Z i t ( z ) K H ( z i t z ) ( z i t z ) m ¨ ( z ) ( z i t z ) + o p ( 1 ) = Q 1 N | H | 2 1 N T i = 1 N t = 1 T [ 1 ( z i t z ) H 1 ] K H ( z i t z ) ( z i t z ) m ¨ ( z ) ( z i t z ) + o p ( 1 ) = Q 1 N | H | 2 1 N T i = 1 N t = 1 T K H ( z i t z ) ( z i t z ) m ¨ ( z ) ( z i t z ) i = 1 N t = 1 T ( z i t z ) H 1 K H ( z i t z ) ( z i t z ) m ¨ ( z ) ( z i t z ) + o p ( 1 ) = Q 1 N | H | 2 f ¯ ( z ) u u K ( u ) d u H m ¨ ( z ) H 0 + o p ( 1 ) .
Thus, we obtain
B 11 = f ¯ ( z ) N | H | 2 Q 1 u u K ( u ) d u H m ¨ ( z ) H 0 + o p ( 1 ) ,
where Q = f ¯ ( z ) 1 0 0 u u K ( u ) d u , f ¯ ( z ) = t = 1 T f t ( z ) .
For B 12 , write B 12 = N | H | S ( z ) η , we can show that E [ B 12 ] = 0 and
V a r ( B 12 ) = E [ B 12 2 ] σ η 2 = O ( 1 ) .
Combined (A9), Slutsky’s theorem and central limit theorem, we get
N | H | δ ^ δ Q 1 f ¯ ( z ) 2 ( u u K ( u ) d u H m ¨ ( z ) H ) 0 D N ( 0 , σ 2 ( η ) ) .
where σ 2 ( δ ) = Ω , Q = f ¯ ( z ) 1 0 0 u u K ( u ) d u , and f ¯ ( z ) = t = 1 T f t ( z ) . □
Proof of Theorem 4. 
It can be seen that
R S S ( H 1 ) = i = 1 N t = 1 T [ y i t m ^ ( z i t ) x i t β ^ ] 2 = i = 1 N t = 1 T [ y i t m ^ ( z i t ) x i t β ] 2 + { i = 1 N t = 1 T [ y i t m ^ ( z i t ) x i t β ^ ] 2 i = 1 N t = 1 T [ y i t m ^ ( z i t ) x i t β ] 2 } = R S S * ( H 1 ) + T 1 ,
where R S S * ( H 1 ) = i = 1 N t = 1 T [ y i t m ^ ( z i t ) x i t β ] 2 and T 1 = i = 1 N t = 1 T [ y i t m ^ ( z i t ) x i t β ^ ] 2 i = 1 N t = 1 T [ y i t m ^ ( z i t ) x i t β ] 2 . For T 1 , it is obtained by Theorems 1–3 and direct calculation that
T 1 = i = 1 N t = 1 T [ y i t m ^ ( z i t ) x i t β ^ ] 2 i = 1 N t = 1 T [ y i t m ^ ( z i t ) x i t β ] 2 = i = 1 N t = 1 T [ x i t β + m ( z i t ) + ε i t m ^ ( z i t ) x i t β ^ ] 2 i = 1 N t = 1 T [ x i t β + m ( z i t ) + ε i t m ^ ( z i t ) x i t β ] 2 = i = 1 N t = 1 T [ O p ( 1 N T ) + O ( | H | 2 ) + | H | 2 O p ( 1 N | H | ) + η ] 2 i = 1 N t = 1 T [ O ( | H | 2 ) + | H | 2 O p ( 1 N | H | ) + η ] 2 = O ( 1 ) .
Similarly, we know that
R S S ( H 0 ) = i = 1 N t = 1 T [ y i t m ˜ ( z i t ) x i t β ^ ] 2 = i = 1 N t = 1 T [ y i t m ˜ ( z i t ) x i t β ] 2 + { i = 1 N t = 1 T [ y i t m ˜ ( z i t ) x i t β ^ ] 2 i = 1 N t = 1 T [ y i t m ˜ ( z i t ) x i t β ] 2 } = R S S * ( H 0 ) + T 0 ,
where R S S * ( H 0 ) = i = 1 N t = 1 T [ y i t m ˜ ( z i t ) x i t β ] 2 , and T 0 = i = 1 N t = 1 T [ y i t m ˜ ( z i t ) x i t β ^ ] 2 i = 1 N t = 1 T [ y i t m ˜ ( z i t ) x i t β ] 2 . For T 0 , it is obtained by Theorems 1–3 and calculation that
T 0 = i = 1 N t = 1 T [ y i t m ˜ ( z i t ) x i t β ^ ] 2 i = 1 N t = 1 T [ y i t m ˜ ( z i t ) x i t β ] 2 = i = 1 N t = 1 T [ x i t β + m ( z i t ) + ε i t m ˜ ( z i t ) x i t β ^ ] 2 i = 1 N t = 1 T [ x i t β + m ( z i t ) + ε i t m ˜ ( z i t ) x i t β ] 2 = i = 1 N t = 1 T [ O p ( 1 N T ) + m ˜ ( z i t ) x i t β ^ ] 2 i = 1 N t = 1 T [ m ˜ ( z i t ) x i t β ^ ] 2 = O p ( 1 ) .
In addition, we obtain that
F N T = N T 2 R S S * ( H 0 ) R S S * ( H 1 ) R S S * ( H 1 ) ( 1 + o p ( 1 ) ) + O p ( 1 ) .
By Remark 3.4 in [27], under H 0 , we obtain
r k F N T D χ r k c k | Z | / | H | 2 ,
where r k = K ( 0 ) 1 2 K 2 ( z ) d u ( K ( z ) 1 2 K K ( z ) ) 2 d u , K ( z ) = d i a g ( H 1 k ( H 1 ( z 11 z ) ) , , H 1 k ( H 1 ( z N T z ) ) ) , c k = K ( 0 ) 1 2 K 2 ( z ) d u , Z is the support of z i t and H is a bandwidth sequence. □

References

  1. Bates, G.E.; Neyman, J. Contributions to the theory of accident proneness II: True of false contagion. Univ. Calif. Publ. Statist. 1952, 1, 255–276. [Google Scholar]
  2. Chamberlain, G. Multivariate regression models for panel data. J. Econom. 1982, 18, 5–46. [Google Scholar] [CrossRef]
  3. Baltagi, B. Econometrics Analysis of Panel Data; Wiley Press: New York, NY, USA, 2001. [Google Scholar]
  4. Arellano, M. Panel Data Econometrics; Oxford University Press: New York, NY, USA, 2003. [Google Scholar]
  5. Baldev, R.; Baltagi, B. Panel Data Analysis; Springer Science & Business Media: Berlin/Heidelberg, Germany, 2012. [Google Scholar]
  6. Hsiao, C. Analysis of Panel Data; Cambridge University Press: Cambridge, UK, 2014. [Google Scholar]
  7. Elhorst, J. Spatial Econometrics: From Cross-Sectional Data to Spatial Panels; Springer Press: Berlin/Heidelberg, Germany, 2014. [Google Scholar]
  8. Druska, V.; Horrace, W. Generalized moments estimation for spatial panel data: Indonesian rice farming. Am. J. Agric. Econ. 2004, 86, 185–198. [Google Scholar] [CrossRef]
  9. Egger, P.; Pfaffermayr, M.; Winner, H. An unbalanced spatial panel data approach to US state tax competition. Econ. Lett. 2005, 88, 329–335. [Google Scholar] [CrossRef]
  10. Kapoor, M.; Kelejian, H.; Prucha, I. Panel data models with spatially correlated error components. J. Econ. 2007, 140, 97–130. [Google Scholar] [CrossRef] [Green Version]
  11. Lee, L.; Yu, J. Estimation of spatial autoregressive panel data models with fixed effects. J. Econ. 2010, 154, 165–185. [Google Scholar] [CrossRef]
  12. Elhorst, J. Serial and spatial autocorrelation. Econ. Lett. 2008, 100, 422–424. [Google Scholar] [CrossRef]
  13. Parent, O.; LeSage, J. A space-Ctime filter for panel data models containing random effects. Comput. Stat. Data Anal. 2011, 55, 475–490. [Google Scholar] [CrossRef]
  14. Lee, L.; Yu, J. Spatial penels: Random components versus fixed effects. Int. Econ. Rev. 2012, 53, 1369–1412. [Google Scholar] [CrossRef]
  15. Lee, L.; Yu, J. Estimation of fixed effects panel regression models with separable and nonseparable space-time filters. J. Econ. 2015, 184, 174–192. [Google Scholar] [CrossRef]
  16. Baltagi, B.; Song, S.; Jung, B.; Koh, W. Testing for serial correlation, spatial autocorrelation and random effects using panel data. J. Econ. 2007, 140, 5–51. [Google Scholar] [CrossRef]
  17. Cohen, J.; Morrison Paul, C. Public infrastructure investment, interstate spatial spillovers and manufacturing costs. Rev. Econ. Stat. 2004, 86, 551–560. [Google Scholar] [CrossRef]
  18. Zhao, J.; Zhao, Y.; Lin, J.; Miao, Z.-X.; Khaled, W. Estimation and testing for panel data partially linear single-index models with errors correlated in space and time. Random Matrices 2020, 9, 2150005. [Google Scholar] [CrossRef] [Green Version]
  19. Bai, Y.; Hu, J.; You, J. Panel data partially linear varying-coefficient model with both spatially and time-wise correlated errors. Stat. Sinica. 2015, 35, 275–294. [Google Scholar] [CrossRef]
  20. Shi, W.; Lee, L. Spatial dynamic panel data models with interactive fixed effects. J. Econ. 2017, 197, 323–347. [Google Scholar] [CrossRef]
  21. Baltagi, B.; Fingleton, B.; Pirotte, A. Estimating and forecasting with a dynamic spatial panel data model. Oxford. B Econ. Stat. 2014, 76, 112–138. [Google Scholar] [CrossRef]
  22. Cai, Z. Trending time-varying coefficient time series models with serially correlated errors. J. Econ. 2007, 136, 163–188. [Google Scholar] [CrossRef]
  23. Lin, X.; Carroll, R. Nonparametric function estimation for clustered data when the predictor is measured without/with error. J. Am. Stat. Assoc. 2000, 95, 520–534. [Google Scholar] [CrossRef]
  24. Fan, J.; Huang, T.; Li, R.Z. Analysis of longitudinal data with semi-parametric estimation of covariance function. J. Am. Stat. Assoc. 2007, 102, 632–641. [Google Scholar] [CrossRef] [Green Version]
  25. Fan, J.; Zhang, W. Statistical methods with varying coefficient models. Stat. Inference 2008, 1, 179–195. [Google Scholar] [CrossRef] [Green Version]
  26. Tang, L.; Liu, Y. Estimation of generalized spatial lag semi-parametric varying-coefficient panel model with random effects. Stat. Res. 2018, 35, 119–128. [Google Scholar]
  27. Fan, J.; Zhang, C.; Zhang, J. Generalized likelihood ratio statistics and Wilks phenomenon. Ann. Stat. 2001, 29, 153–193. [Google Scholar] [CrossRef]
  28. Su, L.; Jin, S. Profile quasi-maximum likelihood estimation of partially linear spatial autoregressive models. J. Econ. 2010, 157, 18–33. [Google Scholar] [CrossRef]
  29. Lee, L. Asymptotic distributions of quasi-maximum likelihood estimators for spatial autoregressive models. Econometrica 2004, 72, 1899–1925. [Google Scholar] [CrossRef]
  30. Hamilton, S.; Truong, Y. Local linear estimation in partly linear models. J. Multivar. Anal. 1997, 60, 1–19. [Google Scholar] [CrossRef] [Green Version]
  31. Ullah, A.; Su, L. Profile likelihood estimation of partially linear panel data models with fixed effects. Econ. Lett. 2006, 92, 75–81. [Google Scholar]
  32. Mack, Y.; Silverman, B. Weak and strong uniform consistency of kernel regression estimates. Zeitschrift für Wahrscheinlichkeitstheorie und Verwandte Gebiete 1982, 61, 405–415. [Google Scholar] [CrossRef]
  33. Su, L. Semiparametric GMM estimation of spatial autoregressive models. J. Econ. 2012, 167, 543–560. [Google Scholar] [CrossRef]
  34. Anselin, L. Spatial Econometrics: Methods and Models; Kluwer Academic Press: Dordrecht, The Netherlands, 1988. [Google Scholar]
  35. Feng, Q.; Horrace, W. Alternative technical efficiency measures: Skew, bias and scale. J. Appl. Economet. 2012, 27, 253–268. [Google Scholar] [CrossRef] [Green Version]
  36. White, H. Estimation, Inference and Specification Analysis; Cambridge University Press: New York, NY, USA, 1994. [Google Scholar]
  37. Kelejian, H.; Prucha, I. On the asymptotic distribution of the Moran I test statistic with applications. J. Econ. 2001, 104, 219–257. [Google Scholar] [CrossRef]
Figure 1. The parameter space Λ of λ and ρ .
Figure 1. The parameter space Λ of λ and ρ .
Fractalfract 06 00735 g001
Figure 2. The fitting images and 95% confidence intervals of m ^ ( z ) with ρ = 0.2 and λ = 0.2 .
Figure 2. The fitting images and 95% confidence intervals of m ^ ( z ) with ρ = 0.2 and λ = 0.2 .
Fractalfract 06 00735 g002
Figure 3. Trajectories of m ( z i t ) with η = 0, 0.05, 0.1, 0.2, and 0.5, respectively.
Figure 3. Trajectories of m ( z i t ) with η = 0, 0.05, 0.1, 0.2, and 0.5, respectively.
Fractalfract 06 00735 g003
Figure 4. Powers of the test with α = 0.01 and 0.05, respectively, with sample size N T .
Figure 4. Powers of the test with α = 0.01 and 0.05, respectively, with sample size N T .
Fractalfract 06 00735 g004
Figure 5. m ^ ( z ) and its 95% confidence intervals for the dataset of Indonesian rice farming in China.
Figure 5. m ^ ( z ) and its 95% confidence intervals for the dataset of Indonesian rice farming in China.
Fractalfract 06 00735 g005
Table 1. Results of parametric estimates under the model (12) with T = 10.
Table 1. Results of parametric estimates under the model (12) with T = 10.
N = 50N = 100N = 200
TrueMeanMSETrueMeanMSETrueMeanMSE
β ^ 1.00000.98540.00701.00001.00960.00321.00001.00600.0018
ρ ^ 0.20000.20340.01080.20000.20270.00890.20000.20170.0044
ρ = 0.2 λ ^ 0.20000.20790.00360.20000.20740.00320.20000.20270.0027
λ = 0.2 σ e 2 ^ 1.00001.03350.00131.00001.02450.00111.00001.00640.0008
σ b 2 ^ 1.00001.03560.00451.00001.02970.00251.00001.01600.0013
β ^ 1.00001.01300.00491.00001.00410.00411.00001.00040.0010
ρ ^ 0.20000.20590.01130.20000.20180.01030.20000.20410.0010
ρ = 0.2 λ ^ 0.40000.41200.00560.40000.40460.00380.40000.40390.0013
λ = 0.4 σ e 2 ^ 1.00001.05670.00881.00001.03020.00451.00001.02900.0043
σ b 2 ^ 1.00001.05360.00571.00001.01450.00331.00001.00220.0025
β ^ 1.00001.03490.07821.00001.03110.06011.00000.97740.0506
ρ ^ 0.40000.38460.03440.40000.40260.04090.40000.39290.0311
ρ = 0.4 λ ^ 0.20000.20690.00180.20000.20550.00150.20000.20440.0013
λ = 0.2 σ e 2 ^ 1.00001.04290.00271.00001.01590.00231.00001.00830.0017
σ b 2 ^ 1.00001.03780.00251.00001.00830.00181.00001.00550.0013
β ^ 1.00001.03560.06371.00001.01160.04411.00001.00710.0063
ρ ^ 0.40000.40830.00110.40000.40140.00100.40000.40080.0005
ρ = 0.4 λ ^ 0.40000.41090.00080.40000.41070.00080.40000.40090.0006
λ = 0.4 σ e 2 ^ 1.00001.04110.00191.00001.02120.00051.00001.00410.0004
σ b 2 ^ 1.00001.03990.00211.00001.01380.00201.00001.00220.0008
Table 2. Results of parametric estimates under the model (12) with T = 20.
Table 2. Results of parametric estimates under the model (12) with T = 20.
N = 50N = 100N = 200
TrueMeanMSETrueMeanMSETrueMeanMSE
β ^ 1.00001.00680.00611.00000.99320.00251.00000.99450.0010
ρ ^ 0.20000.19760.00120.20000.19900.00100.20000.20030.0005
λ = 0.2 λ ^ 0.20000.20510.00220.20000.20190.00150.20000.20080.0008
ρ = 0.2 σ e 2 ^ 1.00001.01020.00091.00001.00980.00071.00001.00050.0005
σ b 2 ^ 1.00001.02420.00341.00001.01450.00131.00001.00080.0007
β ^ 1.00000.98790.00251.00001.00060.00081.00001.00030.0003
ρ ^ 0.20000.20280.00970.20000.20110.00590.20000.20010.0001
ρ = 0.2 λ ^ 0.40000.40500.00430.40000.39720.00190.40000.40080.0011
λ = 0.4 σ e 2 ^ 1.00001.00820.00371.00001.00560.00211.00001.00140.0014
σ b 2 ^ 1.00001.00620.00261.00001.00470.00241.00001.00160.0015
β ^ 1.00001.01570.03691.00000.99150.02821.00000.99410.0008
ρ ^ 0.40000.39710.03120.40000.40100.00330.40000.39900.0005
ρ = 0.4 λ ^ 0.20000.20250.00110.20000.20130.00070.20000.20010.0002
λ = 0.2 σ e 2 ^ 1.00001.00960.00131.00001.00680.00111.00001.00410.0008
σ b 2 ^ 1.00001.00710.00201.00001.00480.00151.00001.00050.0007
β ^ 1.00001.02760.00631.00001.00410.00431.00001.00080.0003
ρ ^ 0.40000.40050.00040.40000.40040.00010.40000.40000.0000
ρ = 0.4 λ ^ 0.40000.40040.00040.40000.40010.00010.40000.40000.0000
λ = 0.4 σ e 2 ^ 1.00001.00250.00121.00001.00120.00041.00001.00040.0002
σ b 2 ^ 1.00001.00090.00161.00001.00080.00091.00001.00020.0005
Table 3. The medians and SDs of MADEs for nonparametric estimates under the model (12) with T = 10.
Table 3. The medians and SDs of MADEs for nonparametric estimates under the model (12) with T = 10.
N = 50N = 100N = 200
ρ = 0.2 λ = 0.2 Median0.33640.32480.3179
SD0.27500.22030.2145
λ = 0.4 Median0.31080.31000.2927
SD0.27090.24670.2247
ρ = 0.4 λ = 0.2 Median0.30600.30460.2933
SD0.24640.21560.2077
λ = 0.4 Median0.32750.32110.3031
SD0.25400.19240.1756
Table 4. The medians and SDs of MADEs for nonparametric estimates under the model (12) with T = 20.
Table 4. The medians and SDs of MADEs for nonparametric estimates under the model (12) with T = 20.
N = 50N = 100N = 200
ρ = 0.2 λ = 0.2 Median0.33550.31740.2687
SD0.24290.19480.1530
λ = 0.4 Median0.30420.29020.2883
SD0.21130.18830.1858
ρ = 0.4 λ = 0.2 Median0.30200.30070.2903
SD0.22250.20690.1908
λ = 0.4 Median0.32440.30010.2873
SD0.20400.18240.1722
Table 5. Results of parametric estimates under the models (12)–(15).
Table 5. Results of parametric estimates under the models (12)–(15).
N = 50, T = 5N = 100, T = 5
β ^ ρ ^ λ ^ σ ^ e 2 σ ^ b 2 β ^ ρ ^ λ ^ σ ^ e 2 σ ^ b 2
model (12)Mean0.98480.41350.40921.02111.05761.00670.40630.40141.01281.0266
MSE0.00480.00140.00130.00150.01650.00230.00110.00090.00130.0125
Mean0.4711-0.53721.02211.10630.8424-0.50431.03311.1221
model (13)MS5.7078-0.01680.10570.12440.9135-0.02830.00950.1365
G R 1 (%)118,812.5-1192.36946.7653.939,617.3-3044.4630.7992
Mean0.83720.6462-1.02341.13220.90100.5968-1.09281.1012
model (14)MSE6.21350.1281-0.01710.12110.88450.1330-0.10130.1403
G R 1 (%)129,347.99050-1040633.938,356.511,990.9-7692.31022.4
Mean0.0366--1.02251.11230.9828--1.09481.1044
model (15)MSE5.8645--0.01600.17810.7981--0.10360.1474
G R 1 (%)122,077.1--966.7979.434,600--7869.21079.2
Table 6. The medians and SDs of MADEs for nonparametric estimates under the models (12)–(15).
Table 6. The medians and SDs of MADEs for nonparametric estimates under the models (12)–(15).
Model (12)Model (13)Model (14)Model (15)
N = 50, T = 5Median0.22320.76320.72590.7983
G R 2 (%)(-)(241.9)(225.2)(257.6)
SD0.14610.28570.30780.4305
G R 3 (%)(-)(95.5)(1106.7)(194.7)
N = 100, T = 5Median0.20421.23971.33231.2387
G R 2 (%)(-)(507.1)(552.4)(506.6)
SD0.14530.55950.47040.4810
G R 3 (%)(-)(285.1)(223.7)(231.0)
Table 7. Powers of test with significance level α .
Table 7. Powers of test with significance level α .
NT α η = 0 η = 0.05 η = 0.1 η = 0.2 η = 0.5
10080.050.060.440.9911
0.010.020.380.8911
100160.050.050.63111
0.010.010.55111
Table 8. Variables and their definitions.
Table 8. Variables and their definitions.
VariableDefinition
highDummy variable for high yield varieties
( h i g h = 1 if the ith form at tth growing reason used high yield varieties,
otherwise high = 0)
mixedDummy variable for mixed yield varieties
(mixed = 1 if the ith form at tth growing reason used mixed varieties,
otherwise mixed = 0)
seedSeed weight used in tth season on ith farm(measured in kilogram)
landTotal area that farmers cultivated with rice(measured in hectares)
Table 9. Results of F-test under significant level α = 0.01 .
Table 9. Results of F-test under significant level α = 0.01 .
VariableHighMixedSeedLand
F-test value2.333263.42983.7706185.0233 ***
Note: *** means that it is significant under test level 1%.
Table 10. Results of parametric estimates of model (1)–(3) for fitting Indonesia rice farming data.
Table 10. Results of parametric estimates of model (1)–(3) for fitting Indonesia rice farming data.
Parameter ρ ^ λ ^ β ^ 1 β ^ 2 β ^ 3 σ ^ e 2 σ ^ b 2
Estimate−0.7227 ***0.3891 ***3.2123 ***2.6885 ***0.0172 **2.4049 ***0.0023 ***
Note: *** and ** mean that they are significant under test level 1% and 5% respectively.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Li, S.; Chen, J.; Li, B. Estimation and Testing of Random Effects Semiparametric Regression Model with Separable Space-Time Filters. Fractal Fract. 2022, 6, 735. https://doi.org/10.3390/fractalfract6120735

AMA Style

Li S, Chen J, Li B. Estimation and Testing of Random Effects Semiparametric Regression Model with Separable Space-Time Filters. Fractal and Fractional. 2022; 6(12):735. https://doi.org/10.3390/fractalfract6120735

Chicago/Turabian Style

Li, Shuangshuang, Jianbao Chen, and Bogui Li. 2022. "Estimation and Testing of Random Effects Semiparametric Regression Model with Separable Space-Time Filters" Fractal and Fractional 6, no. 12: 735. https://doi.org/10.3390/fractalfract6120735

Article Metrics

Back to TopTop