Indirect Inference Estimation of Spatial Autoregressions

The ordinary least squares (OLS) estimator for spatial autoregressions may be consistent as pointed out by Lee (2002), provided that each spatial unit is influenced aggregately by a significant portion of the total units. This paper presents a unified asymptotic distribution result of the properly recentered OLS estimator and proposes a new estimator that is based on the indirect inference (II) procedure. The resulting estimator can always be used regardless of the degree of aggregate influence on each spatial unit from other units and is consistent and asymptotically normal. The new estimator does not rely on distributional assumptions and is robust to unknown heteroscedasticity. Its good finite-sample performance, in comparison with existing estimators that are also robust to heteroscedasticity, is demonstrated by a Monte-Carlo study.


Introduction
Spatial autoregressions (SAR) have attracted lots of attention from both practical and theoretical sides in economics and other disciplines of social sciences since the classical work of Cliff and Ord (1981).Its popularity is mainly due to its parsimonious representation of the cross-sectional correlation by a weight matrix.Correlation in spatial data arises naturally due to competition, copycatting, spillover, aggregation, to name just a few.In these contexts, space embodied in the weight matrix can be defined in terms of not only geographical distance but also economic distance.
The spatial autoregression model extends autocorrelation in time series to the spatial dimension in the sense that in the structural equation a spatially "lagged" dependent variable is included as a regressor.In time series, the autoregression model can be estimated consistently by the ordinary least squares (OLS), but for the spatial autoregression, the OLS is usually regarded as an inconsistent method.Robinson (2008) provided an excellent discussion of the intuition behind this.Estimation strategies like the maximum likelihood (ML), quasi maximum likelihood (QML), instrument variables (IV), and generalized method of moments (GMM) have been proposed in the literature.The ML is the most efficient, but it imposes stringent distributional assumptions on the data generating process, whereas both the ML and QML rule out heteroscedasticity in the error term.Further, the (Q)ML involves calculating the determinant or eigenvalues of a matrix that is of the same size as the sample, and thus many researchers dismiss its use in moderately large samples and advocate the more flexible IV and GMM estimators that may incur less computational burden and are also robust to heteroscedasticity.Lee (2002) overturned the traditional wisdom regarding the OLS estimator in spatial autoregressions when there are exogenous regressors included.He showed that while the OLS estimator is inconsistent for spatial autoregressions with a sparse weight matrix, it can be consistent when spatial units may have small spatial impacts on other units but each unit may be influenced aggregately by a significant portion of the total units.For the special case of the so-called pure SAR model, namely, when there is no other exogenous regressor, Lee (2002) demonstrated that regardless of the structure of the weight matrix, the OLS estimator is always inconsistent.
In practice, one may have limited knowledge to judge whether a spatial unit is influenced by a significant portion of the total units.When a researcher is constructing the weight matrix, she may know the number of neighboring units for each unit, but she may be unable to tell whether the number of neighbors is significant in finite samples.Thus, this poses a challenge for practitioners regarding the usefulness of the OLS estimator: it may or may not be consistent, depending on the degree of aggregate influence on each unit from other units, which may hardly manifest itself in finite samples.This paper carefully analyzes the asymptotic distribution theory for the OLS estimator.A unified asymptotic distribution result for the recentered OLS estimator is presented under different regimes for the spatial weight matrix.Given the asymptotic result for the recentered OLS estimator, a new estimator based on the indirect inference (II) procedure is proposed.Kyriacou et al. (2017) novelly used the II procedure to correct the inconsistency of the OLS estimator in the pure SAR model under homoscedasticity.Even though they provided promising simulation results under some mild heteroscedasticity, they have yet to show rigorously how to construct a consistent estimator with no restrictions on the form of heteroscedasticity.In contrast, this paper considers the SAR model with exogenous regressors and it adds to the existing spatial literature with unknown heteroscedasticity. 1 1 Recent literature on dealing with heteroscedasticity in the spatial framework includes Kelejian and Prucha (2010), Badinger and Egger (2011), Liu and Yang (2015), Jin and Lee (2019), among others.An essential idea in this strand of literature is to use some moment conditions that are robust to unknown heteroscedasticity.
Note that the problem of inconsistency (of the OLS estimator) is solely due to the presence of the endogenous spatially lagged variable.Once the spatial autoregression parameter is estimated consistently, the OLS procedure can be used to estimate the remaining parameters, though the asymptotic variance needs to be modified accordingly to take into account the uncertainty in the estimated spatial autoregression parameter.
The plan of this paper is as follows.In the next section, the asymptotic behavior of the OLS estimator is discussed under different spatial scenarios.The II estimator that aims to correct the possible inconsistency of the OLS estimator is defined and its asymptotic distribution is derived.A very important message from this section is that regardless of the degree of aggregate influence on each spatial unit from other units, the II procedure can always be used and the resulting estimator is consistent and asymptotically normal.Section 3 discusses the special case of pure SAR.Section 4 provides Monte Carlo evidence of the effectiveness of the II estimation strategy.It shows that the II estimator possesses good finite-sample performance relative to other consistent estimators that are also robust to unknown heteroscedasticity and that the II method may be favored for the purpose of hypothesis testing, especially for testing the spatial autoregression parameter.Section 5 concludes.Technical details, including some useful lemmas and proofs of the results presented in Sections 2 and 3, are collected in the appendix.
Throughout, K is used to denote a positive constant on different occasions, arbitrarily large but bounded, that does not depend on the sample size n and whose value may vary in different contexts.I n is the identity matrix of dimension n and 1 n is an n × 1 vector of ones.For an n × 1 vector a n , a i,n denotes its i-th element and for an n are the maximum row sum norm and maximum column sum norm, respectively.A sequence of matrices Dg(a n ) denotes a square diagonal matrix with the vector a n spanning the main diagonal, dg(A n ) is an n × 1 column vector that collects in order the diagonal elements of the square matrix A n , and Dg(A n ) = Dg(dg(A n )).The subscript 0 is used to signify the true parameter value.

Main Results
Consider the SAR model where n is the total number of cross-sectional units, an n × 1 vector, collecting observations on the dependent variable, X n is an n × k matrix of observations on k exogenous nonstochastic regressors with coefficient vector β, W n is an n × n matrix of spatial weights with zero diagonals, λ is the spatial autoregression coefficient, and u n is an n-dimensional vector of error terms.
For the ease of presentation, the following matrix notation is introduced in this paper: When a matrix is presented without its argument λ, it means that it is evaluated at the parameter value λ 0 .Namely, , and If λ is known (equal to its true value), the model becomes a standard linear model with S n y n being the dependent variable; otherwise, W n y n appears on the right-hand side of (1) as a spatially lagged or weighted variable.The OLS estimator of θ 0 is θn = (Z ′ n Z n ) −1 Z ′ n y n , which may be inconsistent.Since the inconsistency of θn is solely due to the endogenous W n y n , the properties of λn (the first element of θn ) are discussed first in this paper.Once a consistent estimator of λ 0 is available, one can show that a consistent estimator of β 0 follows immediately (see Theorem 4 to be introduced).
By using the partitioned regression formula and substituting 2 It should be pointed out when lim n→∞ √ n/hn = 0 (and lim n→∞ hn/n = 0, where h −1 n is the order of magnitude of elements of W n), θn is consistent, as shown in Lee (2002), and thus one may not need to seek a consistent estimator of λ 0 separately and then use it to construct a consistent estimator of β 0 .In practice, one may not know a priori the rate of hn, but the II estimator to be introduced is always consistent regardless of the rate of hn.
The following assumptions are made throughout this paper.
) < ∞ for some positive constant δ; (iii) u i,n and u j,n are independent for any i ̸ = j.
Assumption 4. λ 0 is contained in a compact parameter space Λ.For any admissible λ ∈ Λ, } is is uniformly bounded in row and column sums.
Assumption 5. (i) The elements of X n are uniformly bounded constants for all n; (ii) The probability limit Γ of n −1 Z ′ n Z n exists and is nonsingular.
exists and is nonzero.
Intuitions and related discussions for Assumptions 1-5 are provided in Kelejian and Prucha (1999) and Lee (2001Lee ( , 2002Lee ( , 2004)).Assumption 1.(i) follows naturally when W n is row-or column-normalized, as is typically the case.Assumption 1.(ii) and Assumption 2 limit the degree of dependency among the spatial units and is originated by Kelejian and Prucha (1999).Given Assumption 2, the equilibrium solution of y n is Lee (2002) emphasized that Assumption 5.(ii) is related to an identification condition for estimation in the least squares and IV frameworks.It rules out possible multicollinearities among X n and G n X n β 0 for large n and implies that the limit of d 3n /n is bounded away from zero. 3 Assumption 6 ensures that the asymptotic variances of the (properly recentered) OLS estimator and the resulting II estimator are positive. 4 3 Multicollinearities can happen, for example, when Xn = 1n, namely, it consists of only a constant term.Lee (2004) showed that under homoscedasticity, however, the QML works in violation of this condition.Since the II estimator to be discussed in this paper is to correct the possible inconsistency of the OLS estimator, Assumption 5.(ii) is maintained. 4The asymptotic variances are given by limn→∞ indicating that λn is consistent as long as h n → ∞.
More interestingly, the behavior of λn depends on how fast h n diverges to infinity.If h n tends to infinity at a rate slower than √ n, then r 1n dominates r 2n in the numerator, and which implies that λn converges at the slower rate h n , but it does not converge to λ 0 at rate h n , as plim , r 1n and r 2n are of the same order in the numerator, so indicating that λn converges at rate √ n, but at this rate it does not converge to λ 0 , as in general , respectively, for the (properly recentered) OLS estimator and the resulting II estimator.Their explicit expressions are given respectively in Theorems 1 and 2 to be introduced.Assumption 5.(ii) implies that plim n→∞ (dn where the covariance term disappears under normality, and Var(u Then the usual condition that plim n→∞ n −1 Z ′ n ΣnZn exists and is nonsingular is sufficient for Assumption 6 to hold.When hn is bounded, a more precise characterization of a sufficient condition is not immediately obvious.Essentially, it requires, in addition to the existence and nonsingularity of plim If h n tends to infinity at a rate faster than √ n (and yet smaller than n), namely, lim n→∞ √ n/h n = 0 (and lim n→∞ h n /n = 0), then in the numerator r 2n dominates r 1n , indicating that λn converges to λ 0 at rate √ n and is asymptotically normal if one applies a central limit theorem to n −1/2 r 2n .
One sees that the asymptotic behavior of λn depends on the magnitude of h n , which may be unknown in practice.The following theorem shows that if one can properly recenter λn , then a unified asymptotic distribution result follows.
Theorem 1.Under Assumptions 1-6, the OLS estimator λn of λ 0 in the model (1) has the following asymptotic distribution, where (This is the approach taken by Kyriacou et al. (2017) when dealing with the special case of pure SAR model.)But then by following a similar expansion as in the proof of Theorem 1, one can find that the asymptotic variance of the resulting recentered estimator is much more complicated, involving the variances of r n and d n as well as their covariance.
Remark 2. When lim ) and the asymptotic distribution of √ n( λn − λ 0 ) centers at zero, so one does not need to recenter λn −λ 0 by tr(  Remark 5. Theorem 1 gives a unified representation of the asymptotic distribution of the properly recentered OLS estimator, regardless of the possibly unknown magnitude of h n .Further, it facilitates the construction of the indirect inference estimator to be introduced that corrects the inconsistency, when present, of the OLS estimator.

The Indirect Inference Estimator
One can see from Theorem 1 in the previous subsection that the OLS estimator λn may have an asymptotic bias.Yet a direct feasible bias-correction procedure is not possible, since the bias itself depends on unknown parameters, including λ 0 , which may not be consistently estimated by the OLS estimator λn .Following Phillips (2012) and Kyriacou et al. (2017), one may define the binding function that involves the bias of λn .Unfortunately, the resulting binding function then involves the unknown Σ n , which appears in the recentering quantity for the OLS estimator in (7).The strategy in this paper is to replace tr( that is of the same order of u ′ n M n G n u n and at the same time does not directly involve Σ n . Recall that D n = Dg(M n G n ).Then under a general form of unknown heteroscedas- the recentering quantity.
Theorem 2. Under Assumptions 1-6, the OLS estimator λn of λ 0 in the model (1) has the following asymptotic distribution: where Remark 6.When one replaces tr(Σ n M n G n ) that involves the unknown Σ n and appears in the recentering term of the OLS estimator, the asymptotic variance η of the newly recentered estimator no longer involves Σ (3) n and Σ (4) n .This stands in contrast to the asymptotic variance v facilitates not only the construction of the indirect inference estimator to be introduced but also the inference procedure.
Given Theorem (2) and the observed sample data y n and X n , one can always define the sample binding function.Recall that S n (λ ) are functions of the parameter λ (as well as X n ).So the binding function can be defined as and the II estimator inverts this binding function: Intuitively, the II estimator defined as such tries to match λn from the observed data to its expec-tation, at least approximately.Typically, the expectation may be approximated, to an arbitrary degree of accuracy, via the method of simulations, as in the original spirit of Gouriéroux et al. (1993) and Smith (1993).But in simulations one needs to make some distributional assumption to generate the pseudo error term.So instead one may use some analytical approximation as in Phillips (2012), Kyriacou et al. (2017), and this paper.
In the definition of the II estimator as in (10), it is implicitly assumed that the binding function b n (λ) is invertible.Note that b n (λ) is a function of λ and the sample data y n and X n and thus it is random.5 Assumption 7.For all λ ∈ Λ, the binding function ( 9) is monotonic in λ with probability 1 and when One can see from the expression (A.10) in the appendix that the derivative of the binding function with respect to λ is For divergent h n , one can show that the O P (h −1 n ) term converges almost surely to zero for all λ ∈ Λ.So in large samples, Assumption 7 is more likely to hold.Assumption 7 lists the conditions under which the II estimator exists and is consistent.It would be desirable if one could lay down some primitive conditions on the data matrix X n , the weight matrix W n , and the parameter space Λ so that Assumption 7 would be satisfied.Given the sample data, one may plot the binding function against λ to verify numerically the validity of this assumption.Simulations as in Gospodinov et al. (2017) may also help to establish this assumption's credibility.
Theorem 3.For the model (1), under Assumptions 1-7, the II estimator λII n of λ 0 , defined as in ( 10) that is based on the binding function ( 9), has the following asymptotic distribution: Remark 7. Since when h n diverges, b ′ n (λ 0 ) converges almost surely to 1, the asymptotic distribution of the II estimator is identical to that of the properly recentered OLS estimator.Further, In light of Remark 4, this means that under homoscedasticity, the II estimator is as efficient as the optimal IV estimator and can be as efficient as the ML estimator if the spatial data is normally distributed.
Remark 8. Theorem 3 shows that regardless of the magnitude of h n , which researchers may not know in practice, one can always apply the II procedure after the OLS estimation is done.At worst, when lim n→∞ √ n/h n = 0, this procedure is redundant (see Remark 2), but still the resulting II estimator has exactly the same asymptotic distribution as the consistent OLS estimator (since η = v +o(1), see Remark 7).Otherwise, the II procedure provides a correction to the inconsistent OLS estimator.
Once the spatial autoregression parameter λ is consistently estimated by λII n , one can estimate the parameter vector β by where Ŝn = S n ( λII Theorem 4. For the model (1), under Assumptions 1-7, the OLS estimator of β 0 defined as in (13) has the asymptotic distribution and jointly, where V , assumed to exist and be positive definite, is given by (A.14) and γ is given by (A.15), respectively, in the appendix.
Remark 9.One can see (from the appendix) that the expression of V contains the traditional OLS variance term under heteroscedasticity, lim where ûi,n 's are the sample residuals from the II estimation.6 3 The Special Case of Pure SAR It is worthwhile to discuss the case when there is no X n , namely, the so-called pure SAR model This case is of special interest since there is no IV available.On the other hand, the QML estimator is not consistent under heteroscedasticity.Kyriacou et al. (2017) were the first to explore the possibility of using the II procedure to correct the inconsistency of the OLS estimator under some mild form of heteroscedasticity and their results were quite promising.In this paper, no restrictions are imposed on the form of the unknown heteroscedasticity.
Given the expansion (3), one can see obviously that Assumption 6 needs to be modified accordingly to ensure the asymptotic variance of the properly recentered λn exists and is positive.Now let exists and is positive; (ii) exists and is positive.
Corollary 1.Under Assumptions 1-4 and 6 ′ , the OLS estimator λn of λ in the model ( 14) has the following asymptotic distribution, where v is defined as in Assumption 6 ′ (i).
Corollary 2. Under Assumptions 1-4 and 6 ′ , the OLS estimator λn of λ in the model ( 14) has the following asymptotic distribution: where η is defined as in Assumption 6 ′ (ii).
Let the sample binding function be Accordingly, Assumption 7 is modified as follows.
Assumption 7 ′ .For all λ in Λ, the binding function ( 18) is monotonic in λ with probability 1 Corollary 3.For the model ( 14), under Assumptions 1-4, 6 ′ , and 7 ′ , the II estimator λII n of λ that is based on the binding function ( 18) has the following asymptotic distribution: where η is defined as in Assumption 6 ′ (ii).
Remark 10.If Σ n = σ 2 0 I n (namely, under homoscedasticity) and ones uses E(r 1n )/E(d 1n ) = tr(G n )/tr(G ′ n G n ) as the recentering term, which fortunately does not involve the unknown variance parameter σ 2 0 , then the binding function, its derivative, and the asymptotic distribution of the resulting II estimator can be modified accordingly as in Kyriacou et al. (2017).In contrast to the SAR model with X n , the recentered OLS estimator and the II estimator in the pure SAR model have convergence rate n/h n .
Remark 11.One can see that b ′ n (λ 0 ) does not converge almost surely to 1 when h n is divergent as in the case when X is present.This is because y when X is present, whereas both the numerators in the second terms on the right-hand sides of ( 19) and ( 11

Monte Carlo Evidence
In this section, Monte Carlo simulations are provided to demonstrate the performance of λII n (as well as βII n in ( 13)) in finite samples, in comparison with consistent estimators that are also robust under heteroscedasticity: the optimal robust GMM estimator of Lin and Lee (2010) and the modified QML (MQML) estimator of Liu and Yang (2015).For the optimal robust GMM estimator of Lin and Lee (2010), used as the optimal moment conditions, see Debarsy et al. (2015).They involve λ (appearing in G n (λ)) and β as well as the covariance matrix Σ n .For λ and β, an initial estimation is constructed from the simple 2SLS with W n X n and X n as IV's.One may assume a model for Σ n and then estimate the assumed model so as to construct the moment conditions.In this section, two choices are made regarding this: one is to use u the moment conditions and the other is to use u with the true Σ n (known in simulations) plugged in.The two resulting estimators are denoted by GMM and GMM(Σ n ), respectively, in Tables 1-4.One would expect that in practice the performance of the optimal robust GMM estimator with an estimated Σ n appearing in the moment conditions would stand between the two.
In each experiment, for each estimator, reported are the bias and root mean squared error (RMSE) from 1,000 Monte Carlo simulations.The empirical rejection probabilities of the relevant t tests for testing each parameter equal to its true value at 5% are also reported, denoted by P (5%), where the asymptotic variances for λ II n and βII n , as discussed in the last paragraph of Section 2, are estimated with the unknowns replaced by their estimates based on the II procedure. 7 For the purpose of comparison, the experimental design in Lin and Lee ( 2010) is followed closely.The spatial scenario under a group interaction weight matrix of Case (1991) is considered.
The exogenous variables include a constant term and two independently distributed random variables following N(3, 1) and U(−1, 2), respectively.The size of each group is determined by a U(3, 20) random variable.The error terms follow a zero-mean normal distribution with variances varying across groups.Two variance structures (V1 and V2) are considered.V1: for each group, if the group size is greater than 10, then the error variance is the same as the group size; otherwise, the variance is the inverse of the square of the group size.V2: for each group, the error variance is the inverse of the group size.Two sets of parameter configurations are used: θ 0 = (λ 0 , 0.8, 0.2, 1.5) ′ and θ 0 = (λ 0 , 0.2, 0.2, 0.1) ′ , named P1 and P2, respectively.Different degrees of spatial autocorrelation are considered: λ 0 = 0.2, 0.6, 0.9.Results are reported in Tables 1-4.
7 Neither Lin and Lee (2010) nor Liu and Yang (2015) reported how the inference procedures based on their estimators would perform in finite samples.

3.8%
Notes: The weight matrix is a social interaction matrix Reported for each estimator are the bias, RMSE, and the empirical size of the t test for each parameter equal to its true value at 5% from 1, 000 simulations.The exogenous regressors are (1, The group size m j follows IID U(3, 20).The error terms follow a zero-mean independent normal distribution.The variance structure is such that if m j > 10, the error variance in the j-th group is m Reported for each estimator are the bias, RMSE, and the empirical size of the t test for each parameter equal to its true value at 5% from 1, 000 simulations.The exogenous regressors are (1, Reported for each estimator are the bias, RMSE, and the empirical size of the t test for each parameter equal to its true value at 5% from 1, 000 simulations.The exogenous regressors are Reported for each estimator are the bias, RMSE, and the empirical size of the t test for each parameter equal to its true value at 5% from 1, 000 simulations.The exogenous regressors are The group size m j follows IID U(3, 20).The error terms follow a zero-mean independent normal distribution.The variance structure is such that the error variance in the j-th group is 1/m Some interesting observations arise.Firstly, all the consistent estimators deliver almost unbiased results across all the experimental configurations, though the estimated intercept term associated with X n is relatively more biased.Secondly, among the consistent estimators, the optimal robust GMM using the true Σ n usually achieves the smallest RMSE.The other three estimators have very similar performance in terms of RMSE.Thirdly, for the purpose of hypothesis testing, it appears that the II-based procedure is as good as the one based on (the infeasible) GMM(Σ n ), with the empirical rejection rates matching very closely the nominal size.
The MQML of Liu and Yang (2015) tends to deliver under-sized t-test regarding the spatial autoregression parameter λ when its value is relatively high.For example, in Table 4, one sees the rejection rates of 0.4% and 0.1%, under R = 100 and R = 200, respectively, for testing λ equal to its true value when λ 0 = 0.9 from a 5% t-test based on Liu and Yang (2015).This under-size problem, when the degree of spatial correlation is high, also carries over to the t test associated with the intercept parameter.The GMM estimator, when one is unsure of the error variance structure and uses u ′ n (G n − Dg(G n )u n and u ′ n (G n X n β, X n ) as the moment conditions, delivers very disappointing size performance in Table 4 when testing either the spatial autoregression parameter λ or the intercept parameter: the rejection rates approach around 20% at the 5% nominal size.
Given the simulated data, it is worthwhile to look at a plot of the binding function to check whether the binding function is monotonic, as required by Assumption 7. Figure 1 is drawn for 1,000 simulated data sets under variance structure V1 and parameter configuration P1 with R = 100.8Recall that for a given θ 0 = (λ 0 , β ′ 0 ) ′ and the exogenous X n , the data generating process generates the observable data y n and the binding function λ + One may wonder about the performance of the proposed II estimator under homoscedasticity, relative to the QML estimator of Lee (2004) and the best GMM estimator of Lee (2007).9Tables 5-6 report the Monte Carlo results under parameter configurations P1 and P2, but now the error term is simulated as a standard normal random variable.The exogenous variables are simulated the same as before.From Tables 5-6, one observes that the II estimator is slightly better than the best GMM estimator of Lee (2007), usually delivering smaller finite-sample bias and lower RMSE.Both methods have good finite-sample size performance in terms of the 5% t test.The finite-sample performance of the QML, on the other hand, is quite different from what the asymptotic theory predicts.Its bias is more severe than the other two and it also gives higher RMSE.Moreover, its size performance is very poor.
The simulation results suggest that the II estimator could be used at least as a complement to other consistent estimators proposed in the literature that are robust to unknown heteroscedasticity.The II method may be favored for the purpose of hypothesis testing, especially for testing the spatial autoregression parameter.It also has very good finite-sample performance under homoscedasticity.
5 Concluding Remarks Lee (2002) challenged the traditional wisdom that the OLS estimator is biased and inconsistent in spatial autoregressions and showed that it may be consistent under some special circumstances if there are exogenous regressors included.This paper examines thoroughly the asymptotic behavior of the OLS estimator under different specifications of the degree of aggregate influence on each unit from other units and provides a unified asymptotic distribution result of the recentered OLS estimator.Based on this, an indirect inference estimator, which is consistent and asymptotically normal, is introduced.The new estimator is relatively easy to calculate, does not rely on distributional assumptions on the data, and is robust to heteroscedasticity.Monte Carlo experiments in this paper show the good finite-sample performance of the II estimator in comparison with other consistent estimators that are robust to unknown heteroscedasticity.
tr and ⊙ are matrix trace and Hadamard product operators, respectively.

TABLE 1 :
Estimation of SAR with variance structure V1 and parameter configuration P1 y ′ n S ′ n (λ)M n D n (λ)M n S n (λ)y n /y ′ n W ′ n M n W n y n is a function of λ.Figure 1 clearly illustrates that the binding function is a monotonic function of λ and thus the monotonicity condition in Assumption 7 is numerically valid.Figures drawn for the simulated data under other configurations display similar patterns and are omitted.

FIGURE 1 :
FIGURE 1: b n (λ) under variance structure V1 and parameter configuration P1, R = 100 where the rate sequence {h n } is uniformly bounded 1 The Asymptotic Behavior of the OLS estimator From Lemma 6 in the appendix, r 1n and d 1n are both O P (n/h n ), r 2n and d 2n are both O P ( as in Theorem 2.) Remark 3. When h n diverges, one may single out dominating terms in Var(r n ) and E(d n ) so that a finer expression of v = lim n→∞ [Var(r n )/n]/[E(d n )/n] 2 can be presented.For example, under divergent h n , one can replace Var(r n ) with Var(r 2n When h n is bounded, however, such replacements are not available and one needs to keep all the term in Var(r n ) and E(d n ).
Lee (2002)In view of Remark 3, under divergent h n , if further Σ n = σ 2 0 I n , namely, under homoscedasticity with σ 2 0 = E(u 2 i,n ), then v corresponds to the top left element of σ 2 0 Γ −1 .This, together with Remark 2, is in line with the observation inLee (2002)that the (consistent, uncentered) OLS estimator (when lim n→∞ √ n/h n = 0) has the same limiting distribution of the optimal IV estimator and under normality it has the same limiting distribution of the ML estimator.It also implies that under other cases of divergent h n (slower than or equal to rate √ n), as long as the OLS estimator is properly recentered, it achieves the same limiting distribution.
to estimate η/b 2 0 , V , and γ in Theorems 3 and 4. As η/b 2 0 , V , and γ involve the unknown G n , S n , β 0 , and Σ n , one may replace G n by Ĝn = G n ( λII n ), D n by Dn = Dg(M n Ĝn ), E n by Ên = M n Ĝn − Dn , S n by Ŝn = S n ( λII n ), β 0 by βII n , and Σ n by Dg(û 2 1,n as well as terms that signal the additional uncertainty introduced by λII n in the definition of βII n as in (13).In practice, in order to make asymptotically valid inference from the II estimation strategy, one needs and x > 10, the error variance in the j-th group is m j

TABLE 3 :
Estimation of SAR with variance structure V2 and parameter configuration P1 n

TABLE 4 :
Estimation of SAR with variance structure V2 and parameter configuration P2 Notes:The weight matrix is a social interaction matrix W n