Selection of Auxiliary Variables for Three-Fold Linking Models in Small Area Estimation: A Simple and Effective Method

Cai, Song; Rao, J.N.K.

doi:10.3390/stats5010009

Open AccessArticle

Selection of Auxiliary Variables for Three-Fold Linking Models in Small Area Estimation: A Simple and Effective Method

by

Song Cai

^*

and

J.N.K. Rao

School of Mathematics and Statistics, Carleton University, Ottawa, ON K1S 5B6, Canada

^*

Author to whom correspondence should be addressed.

Stats 2022, 5(1), 128-138; https://doi.org/10.3390/stats5010009

Submission received: 9 January 2022 / Revised: 31 January 2022 / Accepted: 3 February 2022 / Published: 5 February 2022

(This article belongs to the Special Issue Small Area Estimation: Theories, Methods and Applications)

Download Versions Notes

Abstract

:

Model-based estimation of small area means can lead to reliable estimates when the area sample sizes are small. This is accomplished by borrowing strength across related areas using models linking area means to related covariates and random area effects. The effective selection of variables to be included in the linking model is important in small area estimation. The main purpose of this paper is to extend the earlier work on variable selection for area level and two-fold subarea level models to three-fold sub-subarea models linking sub-subarea means to related covariates and random effects at the area, sub-area, and sub-subarea levels. The proposed variable selection method transforms the sub-subarea means to reduce the linking model to a standard regression model and applies commonly used criteria for variable selection, such as AIC and BIC, to the reduced model. The resulting criteria depend on the unknown sub-subarea means, which are then estimated using the sample sub-subarea means. Then, the estimated selection criteria are used for variable selection. Simulation results on the performance of the proposed variable selection method relative to methods based on area level and two-fold subarea level models are also presented.

Keywords:

Fay–Herriot model; information criterion; transformation; two-fold subarea model; variable selection

1. Introduction

Sample surveys are designed to provide reliable estimates of the overall means of a finite population and means for large domains or sub-populations (areas). For areas with small sample sizes (called small areas), direct area-specific estimators from the survey data are unreliable, and it is necessary to use model-based methods based on models linking area means to related covariates and random area effects. Resulting model-based estimators can lead to a significant increase in precision relative to direct estimators. Rao and Molina [1], in Chapter 6, provide a detailed account of model-based estimation under area level models. The effective selection of auxiliary variables to be included in the linking model is important for the success of model-based small area estimation (SAE).

A basic area level model, due to Fay and Herriot [2], is widely used for SAE in practice. Suppose that we have m areas with direct estimators

y_{i}

of the area means

θ_{i}

(

i = 1, \dots, m

) and associated candidate covariate vectors

x_{i}

. The area level model consists of two components: a sampling model given by

\begin{matrix} y_{i} & = θ_{i} + e_{i}, i = 1, \dots, m \end{matrix}

(1)

and a linking model given by

\begin{matrix} θ_{i} & = x_{i}^{T} β + u_{i}, i = 1, \dots, m, \end{matrix}

(2)

where

e_{i}

denotes sampling errors assumed to be independent

N (0, Ψ_{i})

with known sampling variance

Ψ_{i}

, and

u_{i}

denotes random area effects independent of

e_{i}

that are assumed to be independent and identically distributed (iid) as

N (0, σ_{u}^{2})

with unknown variance

σ_{u}^{2}

. In practice, the sampling variances are obtained by smoothing the estimators of sampling variances using the Generalized Variance Function (GVF) method [3] and treating the smoothed estimators as the sampling variances

Ψ_{i}

. It is clear from (2) that it has a standard linear regression model form, and standard variable selection methods, such as Akaike Information Criterion (AIC) or Bayesian Information Criterions (BIC), can be applied to select variables, provided the area means

θ_{i}

are known. Lahiri and Suntornchost [4] estimated the resulting selection criteria using the sampling model (1) and proposed to use them for variable selection (see Section 2.1 for details). We refer the reader to Rao and Molina [1], Chapter 6, for details of empirical best linear unbiased prediction (EBLUP) estimators of area means from the models (1) and (2) for specified covariate vectors

x_{i}

. The EBLUP estimator of

θ_{i}

is a weighted average of the direct estimator

y_{i}

and a synthetic regression estimator

x_{i}^{T} \hat{β}

, where

\hat{β}

denotes an estimator of the regression parameter vector

β

. For a non-sampled area, direct estimator is not available. Hence, the synthetic estimator

x_{i}^{T} \hat{β}

is used to estimate small area mean, provided the associated

x_{i}

is known. Fay and Herriot [2] obtained EBLUP estimates of per-capita income for small places in the USA, using the basic area level model given by (1) and (2).

Estimation of means for subareas nested within areas is of considerable interest. Mohadjer et al. [5] studied adult literacy for counties (subareas) sampled from states (areas), using data from the 2003 U.S. National Assessment of Adult Literacy. A two-fold subarea model is used to estimate subarea means

θ_{i j}

from

n_{i}

subareas j sampled from m areas i. A two-fold linking model on the subarea means

θ_{i j}

is given by

\begin{matrix} θ_{i j} & = x_{i j}^{T} β + v_{i} + u_{i j}, j = 1, \dots, n_{i}; i = 1, \dots, m, \end{matrix}

(3)

where

x_{i j}

is the vector of covariates associated with

θ_{i j}

, and

v_{i}

is random area effect independent of random subarea effect

u_{i j}

. Furthermore,

v_{i} \overset{i i d}{\sim} N (0, σ_{v}^{2})

and

u_{i j} \overset{i i d}{\sim} N (0, σ_{u}^{2})

. The linking model (3) is combined with the sampling model for the direct estimators

y_{i j}

, and it is given by

\begin{matrix} y_{i j} & = θ_{i j} + e_{i j}, j = 1, \dots, n_{i}; i = 1, \dots, m, \end{matrix}

(4)

where

e_{i j}

are sampling errors independently distributed as

N (0, Ψ_{i j})

with known sampling variances

Ψ_{i j}

, and assumed to be independent of

v_{i}

and

u_{i j}

. Torabi and Rao [6] obtained EBLUP estimators of subarea means for sampled subareas as well as non-sampled subareas. An advantage of the two-fold model is that the EBLUP estimator of a non-sampled subarea involves both the synthetic estimator of

θ_{i j}

and the direct estimators for the sampled subareas within the same area. For a non-sampled subarea within a non-sampled area, a synthetic estimator is used under the two-fold model. For variable selection under the two-fold model, Cai et al. [7] transformed the linking model to a standard regression model and applied variable selection criteria to the reduced model; see Section 2.2 for details.

Three-fold linking models involving sub-subareas (level 3) nested within subareas (level 2) which in turn are nested within areas (level 1) are also of practical interest. For example, such models were used in the Program for the International Assessment of Adult Competencies (PIAAC) in the context of estimating means for sub-subareas (counties) nested within subareas (states), which in turn are nested within areas (census divisions). Details of this application are reported in Krenzke et al. [8] and Ren et al. [9]. A three-fold linking model on the sub-subarea means

θ_{i j k}

is given by

\begin{matrix} θ_{i j k} = x_{i j k}^{T} β + w_{i} + v_{i j} + u_{i j k}, k = 1, \dots, n_{i j}; j = 1, \dots, m_{i}; i = 1, \dots, L, \end{matrix}

(5)

where k denotes sub-subarea nested within subarea j nested within area i,

x_{i j k}

is the vector of covariates associated with

θ_{i j k}

,

w_{i}

is the random area effect,

v_{i j}

is the random subarea effect, and

u_{i j k}

is the random sub-sub area effect. We assume that all the L areas in the population are included in the sample, but not all the subareas within an area are covered by the sample. Furthermore, not all the sub-subareas within a subarea covered by the sample are included in the sample. We assume that the three random effects in the model (5) are independent,

w_{i} \overset{i i d}{\sim} N (0, σ_{w}^{2})

,

v_{i j} \overset{i i d}{\sim} N (0, σ_{v}^{2})

and

u_{i j k} \overset{i i d}{\sim} N (0, σ_{u}^{2})

. The linking model (5) is combined with the sampling model for the direct estimators

y_{i j k}

of the means

θ_{i j k}

for the sub-subareas in the sample. It is given by

\begin{matrix} y_{i j k} = θ_{i j k} + e_{i j k}, k = 1, \dots, n_{i j}; j = 1, \dots, m_{i}; i = 1, \dots, L, \end{matrix}

(6)

where the

e_{i j k}

are sampling errors assumed to be independently distributed as

N (0, Ψ_{i j k})

with known sampling variances

Ψ_{i j k}

, and they are assumed to be independent of the random effects

w_{i}

,

v_{i j}

, and

u_{i j k}

. In practice, the sampling variances are ascertained through smoothing of the estimated sampling variances, as done in the PIAAC project.

The survey design may not have the same hierarchical structure as the linking model (5). For example, in the PIAAC project, data from a stratified multistage sample with a different hierarchical structure are used. Given the vector of covariates

x_{i j k}

after variable selection, EBLUP estimators of the sub-subarea means can be obtained. It should be noted that the EBLUP estimators for non-sampled sub-subareas within a sampled subarea as well as those within non-sampled subareas avoid pure synthetic estimation by virtue of the area effects

w_{i}

included in the linking model (5), noting that all the areas in the population are included in the sample. In the PIAAC study, a hierarchical Bayes (HB) approach was used to estimate the population sub-subarea means. We will report EBLUP estimation for the three-fold model, which is given by (5) and (6), in a separate paper.

The main purpose of this paper is to extend the transformation method of Cai et al. [7] for variable selection to three-fold models given by (5) and (6). We propose two transformation-based methods—one is parameter free and the other is parameter-dependent—for variable selection. Section 2 is a review of some relevant variable selection methods for the area level model and the two-fold subarea model. Variable selection methods for the three-fold model are presented in Section 3. Results of a simulation study on the performance of the proposed methods relative to some naive alternatives, based on one-fold and two-fold models, are presented in Section 4. Some concluding remarks are presented in Section 5.

2. Area Level and Subarea Level Linking Models: Methods for Variable Selection

We now provide a brief review of earlier work on variable selection for area level and subarea level linking models related to the method for sub-subarea linking models presented in Section 3.

2.1. Area Level Model

The area level linking model (2) has the standard linear regression model form with unknown

θ_{i}

as the dependent variable. Lahiri and Suntornchost [4] noted that standard variable selection criteria applied to (2), such as AIC, BIC, and Mallow’s

C_{p}

, are continuous functions of the unknown error mean sum of squares

{MSE}_{θ} = {(m - p)}^{- 1} θ^{T} (I_{m} - P_{X}) θ

, where

θ = {(θ_{1} \dots θ_{m})}^{T}

,

P_{X} = X {(X^{T} X)}^{- 1} X^{T}

is the projection matrix with

X = {(x_{1} \dots x_{m})}^{T}

,

I_{m}

is the identity matrix of order m, and p is the dimension of

β

. Then, the unknown

{MSE}_{θ}

is replaced by a consistent estimator obtained as

\begin{matrix} {\hat{MSE}}_{θ} = {MSE}_{y} - \bar{Ψ}_{w}, \end{matrix}

(7)

where

{MSE}_{y}

is obtained by substituting

y = {(y_{1} \dots y_{m})}^{T}

for

θ

in the above expression for

{MSE}_{θ}

and

\bar{Ψ}_{w} = {(m - p)}^{- 1} \sum_{i = 1}^{m} (1 - h_{i i}) Ψ_{i}

is a weighted mean of the sampling variance

Ψ_{i}

with

h_{i i} = x_{i}^{T} {(X^{T} X)}^{- 1} x_{i}

. The estimator (7) can take negative values, and modifications to (7) leading to positive values were proposed by Lahiri and Suntornchost [4].

As noted earlier, standard variable selection criteria applied to the linking model (2) are simple functions of

{MSE}_{θ}

and can be estimated by simply substituting

{MSE}_{y}

for

{MSE}_{θ}

. For example, BIC applied to linking model (2) can be estimated as

\begin{matrix} \hat{BIC} & = m log \{(m - p) {\hat{MSE}}_{θ} / m\} + p log m . \end{matrix}

Some other variable selection criteria applicable to the area level model include the conditional AIC (cAIC) proposed by Han [10] and mixed generalized AIC (xGAIC) proposed by Lombardía et al. [11].

2.2. Subarea Level Model

Cai et al. [7] extended the method of Lahiri and Suntornchost [4] to the subarea model given by (3) and (4). In this case, the linking model (3) does not have a standard linear regression model form, because the error terms

v_{i} + u_{i j}

within areas are correlated. As a result, we need to first transform the linking model (3) to a standard linear regression model form with iid errors. Specifically, we rewrite the subarea model in matrix form as

y_{i} = θ_{i} + e_{i}

and

θ_{i} = X_{i} β + τ_{i}

,

i = 1, \dots, m

, where

y_{i} = {(y_{i 1} \dots y_{i n_{i}})}^{T}

,

X_{i} = {(x_{i 1} \dots x_{i n_{i}})}^{T}

,

θ_{i} = {(θ_{i 1} \dots θ_{i n_{i}})}^{T}

,

e_{i} = {(e_{i 1} \dots e_{i n_{i}})}^{T}

, and

τ_{i} = v_{i} 𝟙_{n_{i}} + u_{i}

with

𝟙_{n_{i}}

being a vector of 1s of length

n_{i}

and

u_{i} = {(u_{i 1} \dots u_{i n_{i}})}^{T}

. Cai et al. [7] proposed to find a matrix

A_{i}

for each

i = 1, \dots, m

such that the covariance matrix of

τ_{i}^{*} : = A_{i} τ_{i}

is a diagonal matrix with equal diagonal elements across

i = 1, \dots, m

. Then, a transformed two-fold model is obtained:

\begin{matrix} y_{i}^{*} & = θ_{i}^{*} + e_{i}^{*} and θ_{i}^{*} = X_{i}^{*} β + τ_{i}^{*}, i = 1, \dots, m, \end{matrix}

(8)

where

y_{i}^{*} = A_{i} y_{i}

,

e_{i}^{*} = A_{i} e_{i}

,

θ_{i}^{*} = A_{i} θ_{i}

, and

X_{i}^{*} = A_{i} X_{i}

. Noting that (8) has the standard regression model form on the transformed variables

y_{i}^{*}

and

θ_{i}^{*}

, we can apply the method used for the area level model to obtain variable selection criteria.

Cai et al. [7] gave two methods for finding the transformation matrix

A_{i}

, one being parameter-free and the other relying on estimated model-parameter values. The parameter-free transformation follows the parameter-free transformation method proposed by Li and Lahiri [12] for selecting auxiliary variables under the unit-level nested error regression (NER) model. Observing that the covariance matrix of

τ_{i}^{*}

is given by

\begin{matrix} Cov (τ_{i}^{*}) = A_{i} Σ_{i} A_{i}^{T} = σ_{v}^{2} (A_{i} 𝟙_{n_{i}}) {(A_{i} 𝟙_{n_{i}})}^{T} + σ_{u}^{2} A_{i} A_{i}^{T}, \end{matrix}

one can choose an matrix

A_{i}

such that (a)

A_{i} 𝟙_{n_{i}} = 0

, and (b)

A_{i} A_{i}^{T}

is a diagonal matrix whose diagonal elements are equal for all

i = 1, \dots, m

. Cai et al. [7] proposed a numerical procedure to find the

A_{i}

matrix satisfying the above conditions. As a result of the linear constraint (a), the rank of

A_{i}

is

n_{i} - 1

at most, and as a result, the transformed two-fold model loses one data point for each sampled area. The parameter-dependent method used by Cai et al. [7] is the well-known Fuller–Battese transformation [13]. In practice, the parameter-free transformation is more likely to be used because of its simplicity and not requiring the estimates of variance parameters.

3. Sub-Subarea Linking Models: Variable Selection

In this section, we present the proposed method for variable selection under the sub-subarea linking model. We extend the method of Cai et al. [7] for the two-fold model to the three-fold case.

We first express the sub-subarea linking and sampling models given by (5) and (6) as

\begin{matrix} θ_{i} = X_{i} β + η_{i}, i = 1, \dots, L, \end{matrix}

(9)

and

\begin{matrix} y_{i} = θ_{i} + e_{i}, i = 1, \dots, L, \end{matrix}

(10)

respectively, where

y_{i} = {(y_{i j 1} y_{i j 2} \dots y_{i m_{i} n_{i j}})}^{T}

,

X_{i} = {(x_{i j 1} x_{i j 2} \dots x_{i m_{i} n_{i j}})}^{T}

,

θ_{i} = {(θ_{i j 1} θ_{i j 2} \dots θ_{i m_{i} n_{i j}})}^{T}

,

e_{i} = {(e_{i j 1} e_{i j 2} \dots e_{i m_{i} n_{i j}})}^{T}

and

\begin{matrix} η_{i} = w_{i} 𝟙_{n_{i}} + Ω_{i} v_{i} + u_{i} \end{matrix}

with

n_{i} = \sum_{j = 1}^{m_{i}} n_{i j}

,

Ω_{i} = diag (𝟙_{n_{i 1}}, 𝟙_{n_{i 2}}, \dots, 𝟙_{n_{i m_{i}}})

,

v_{i} = {(v_{i 1} \dots v_{i m_{i}})}^{T}

and

u_{i} = {(u_{i 11} u_{i 12} \dots u_{i m_{i} n_{i m_{i}}})}^{T}

. Note that

η_{i} \sim N (0, Σ_{i})

, where

\begin{matrix} Σ_{i} & = Cov (η_{i}) = σ_{w}^{2} 𝟙_{n_{i}} 𝟙_{n_{i}}^{T} + σ_{v}^{2} Ω_{i} Ω_{i}^{T} + σ_{u}^{2} I_{n_{i}} . \end{matrix}

(11)

As in the case of subarea linking model (3), the covariance matrix

Σ_{i}

of

η_{i}

in the linking model (9) does not have a diagonal structure. Following the idea of Cai et al. [7], we first transform the three-fold linking model (9) using a linear transformation so that the covariance matrix of the transformed

η_{i}

has a diagonal structure. For each area i, we obtain a matrix

T_{i}

that makes the transformed vector

η_{i}^{*} : = T_{i} η_{i}

have a diagonal covariance matrix with diagonal elements being a positive constant c for all

i = 1, \dots, L

. Using the

T_{i}

, we transform the three-fold sampling model (10) and linking model (9) into

\begin{matrix} y_{i}^{*} & = θ_{i}^{*} + e_{i}^{*}, \end{matrix}

(12)

\begin{matrix} θ_{i}^{*} & = X_{i}^{*} β + η_{i}^{*}, \end{matrix}

(13)

where

y_{i}^{*} = T_{i} y_{i}

,

θ_{i}^{*} = T_{i} θ_{i}

,

e_{i}^{*} = T_{i} e_{i}

and

X_{i}^{*} = T_{i} X_{i}

. The transformed linking model (13) is a standard linear regression model with unknown dependent variable

θ_{i}^{*}

, and it shares the same

β

parameter with the original linking model (9). Then, we can use a bias-correction method similar to that of Lahiri and Suntornchost [4] to estimate an information criterion for (13) so as to select auxiliary variables. The details of the proposed transformation and bias-correction methods are given in the following subsections.

3.1. Transformation Methods

3.1.1. Parameter-Free Transformation

It is desirable that the transformation matrices

T_{i}

,

i = 1, \dots, L

do not rely on unknown parameter values. To find parameter-free

T_{i}

, we follow the idea used by Cai et al. [7] for the two-fold subarea model. By (11),

\begin{matrix} Cov (η_{i}^{*}) = T_{i} Cov (η_{i}) T_{i}^{T} = σ_{w}^{2} (T_{i} 𝟙_{n_{i}}) {(T_{i} 𝟙_{n_{i}})}^{T} + σ_{v}^{2} (T_{i} Ω_{i}) {(T_{i} Ω_{i})}^{T} + σ_{u}^{2} T_{i} T_{i}^{T} . \end{matrix}

If

T_{i} 𝟙_{n_{i}} = 0

,

T_{i} Ω_{i} = 0

and

T_{i} T_{i}^{T}

is a diagonal matrix with equal diagonal elements, then

Cov (η_{i}^{*})

will have the desired diagonal structure. Furthermore,

T_{i} Ω_{i} = 0

implies

T_{i} 𝟙_{n_{i}} = 0

because

Ω_{i} 𝟙_{m_{i}} = 𝟙_{n_{i}}

. Therefore, it suffices to find

T_{i}

such that (i)

T_{i} Ω_{i} = 0

, and (ii)

T_{i} T_{i}^{T}

is a diagonal matrix with equal diagonal elements across

i = 1, \dots, L

. Since the above two conditions do not involve any parameter, a matrix

T_{i}

that satisfies them will be parameter free.

Recall that

Ω_{i} = diag (𝟙_{n_{i 1}}, 𝟙_{n_{i 2}}, \dots, 𝟙_{n_{i m_{i}}})

, which is a matrix with full column rank

m_{i}

. Therefore, imposing

T_{i} Ω_{i} = 0

on

T_{i}

introduces

m_{i}

independent linear constraints on

T_{i}

, with one constraint for each sub-area j,

j = 1, \dots, m_{i}

. To be specific, the constraint for subarea j is

T_{i} {\underset{˜}{ω}}_{j} = 0

, where

{\underset{˜}{ω}}_{j}

is the jth,

j = 1, \dots, m_{i}

, column of

Ω_{i}

. Consequently, the rank of

T_{i}

is at most

n_{i} - m_{i}

, and hence, each area i will lose

m_{i}

data points (or equivalently, each subarea will lose one data point) in the transformation. This is different from the parameter-free transformation for the two-fold subarea model discussed in Section 2.2, where each area loses a single data point in the transformation.

In the following, we provide a numerical algorithm to find

T_{i}

for area i,

i = 1, \dots, L

, that satisfies the above requirements (i) and (ii).

Step 1:: For each subarea j, $j = 1, \dots, m_{i}$ , of area i, fix a set of $n_{i j} - 1$ linearly independent vectors of length $n_{i j}$ , denoted $b_{i 1}, b_{i 2}, \dots, b_{i (n_{i j} - 1)}$ , that satisfies $b_{i k}^{T} 𝟙_{n_{i j}} = 0$ for $k = 1, \dots, n_{i j} - 1$ . For a given k, a valid choice for $b_{i k}$ is the vector of length $n_{i j}$ whose kth element is 1, last element is $- 1$ , and all other elements are 0. For example, if $n_{i j} = 5$ , then we can take $b_{i k}$ , $k = 1, 2, 3, 4$ , as $b_{i 1} = {(1 0 0 0 - 1)}^{T}$ , $b_{i 2} = {(0 1 0 0 - 1)}^{T}$ , $b_{i 3} = {(0 0 1 0 - 1)}^{T}$ and $b_{i 4} = {(0 0 0 1 - 1)}^{T}$ . Another possibility is to take $b_{i k}$ to be the vector whose kth element is 1 and all the other elements equal $\frac{- 1}{n_{i j} - 1}$ if $n_{i j} > 1$ .
Step 2:: Apply the Gram–Schmidt process to $b_{i 1}, \dots, b_{i (n_{i j} - 1)}$ to acquire a set of orthogonal vectors $a_{i 1}, a_{i 2}, \dots, a_{i (n_{i j} - 1)}$ with $a_{i 1} = b_{i 1}$ and $a_{i k} = b_{i k} - \sum_{l = 1}^{k - 1} {Proj}_{a_{i l}} (b_{i k})$ for $k = 2, \dots, n_{i j} - 1$ , where ${Proj}_{y} (x) : = \frac{x^{T} y}{y^{T} y} y$ is the projection of vector x onto the line spanned by vector y. Construct a $(n_{i j} - 1)$ by $n_{i j}$ matrix, denoted $T_{i j}$ , from $a_{i 1}, \dots, a_{i (n_{i j} - 1)}$ as $T_{i j} = {(\frac{a_{i 1}}{∥ a_{i 1} ∥} \dots \frac{a_{i (n_{i j} - 1)}}{∥ a_{i (n_{i j} - 1)} ∥})}^{T}$ , where $∥ \cdot ∥$ is the Euclidean norm.
Step 3:: Repeat Step 1 and Step 2 for all subareas $j = 1, \dots, m_{i}$ of area i to obtain matrices $T_{i 1}, \dots, T_{i m_{i}}$ . Take $T_{i} = diag (T_{i 1}, \dots, T_{i m_{i}})$ .

The

T_{i}

constructed using the above steps is parameter free. Step 1 generates a set of linear independent vectors

b_{i k}

,

k = 1, \dots, m_{i}

, satisfying

b_{i k}^{T} 𝟙_{n_{i j}} = 0

. The Gram–Schmidt process in Step 2 produces a set of orthonormal vectors

a_{i k}

,

k = 1, \dots, m_{i}

, based on

b_{i k}

, while carrying over the property

a_{i k}^{T} 𝟙_{n_{i j}} = 0

. Thus, the matrix

T_{i j}

constructed in Step 2 satisfies

T_{i j} 𝟙_{n_{i j}} = 0

and

T_{i j} T_{i j}^{T} = I_{n_{i j} - 1}

, which in turn guarantee that the matrix

T_{i}

defined in Step 3 satisfies the requirements (i) and (ii) with

T_{i} T_{i}^{T} = I_{n_{i} - m_{i}}

. Under this transformation, we get

Cov (η_{i}^{*}) = σ_{u}^{2} I_{n_{i} - m_{i}}

.

A parameter-free transformation matrix

T_{i}

that satisfies the constraints (i) and (ii) is not unique. However, we found that different choices of

T_{i}

yield similar results.

3.1.2. Parameter-Dependent Transformation

It is straightforward to obtain a transformation matrix

T_{i}

that depends on the model parameter values. Since

Cov (τ_{i}^{*}) = T_{i} Σ_{i} T_{i}^{T}

, where

Σ_{i}

is given by (11), we can take

T_{i} = c Σ_{i}^{- 1 / 2}

, where

Σ_{i}^{- 1 / 2}

is the positive definite square-root matrix of

Σ_{i}^{- 1}

and c is a non-zero constant. Then,

Cov (τ_{i}^{*})

is a diagonal matrix whose diagonal elements are all equal to c. Note that

Σ_{i}

is determined by the variance parameters

σ_{w}^{2}

,

σ_{v}^{2}

and

σ_{u}^{2}

, so this transformation matrix is parameter-dependent. Under the two-fold subarea model, applying this idea and choosing

c = σ_{u}

yields the Fuller–Battese Transformation [7].

Under the three-fold sub-subarea model, we found that

\begin{matrix} Σ_{i}^{- 1} = & σ_{u}^{- 2} (I_{n_{i}} - Λ_{i} - ξ_{i} 𝟙_{n_{i}} 𝟙_{n_{i}}^{T} + ξ_{i} 𝟙_{n_{i}} 𝟙_{n_{i}}^{T} Λ_{i} + ξ_{i} Λ_{i} 𝟙_{n_{i}} 𝟙_{n_{i}}^{T} - ξ_{i} Λ_{i} 𝟙_{n_{i}} 𝟙_{n_{i}}^{T} Λ_{i}), \end{matrix}

where

Λ_{i} = diag (ρ_{i 1} 𝟙_{n_{i 1}} 𝟙_{n_{i 1}}^{T}, \dots, ρ_{i m_{i}} 𝟙_{n_{i m_{i}}} 𝟙_{n_{i m_{i}}}^{T})

,

ρ_{i j} = σ_{v}^{2} / (σ_{u}^{2} + n_{i j} σ_{v}^{2})

and

ξ_{i} = σ_{w}^{2} / \{σ_{u}^{2} + σ_{w}^{2} (n_{i} - \sum_{k = 1}^{m_{i}} ρ_{i k} n_{i k}^{2})\}

. The square-root matrix

Σ_{i}^{- 1 / 2}

has a complicated expression but can be found easily with a numerical procedure, for example, by applying the spectral decomposition or polar decomposition on

Σ_{i}^{- 1}

. Taking

T_{i} = σ_{u} Σ_{i}^{- 1 / 2}

, we get

Cov (τ_{i}^{*}) = σ_{u}^{2} I_{n_{i}}

.

In practice, we need to estimate the variance parameters

σ_{w}^{2}

,

σ_{v}^{2}

, and

σ_{u}^{2}

to construct the transformation matrices

T_{i}

, as in the case of the subarea model given by (8).

3.2. Estimating Variable Selection Criteria: Sub-Subarea Model

After transformation, the linking model (13) takes the matrix form of a regular regression model with unobserved response variable values

θ_{i}^{*}

. We now use a method similar to that of Cai et al. [7] to estimate information criteria, including AIC, BIC, and Mallows’

C_{p}

, for the transformed linking model (13). Then, these information criteria can be used for selecting auxiliary variables under the three-fold sub-subarea model.

The error mean sum of squares of the transformed linking model (13) is given by

\begin{matrix} {MSE}_{θ^{*}} & = \frac{1}{n^{*} - p} {θ^{*}}^{T} (I_{n^{*}} - P_{X^{*}}) θ^{*}, \end{matrix}

where

θ^{*} = {({θ_{1}^{*}}^{T} \dots {θ_{L}^{*}}^{T})}^{T}

,

P_{X^{*}} = X^{*} {({X^{*}}^{T} X^{*})}^{- 1} {X^{*}}^{T}

with

X^{*} = {({X_{1}^{*}}^{T} \dots {X_{L}^{*}}^{T})}^{T}

,

n^{*}

is the dimension of

θ^{*}

, and p is the dimension of

β

. Since

θ^{*}

are unobserved,

{MSE}_{θ^{*}}

cannot be calculated. Instead, we estimate

{MSE}_{θ^{*}}

based on the transformed sampling model (12). Let

y^{*} = {({y_{1}^{*}}^{T} \dots {y_{L}^{*}}^{T})}^{T}

and

e^{*} = {({e_{1}^{*}}^{T} \dots {e_{L}^{*}}^{T})}^{T}

. Put

\begin{matrix} {MSE}_{y^{*}} = \frac{1}{n^{*} - p} {y^{*}}^{T} (I_{n^{*}} - P_{X^{*}}) y^{*} . \end{matrix}

We propose to estimate

{MSE}_{θ^{*}}

by

\begin{matrix} {\hat{MSE}}_{θ^{*}} = {MSE}_{y^{*}} - \frac{1}{n^{*} - p} tr \{(I_{n^{*}} - P_{X^{*}}) V_{e^{*}}\}, \end{matrix}

(14)

where

V_{e^{*}}

is the covariance matrix of

e^{*}

, given by

V_{e^{*}} = Cov (e^{*}) = T V_{e} T^{T}

with

T = diag (T_{1}, \dots, T_{L})

and

V_{e} = diag (Ψ_{111}, Ψ_{112}, \dots, Ψ_{L m_{L} n_{L m_{L}}})

. It can be shown, using the same argument as used in the proof of Theorem 1 of Cai et al. [7], that if the sampling variances

Ψ_{i j k}

are bounded for all i, j and k, and

n_{i j} \geq 2

for all i and j, then

\begin{matrix} {\hat{MSE}}_{θ^{*}} = {MSE}_{θ^{*}} + o_{p} (1) \end{matrix}

(15)

as the number of areas

L \to \infty

.

The term

{(n^{*} - p)}^{- 1} tr \{(I_{n^{*}} - P_{X^{*}}) V_{e^{*}}\}

in (14) can be considered as a bias-correction term, and because of its presence,

{\hat{MSE}}_{θ^{*}}

may take a negative value. A simple truncation or a continuous transformation of

{\hat{MSE}}_{θ^{*}}

as suggested by Lahiri and Suntornchost [4] may be used to obtain a strictly positive estimate of

{MSE}_{θ^{*}}

.

Given the above estimator of

{MSE}_{θ^{*}}

, estimators of the AIC, BIC and Mallows’

C_{p}

for the transformed linking model (13) are readily constructed. The AIC, BIC and Mallows’

C_{p}

of a submodel of (13) with

p_{s}

covariates are given by

\begin{matrix} {AIC}^{(s)} & = n^{*} log \{(n^{*} - p_{s}) {MSE}_{θ^{*}}^{(s)} / n^{*}\} + 2 p_{s}, \end{matrix}

(16a)

\begin{matrix} {BIC}^{(s)} & = n^{*} log \{(n^{*} - p_{s}) {MSE}_{θ^{*}}^{(s)} / n^{*}\} + p_{s} log (n^{*}), \end{matrix}

(16b)

\begin{matrix} C_{p}^{(s)} & = (n^{*} - p_{s}) {MSE}_{θ^{*}}^{(s)} / {MSE}_{θ^{*}} + 2 p_{s} - n^{*}, \end{matrix}

(16c)

respectively, where

{MSE}_{θ^{*}}^{(s)}

is the MSE from the submodel. Their estimators, denoted

{\hat{AIC}}^{(s)}

,

{\hat{BIC}}^{(s)}

and

{\hat{C_{p}}}^{(s)}

, respectively, are obtained by substituting

{\hat{MSE}}_{θ^{*}}

into the corresponding expressions in (16a) to (16c). Then, variable selection is carried out by choosing one of the above criteria and estimating its values for a set of specified sub-models. The sub-model with the smallest estimated criterion value is chosen as the selected linking model. Noting that the criteria (16a)–(16c) are continuous functions of

{MSE}_{θ^{*}}

and the error of the estimator

{\hat{MSE}}_{θ^{*}}

is of

o_{p} (1)

, it follows from the continuous mapping theorem [14] (Theorem 2.3) that the error in the estimated variable selection criteria is also

o_{p} (1)

and hence negligible when the number of areas L is large.

4. Results of a Simulation Study

This section provides results of a limited simulation study on the performance of the proposed method for variable selection for sub-subarea linking models. The simulation data are generated from the three-fold sub-subarea model given by (5) and (6). The number of areas is set to

L = 10

and the number of subareas sampled from each area i,

i = 1, \dots, L

, is set to

m_{i} = 5

. The number of sampled sub-subareas is taken as

n_{i j} = 8

for every subarea j in areas

i = 1, \dots, 5

,

n_{i j} = 5

for every subarea in areas

i = 6, 7, 8

and

n_{i j} = 10

for each subarea in areas

i = 9, 10

. The sampling standard deviation

\sqrt{Ψ_{i j k}}

in the sampling model (6) is generated from

Unif (0.5, 1.5)

. The standard deviation of the sub-subarea random effect in the linking model (5) is set to

σ_{u} = 2

. A few settings for the standard deviations of the area-level and subarea-level random effects,

(σ_{w}, σ_{v})

, are used:

(2, 2)

,

(4, 3)

,

(6, 3)

,

(8, 4)

,

(6, 6)

,

(3, 6)

and

(4, 8)

. We consider a linking model that has an intercept term with corresponding covariate

x_{i j k, 1} = 1

and eight other covariates

x_{i j k, l}

(

l = 2, \dots, 9

) generated as follows:

log x_{i j k, 2} \sim N (0.3, 0.5)

with mean

0.3

and variance

0.5

,

x_{i j k, 3} \sim Gamma (1.5, 2)

with shape parameter

1.5

and rate parameter 2,

x_{i j k, 4} \sim N (0, 0.8)

,

x_{i j k, 5} \sim N (1, 1.5)

,

x_{i j k, 6} \sim Gamma (0.6, 10)

,

x_{i j k, 7} \sim Beta (0.5, 0.5)

with shape parameters

0.5

and

0.5

,

x_{i j k, 8} \sim Unif (1, 3)

on the interval

(0, 3)

, and

x_{i j, 9} \sim Poisson (1.5)

with mean parameter

1.5

. The value of the regression parameter vector

β

is set to

{(2, 3, 0, 4, 0, 8, 0, 1, 0)}^{T}

. It corresponds to a true model consisting of the intercept term of value 2 and covariates

x_{i j, 2}

,

x_{i j, 4}

,

x_{i j, 6}

and

x_{i j, 8}

. For variable selection, we always include the intercept term when we compare all possible sub-models defined by the inclusion/exclusion of the eight variables

x_{i j, 2}, \dots, x_{i j, 9}

.

We generated 5000 simulation runs, and the covariates are generated first and kept fixed throughout all simulation runs. Then, we generated the response vectors

y_{i}

,

j = 1, \dots, L

, from the sub-subarea model given by (9) and (10) for each simulation run, using the specified settings.

We report the performance of the proposed method with parameter-free transformation (

3 F_{pfree}

) and parameter-dependent transformation (

3 F_{pdep}

). For

{3F}_{pdep}

, the true parameter values are used here, for simplicity. Under estimated parameter values, the performance of

{3F}_{pdep}

is likely to be inferior. The parameter-free and parameter-dependent methods of Cai et al. [7] for the two-fold subarea model are used for comparison. To fit a two-fold subarea model to the data with a three-fold structure, the actual sub-subareas are treated as the subareas in the two-fold model. We can treat either (i) the actual subareas or (ii) the actual areas as the areas in the two-fold model. Treatment (i) is a natural choice when there is substantial subarea-level variability. Under treatment (i), where the actual subareas are treated as areas, the parameter-free transformation under the two-fold model is algebraically identical to the parameter-free transformation under the three-fold model. As a result, variable selection based on the parameter-free transformation under treatment (i) leads to the same set of variables as that under the three-fold model. However, it leads to pure synthetic estimates for non-sampled areas (actual subareas). Moreover, computationally, there is no advantage of treatment (i) over the three-fold model because the same transformation is used. On the other hand, the parameter-dependent method applied to treatment (i) may lead to a different set of variables. Therefore, we report the simulation results only for the parameter-dependent method under treatment (i), which is denoted as

{2F-S-SS}_{pdep}

. The two-fold parameter-free and parameter-dependent methods under treatment (ii) are denoted as

{2F-A-SS}_{pfree}

and

{2F-A-SS}_{pdep}

, respectively. Under treatment (ii), pure synthetic estimation is avoided because all areas are sampled. For comparison, we further consider three naive methods designed for the one-fold FH model and the regular linear regression model, including the Lahiri–Suntornchost [4] method (Naive-LS) and Han’s [10] cAIC method (Naive-cAIC) for the FH model, as well as an information criterion-based method for the regular linear regression model fitted naively to the data (Naive-LM). For Naive-LS and Naive-cAIC, the actual sub-subareas are treated as the areas in the FH model. For Naive-LM, the sub-subarea level direct estimator

y_{i j k}

is treated as the response variable of the regular linear regression model.

Table 1 summarizes the simulation results for variable selection using BIC.

The proposed

{3F}_{pfree}

and

{3F}_{pdep}

perform equally well with a stable rate between

87 %

and

89 %

in selecting the true model under all settings for

(σ_{w}, σ_{v})

. The two-fold method

{2F-S-SS}_{pdep}

, which treats the actual subareas as areas in the two-fold model, exhibits similar performance to that of the proposed methods. All the other methods have inferior performance and display a dramatic decay in rate of selecting the true model when

σ_{w}

and

σ_{u}

increase. This indicates that in the presence of strong area-level effect or subarea-level effect, which often happens in practice,

{3F}_{pfree}

,

{3F}_{pdep}

and

{2F-S-SS}_{pdep}

are preferred over the other alternative methods.

The simulation results based on AIC and Naive cAIC are given in Table 2.

Compared with BIC, AIC gives a significantly lower true-model selection rate under all the methods. As the case for BIC, methods

{3F}_{pfree}

,

{3F}_{pdep}

and

{2F-S-SS}_{pdep}

perform equally well and yield stable results for different

(σ_{w}, σ_{v})

values, and they have better performance than the other methods. Methods

{2F-A-SS}_{pfree}

and

{2F-A-SS}_{pdep}

have slightly better performance than

{3F}_{pfree}

,

{3F}_{pdep}

and

{2F-S-SS}_{pdep}

when

(σ_{w}, σ_{v}) = (2, 2)

,

(4, 3)

, and

(6, 3)

but notably inferior performance under the other settings for

(σ_{w}, σ_{v})

. Methods Naive-LS, Naive-LM and Naive-cAIC have significantly lower rates of selecting the true model than the other methods.

Table 3 reports simulation results under Mallows’

C_{p}

criterion for variable selection. The results in Table 3 are similar to those reported in Table 2 under AIC, and the same conclusions hold.

5. Concluding Remarks

A transformation-based method is proposed for selecting covariates under the three-fold sub-subarea model for small area estimation. Two transformations, one being parameter-free and the other being parameter-dependent, are proposed to accompany the variable selection method. Compared to the parameter-free transformation, the parameter-dependent transformation does not induce loss of data points but requires estimated variance parameters in practice. We prefer the parameter-free transformation for its simplicity and not requiring the estimates of variance parameters. The performance of parameter-free and parameter-dependent transformation methods is similar under various simulation settings for variances of the area-level and subarea-level random effects. EBLUP estimation of sub-subarea means for sampled sub-subareas, non-sampled sub-subareas within sampled subareas, and sub-subareas within non-sampled subareas will be studied in detail in a separate paper. Measures of uncertainty of the EBLUP estimators will also be studied.

Author Contributions

Formal analysis, S.C.; Methodology, S.C., J.N.K.R.; Writing-original draft, S.C.; Writing-review & editing, S.C., J.N.K.R.; Conceptualization, J.N.K.R. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by research grants to Song Cai and J.N.K. Rao from the Natural Sciences and Engineering Research Council of Canada.

Acknowledgments

We thank the reviewers for their useful comments and constructive suggestions.

Conflicts of Interest

The authors declare no conflict of interest.

References

Rao, J.N.K.; Molina, I. Small Area Estimation, 2nd ed.; Wiley: Hoboken, NJ, USA, 2015. [Google Scholar]
Fay, R.E.; Herriot, R.A. Estimates of income for small places: An application of james-stein procedures to census data. J. Am. Stat. Assoc. 1979, 74, 269–277. [Google Scholar] [CrossRef]
Wolter, K.M. Introduction to Variance Estimation, 2nd ed.; Springer: New York, NY, USA, 2007. [Google Scholar]
Lahiri, P.; Suntornchost, J. Variable selection for linear mixed models with applications in small area estimation. Sankhyā B 2015, 77, 312–320. [Google Scholar] [CrossRef]
Mohadjer, L.; Rao, J.N.K.; Liu, B.; Krenzke, T.; Van de Kerckhove, W. Hierarchical Bayes small area estimates of adult literacy using unmatched sampling and linking models. J. Indian Soc. Agric. 2012, 66, 55–63. [Google Scholar]
Torabi, M.; Rao, J.N.K. On small area estimation under a sub-area level model. J. Multivar. Anal. 2014, 127, 36–55. [Google Scholar] [CrossRef]
Cai, S.; Rao, J.N.K.; Dumitrescu, L.; Chatrchi, G. Effective transformation-based variable selection under two-fold subarea models in small area estimation. Stat. Transit. New Ser. 2020, 21, 68–83. [Google Scholar] [CrossRef]
Krenzke, T.; Mohadjer, L.; Li, J.; Erciulescu, A.; Fay, R.E.; Ren, W.; VanDeKerckhove, W.; Li, L.; Rao, J.N.K. Program for the International Assessment of Adult Competencies (PIAAC): State and County Estimation Methodology Report; Technical Report; Institute of Education Sciences, National Center for Education Statistics: Washington, DC, USA, 2020. [Google Scholar]
Ren, W.; Li, J.; Erciulescu, A.; Krenzke, T.; Mohadjer, L. A variable selection method for small area estimation modeling of the proficiency of adult competency. In Proceedings of the Survey Research Methods Section, Joint Statistical Meetings of the American Statistical Association, Alexandria, VA, USA, 2–6 August 2020; pp. 924–956. [Google Scholar]
Han, B. Conditional Akaike information criterion in the Fay-Herriot model. Stat. Methodol. 2013, 11, 53–67. [Google Scholar] [CrossRef]
Lombardía, M.J.; López-Vizcaíno, E.; Rueda, C. Mixed generalized akaike information criterion for small area models. J. R. Stat. Soc. Ser. A 2017, 180, 1229–1252. [Google Scholar] [CrossRef]
Li, Y.; Lahiri, P. A simple adaptation of variable selection software for regression models to select variables in nested error regression models. Sankhyā B 2019, 81, 302–317. [Google Scholar] [CrossRef]
Fuller, W.A.; Battese, G.E. Transformations for estimation of linear models with nested-error structure. J. Am. Stat. Assoc. 1973, 68, 626–632. [Google Scholar] [CrossRef]
van der Vaart, A.W. Asymptotic Statistics; Cambridge University Press: Cambridge, MA, USA, 1998. [Google Scholar]

Table 1. True model selection rate (%): BIC.

Method	$(σ_{w}, σ_{v})$
Method	$(2, 2)$	$(4, 3)$	$(6, 3)$	$(8, 4)$	$(6, 6)$	$(3, 6)$	$(4, 8)$
${3F}_{pfree}$	$87.12$	$87.62$	$87.50$	$88.18$	$87.26$	$87.32$	$87.02$
${3F}_{pdep}$	$87.94$	$88.20$	$88.46$	$88.52$	$88.00$	$87.96$	$87.60$
${2F-S-SS}_{pdep}$	$87.64$	$87.82$	$88.16$	$88.48$	$87.90$	$87.86$	$87.56$
${2F-A-SS}_{pfree}$	$83.28$	$63.14$	$62.62$	$36.70$	$8.38$	$9.22$	$2.24$
${2F-A-SS}_{pdep}$	$82.60$	$60.84$	$60.66$	$34.58$	$7.24$	$8.48$	$1.80$
Naive-LS	$63.62$	$19.68$	$8.80$	$2.56$	$1.94$	$4.96$	$0.78$
Naive-LM	$60.94$	$18.32$	$8.26$	$2.44$	$1.84$	$4.70$	$0.76$

Table 2. True model selection rate (%): AIC and Naive-cAIC.

Method	$(σ_{w}, σ_{v})$
Method	$(2, 2)$	$(4, 3)$	$(6, 3)$	$(8, 4)$	$(6, 6)$	$(3, 6)$	$(4, 8)$
${3F}_{pfree}$	$43.88$	$42.84$	$43.76$	$43.92$	$43.38$	$44.02$	$43.42$
${3F}_{pdep}$	$44.22$	$43.18$	$43.66$	$44.08$	$43.36$	$44.04$	$43.48$
${2F-S-SS}_{pdep}$	$44.32$	$43.14$	$43.66$	$43.68$	$43.22$	$44.02$	$43.32$
${2F-A-SS}_{pfree}$	$47.00$	$46.60$	$47.10$	$42.86$	$26.46$	$27.16$	$14.60$
${2F-A-SS}_{pdep}$	$47.78$	$48.48$	$48.36$	$43.96$	$26.36$	$26.52$	$14.54$
Naive-LS	$45.00$	$31.74$	$22.48$	$14.62$	$14.02$	$19.84$	$10.52$
Naive-LM	$47.02$	$32.18$	$22.54$	$14.56$	$13.94$	$19.76$	$10.40$
Naive-cAIC	$42.86$	$26.64$	$17.28$	$12.12$	$11.16$	$15.62$	$8.84$

Table 3. True model selection rate (%): Mallows’

C_{p}

.

Table 3. True model selection rate (%): Mallows’

C_{p}

.

Method	$(σ_{w}, σ_{v})$
Method	$(2, 2)$	$(4, 3)$	$(6, 3)$	$(8, 4)$	$(6, 6)$	$(3, 6)$	$(4, 8)$
${3F}_{pfree}$	$44.78$	$43.66$	$44.84$	$44.60$	$44.00$	$44.84$	$44.04$
${3F}_{pdep}$	$45.02$	$43.90$	$44.40$	$44.90$	$44.16$	$44.80$	$44.30$
${2F-S-SS}_{pdep}$	$44.96$	$43.74$	$44.32$	$44.50$	$43.76$	$44.74$	$44.10$
${2F-A-SS}_{pfree}$	$47.60$	$47.28$	$47.84$	$43.32$	$26.34$	$27.08$	$14.54$
${2F-A-SS}_{pdep}$	$48.82$	$49.08$	$49.38$	$44.52$	$26.54$	$26.64$	$14.22$
Naive-LS	$45.60$	$32.16$	$22.60$	$14.52$	$13.98$	$19.80$	$10.32$
Naive-LM	$47.66$	$32.52$	$22.64$	$14.54$	$14.04$	$19.80$	$10.24$

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Cai, S.; Rao, J.N.K. Selection of Auxiliary Variables for Three-Fold Linking Models in Small Area Estimation: A Simple and Effective Method. Stats 2022, 5, 128-138. https://doi.org/10.3390/stats5010009

AMA Style

Cai S, Rao JNK. Selection of Auxiliary Variables for Three-Fold Linking Models in Small Area Estimation: A Simple and Effective Method. Stats. 2022; 5(1):128-138. https://doi.org/10.3390/stats5010009

Chicago/Turabian Style

Cai, Song, and J.N.K. Rao. 2022. "Selection of Auxiliary Variables for Three-Fold Linking Models in Small Area Estimation: A Simple and Effective Method" Stats 5, no. 1: 128-138. https://doi.org/10.3390/stats5010009

APA Style

Cai, S., & Rao, J. N. K. (2022). Selection of Auxiliary Variables for Three-Fold Linking Models in Small Area Estimation: A Simple and Effective Method. Stats, 5(1), 128-138. https://doi.org/10.3390/stats5010009

Article Menu

Selection of Auxiliary Variables for Three-Fold Linking Models in Small Area Estimation: A Simple and Effective Method

Abstract

1. Introduction

2. Area Level and Subarea Level Linking Models: Methods for Variable Selection

2.1. Area Level Model

2.2. Subarea Level Model

3. Sub-Subarea Linking Models: Variable Selection

3.1. Transformation Methods

3.1.1. Parameter-Free Transformation

3.1.2. Parameter-Dependent Transformation

3.2. Estimating Variable Selection Criteria: Sub-Subarea Model

4. Results of a Simulation Study

5. Concluding Remarks

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI