Sparse Semi-Functional Partial Linear Single-Index Regression

Silvia Novo; Germán Aneiros; Philippe Vieu

doi:10.3390/proceedings2181190

,

and

¹

MODES Research Group, CITIC, Universidade da Coruña, 15071 A Coruña, Spain

²

Institut de Mathématiques, Université Paul Sabatier, 31062 Toulouse, France

^*

Author to whom correspondence should be addressed.

^†

Presented at the XoveTIC Congress. A Coruña, Spain, 27--28 September 2018.

Proceedings2018, 2(18), 1190;https://doi.org/10.3390/proceedings2181190

This article belongs to the Proceedings XoveTIC Congress 2018

Version Notes

Order Reprints

Abstract

The variable selection problem is studied in the sparse semi-functional partial linear model, with single-index type influence of the functional covariate in the response. The penalized least squares procedure is employed for this task. Some properties of the resultant estimators are derived: the existence (and rate of convergence) of a consistent estimator for the parameters in the linear part and an oracle property for the variable selection method. Finally, a real data application illustrates the good performance of our procedure.

Keywords:

functional data analysis; variable selection; sparse model; dimension reduction; functional single-index model; semiparametric model

1. Introduction

In many real problems, to predict the value of a random variable, observations of many other variables are available. However, in many cases, it is unknown which of them (very few) have a real influence in the response. In this practical framework, we need procedures able to select the relevant variables to avoid high-dimensionality problems. Reducing the complexity of the model becomes even more crucial when regression involves a functional variable too (data are functions, images...). Therefore, the main goal is the simplification of the model, which makes easier both its estimation and interpretation, without losing its predictive efficiency.

These practical problems have motived the peak of semiparametric models in the functional regression, together with the variable selection procedures. In [1] the penalized least squares method for estimation and variable selection is studied for the partial linear model with functional covariate. In this model, the real variables have a linear effect (involving interpretable coefficients that are the parameters) in the response, while the infinite-dimensional covariate has a nonlinear (nonparametric) influence. However, in real data applications, it would be interesting having parameters related to the functional variable to derive practical interpretations. This is one of the advantages of the semi-functional partial linear single-index model (SFPLSIM): the real covariates also affect in a linear way to the response, but the infinite-dimensional covariate influences it trough a projection in an unknown direction, after applying a nonlinear link function. This direction of projection behaves like a function-parameter that could have interesting interpretations. Some theoretical properties related to the nonparametric estimation of the functional single-index model are given in [2]. In this paper, we will study the sparse SFPLSIM, focusing in the variable selection problem. For this purpose, we will use the penalized least squares procedure for estimating the parameters of the lineal components and, simultaneously, selecting the relevant covariates. The properties of the estimators will be analysed from a theoretical point of view: we will set its convergence rates and the consistency for selecting the model. These results will be illustrated through a real data application.

2. The Model

The SFPLSIM is defined by the relationship

Y_{i} = X_{i 1} β_{01} + \dots + X_{i p_{n}} β_{0 p_{n}} + m (⟨θ_{0}, X_{i}⟩) + ε_{i}, \forall i = 1, \dots, n,

(1)

where

Y_{i}

denotes a scalar response,

X_{i 1}, \dots, X_{i p_{n}}

are random covariates taking values in

R

and

X_{i}

is a functional random covariate valued in a separable Hilbert space

H

with inner product

⟨\cdot, \cdot⟩

.

β_{0} = {(β_{01}, \dots, β_{0 p_{n}})}^{⊤} \in R^{p_{n}}

,

θ_{0} \in H

and

m (\cdot)

are a vector of unknown real parameters, an unknown functional direction and an unknown smooth real-valued function, respectively. Finally,

ε_{i}

is the random error, which verifies

E (ε_{i} | X_{i 1}, \dots, X_{i p_{n}}, X_{i}) = 0 .

3. The Penalized Least-Squares Estimators

For the purpose of simultaneously estimating

β

-parameters and selecting relevant X-covariates in the SFPLSIM (1), we will apply the penalized least-squares approach. For that, in a first step we transform the SFPLSIM in a linear model by extracting from

Y_{i}

and

X_{i j}

(

j = 1, \dots, p_{n}

) the effect of the functional covariate

X_{i}

when is projected on the direction

θ_{0}

. Specifically, denoting by

X_{i} = {(X_{i 1}, X_{i 2}, \dots, X_{i p_{n}})}^{⊤}, X = {(X_{1}, \dots, X_{n})}^{⊤} and Y = {(Y_{1}, \dots, Y_{n})}^{⊤}

, the fact that

Y_{i} - E (Y_{i} | ⟨θ_{0}, X_{i}⟩) = {(X_{i} - E (X_{i} | ⟨θ_{0}, X_{i}⟩))}^{⊤} β_{0} + ε_{i}, \forall i = 1, \dots, n,

(2)

allows to consider the following approximate linear model (see Appendix A for understanding the notation):

{\tilde{Y}}_{θ_{0}} \approx {\tilde{X}}_{θ_{0}} β_{0} + ε,

(3)

where

ε = {(ε_{1}, \dots, ε_{n})}^{⊤}

. Then, in a second step, the penalized least-squares approach is applied to model (3). Specifically,

β_{0}

and

θ_{0}

are estimated by considering a minimizer,

({\hat{β}}_{0}, {\hat{θ}}_{0})

, of the penalized profile least-squares function

Q (β, θ) = \frac{1}{2} {({\tilde{Y}}_{θ} - {\tilde{X}}_{θ} β)}^{⊤} ({\tilde{Y}}_{θ} - {\tilde{X}}_{θ} β) + n \sum_{j = 1}^{p_{n}} P_{λ_{j_{n}}} (| β_{j} |),

where

β = {(β_{1}, \dots, β_{p_{n}})}^{⊤}

,

P_{λ_{j_{n}}} (\cdot)

is a penalty function and

λ_{j_{n}} > 0

is a tuning parameter. Note that, simultaneously to the parameter estimation, the previous procedure can be considered as a variable selection method: if

{\hat{β}}_{0 j}

is a non-null component of

{\hat{β}}_{0}

, then

X_{j}

is selected as an influential variable.

From now on, we will denote

J_{n} = {1, \dots, p_{n}}

and

S_{n} \subset J_{n}

such that

β_{0 j} \neq 0

for

j \in S_{n}

and

β_{0 j} = 0

for

j \in S_{n}^{c} = J_{n} / S_{n}

. In addition

s_{n}

will mean card

(S_{n})

and we will assume that

S_{n} = {1, \dots, s_{n}}

.

4. Asymptotic Theory

In this paper, the existence of the penalized estimator is established as well as the corresponding rates of convergence. In particular, under some assumptions, we proved that there exists a local minimizer

({\hat{β}}_{0}, {\hat{θ}}_{0})

of

Q (β, θ)

such that

\begin{matrix} ∥{\hat{β}}_{0} - β_{0}∥ = O_{p} (\sqrt{s_{n}} (n^{- 1 / 2} + δ_{n})) where δ_{n} = max_{j \in S_{n}} \{|P_{λ_{j_{n}}}^{'} (| β_{0 j} |)|\} . \end{matrix}

(4)

Furthermore, the selected set of variables,

{\hat{S}}_{n} = {j \in J_{n}; {\hat{β}}_{0 j} \neq 0}

, works as well (at least asymptotically) as it would do if the true set of relevant variables

S_{n}

was known. Specifically,

P ({\hat{S}}_{n} = S_{n}) \to 1 as n \to \infty

.

An application to real data is included, which shows the good performance of the presented method in terms of error of prediction.

Funding

The authors acknowledge partial support by MINECO grants MTM2014-52876-R and MTM2017-82724-R (EU ERDF support included). Additionally, financial support from the Xunta de Galicia (Centro Singular de Investigación de Galicia accreditation ED431G/01 2016-2019 and Grupos de Referencia Competitiva ED431C2016-015) and the European Union (European Regional Development Fund - ERDF), is gratefully acknowledged. The first author also thanks the financial support from the Xunta de Galicia and the European Union (European Social Fund - ESF), the reference of which is ED481A-2018/191.

Conflicts of Interest

The authors declare no conflict of interest. The founding sponsors had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, and in the decision to publish the results.

Abbreviations

The following abbreviations are used in this manuscript:

SFPLSIM

Semi-functional partial linear single index model

Appendix A. Notation

For any

(n \times q)

-matrix

A

(q \geq 1)

, if

I

is the

(n \times n)

-identity-matrix, we denote

{\tilde{A}}_{θ} = (I - W_{h, θ}) A, w h e r e W_{h, θ} = {(w_{n, h, θ} (X_{i}, X_{j}))}_{i, j},

with

w_{n, h, θ} (\cdot, \cdot)

being the weight function

w_{n, h, θ} (χ, X_{i}) = \frac{K (d_{θ} (χ, X_{i}) / h)}{\sum_{j = 1}^{n} K (d_{θ} (χ, X_{j}) / h)},

where

K : R^{+} \to R^{+}

is a kernel function,

h > 0

is a smoothing parameter and, for

θ \in H

,

d_{θ} (\cdot, \cdot)

is the semimetric defined as

d_{θ} (χ, χ^{'}) = |⟨θ, χ - χ^{'}⟩|, \forall χ, χ^{'} \in H .

References

Aneiros, G.; Ferraty, F.; Vieu, P. Variable selection in partial linear regression with functional covariate. Statistics 2015, 49, 1322–1347. [Google Scholar] [CrossRef]
Novo, S.; Aneiros, G.; Vieu, P. Automatic and location-adaptive estimation in functional single-index regression. 2018; in press. [Google Scholar]

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Sparse Semi-Functional Partial Linear Single-Index Regression^†

Abstract

1. Introduction

2. The Model

3. The Penalized Least-Squares Estimators

4. Asymptotic Theory

Funding

Conflicts of Interest

Abbreviations

Appendix A. Notation

References

Article Metrics

Citations

Article Access Statistics

Sparse Semi-Functional Partial Linear Single-Index Regression †

Abstract

1. Introduction

2. The Model

3. The Penalized Least-Squares Estimators

4. Asymptotic Theory

Funding

Conflicts of Interest

Abbreviations

Appendix A. Notation

References

Article Metrics

Citations

Article Access Statistics

Sparse Semi-Functional Partial Linear Single-Index Regression^†