Next Article in Journal
Generic Riemannian Maps from Nearly Kaehler Manifolds
Previous Article in Journal
Numerical Solution of The Effects of Variable Fluid Properties on Biomagnetic Fluid over an Unsteady Stretching Sheet
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Proceeding Paper

Estimation of the Conditional Hazard Function with a Recursive Kernel from Censored Functional Ergodic Data †

Laboratory of Statistic and Processus Stochastic, University of Djillali Liabes, L.P 89, Sidi Bel Abbes 22000, Algeria
*
Author to whom correspondence should be addressed.
Presented at the 1st International Online Conference on Mathematics and Applications, 1–15 May 2023; Available online: https://iocma2023.sciforum.net/.
Comput. Sci. Math. Forum 2023, 7(1), 16; https://doi.org/10.3390/IOCMA2023-14412
Published: 28 April 2023

Abstract

:
In this paper, we propose a non-parametric estimator of the conditional hazard function weighted on the recursive kernel method given an explanatory variable taking values in a semi-metric space when the scalar response is censored. Under the ergodicity condition, we establish the convergence rate of this estimator.

1. Introduction

The functional estimate has been a topic of great interest in the statistical literature. To obtain a summary of the current state of non-parametric functional data, we refer to the work in [1,2]. The hazard function, also known as the risk function, is a concept commonly used in survival analysis and reliability theory. It plays an important role in statistics and arises in a variety of fields, including econometrics, epidemiology, environmental science, and many others. The work in [3] is an important contribution to the conditional hazard rate for functional covariates in an infinite-dimensional space. Censored data are a type of data in which the values are incomplete or partially known. We consider a type of right-censored data, where the observation is known to be above a certain threshold, but the exact value is unknown. For example, if we are studying the time until a light bulb fails, we might know that the bulb lasted at least 500 h, but we do not know exactly how long it lasted beyond that. Ergodic data have represented a rising interest in this domain over the past few years. It is an essential postulate in statistical physics for analyzing the thermodynamic characteristics of gases, atoms, electrons, or plasmas. Ergodic theory enables us to circumvent intricate probabilistic computations related to the mixing condition. In our setting, we study the almost sure convergence of the kernel estimator of the conditional hazard function; we consider a recursive estimate with strictly stationary, censored, and ergodic observations. It is worth noting that the recursive estimate has a benefit in that the smoothing parameter is tied to the observation ( X i , Y i ) , which enables us to continuously update our estimator as we receive new observations. To combine censored data and ergodic theory, we refer to the work in [4]. They estimated the conditional quantile using censored and ergodic data.

2. Materials and Methods

In practice, it is possible to coincide with censored variables, that is, instead of observing the lifetimes, we observe the censored lifetimes. This problem is usually modeled by considering T 1 , . . . , T n , a sequence of lifetimes which satisfy some kind of dependency, and C 1 , . . . C n , which is a sequence of i.i.d censored random variables with a common unknown continuous distribution function G. We observe only the n pairs ( Y i , δ i ) , where Y i = m i n { T i , C i } and δ i = I { T i C i } , 1 i n , where I A denotes the indicator function of the set A. To ensure the identifiability of the model, we assume that C i and ( T i , X i ) ( 1 i n ) are independent. Let ( X i , T i ) i = 1 , . . . , n be a sequence of strictly stationary ergodic processes with the same distribution. We also assume X i takes values in a semi-metric space ( F , d ) , whereas T i are real-valued random variables. In addition, for insuring good mathematical properties of the functional nonparametric methods, we establish our asymptotic results on the concentration properties of small balls of the probability measure of the functional variable.
We define the function hazard h x for y R and F x ( y ) < 1 by
h x ( y ) = f x ( y ) 1 F x ( y ) .
To this aim, we first introduce the recursive double-kernel-type pseudo-estimator F ˜ x of the conditional distribution function F x defined by
F ˜ x ( t ) = i = 1 n δ i G ¯ ( Y i ) K ( a i 1 d ( x , X i ) ) H ( b i 1 ( t Y i ) ) i = 1 n K ( a i 1 d ( x , X i ) ) , t R .
where K is the kernel, H is a strictly increasing distribution function, a i , b i are sequences of positive real numbers such that lim n + a n = lim n + b n = 0 , and G ¯ ( . ) = 1 G ( . ) .
From this pseudo-estimator, we deduce a pseudo-estimator f ˜ x of the conditional density f x by
f ˜ x ( t ) = i = 1 n δ i G ¯ ( Y i ) b i 1 K ( a i 1 d ( x , X i ) ) H ( b i 1 ( t Y i ) ) i = 1 n K ( a i 1 d ( x , X i ) ) , t R .
where H is the derivative of H.
In practice, G is unknown, and one can estimate it using the Kaplan and Meier (1958) estimate G ¯ n ( . ) , defined as:
G ¯ n ( t ) = i = 1 n 1 1 δ ( i ) n i + 1 I { Y ( i ) t } if t < Y ( n ) , 0 Otherwise .
where Y ( 1 ) < Y ( 2 ) < . . . < Y ( n ) are the order statistics of ( Y i ) 1 i n and δ ( i ) is concomitant with Y ( i ) .
Thus, the feasible estimator of the conditional distribution function F x ( t ) is given by
F ^ x ( t ) = i = 1 n δ i G ¯ n ( Y i ) K ( a i 1 d ( x , X i ) ) H ( b i 1 ( t Y i ) ) i = 1 n K ( a i 1 d ( x , X i ) ) , t R .
We deduce an estimator for a conditional density f x ( t ) , defined as
f ^ x ( t ) = i = 1 n δ i G ¯ n ( Y i ) b i 1 K ( a i 1 d ( x , X i ) ) H ( b i 1 ( t Y i ) ) i = 1 n K ( a i 1 d ( x , X i ) ) , t R .
We estimate the conditional hazard function h ^ x by
h ^ x ( t ) = f ^ x ( t ) 1 F ^ x ( t ) . t R .
Remark 1.
The Kaplan–Meier estimator is not recursive and the use of such an estimator can slightly penalize the efficiency of our estimator in terms of computational time.

3. Results

To establish the almost sure convergence of h ^ x , we need to include the following assumptions:
Assumptions 1.
(H1) 
( i )   T h e   f u n c t i o n   ϕ ( x , h ) : = P ( X B ( x , h ) ) > 0 , h > 0 . ( i i )   F o r   a l l   i = 1 , . . . , n t h e r e   e x i s t s   a   d e t e r m i n i s t i c   f u n c t i o n   ϕ i ( x , . ) s u c h   t h a t   a l m o s t   s u r e l y   0 < P ( X i B ( x , h ) | F i 1 ) ϕ i ( x , h ) , h > 0 . a n d ϕ i ( x , h ) 0 a s h 0 . ( i i i ) For all sequences ( h i ) i = 1 , . . . , n > 0 , i = 1 n P ( X i B ( x , h i ) | F i 1 ) i = 1 n ϕ ( x , h i ) 1
where  B ( x , h ) : = { x F / d ( x , x ) < h } .
(H2) 
(i) The conditional distribution function  F x  is such that  t S , β > 0 , inf t S ( 1 F x ( t ) ) > β , ∀ ( t 1 , t 2 ) S × S , ∀ ( x 1 , x 2 ) N x × N x ,
| F x 1 ( y 1 ) F x 2 ( y 2 ) | C 1 ( d ( x 1 , x 2 ) β 1 + | t 1 t 2 | β 2 ) , β 1 > 0 , β 2 > 0 .
(ii) The density  f x  is such that  t S , α > 0   , f x ( t ) < α , ∀ ( t 1 , t 2 ) S × S , ∀ ( x 1 , x 2 ) N x × N x ,
| f x 1 ( t 1 ) f x 2 ( t 2 ) | C 1 ( d ( x 1 , x 2 ) β 1 + | t 1 t 2 | β 2 ) ,   β 1 > 0 , β 2 > 0 .
(H3) 
( t 1 , t 2 ) R 2 , | H ( j ) ( t 1 ) H ( j ) ( t 2 ) | C | t 1 t 2 | for j = 0 , 1 .
| w | β 2 H ( 1 ) ( w ) d w < , H 2 ( t ) d w < .
(H4) 
K is a function with support (0,1) such that  0 < C 1 I [ 0 , 1 ] < K ( t ) < C 2 I [ 0 , 1 ] < , where  I A  is the indicator function.
(H5) 
(i) lim n + n 1 i = 1 n a i β 1 ϕ i ( x , a i ) ϕ ( x , a i ) = 0 ,  (ii) lim n + n 1 i = 1 n b i β 2 ϕ i ( x , a i ) ϕ ( x , a i ) = 0 .
(H6) 
lim n + φ n , j ( x ) log n n 2 = 0 , where  φ n , j ( x ) = i = 1 n b i j ϕ i ( x , a i ) ϕ 2 ( x , a i )  for  j = 0 , 1 .
(H7) 
( C n ) n 1  and  ( X n , T n ) n 1  are independent.
(H8) 
G has a bounded first derivative  G ( 1 ) .
Theorem 1.
Under hypotheses (H1)–(H7), we have:
sup t S | h ^ x ( t ) h x ( t ) | =
O n 1 i = 1 n a i β 1 ϕ i ( x , a i ) ϕ ( x , a i ) + O n 1 i = 1 n b i β 2 ϕ i ( x , a i ) ϕ ( x , a i ) + O φ n , 1 ( x ) log n n 2 a . s .
Proof of Theorem 1.
The proof of this theorem is based on the following decomposition and lemmas below:
h ^ x ( t ) h x ( t ) = 1 1 F ^ x ( t ) f ^ x ( t ) f x ( t ) + h x ( t ) 1 F ^ x ( t ) F ^ x ( t ) F x ( t ) .
Lemma 1.
Under hypotheses (H1), (H2)(i) and (H3)–(H7), we have:
sup t S | F ^ x ( t ) F x ( t ) | =
O n 1 i = 1 n a i β 1 ϕ i ( x , a i ) ϕ ( x , a i ) + O n 1 i = 1 n b i β 2 ϕ i ( x , a i ) ϕ ( x , a i ) + O φ n , 0 ( x ) log n n 2 a . s .
Lemma 2.
Under hypotheses (H1), (H2)(ii) and (H3)–(H7), we have:
sup t S | f ^ x ( t ) f x ( t ) | =
O n 1 i = 1 n a i β 1 ϕ i ( x , a i ) ϕ ( x , a i ) + O n 1 i = 1 n b i β 2 ϕ i ( x , a i ) ϕ ( x , a i ) + O φ n , 1 ( x ) log n n 2 a . s .
Lemma 3.
Under the hypotheses of Lemma 1, we have:
δ > 0 s u c h t h a t n = 1 P inf t S | 1 F ^ x ( t ) | δ < .

4. Discussion

This contribution concerns a recursive nonparametric estimation of the conditional hazard function in the presence of a functional explanatory variable when the scalar response is right censored in the ergodic case. As asymptotic results, we have established the almost sure convergence. Concerning the assumptions, they can be divided into three categories: structural assumptions, assumptions of the explanatory variable, and technical assumptions.

Author Contributions

Both authors contributed equally to this work. All authors have read and agreed to the published version of the manuscript.

Funding

The work received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Ferraty, F.; Vieu, P. Nonparametric Functional Data Analysis; Springer Science+Business Media, Inc.: Berlin/Heidelberg, Germany, 2006. [Google Scholar]
  2. Ramsay, J.O.; Silverman, B.W. Applied Functional Data Analysis; Springer Series in Statistics; Springer: New York, NY, USA, 2002; Volume 1. [Google Scholar]
  3. Ferraty, F.; Rabhi, A.; Vieu, P. Estimation non-paramétrique de la fonction de hasard avec variable explicative fonctionnelle. Rev. Math. Pures Appl. 2008, 53, 1–18. [Google Scholar]
  4. Chaouch, M.; Khardani, S. Randomly censored quantile regression estimation unsing functional stationary ergodic data. J. Nonparametr. Stat. 2015, 27, 65–87. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Kebir, H.; Mechab, B. Estimation of the Conditional Hazard Function with a Recursive Kernel from Censored Functional Ergodic Data. Comput. Sci. Math. Forum 2023, 7, 16. https://doi.org/10.3390/IOCMA2023-14412

AMA Style

Kebir H, Mechab B. Estimation of the Conditional Hazard Function with a Recursive Kernel from Censored Functional Ergodic Data. Computer Sciences & Mathematics Forum. 2023; 7(1):16. https://doi.org/10.3390/IOCMA2023-14412

Chicago/Turabian Style

Kebir, Hadjer, and Boubaker Mechab. 2023. "Estimation of the Conditional Hazard Function with a Recursive Kernel from Censored Functional Ergodic Data" Computer Sciences & Mathematics Forum 7, no. 1: 16. https://doi.org/10.3390/IOCMA2023-14412

APA Style

Kebir, H., & Mechab, B. (2023). Estimation of the Conditional Hazard Function with a Recursive Kernel from Censored Functional Ergodic Data. Computer Sciences & Mathematics Forum, 7(1), 16. https://doi.org/10.3390/IOCMA2023-14412

Article Metrics

Back to TopTop