1. Introduction
The functional estimate has been a topic of great interest in the statistical literature. To obtain a summary of the current state of non-parametric functional data, we refer to the work in [
1,
2]. The hazard function, also known as the risk function, is a concept commonly used in survival analysis and reliability theory. It plays an important role in statistics and arises in a variety of fields, including econometrics, epidemiology, environmental science, and many others. The work in [
3] is an important contribution to the conditional hazard rate for functional covariates in an infinite-dimensional space. Censored data are a type of data in which the values are incomplete or partially known. We consider a type of right-censored data, where the observation is known to be above a certain threshold, but the exact value is unknown. For example, if we are studying the time until a light bulb fails, we might know that the bulb lasted at least 500 h, but we do not know exactly how long it lasted beyond that. Ergodic data have represented a rising interest in this domain over the past few years. It is an essential postulate in statistical physics for analyzing the thermodynamic characteristics of gases, atoms, electrons, or plasmas. Ergodic theory enables us to circumvent intricate probabilistic computations related to the mixing condition. In our setting, we study the almost sure convergence of the kernel estimator of the conditional hazard function; we consider a recursive estimate with strictly stationary, censored, and ergodic observations. It is worth noting that the recursive estimate has a benefit in that the smoothing parameter is tied to the observation
, which enables us to continuously update our estimator as we receive new observations. To combine censored data and ergodic theory, we refer to the work in [
4]. They estimated the conditional quantile using censored and ergodic data.
2. Materials and Methods
In practice, it is possible to coincide with censored variables, that is, instead of observing the lifetimes, we observe the censored lifetimes. This problem is usually modeled by considering , a sequence of lifetimes which satisfy some kind of dependency, and , which is a sequence of i.i.d censored random variables with a common unknown continuous distribution function G. We observe only the n pairs , where and , , where denotes the indicator function of the set A. To ensure the identifiability of the model, we assume that and are independent. Let be a sequence of strictly stationary ergodic processes with the same distribution. We also assume takes values in a semi-metric space , whereas are real-valued random variables. In addition, for insuring good mathematical properties of the functional nonparametric methods, we establish our asymptotic results on the concentration properties of small balls of the probability measure of the functional variable.
We define the function hazard
for
and
by
To this aim, we first introduce the recursive double-kernel-type pseudo-estimator
of the conditional distribution function
defined by
where
K is the kernel,
H is a strictly increasing distribution function,
are sequences of positive real numbers such that
, and
.
From this pseudo-estimator, we deduce a pseudo-estimator
of the conditional density
by
where
is the derivative of
H.
In practice, G is unknown, and one can estimate it using the Kaplan and Meier (1958) estimate
, defined as:
where
are the order statistics of
and
is concomitant with
.
Thus, the feasible estimator of the conditional distribution function
is given by
We deduce an estimator for a conditional density
, defined as
We estimate the conditional hazard function
by
Remark 1. The Kaplan–Meier estimator is not recursive and the use of such an estimator can slightly penalize the efficiency of our estimator in terms of computational time.
3. Results
To establish the almost sure convergence of , we need to include the following assumptions:
Assumptions 1. - (H1)
where - (H2)
(i) The conditional distribution function is such that , ∀ (,), ∀ (,),
.
(ii) The density is such that , ∀ (,), ∀ (,),
.
- (H3)
,for.
, .
- (H4)
K is a function with support (0,1) such that , where is the indicator function.
- (H5)
(i) (ii)
- (H6)
, where for
- (H7)
and are independent.
- (H8)
G has a bounded first derivative .
Theorem 1. Under hypotheses (H1)–(H7), we have:
Proof of Theorem 1. The proof of this theorem is based on the following decomposition and lemmas below:
□
Lemma 1. Under hypotheses (H1), (H2)(i) and (H3)–(H7), we have:
Lemma 2. Under hypotheses (H1), (H2)(ii) and (H3)–(H7), we have:
Lemma 3. Under the hypotheses of Lemma 1, we have: 4. Discussion
This contribution concerns a recursive nonparametric estimation of the conditional hazard function in the presence of a functional explanatory variable when the scalar response is right censored in the ergodic case. As asymptotic results, we have established the almost sure convergence. Concerning the assumptions, they can be divided into three categories: structural assumptions, assumptions of the explanatory variable, and technical assumptions.