Abstract
In this paper, we propose a non-parametric estimator of the conditional hazard function weighted on the recursive kernel method given an explanatory variable taking values in a semi-metric space when the scalar response is censored. Under the ergodicity condition, we establish the convergence rate of this estimator.
1. Introduction
The functional estimate has been a topic of great interest in the statistical literature. To obtain a summary of the current state of non-parametric functional data, we refer to the work in [1,2]. The hazard function, also known as the risk function, is a concept commonly used in survival analysis and reliability theory. It plays an important role in statistics and arises in a variety of fields, including econometrics, epidemiology, environmental science, and many others. The work in [3] is an important contribution to the conditional hazard rate for functional covariates in an infinite-dimensional space. Censored data are a type of data in which the values are incomplete or partially known. We consider a type of right-censored data, where the observation is known to be above a certain threshold, but the exact value is unknown. For example, if we are studying the time until a light bulb fails, we might know that the bulb lasted at least 500 h, but we do not know exactly how long it lasted beyond that. Ergodic data have represented a rising interest in this domain over the past few years. It is an essential postulate in statistical physics for analyzing the thermodynamic characteristics of gases, atoms, electrons, or plasmas. Ergodic theory enables us to circumvent intricate probabilistic computations related to the mixing condition. In our setting, we study the almost sure convergence of the kernel estimator of the conditional hazard function; we consider a recursive estimate with strictly stationary, censored, and ergodic observations. It is worth noting that the recursive estimate has a benefit in that the smoothing parameter is tied to the observation , which enables us to continuously update our estimator as we receive new observations. To combine censored data and ergodic theory, we refer to the work in [4]. They estimated the conditional quantile using censored and ergodic data.
2. Materials and Methods
In practice, it is possible to coincide with censored variables, that is, instead of observing the lifetimes, we observe the censored lifetimes. This problem is usually modeled by considering , a sequence of lifetimes which satisfy some kind of dependency, and , which is a sequence of i.i.d censored random variables with a common unknown continuous distribution function G. We observe only the n pairs , where and , , where denotes the indicator function of the set A. To ensure the identifiability of the model, we assume that and are independent. Let be a sequence of strictly stationary ergodic processes with the same distribution. We also assume takes values in a semi-metric space , whereas are real-valued random variables. In addition, for insuring good mathematical properties of the functional nonparametric methods, we establish our asymptotic results on the concentration properties of small balls of the probability measure of the functional variable.
We define the function hazard for and by
To this aim, we first introduce the recursive double-kernel-type pseudo-estimator of the conditional distribution function defined by
where K is the kernel, H is a strictly increasing distribution function, are sequences of positive real numbers such that , and .
From this pseudo-estimator, we deduce a pseudo-estimator of the conditional density by
where is the derivative of H.
In practice, G is unknown, and one can estimate it using the Kaplan and Meier (1958) estimate , defined as:
where are the order statistics of and is concomitant with .
Thus, the feasible estimator of the conditional distribution function is given by
We deduce an estimator for a conditional density , defined as
We estimate the conditional hazard function by
Remark 1.
The Kaplan–Meier estimator is not recursive and the use of such an estimator can slightly penalize the efficiency of our estimator in terms of computational time.
3. Results
To establish the almost sure convergence of , we need to include the following assumptions:
Assumptions 1.
- (H1)
- where
- (H2)
- (i) The conditional distribution function is such that , ∀ (,), ∀ (,),.(ii) The density is such that , ∀ (,), ∀ (,),.
- (H3)
- ,for., .
- (H4)
- K is a function with support (0,1) such that , where is the indicator function.
- (H5)
- (i) (ii)
- (H6)
- , where for
- (H7)
- and are independent.
- (H8)
- G has a bounded first derivative .
Theorem 1.
Under hypotheses (H1)–(H7), we have:
Proof of Theorem 1.
The proof of this theorem is based on the following decomposition and lemmas below:
□
Lemma 1.
Under hypotheses (H1), (H2)(i) and (H3)–(H7), we have:
Lemma 2.
Under hypotheses (H1), (H2)(ii) and (H3)–(H7), we have:
Lemma 3.
Under the hypotheses of Lemma 1, we have:
4. Discussion
This contribution concerns a recursive nonparametric estimation of the conditional hazard function in the presence of a functional explanatory variable when the scalar response is right censored in the ergodic case. As asymptotic results, we have established the almost sure convergence. Concerning the assumptions, they can be divided into three categories: structural assumptions, assumptions of the explanatory variable, and technical assumptions.
Author Contributions
Both authors contributed equally to this work. All authors have read and agreed to the published version of the manuscript.
Funding
The work received no external funding.
Institutional Review Board Statement
Not applicable.
Informed Consent Statement
Not applicable.
Data Availability Statement
Not applicable.
Conflicts of Interest
The authors declare no conflict of interest.
References
- Ferraty, F.; Vieu, P. Nonparametric Functional Data Analysis; Springer Science+Business Media, Inc.: Berlin/Heidelberg, Germany, 2006. [Google Scholar]
- Ramsay, J.O.; Silverman, B.W. Applied Functional Data Analysis; Springer Series in Statistics; Springer: New York, NY, USA, 2002; Volume 1. [Google Scholar]
- Ferraty, F.; Rabhi, A.; Vieu, P. Estimation non-paramétrique de la fonction de hasard avec variable explicative fonctionnelle. Rev. Math. Pures Appl. 2008, 53, 1–18. [Google Scholar]
- Chaouch, M.; Khardani, S. Randomly censored quantile regression estimation unsing functional stationary ergodic data. J. Nonparametr. Stat. 2015, 27, 65–87. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).