Abstract
In applications, missing data may occur randomly and some relevant datum are often used to replace the missing ones. This article mainly explores the influence of the degree of dependence of stationary Gaussian sequences on the joint asymptotic distribution of the maximum of the Gaussian sequence and its maximum when the sequence is subject to random replacing.
Keywords:
extreme value theory; random replacing; asymptotic distribution; stationary gaussian sequences MSC:
60G70; 60G15
1. Introduction
Data missing is a common phenomenon in the field of applications. When it occurs, the most common approach is to treat the available sample as a non complete sample with a random sample size. Furthermore, it is necessary to study the properties of incomplete samples with random sample sizes. In the field of extreme value theory, refs. [1,2] first studied the effect of the missing data on extremes of original sequences. Let be a sequence of stationary random variables with the marginal distribution function , and suppose that some of the random variables in the sequence are missing randomly. Let be the indicator of the event that random variable is observed. For the random sequence , define its random missing sequence as:
where . Suppose that the indicator sequence is independent of , and let be the numbers of the observed variables satisfying
where is a random or nonrandom variable.
When is a constant, under a global dependent condition (see [2]) and a well-known local dependent condition (see e.g., [3]), ref. [2] derived the joint asymptotic distribution of the maximum from a stationary sequence and the maximum from its random missing sequence and proved, for any
with and , where G is one of the three types of extreme value distributions (see, e.g., [3]) and .
The result in (3) has been extended to many other cases; we refer to [4,5] for Gaussian cases; ref. [6,7] for the almost sure limit theorem; ref. [8,9] for autoregressive process; ref. [10] for non-stationary random fields; ref. [11] for linear process; and refs. [12,13] for point process.
When is a random variable, ref. [14] proved a similar result: for any
Ref. [15] extended the results of (4) to weakly and strongly dependent Gaussian sequences. Let be a sequence of stationary Gaussian variables with correlation function . If satisfies
for any , ref. [15] proved that
where denotes the distribution function of a standard (mean 0 and variance 1) normal random variable, , and the normalizing constants and are defined as
If satisfies:
- (A1) is convex with ;
- (A2) is monotone with ,
- for any , ref. [15] proved that
For more related studies of this situation, we refer to [16,17,18,19].
In application, in addition to treating the available samples as incomplete samples with a random sample size, we often use another set of samples to replace the randomly missing samples, to obtain a relatively complete sample. However, this raises the question of to what extent can the random missing samples replace the original samples. To answer this question, we must study the relationship between the original samples and samples subject to random replacement. In the field of extreme value theory, we need to study the asymptotic relationship between the maximum of the original samples and their maximum when the samples are subject to random replacement.
For the random sequence , define the sequence subject to random replacement as
where the sequence is an independent copy of . When the sequence is strongly mixed, ref. [20] proved that the maximum sequences and the maximum when the sequence is subject to random replacement are asymptotically dependent.Under the dependent conditions and , ref. [21] studied the asymptotic distribution of the maximum from a stationary sequence and its maximum subject to random replacement and proved, for ,
where .
It is worth noting that, in the study of [21], the random replacement sequence was an independent copy of the original sequence, so they had the same dependent structure. However, in practical applications, we may not know the dependent structure of the original sequence, so it is necessary to explore the impact of the dependent structure of the original sequence itself and the dependent structure of the random replacement sequence itself on their maxima.
The main purpose of this article is to explore the influence of the self-dependent structure of an original sequence and the random replacement sequence on the joint asymptotic distribution between their maxima under a Gaussian scenario. The advantages of choosing a Gaussian sequence scenario are as follows: The dependent structure of Gaussian sequences can be characterized by their correlation coefficient functions; in the field of extreme value theory, the dependence of Gaussian sequences can be characterized by the speed at which their correlation coefficient function converges to 0; the relevant conclusions in the case of Gaussian sequences can be easily generalized, such as in the case of chi square sequences, Gaussian ordered sequences, and so on.
2. Main Results
In the following part of this paper, let and be stationary standard Gaussian sequences, with correlation functions and , respectively. Let be a sequence of indicators and . Suppose that (2) holds for some random variable a.s. In addition, suppose that , , are independent of each other, and suppose that U and V are independent standard Gaussian random variables, which are independent of . Let and be defined as in (7).
Theorem 1.
Suppose that and satisfy and , respectively. For any , we have
Corollary 1.
(i). Suppose that and satisfy and , respectively. For any , we have
(ii). Suppose that and satisfy and , respectively. For any , we have
(iii). Suppose that and satisfy and , respectively. For any , we have
Remark 1.
The first assertion of Corollary 1 indicates that, when both the original sequence and the random replacement sequence are weakly dependent, the result is consistent with that of [21]. The second and third assertions of Corollary 1 indicate that, when the dependent strength between the original sequence and the random replacement sequence are different, the joint asymptotic distribution of the maximum of the original sequence and the maximum of the sequence subject to random replacement is highly dependent on the strength of dependence.
Corollary 2.
Under the conditions of Theorem 1, for any , we have
and
Theorem 2.
Suppose both and satisfy the conditions A1 and A2. For any , we have
Corollary 3.
Under the conditions of Theorem 2, for any , we have
and
Remark 2.
Corollaries 2 and 3 indicate that, when both the original sequence and the random replacing sequence are weakly dependent, the limit distribution of the maximum of the original sequence and the limit distribution of the maximum of the sequence subject to random replacing are consistent. At this point, the sequence subject to random replacement can be used to replace the original sequence. When both the original sequence and the sequence subject to random replacement are strongly dependent, the limit distribution of the maximum of the original sequence and the sequence subject to random replacement is inconsistent. In this case, the sequence subject to random replacement cannot be directly used to replace the original sequence.
Theorem 3.
(i). Suppose that and satisfy and the conditions A1 and A2, respectively. For any , we have
(ii). Suppose that and satisfy the conditions A1 and A2 and , respectively. For any , we have
Remark 3.
Note that, for Gaussian random sequences with correlation functions satisfying the conditions A1 and A2, their maxima have a non-degenerate limit under the normalizing level and have a degenerate limit 1 under the normalizing level ; for Gaussian random sequences with correlation functions satisfying the condition , their maxima have a non-degenerate limit under the normalizing level and have a degenerate limit 0 under the normalizing level . Thus, in order to obtain the non-degenerate limit, we choose the normalizing level in Theorem 3.
3. Proofs
Let be a sequence of 0 and 1 (). For the arbitrary random or nonrandom sequence of 0 and 1 and subset , put
For any , put and . Set . For simplicity, in the following part, denote and , .
Lemma 1.
Let be a standard Gaussian sequence with mutually independent elements, which is independent of . Let be the independent copy. Define . Under the conditions of Theorem 1, we have ,
where , .
Proof.
Note that and . Let , , where are independent standard Gaussian random variables and are independent of and . Let . It is easy to see that both , and are standard Gaussian sequences. Using the normal comparison lemma (see, e.g., [3]),
where , and C is a constant. Using Lemma 6.4.1 of [3], we know that the above sums tend to 0, as . With the definition of , we have
The proof of Lemma 1 is complete. □
For some fixed k, define , , where , and denotes the integral part of x.
Lemma 2.
Under the conditions of Theorem 1, for any ,
where .
Proof.
This proof is the same as that of Lemma 3.2 of [21], so we omit the details. □
Lemma 3.
Under the conditions of Theorem 1, for any ,
and
Proof.
Recall that and . Noting that is a Gaussian random sequence with mutually independent elements, we have
where denotes the cardinality of the set A. Similarly,
which completes the proof of the first result. The proof of the second result is similar, so we omit it. □
Now, for the random variable a.s., define
and
Put
Proof of Theorem 1.
Note that
We will split the proof into six steps. The first step, using Lemma 1, we have
In the second step, we will prove and
Using Lemma 2, we have
It follows from the proof of Theorem 6.5.1 of [3] that
Then, as
Thus, as and , tends to 0.
In the third step, we prove that and
By the following basic inequality
we obtain
where, using Lemma 3, we have
and
Hence, we obtain
Thus, letting , tends to 0.
In the fourth step, we prove and
Thus, letting , we have tends to 0.
In the fifth step, using (10) again, it is easy to show that and
In the last step, letting , we have
The proof of Theorem is complete. □
Proof of Theorem 2.
First, note that
where
It follows from (3.5) of [15] that
Since and have the same distribution function, using a similar proof, we have
Now, we can finish the proof of Theorem 2 by plugging the last equality into (13) and dominated convergence theorem. □
Proof of Theorem 3.
We only give the proof of case (i), since the proof of case (ii) is similar. First, note that
where
Obviously,
Since the correlation function of satisfies the conditions A1 and A2, we have
Furthermore, using (16), as
Hence, for any and sufficiently large n
Thus, for a sufficiently large n
Now, using the dominated convergence theorem, in order to finishing the proof, we only need to show and
Noting that the correlation function of satisfies , repeating the proof of Theorem 1, we can prove that (20) holds. □
4. Conclusions
The joint asymptotic distribution of the maximum of stationary Gaussian sequence and the maximum of the sequence subject to random replacing is highly dependent on the dependent structure of the original sequence and the replacing sequence.
Author Contributions
Y.L. was a major contributor in writing the manuscript; Z.T. provided some helpful discussions in writing the manuscript. All authors have read and agreed to the published version of the manuscript.
Funding
This work was supported by Innovation of Jiaxing City: a program to support the talented persons and Project of new economy research center of Jiaxing City (No. WYZB202254).
Data Availability Statement
Not Applicable.
Acknowledgments
The authors would like to thank the referees and the Editor for the thorough reading and valuable suggestions.
Conflicts of Interest
The authors declare no competing interests.
References
- Hall, A.; Hüsler, J. Extremes of stationary sequences with failures. Stoch. Model. 2006, 22, 537–557. [Google Scholar] [CrossRef]
- Mladenović, P.; Piterbarg, V. On asymptotic distribution of maxima of complete and incomplete samples from stationary sequences. Stoch. Process. Their Appl. 2006, 116, 1977–1991. [Google Scholar] [CrossRef]
- Leadbetter, M.R.; Lindgren, G.; Rootzén, H. Extremes and Related Properties of Random Sequences and Processes; Springer: New York, NY, USA, 1983. [Google Scholar]
- Peng, Z.; Cao, L.; Nadarajah, S. Asymptotic distributions of maxima of complete and incomplete samples from multivariate stationary Gaussian sequences. J. Multivar. Anal. 2010, 101, 2641–2647. [Google Scholar] [CrossRef]
- Cao, L.; Peng, Z. Asymptotic distributions of maxima of complete and incomplete samples from strongly dependent stationary Gaussian sequences. Appl. Math. Lett. 2011, 24, 243–247. [Google Scholar] [CrossRef]
- Tong, B.; Peng, Z. On almost sure max-limit theorems of complete and incomplete samples from stationary sequences. Acta Math. Sin. 2011, 27, 1323–1332. [Google Scholar] [CrossRef]
- Tan, Z.; Wang, Y. Some asymptotic results on extremes of incomplete samples. Extremes 2012, 15, 319–332. [Google Scholar] [CrossRef]
- Mladenović, P.; Zivadinovic, L. Uniform AR(1) Processes and maxima on partial samples. Commun. Stat.-Theory Methods 2015, 44, 2546–2563. [Google Scholar] [CrossRef]
- Glavaš, L.; Mladenović, P.; Samorodnitsky, G. Extreme values of the uniform order 1 autoregressive processes and missing observations. Extremes 2017, 20, 671–690. [Google Scholar] [CrossRef]
- Panga, Z.; Pereira, L. On the maxima and minima of complete and incomplete samples from nonstationary random fields. Stat. Probab. Lett. 2018, 137, 124–134. [Google Scholar] [CrossRef]
- Glavaš, L.; Mladenović, P. Extreme values of linear processes with heavy-tailed innovations and missing observations. Extremes 2020, 23, 547–567. [Google Scholar] [CrossRef]
- Tong, J.; Peng, Z. Asymptotic distributions of the exceedances point processes for a class of dependent normal sequence. Acta Math. Appl. Sin. 2010, 33, 1–9. [Google Scholar]
- Peng, Z.; Tong, J.; Weng, Z. Exceedances point processes in the plane of stationary Gaussian sequences with data missing. Stat. Probab. Lett. 2019, 149, 73–79. [Google Scholar] [CrossRef]
- Krajka, T. The asymptotic behaviour of maxima of complete and incomplete samples from statioanry sequences. Stoch. Process. Their Appl. 2011, 121, 1705–1719. [Google Scholar] [CrossRef]
- Hashorva, E.; Peng, Z.; Weng, Z. On Piterbarg theorem for maxima of stationary Gaussian sequences. Lith. Math. J. 2013, 53, 280–292. [Google Scholar] [CrossRef]
- Hashorva, E.; Weng, Z. Maxima and minima of complete and incomplete stationary sequences. Stochastics 2014, 86, 707–720. [Google Scholar] [CrossRef]
- Krajka, T.; Rychlik, Z. The limiting behaviour of sums and maximums of iid random variables from the viewpoint of different observers. Probab. Math. Stat. 2014, 34, 237–252. [Google Scholar]
- Zheng, S.; Tan, Z. On the maxima of nonstationary random fields subject to missing observations. arXiv 2023, arXiv:2306.13857. [Google Scholar]
- Fang, Y.; Tan, Z. On the extreme order statistics for stationary Gaussian sequences subject to random missing observations. arXiv 2023, arXiv:2306.13861. [Google Scholar]
- Robert, C.Y. On asymptotic distribution of maxima of stationary sequences subject to random failure or censoring. Statist. Probab. Lett. 2010, 134, 134–142. [Google Scholar] [CrossRef]
- Li, Y.; Tan, Z. The asymptotic distribution of maxima of stationary random sequences under random replacing. arXiv 2023. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).