1. Introduction
The exploration of dependence among random variables is fundamental in probability theory and spans various scientific disciplines. Particularly, Extended Negative Dependence (END) has emerged as a significant area of study, shedding light on systems where variables inversely influence each other. This paper builds upon the rich theoretical background established by seminal works to advance the understanding of END random variables, their convergence properties, and implications in statistical modeling and analysis.
Liu’s [
1] pioneering work on precise large deviations for dependent variables with heavy tails marked a significant advancement in the field, providing a methodological foundation that has influenced subsequent studies. Understanding the behavior of dependent variables, especially those with heavy tails, is crucial as it impacts the reliability of statistical inferences in real-world applications, ranging from financial risk management to environmental statistics.
Building on this foundation, Shen et al. [
2,
3] introduced probability inequalities for END sequences and elucidated their convergence behaviors. These contributions are pivotal as they not only enhance the theoretical understanding of END variables but also offer practical tools for analyzing their collective dynamics. The study of convergence properties, as explored by Yan [
4] and Wang et al. [
5], further expands this knowledge base, offering nuanced insights into the statistical properties of END variables through the lens of complete convergence and moment convergence.
The theoretical underpinnings of END variable analysis also draw heavily from classic results in probability theory, such as those by Baum and Katz [
6] on convergence rates in the law of large numbers. This theory provides a mathematical framework for assessing the convergence behavior of random variables, a critical aspect when dealing with END variables. Additionally, the work by Chen et al. [
7] and Chow [
8] on uniform asymptotics and moment complete convergence, respectively, extends the application of these theories to more complex and realistic models.
At the intersection of dependence structures and statistical modeling, the contributions of Ebrahimi and Ghosh [
9] on multivariate negative dependence, and Wu and Min [
10] on the asymptotic theory of unbounded negatively dependent variables, lay the groundwork for a nuanced understanding of END variables [
11,
12,
13,
14]. These studies highlight the complexity of dependence structures and their impact on the asymptotic behavior of statistical estimators and models. Exponential type inequality for unbounded Widely Orthant Dependent (WOD) random variables can be found in [
15].
This paper not only synthesizes the aforementioned theoretical advancements but also aims to extend them by exploring new dimensions of END variables and their applications. Boucheron, Lugosi, and Massart’s [
16] work on concentration inequalities and Vershynin’s [
17] introduction to high-dimensional probability theory provide a modern context for our study. These perspectives are crucial for addressing the challenges posed by high-dimensional data in understanding the behavior of END random variables.
The concentration inequality plays a pivotal role in many limit theorem proofs by providing a measure of partial sum convergence crucial for complete convergence analysis. It was Liu [
1] who initially proposed the concept of Extended Negative Dependence (END). Before delving further, let us review the definitions of Lower Negatively Extended Dependent and Upper Negatively Extended Dependent. As outlined by Liu, the Extended Negative Dependence structure surpasses the Negatively Orthant-Dependent (NOD) structure as it can reflect both negative and positive dependence structures to some extent. Liu also provided intriguing examples demonstrating that Extended Negative Dependence supports a wide range of dependency models. According to Joag-Dev and Proschan [
18], random variables that are negatively associated (NA) must satisfy the property of negative orthant dependence (NOD). However, not all random variables satisfying NOD are necessarily negatively associated. Therefore, random variables that exhibit negative association are considered endogenous (END).
The potential scenarios where END errors might be relevant include situations with volatility clustering and asymmetric dependence observed in financial time series in financial data, negative dependence between observations due to resource constraints or competition in environmental data, and dependence structures arising from interactions between interconnected components in networked systems.
Therefore, it is crucial to examine the asymptotic characteristics of this expanded END category. Following the release of Ebrahimi and Ghosh’s work [
9], researchers have extensively investigated the convergence properties of NOD random sequences using various approaches. A sequence of random variables
is said to converge completely to a constant if, for each
, the sum of the probabilities
. As a consequence of the Borel–Cantelli lemma,
almost surely. Therefore, the idea of complete convergence is more powerful than that of almost surely convergence. The first explanation of complete convergence came from Robbins and Hsu [
11]. In general, Baum and Katz [
6], Chow [
8], Wang and others [
3,
5,
7] can all be cited.
This paper aims to enhance the understanding of END variables. We begin by reviewing the literature on END variables, highlighting their defining characteristics and the theoretical advancements. We then introduce a novel concentration inequality tailored to END variables, detailing its derivation and potential applications. Following this, we explore the convergence properties of sequences of END variables, employing theoretical analysis and simulated data to illustrate the practical implications of our findings.
We apply our theoretical findings to linear models, with a particular focus on the first-order auto-regressive (AR(1)) model. This application not only demonstrates the utility of our contributions in a practical setting but also offers new insights into the behavior of END variables in statistical modeling. Through rigorous analysis, we aim to show how our research advances both the theoretical and practical understanding of END variables.
In this comprehensive exploration, our paper unfolds across seven distinctive sections.
Section 1 introduces the theoretical foundation and motivation, providing a context for our study.
Section 2, Preliminary Lemmas, establishes essential mathematical lemmas that underpin our subsequent analysis. The main theorems and their proofs are presented in
Section 3, Main Results and Proofs, offering a deep dive into the theoretical framework, introducing the novel concentration inequality tailored to END random variables and establishing its foundational properties. In
Section 4, we present the principal finding of almost complete convergence for sequences of Extended Negative Dependence random variables, leveraging the developed concentration inequality.
Section 5 showcases the application of these results in the context of the AR(1) model.
Section 6, Numerical Illustration, breathes life into the theoretical framework, showcasing its practical implications through a detailed numerical example. Finally,
Section 7 concludes the paper, summarizing its contributions and outlining potential avenues for future research, and also reflecting on the broader significance of our study in the realms of extreme value theory and statistical analysis.
6. Numerical Illustration
In this section, we provide a comprehensive numerical demonstration of concentration inequalities applied to a dataset generated from an autoregressive process with transformed residuals. Our main objective is to validate and visualize the concentration of the sum of transformed residuals around its expected value.
Concentration inequalities are vital tools in probability theory and statistical analysis, offering limits on the deviation of random variables from their expected values. This section explores the application of concentration inequalities to a dataset derived from an autoregressive process with transformed residuals, examining the implications of extreme value theory (EVT) on the distribution’s tails.
The methodology of this study involved generating a dataset that represents an autoregressive process AR(1) with residuals transformed into Extended Negative Dependence random variables (END) using the sine function, in
Figure 1 and
Figure 2 below.
When we proceed to verify END properties for the transformed residuals, we find that the condition, for all
k,
, is not satisfied. Therefore, we apply an additional transformation using a sine function,
Figure 3 below.
Before verifying the condition above for this final transformation, we ensure that the sine-transformed data remain Extended Negative Dependence random variables (ENDs) through moment conditions and the
Q-
Q plot in the
Figure 4 below.
The points roughly follow a straight line from around −2.0 to 1.5 on the x-axis. This indicates that the data approximates a normal distribution.
Prior to delving into the concentration inequality, we rigorously confirm that the transformed residuals meet the conditions outlined by Theorem 1. Specifically, we validate that for all .
Ensuring the meticulous verification of these conditions is crucial for laying the foundation for the subsequent analysis of concentration inequalities.
With these conditions confirmed, we proceed to compute the theoretical bound for the concentration inequality. The bound is expressed as
where
M,
, and
are constants determined based on the characteristics of the transformed residuals.
Subsequently, we compute the actual probability of the sum of transformed residuals deviating from its expected value for varying sample sizes. This involves calculating the cumulative sum of the transformed data (Sn) and comparing it with the expected cumulative sum (ESn).
The culmination of our numerical illustration is visualized through a line plot. This plot showcases the theoretical bound alongside the actual probability for different sample sizes, offering a visual comparison of the concentration behavior in
Figure 5 below.
The results obtained from the numerical illustration shed light on the concentration behavior of the sum of transformed residuals. The visual comparison of the theoretical bound and the actual probability reinforces the validity of the concentration inequality for the given dataset.
Furthermore, the verification of conditions ensures the applicability of Theorem 1 to the transformed residuals, affirming the foundational assumptions crucial for the subsequent analysis.
In conclusion, this section provides a comprehensive numerical illustration of concentration inequalities applied to transformed residuals from an autoregressive process. Through meticulous verification of conditions, calculation of theoretical bounds, and visual comparison with actual probabilities, we gain insights into the concentration behavior of the transformed data.
This analysis contributes to a deeper understanding of statistical properties in the context of EVT, particularly for non-random variables exhibiting extreme value behavior. The verified concentration inequalities provide a solid foundation for subsequent analyses and form a basis for further exploration into the tails of the distribution.
Through this illustration, we not only showcase the application of concentration inequalities but also emphasize the importance of thorough verification and visual representation in establishing the reliability of statistical results.