Abstract
In this paper, we study self-normalized moderate deviations for degenerate U-statistics of order 2. Let be i.i.d. random variables and consider symmetric and degenerate kernel functions in the form , where , , and is in the domain of attraction of a normal law for all . Under the condition and some truncated conditions for , we show that for and , where . As application, a law of the iterated logarithm is also obtained.
Keywords:
moderate deviation; degenerate U-statistics; law of the iterated logarithm; self-normalization MSC:
60F15; 60F10; 62E20
1. Introduction and Main Results
The recent three decades have witnessed significant developments on self-normalized limit theory, especially on large deviations, Cramér-type moderate deviations, and the law of the iterated logarithm. Compared with the classical limit theorems, these self-normalized limit theorems usually require much less moment assumptions.
Let be independent identically distributed (i.i.d.) random variables. Set
Griffin and Kuelbs [1] obtained a law of the iterated logarithm (LIL) for the self-normalized sum of i.i.d. random variables with distributions in the domain of attraction of a normal or stable law. They proved that
- If and X is in the domain of attraction of a normal law, then
- If X is symmetric and is in the domain of attraction of a stable law, then there exists a positive constant C such that
Shao [2] obtained the following self-normalized moderate deviations and specified the constant C in (2). Let be a sequence of positive numbers such that and as .
- If and X is in the domain of attraction of a normal law, then
- If X is in the domain of attraction of a stable law such that with index , or is symmetric with index , then
where is a constant depending on the tail distribution; see [2] for an explicit definition.
Shao [3] refined (3) and obtained the following Cramér-type moderate deviation theorem under a finite third moment: if and , then
for any , where Z is the standard normal random variable.
Jing, Shao and Wang [4] further extended (4) to general independent random variables under a Lindeberg-type condition, while Shao and Zhou [5] established the result for self-normalized non-linear statistics, which include U-statistics as a special case.
The U-statistics were introduced by Halmos [6] and Hoeffding [7]. The LIL for nondegenerate U-statistics was obtained by Serfling [8]. The LIL for degenerate U-statistics was studied by Dehling, Denker and Philipp ([9,10]), Dehling [11], Arcones and Giné [12], Teicher [13], Giné and Zhang [14], and others. Giné, Kwapień, Latała and Zinn [15] provided necessary and sufficient conditions for the LIL of degenerate U-statistics of order 2, which was extended to any order by Adamczak and Latała [16].
The main purpose of this paper is to study the self-normalized moderate deviations and the LIL for degenerate U-statistics of order 2. Let
where
A motivation example for the LIL is the one with the kernel . Obviously, . Then, via (1) and (2), we have
- If and X is in the domain of attraction of a normal law, then
- If X is symmetric and is in the domain of attraction of a stable law, then there exists a positive constant C such that
For the general degenerate kernel h defined in (5), we have
Suppose that is in the domain of attraction of a normal law for every . Then, is a slowly varying function for all as . Let be a sequence of positive numbers such that and as . For each , set
Write
We have the following self-normalized moderate deviation:
Theorem 1.
Let and for every .
and
for any . Then, for and ,
As an application, we have the following self-normalized LIL:
Theorem 2.
Under the assumptions in Theorem 1, and instead of (8), we assume that for each , there is a constant such that
Then,
2. Proofs
In the proofs of theorems, we will use the following properties for the slowly varying functions (e.g., Bingham et al. [17]). As ,
2.1. The Upper Bound of Theorem 1
2.2. Estimation of and
Proposition 1.
For that is sufficiently large,
and for any constants and ,
In particular,
Proof.
We shall apply the following exponential inequality (see, e.g., Theorem 2.19 of de la Peña, Lai and Shao [18]). If are independent random variables with , and , then for ,
By (8), as . Then, by (17),
Hence,
Then,
By Minkowski’s integral inequality, (14) and (15),
Therefore, (23) follows from (27) and (28). To show (24), notice that by (17),
Then,
Similar to the proof of as in (27) and (28), we have (24).
2.3. Estimation of
Proposition 2.
Proof.
Via the Cauchy–Schwarz inequality,
By (17), the sum of the diagonal terms is as follows:
Let
Then, for any constant ,
Let be the expectation of for . Then,
Since for any and for some sufficiently large m value, then
By (12) and (15), we have . Then, together with (17),
Applying (32) to (31), we have
Similarly,
Continue this process from to . We conclude that
Applying (33) to (30) and letting , we have
By the same argument,
Combining (29), (34) and (35), we obtain the proposition. □
2.4. Estimation of
Let be an independent copy of . We will use the following lemma which is a Bernstein-type exponential inequality for degenerate U-statistics.
Lemma 1
((3.5) of Giné, Latała and Zinn [19]). For bounded degenerate kernel , let
Then, there is a universal constant K such that
Recall (16). Hence, by (19) and the definition of in (22),
Let
In addition to the estimate of , we include (40) in the following proposition which will be used in the proof of Theorem 2, where
Proposition 3.
For a sufficiently large constant ,
Then, by (36) and the decoupling inequalities of de la Peña and Montgomery-Smith [20], for a sufficiently large ,
Suppose that for all and is a constant. For constants that are sufficiently small,
Proof.
We will prove (39) and (40) simultaneously. By (15) and (37),
By (15) and (38),
Let
Since , then by Cauchy–Schwarz inequality, (15) and (37),
The same result can be obtained for . Therefore,
Similarly,
Since , then . Hence,
By (37) and the Cauchy–Schwarz inequality,
Similarly,
Now let
By (41) and (46),
By (43) and (46),
By (44) and (46),
Then, (39) follows from Lemma 1 for a sufficiently large value. Similarly, let
By (42) and (47),
By (44) and (47),
By (45) and (47),
Therefore, (40) follows from Lemma 1 for and values that are sufficiently small. □
2.5. Estimation of
Lemma 2 below follows Corollary 1(b) of Einmahl [21] and Lemma 4.2 of Lin and Liu [22]. However, we add the condition , and our result is in a form of an exponential inequality for independent random vectors. We use the same positive constants and (depending only on the vector dimension d) in Einmahl [21].
Lemma 2.
Let be independent random vectors with a mean of zero and values in such that and , where denotes the Euclidean norm. Let . Suppose that
where , is a identity matrix, and is a positive sequence such that and
Let
Then, for any , there exists such that for all ,
uniformly for , where can be any sequence with and .
Proof.
Let , , be independent random vectors, which are independent of the s, where
By (49) and (50), we have as . Hence, as . Let be the probability density of and be the density of . By Corollary 1(b) in Einmahl [21] (together with the Remark on page 32), for ,
For any and ,
Let N denote a centered normal random vector with covariance matrix . Then, by (52),
Observe that has a distribution. It is well known that for a random variable Y, for . Hence,
by . Similarly,
by (51). Then, by (54)–(56), we have
Since the distribution of is , the distribution of is . Since the s are independent, by (48). Hence, the distribution of is . Then, similar to (56),
By (57) and (58), we have
Next, we estimate . For each , let , where denote the transpose of a vector . Then
Hence,
Since and , then and for each . By Bernstein’s inequality (e.g., (2.17) of de la Peña, Lai and Shao [18]),
Then, the lemma follows by applying (59) and (60) to (53). □
Now, we estimate in the following proposition, which uses some ideas in Liu and Shao [23]:
Proposition 4.
Proof.
For each , let
Let and be the covariance matrix of . For , let
Then,
Since the s are independent, then
Hence, Condition (48) in Lemma 2 is satisfied. Let
where is a finite constant depending only on m. We shall verify Condition (49). By (61),
Observe that is positive definite by Assumption (9). Then, by the identity
for any positive definite matrix , we have
Let such that and . Then, for any , by the Cauchy–Schwarz inequality,
Since for all , then by (13) and (15). By Assumption (9),
By the Cauchy–Schwarz inequality,
Hence,
Applying (65) and (66) to (64), we have
Since by (15), and (16), we have
By (67),
and
By Hölder’s inequality,
Combining (70) and (71), we have
by (14) and (15). Since and , then by (68) and (72), we have
Hence, Condition (49) in Lemma 2 is satisfied. Similarly,
Then, . By (68), we have . By (69), we have . Then, by Lemma 2 and (73) with and for a sufficiently large value, we have
Letting , we have
Similar to (62),
We will use Identity (63) to estimate (75). Let be such that and
Observe that
By (66),
because . By Identity (63) and by (76)–(78),
By (74), (75) and (79), with the application of (16) and (19),
□
Since is arbitrary, then the upper bound of Theorem 1 follows from (22) and the estimates of , , and .
3. The Lower Bound of Theorem 1
Let be the random variable with distribution which is of the distribution of conditioned on . Define and . By the definition of and (12),
Notice that (13) implies . Then, we have
and
Then, for ,
Without the loss of generality, assume that . Then,
Recall (15) and (81). Take ; thus, we have . Therefore, by Theorem 5.2.2 in Stout [24], for any , we have
On the other hand,
We apply the following exponential inequality (see Lemma 2.1, Csörgő, Lin and Shao [25]; see also Pruitt [26] and Griffin and Kuelbs [1]) for the rest of the proof.
4. The Upper Bound of Theorem 2
Lemma 4
(Lemma 2.3 of Giné, Kwapień, Latała, and Zinn [15]). There exists a universal constant such that for any kernel h and any two sequences of i.i.d. random variables, we have
for all and all .
Proposition 5.
Under the assumptions of Theorem 1,
Consequently,
Proof.
Let as . Let with be sufficiently small. For any positive integer , via a similar idea as in (19) with ,
Notice that (10) implies (8). By (17) and the upper bound of Theorem 1,
Let be a sufficiently small constant. By (19),
By (24) in Proposition 1,
By the Cauchy–Schwarz inequality, for each k,
By (17), for some m value that is sufficiently large, the sum of the diagonal terms is as follows:
By (91) and (92),
Let
Then, for any constant ,
Let be the expectation of for . Then,
Since for any and for some m value that is sufficiently large, then
By (12) and (15), we have . Then, together with (17),
Applying (96) to (95), we have
Similarly,
Continue this process from to . Thus, we conclude that
Applying (97) to (94) and letting , we have
To estimate , let
Then, for any constant ,
Let be the expectation of for . Note . Then,
Observe that
Then, by (17), for some m value that is sufficiently large. Since for any ,
Under Assumption (10), for each ,
Recall that (17); then, for each ,
Hence, by (101),
Then,
By (12) and (15), we have . Then, together with (10),
Then, by (102) and (103),
Continue this process from to and by (100),
Applying (104) to (99) and letting , we have
By (93), (98) and (105),
By the definition of in (89), and by Lemma 4, there is a constant such that
Similar to (36),
By the decoupling version of (40) in Proposition 3,
Combining (89), (90), (106) and (107), we have
By (87), (88) and (108),
Let for some . We have
Let . Then,
By the Borel–Cantelli lemma,
□
5. The Lower Bound of Theorem 2
Author Contributions
Conceptualization, Q.-M.S.; methodology, L.G., H.S. and Q.-M.S.; formal analysis, L.G., H.S. and Q.-M.S.; investigation, L.G., H.S. and Q.-M.S.; writing—original draft preparation, L.G. and H.S.; writing—review and editing, Q.-M.S. All authors have read and agreed to the published version of the manuscript.
Funding
This research was partially supported by the Simons Foundation Grant 586789, USA, as well as by the National Nature Science Foundation of China NSFC 12031005 and Shenzhen Outstanding Talents Training Fund, China.
Institutional Review Board Statement
Not applicable.
Data Availability Statement
Data sharing is not applicable (only appropriate if no new data is generated or the article describes entirely theoretical research).
Conflicts of Interest
The authors declare no conflicts of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.
References
- Griffin, P.; Kuelbs, J. Self-normalized laws of the iterated logarithm. Ann. Probab. 1989, 17, 1571–1601. [Google Scholar] [CrossRef]
- Shao, Q.M. Self-normalized large deviations. Ann. Probab. 1997, 25, 285–328. [Google Scholar] [CrossRef]
- Shao, Q.M. A Cramér type large deviation result for Student’s t-statistic. J. Theoret. Probab. 1999, 12, 385–398. [Google Scholar] [CrossRef]
- Jing, B.Y.; Shao, Q.M.; Wang, Q. Self-normalized Cramér type large deviations for independent random variables. Ann. Probab. 2003, 31, 2167–2215. [Google Scholar] [CrossRef]
- Shao, Q.M.; Zhou, W.X. Cramér type moderate deviation theorems for self-normalized processes. Bernoulli 2016, 22, 2029–2079. [Google Scholar] [CrossRef]
- Halmos, P.R. The theory of unbiased estimation. Ann. Math. Statst. 1946, 17, 34–43. [Google Scholar] [CrossRef]
- Hoeffding, W. A class of statistics with asymptotically normal distribution. Ann. Math. Statst. 1948, 19, 293–325. [Google Scholar] [CrossRef]
- Serfling, R.J. The law of the iterated logarithm for U-statistics and related von Mises statistics. Ann. Math. Statist. 1971, 42, 1794. [Google Scholar]
- Dehling, H.; Denker, M.; Philipp, W. Invariance principles for von Mises and U-statistics. Z. Wahrscheinlichkeitstheorie Verwandte Geb. 1984, 67, 139–167. [Google Scholar] [CrossRef]
- Dehling, H.; Denker, M.; Philipp, W. A bounded law of the iterated logarithm for Hilbert space valued martingales and its application to U-statistics. Probab. Theory Relat. Fields 1986, 72, 111–131. [Google Scholar] [CrossRef]
- Dehling, H. Complete convergence of triangular arrays and the law of the iterated logarithm for degenerate U-statistics. Statist. Probab. Lett. 1989, 7, 319–321. [Google Scholar] [CrossRef]
- Arcones, M.; Giné, E. On the law of the iterated logarithm for canonical U-statistics and processes. Stoch. Process. Appl. 1995, 58, 217–245. [Google Scholar] [CrossRef]
- Teicher, H. Moments of randomly stopped sums revisited. J. Theoret. Probab. 1995, 8, 779–794. [Google Scholar] [CrossRef]
- Giné, E.; Zhang, C.-H. On Integrability in the LIL for Degenerate U-statistics. J. Theoret. Probab. 1996, 9, 385–412. [Google Scholar] [CrossRef]
- Giné, E.; Kwapień, S.; Latała, R.; Zinn, J. The LIL for canonical U-statistics of order 2. Ann. Probab. 2001, 29, 520–557. [Google Scholar] [CrossRef]
- Adamczak, R.; Latała, R. The LIL for canonical U-statistics. Ann. Probab. 2008, 36, 1023–1068. [Google Scholar] [CrossRef]
- Bingham, N.H.; Goldie, C.M.; Teugels, J.L. Regular Variation; Cambridge University Press: Cambridge, UK, 1987. [Google Scholar]
- de la Peña, V.H.; Lai, T.L.; Shao, Q.M. Self-Normalized Processes: Limit Theory and Statistical Applications. Probability and Its Applications (New York); Springer: Berlin, Germany, 2009. [Google Scholar]
- Giné, E.; Latała, R.; Zinn, J. Exponential and moment inequalities for U-statistics. In High Dimensional Probability II; Birkhäuser: Boston, MA, USA, 2000; pp. 13–38. [Google Scholar]
- de la Peña, V.H.; Montgomery-Smith, S.J. Decoupling inequalities for the tail probabilities of multivariate U-statistics. Ann. Probab. 1995, 23, 806–816. [Google Scholar] [CrossRef]
- Einmahl, U. Extensions of results of Komlós, Major, and Tusnády to the multivariate case. J. Mult. Anal. 1989, 28, 20–68. [Google Scholar] [CrossRef]
- Lin, Z.; Liu, W. On maxima of periodograms of stationary processes. Ann. Statist. 2009, 37, 2676–2695. [Google Scholar] [CrossRef]
- Liu, W.; Shao, Q.M. A Cramér moderate deviation theorem for Hotelling’s T2-statistic with applications to global tests. Ann. Statist. 2013, 41, 296–322. [Google Scholar] [CrossRef]
- Stout, W.F. Almost Sure Convergence; Academic Press: New York, NY, USA, 1974. [Google Scholar]
- Csörgő, M.; Lin, Z.Y.; Shao, Q.M. Studentized increments of partial sums. Sci. China Ser. A 1994, 37, 265–276. [Google Scholar]
- Pruitt, W.E. General one-sided laws of the iterated logarithm. Ann. Probab. 1981, 9, 1–48. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).