Hypercontractive Inequalities for the Second Norm of Highly Concentrated Functions, and Mrs. Gerber’s-Type Inequalities for the Second Rényi Entropy
Abstract
:1. Introduction
- –
- The functions are somewhat cumbersome to describe, and hence we relegate their precise definition to Definition 1 below.
- –
- –
- Recall that for a point and , the Hamming sphere of radius r around x is the set . As will be seen from the proof of Proposition 1, (4) is essentially tight for a certain convex combination of the uniform distribution on and the characteristic function of a Hamming sphere of an appropriate radius (depending on q, , and the required value of ).
- –
- In information theory one typically considers a slightly different notion of Rényi entropies: For a probability distribution P on , the Renyi entropy of P is given by . To connect notions, if f is a nonnegative (non-zero) function on with expectation 1, then is a probability distribution, and . Furthermore, , where X is a random variable on distributed accordinng to P and Z is an independent noise vector corresponding to a binary symmetric channel with crossover probability . Hence, (2) can be restated asHere is an explicitly given function on , which is increasing and convex in its first argument (), and similarly for .Next, we describe our main result, a hypercontractive inequality for the norm of which takes into account the ratio between and norms of f, and more specifically .
- –
- The precise definition of the functions will be given in Definition 1 below. At this point let us just observe that since the sequence increases with q, we would expect the fact that is large to become less significant as q increases. This is expressed in the properties of the functions in the following manner: If then for any the function starts as a constant- function up to some , and becomes strictly decreasing after that. In other words is the largest possible value of for which Theorem 1 provides no new information compared to (1). For there is a value , such that for all the function is strictly decreasing (in which case we say that ). However, for all . The function decreases with q (in particular, for ). The function increases both in q and in .
- –
- Notably, taking in Theorem 1 gives (see Corollary 1)
- –
- As will be seen from the proof of Theorem 1, (5) is essentially tight for a certain convex combination of the uniform distribution on and characteristic functions of one or two Hamming spheres of appropriate radii (the number of the spheres and their radii depend on q, , and the required value of ).
- –
- Let f be a non-constant function and let be fixed. Consider the function . It will be seen that there is a unique value of q for which . Furthermore, . Hence it provides the best possible value for in Theorem 1. With that, determining might in principle require knowledge of all the Renyi entropies , for , while typically we are in possession of one of the “easier” Rényi entropies, such as or .
1.1. Full Statements of Proposition 1 and Theorem 1
- If , let . Define
- Let . Let . If , let be determined by . If , define . Otherwise, define
- 1.
- Taking in Theorem 1 gives:
- 2.
- Taking in Theorem 1 gives, for andIn the second case α is determined by .
1.2. Applications
- 1.
- 2.
- Let express the fact that ϵ is a sufficiently small absolute constant. Then for holds , where the asymptotic notation hides absolute constants which may depend on x.
- Let D be a subset of of cardinality , for some . ThenThis is almost tight if D is a Hamming ball of exponentially small cardinality.
- For any holds
- –
- As discussed above, the first of the these claims answers the question of [19] and shows that a certain approach to bound binary codes does not lead to an improvement of the first JPL bound. The second claim shows that the best possible bound obtainable via the linear programming approach of [15] is not better than the arithmetic average of the Gilbert-Varshamov bound and the first JPL bound. Observe that the first claim is a consequence of the logarithmic Sobolev inequality in Corollary 2, and hence of the behavior of the norm in Theorem 1 as . The second claim is a consequence of the uncertainty principle in Corollary 3, and hence of the behavior of the norm in Theorem 1 as . We find these connections between notions to be rather intriguing.
- –
- As we have mentioned, the first of the claims recovers a result of [20], where it was also derived from the appropriate logarithmic Sobolev inequality. (Apart from this claim being a simple corollary of Theorem 1, an additional reason for stating it here is that it has only appeared in the unpublished arXiv preprint [20].) The second claim of recovers a result of [17].
1.3. Related Work
2. Proof of Proposition 1
- 1.
- is strictly concave and increasing from to 0 on .
- 2.
- , .
- 3.
- is strictly increasing in α, going up to , as .
- 4.
- The function is strictly decreasing on . Moreover, and .
- . In this case h is increasing on I and we getThe last equality follows from the definition of in this case.
- . Note that, by Lemma 3, . Hence, in this case the maximum of h on I is located at the unique zero of its derivative, that is at the point such that . Using the definition of in this case, we get
- If , then by the discussion above, the point lies on the line , but not on the line . Observe that (the first estimate follows from the Stirling formula, for the second estimate see, e.g., Theorem 1.4.5. in [28]). As the first attempt, let . Then . That is, . However, is exponentially small. To correct that, we define f to be on S, and on the complement of S. Then . We choose to be as small as possible, while ensuring that . Since the contribution of the constant-1 function to is exponentially small w.r.t. , we can choose . We now have , , andHere the second equality follows from the semigroup property of the noise operator: . The first inequality follows from the tightness part of Theorem 2 and the definition of . The second inequality follows from Lemma 2.The tightness of (4) in this case now follows, taking into account the fact that is strictly increasing.
- If , the point lies on the intersection of the lines , and . Hence the function has both , and . It is easy to see that g can be corrected as in the preceding case, by decreasing it slightly on S and adding a constant component, to obtain a function f with expectation 1 and , and with , proving the tightness of (4) in this case as well. We omit the details.
3. Proof of Theorem 1
- 1.
- 2.
- If , then for any choice of holds
- 3.
- If , then for any choice of holds
3.1. Proof of Lemma 4
- 1.
- and .
- 2.
- .
- 1.
- For any holds .
- 2.
- on Ω.
- 3.
- For any holds . (If we replace the RHS of this inequality with .)
3.2. Proof of Proposition 3
- .
- .
- . Let be a maximum point of of the second type in this case. Then .
- . In this case f has a unique maximum point . This point is of the second type. Furthermore, , and it is uniquely determined by the following identity:
3.3. Proofs of the Remaining Lemmas
- .
- and .
- . In this case, by the third claim of Lemma 3, is never larger than q, and henceHere , , and is determined by . Note that is well-defined, by the fourth claim of Lemma 3. The fact that is decreasing in x follows from combining the third and the fourth claims of Lemma 3. In fact, is a constant- function for , and it is strictly decreasing for larger x.
- . In this case y is always greater than and we have thatIt suffices to show that is decreasing on both relevant subintervals of , and this again follows from the third and the fourth claims of Lemma 3. In this case is strictly decreasing on .
4. Remaining Proofs
- 1.
- 2.
- 3.
- f is symmetric, that is depends only on .
- for .
- and .
- .
Proofs of Comments to Theorem 1
- If then for any the function starts as a constant- function up to some , and becomes strictly decreasing after that. For there is a value , such that for all the function is strictly decreasing (in which case we say that ). However, for all . The function decreases with q (in particular, for ). The function increases both in q and in ϵ.
- The function is strictly decreasing in its first argument for any . It satisfies , for all .
- Let f be a non-constant function on . Let . Let . There is a unique value of q for which . Moreover, . Furthermore, for any f.
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Beckner, W. Inequalities in Fourier Analysis. Ann. Math. 1975, 102, 159–182. [Google Scholar] [CrossRef]
- Bonami, A. Etude des coefficients Fourier des fonctions de Lp(G). Annales de l’Institut Fourier 1970, 20, 335–402. [Google Scholar] [CrossRef]
- Gross, L. Logarithmic Sobolev inequalities. Am. J. Math. 1975, 97, 1061–1083. [Google Scholar] [CrossRef]
- O’Donnell, R. Analysis of Boolean Functions; Cambridge University Press: New York, NY, USA, 2014. [Google Scholar]
- Samorodnitsky, A. An upper bound on ℓq norms of noisy functions. IEEE Trans. Inf. Theory 2020, 66, 742–748. [Google Scholar] [CrossRef]
- Wyner, A.D.; Ziv, J. A theorem on the entropy of certain binary sequences and applications: Part I. IEEE Trans. Inf. Theory 1973, 19, 769–772. [Google Scholar] [CrossRef]
- Cover, T.; Thomas, J. Elements of Information Theory; Wiley: Hoboken, NJ, USA, 2006. [Google Scholar]
- Rényi, A. On the measures of entropy and information. In Proceedings of the Fourth Berkeley Symposium on Mathematical Statistics and Probability, Berkeley, CA, USA, 20–30 June 1961; Volume 1, pp. 547–561. [Google Scholar]
- Kirshner, N.; Samorodnitsky, A. A moment ratio bound for polynomials and some extremal properties of Krawchouk polynomials and Hamming spheres. IEEE Trans. Inf. Theory 2021, 67, 3509–3541. [Google Scholar] [CrossRef]
- Polyanskiy, Y.; Samorodnitsky, A. Improved log-Sobolev inequalities, hypercontractivity and uncertainty principle on the hypercube. J. Funct. Anal. 2019, 277, 108280. [Google Scholar] [CrossRef]
- Yu, L.; Anantharam, V.; Chen, J. Graphs of joint types. In Proceedings of the ISIT, Melbourne, Australia, 12–20 July 2021. [Google Scholar]
- Chandar, V.; Tchamkerten, A. Most informative quantization functions. In Proceedings of the ITA Workshop, San Diego, CA, USA, 9–14 February 2014. [Google Scholar]
- MacWilliams, J.; Sloane, N.J.A. The Theory of Error Correcting Codes; Elsevier: Amsterdam, The Netherlands, 1977. [Google Scholar]
- McEliece, R.J.; Rodemich, E.R.; Rumsey, H., Jr.; Welch, L.R. New upper bounds on the rate of a code via the Delsarte-MacWilliams inequalities. IEEE Trans. Inf. Theory 1977, 23, 157–166. [Google Scholar] [CrossRef]
- Delsarte, P. An algebraic approach to the association schemes of coding theory. Philips Res. Rep. Suppl. 1973, 10, vi+–97. [Google Scholar]
- Barg, A.; Jaffe, D.B. Numerical results on the asymptotic rate of binary codes. In Codes and Association Schemes; Barg, A., Litsyn, S., Eds.; American Mathematical Society: Providence, RI, USA, 2001. [Google Scholar]
- Samorodnitsky, A. On the Optimum of Delsarte’s Linear Program. J. Comb. Theory Ser. A 2001, 96, 261–287. [Google Scholar] [CrossRef]
- Navon, M.; Samorodnitsky, A. On Delsarte’s Linear Programming Bounds for Binary Codes. In Proceedings of the FOCS 2005, Pittsburgh, PA, USA, 23–25 October 2005; pp. 327–338. [Google Scholar]
- Friedman, J.; Tillich, J.-P. Generalized Alon-Boppana theorems and error-correcting codes. SIAM J. Discret. Math. 2005, 19, 700–718. [Google Scholar] [CrossRef] [Green Version]
- Samorodnitsky, A. A modified logarithmic Sobolev inequality for the hamming cube and some applications. arXiv 2008, arXiv:0807.1679. [Google Scholar]
- Boucheron, S.; Lugosi, G.; Massart, P. Concentration Inequalities; Oxford University Press: Oxford, UK, 2013. [Google Scholar]
- Donoho, D.L.; Stark, P.B. Uncertainty principles and signal recovery. SIAM J. Appl. Math. 1989, 49, 906–931. [Google Scholar] [CrossRef]
- Kalai, G.; Linial, N. On the distance distribution of codes. IEEE Trans. Inf. Theory 1995, 41, 1467–1472. [Google Scholar] [CrossRef]
- Ashikhmin, A.; Cohen, G.; Krivelevich, M.; Litsyn, S. Bounds on distance distributions in codes of known size. IEEE Trans. Inf. Theory 2005, 51, 250–258. [Google Scholar] [CrossRef]
- Eskenazis, A.; Ivanisvili, P. Polynomial inequalities on the Hamming cube. Prob. Theory Relat. Fields 2020, 178, 235–287. [Google Scholar] [CrossRef]
- Ivanisvili, P.; Tkocz, T. Comparison of moments of Rademacher Chaoses. Ark. Mat. 2019, 57, 121–128. [Google Scholar] [CrossRef]
- Yu, L. Strong Brascamp-Lieb Inequalities. arXiv 2021, arXiv:2102.06935. [Google Scholar]
- van Lint, J.H. Introduction to Coding Theory, 3rd ed.; Graduate Texts in Mathematics; Springer: Berlin, Germany, 1999; Volume 86. [Google Scholar]
- Kalai, G.; (The Hebrew University of Jerusalem, Jerusalem, Israel); Linial, N.; (The Hebrew University of Jerusalem, Jerusalem, Israel). Personal communication, 1994.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Levhari, N.; Samorodnitsky, A. Hypercontractive Inequalities for the Second Norm of Highly Concentrated Functions, and Mrs. Gerber’s-Type Inequalities for the Second Rényi Entropy. Entropy 2022, 24, 1376. https://doi.org/10.3390/e24101376
Levhari N, Samorodnitsky A. Hypercontractive Inequalities for the Second Norm of Highly Concentrated Functions, and Mrs. Gerber’s-Type Inequalities for the Second Rényi Entropy. Entropy. 2022; 24(10):1376. https://doi.org/10.3390/e24101376
Chicago/Turabian StyleLevhari, Niv, and Alex Samorodnitsky. 2022. "Hypercontractive Inequalities for the Second Norm of Highly Concentrated Functions, and Mrs. Gerber’s-Type Inequalities for the Second Rényi Entropy" Entropy 24, no. 10: 1376. https://doi.org/10.3390/e24101376