Large Deviation Results and Applications to the Generalized Cramér Model

Rita Giuliano; Claudio Macci

doi:10.3390/math6040049

and

¹

Dipartimento di Matematica, Università di Pisa, Largo Bruno Pontecorvo 5, I-56127 Pisa, Italy

²

Dipartimento di Matematica, Università di Roma Tor Vergata, Via della Ricerca Scientifica, I-00133 Rome, Italy

^*

Author to whom correspondence should be addressed.

^†

The support of INdAM (Fondi GNAMPA) and Università di Pisa (Fondi di Ateneo) is acknowledged. The first version of this paper was written during the staying of the first author at the University Jean Monnet (St. Etienne).

Mathematics2018, 6(4), 49;https://doi.org/10.3390/math6040049

This article belongs to the Special Issue Stochastic Processes with Applications

Version Notes

Order Reprints

Abstract

In this paper, we prove large deviation results for some sequences of weighted sums of random variables. These sequences have applications to the probabilistic generalized Cramér model for products of primes in arithmetic progressions; they could lead to new conjectures concerning the (non-random) set of products of primes in arithmetic progressions, a relevant topic in number theory.

Keywords:

arithmetic progressions; first Chebyshev function; products of primes; regularly varying functions; slowly varying functions

1. Introduction

The aim of this paper is to prove asymptotic results for a class of sequences of random variables, i.e.,

\{\frac{\sum_{k = 1}^{n} L_{k} X_{k}}{b_{n}} : n \geq 1\}

(1)

for suitable sequences of real numbers

{b_{n} : n \geq 1}

and

{L_{n} : n \geq 1}

(see Condition 1 in Section 3) and suitable random independent variables

{X_{n} : n \geq 1}

defined on the same probability space

(Ω, F, P)

. We also present analogue results for the slightly different sequence

\{\frac{L_{n} \sum_{k = 1}^{n} X_{k}}{b_{n}} : n \geq 1\} .

(2)

More precisely we refer to the theory of large deviations, which gives an asymptotic computation of small probabilities on an exponential scale (see, e.g., [1] as a reference on this topic). We recall [2] as a recent reference on large deviations for models of interest in number theory.

The origin and the motivation of our research rely on the study of some random models similar in nature to the celebrated Cramér model for prime numbers: i.e., what we have called the generalized model (for products of prime numbers in arithmetic progressions). We are not aware of any work where these probabilistic models are studied. Details on these structures will be given in Section 2. Here we only point out that, as the classical probabilistic model invented by Cramér has been used to formulate conjectures on the (non-random) set of primes (see [3] for details), in a similar way we can draw out conjectures also for the non-random sets of products of primes or products of primes in arithmetic progressions. The large deviation results for the sequences concerning these structures will be given in Corollary 1.

We also remark that the particular form of the sequence (1) is motivated by analogy with the first Chebyshev function, as will be explained in Section 2.

It is worth noting that also some moderate deviation properties can be proved (in terms of suitable bounds on cumulants and central moments) for the centered sequences

\{\frac{\sum_{k = 1}^{n} L_{k} (X_{k} - E [X_{k}])}{b_{n}} : n \geq 1\} and \{\frac{L_{n} \sum_{k = 1}^{n} (X_{k} - E [X_{k}])}{b_{n}} : n \geq 1\} .

Such propositions will not be dealt with in the sequel since, though some specific assumptions must be made in the present setting, these results are in the same direction as those of the paper [4], where moderate deviations from the point of view of cumulants and central moments are fully investigated.

It should be noted that our results are a contribution to the recent literature on limit theorems of interest in probability and number theory; here, we recall [5], where the results are formulated in terms of the mod-

φ

convergence (see also [6] where the simpler mod-Gaussian convergence is studied).

We here introduce some terminology and notation. We always set

0 log 0 = 0

,

\frac{c}{\infty} = 0

for

c \neq 0

, and

⌊ x ⌋ : = max {k \in Z : k \leq x < k + 1}

for all

x \in R

. Moreover, we write

$a_{n} \sim b_{n}$ to mean that ${lim}_{n \to \infty} \frac{a_{n}}{b_{n}} = 1$ ;
$Z \overset{law}{\sim} B (p)$ , for $p \in [0, 1]$ , to mean that $P (Z = 1) = p = 1 - P (Z = 0)$ ;
$Z \overset{law}{\sim} P (λ)$ , for $λ > 0$ , to mean that $P (Z = k) = \frac{λ^{k}}{k!} e^{- λ}$ for all integers $k \geq 0$ .

The outline of this paper is as follows: We start with some preliminaries in Section 2, and we present the results in Section 3. The results for the generalized Cramér model (for products of primes in arithmetic progressions) are presented in Corollary 1.

2. Preliminaries

On large deviations.

We refer to [1] (pages 4–5). Let

Z

be a topological space equipped with its completed Borel

σ

-field. A sequence of

Z

-valued random variables

{Z_{n} : n \geq 1}

satisfies the large deviation principle (LDP) with speed function

v_{n}

and rate function I if the following is true:

{lim}_{n \to \infty} v_{n} = \infty

, and the function

I : Z \to [0, \infty]

is lower semi-continuous.

\underset{n \to \infty}{lim sup} \frac{1}{v_{n}} log P (Z_{n} \in F) \leq - inf_{z \in F} I (z) for all closed sets F

\underset{n \to \infty}{lim inf} \frac{1}{v_{n}} log P (Z_{n} \in G) \geq - inf_{z \in G} I (z) for all open sets G .

A rate function I is said to be good if its level sets

{{z \in Z : I (z) \leq η} : η \geq 0}

are compact.

Throughout this paper, we prove LDPs with

Z = R

. We recall the following known result for future use.

Theorem 1 (Gärtner–Ellis Theorem).

Let

{Z_{n} : n \geq 1}

be a sequence of real valued random variables. Assume that the function

Λ : R \to (- \infty, \infty]

defined by

Λ (θ) : = lim_{n \to \infty} \frac{1}{v_{n}} log E [e^{v_{n} θ Z_{n}}] (f o r a l l θ \in R)

(3)

exists; assume, moreover, that Λ is essentially smooth (see e.g., Definition 2.3.5 in [1]) and lower semi-continuous. Then

{Z_{n} : n \geq 1}

satisfies the LDP with speed function

v_{n}

and good rate function

Λ^{*} : R \to [0, \infty]

defined by

Λ^{*} (z) : = sup_{θ \in R} {θ z - Λ (θ)} .

Proof.

See, e.g., Theorem 2.3.6 in [1]. ☐

The main application of Theorem 1 in this paper concerns Theorem 2, where we have

Λ (θ) = e^{θ} - 1, which yields Λ^{*} (x) = \{\begin{matrix} x log x - x + 1 & if x \geq 0 \\ \infty & if x < 0 . \end{matrix}

(4)

The LDP in Theorem 3 will instead be proved by combining Theorem 4.2.13 in [1] with Theorem 2, i.e., by checking the exponential equivalence (see, e.g., Definition 4.2.10 in [1]) of the involved sequences.

On the generalized Cramér model (for products of primes in arithmetic progressions).

The Cramér model for prime numbers consists in a sequence of independent random variables

{X_{n} : n \geq 1}

such that, for every

n \geq 2

,

X_{n} \overset{law}{\sim} B (1 / log n) .

(5)

This model can be justified by the prime numbers theorem (PNT), which roughly asserts that the expected density of primes around x is

\frac{1}{log x}

: the cardinality of prime numbers

\leq n

is

π (n) : = \sum_{p \leq n} 1 \sim li (n) : = \int_{2}^{n} \frac{1}{log t} d t,

and, with the words of [7] (see footnote on p. 6), “the quantity

\frac{1}{log n}

appears here naturally as the derivative of

li (x)

evaluated at

x = n

”. Since

\int_{2}^{n} \frac{1}{log t} d t \sim \frac{n}{log n}

, another way of stating the PNT is

\frac{π (n)}{n} \sim \frac{1}{log n} .

(6)

A first extension of this formula concerns the case of integers n which are products of exactly r prime factors (

r \geq 2

). More precisely, we consider the sets

A_{r} (n) : = {k \leq n : Ω (k) = r} and B_{r} (n) : = {k \leq n : ω (k) = r}

where

ω (n)

is the number of distinct prime factors of n, and

Ω (n)

counts the number of prime factors of n (with multiplicity); this means that, letting (by the canonical prime factorization of n)

n = \prod_{i = 1}^{ω (n)} p_{i}^{α_{i}}

, where

p_{1}, \dots, p_{n}

are the distinct prime factors of n, we have

Ω (n) : = \sum_{i = 1}^{ω (n)} α_{i} .

A result proved by Landau in 1909 (see, e.g., [8]) states that the cardinalities

τ_{r} (n)

and

π_{r} (n)

of

A_{r} (n)

and

B_{r} (n)

respectively verify

τ_{r} (n) : = \sum_{k \in A_{r} (n)} 1 \sim \frac{n {(log log n)}^{r - 1}}{(r - 1)! log n} and π_{r} (n) : = \sum_{k \in B_{r} (n)} 1 \sim \frac{n {(log log n)}^{r - 1}}{(r - 1)! log n};

see also, e.g., Theorem 437 in [9] (Section 22.18, page 368) or [10] (II.6, Theorems 4 and 5). Note that this formula for

π_{r} (n)

reduces to Equation (6) when

r = 1

.

Going a little further, for fixed integers a and q, we can consider the sets of products of primes in arithmetic progressions

A_{r}^{(q)} (n) = : {k \leq n : Ω (k) = r, k \equiv a \mod q} and B_{r}^{(q)} (n) = : {k \leq n : ω (k) = r, k \equiv a \mod q} .

One can prove (by similar methods as in [10,11]) that, for any a and q with

(a, q) = 1

, the cardinalities

τ_{r}^{(q)} (n)

and

π_{r}^{(q)} (n)

of

A_{r}^{(q)} (n)

and

B_{r}^{(q)} (n)

respectively verify

τ_{r}^{(q)} (n) : = \sum_{k \in A_{r}^{(q)} (n)} 1 \sim \frac{1}{ϕ (q)} \cdot \frac{n {(log log n)}^{r - 1}}{(r - 1)! log n} and π_{r}^{(q)} (n) : = \sum_{k \in B_{r}^{(q)} (n)} 1 \sim \frac{1}{ϕ (q)} \cdot \frac{n {(log log n)}^{r - 1}}{(r - 1)! log n},

where

ϕ

is Euler’s totient function. Notice that, for

r = 1

, we recover the sets of primes in arithmetic progressions, considered for instance in [8,10] II.8, or [11]; the case

r = 2

is studied in [12]; the general case

r \geq 1

is considered in the recent preprint [13]; for

q = 1

, we recover the sets and the formulas for the model described above.

Therefore, following Cramér’s heuristic, Equation (5), we can define the generalized Cramér model for products of r prime numbers (or products of r prime numbers in arithmetic progression) as a sequence of independent random variables

{X_{n} : n \geq 1}

such that

X_{n} \overset{law}{\sim} B (λ_{n}), where λ_{n} : = \frac{ℓ_{n}}{log n} and ℓ_{n} : = \frac{1}{ϕ (q)} \cdot \frac{{(log log n)}^{r - 1}}{(r - 1)!} .

(7)

Obviously in Equation (7) we take

n \geq n_{0}

, where

n_{0}

is an integer, such that

λ_{n} \in (0, 1]

for

n \geq n_{0}

; the definition of

λ_{n}

for

n < n_{0}

is arbitrary.

Large deviation results for this model will be presented in Corollary 1 as a consequence of Theorem 3 and Remark 2, with

L_{n} : = log n and b_{n} : = n ℓ_{n};

(8)

thus, the sequences in Equations (1) and (2) become

\frac{\sum_{k = 1}^{n} (log k) X_{k}}{n ℓ_{n}} and \frac{(log n) \sum_{k = 1}^{n} X_{k}}{n ℓ_{n}}

(9)

respectively. Moreover, by taking into account Remark 3 presented below, the sequences in Equation (9) converge almost surely to 1 (as

n \to \infty

).

On the first Chebyshev function.

The first Chebyshev function is defined by

θ (x) : = \sum_{p \leq x} log p,

where the sum is extended over all prime numbers

p \leq x

.

Therefore, when considering the classical Cramér model, this function is naturally modeled with

\sum_{k = 1}^{n} (log k) X_{k}

(and we obtain the numerator of the first fraction in Equation (9)).

It must be noted that T. Tao, in his blog (see [14]), considers the same random variable

\sum_{k \leq x} (log k) X_{k}

and proves that almost surely one has

\sum_{k \leq x} (log k) X_{k} = x + O_{ε} (x^{1 / 2 + ε})

for all

ε > 0

(where the implied constant in the

O_{ε} (\cdot)

notation is allowed to be random). In particular, almost surely one has

lim_{n \to \infty} \frac{\sum_{k \leq n} (log k) X_{k}}{n} = 1 .

It appears clearly that in this setting we have a sequence of the form of Equation (1), with the particular choices

L_{n} = log n

and

b_{n} = n

. What we are going to investigate in the sequel is how the sequence of random variables

{X_{n} : n \geq 1}

and the two sequences of numbers

{L_{n} : n \geq 1}

and

{b_{n} : n \geq 1}

must be connected in order to obtain large deviations and convergence results (see also Equations (8) and (9) above).

On slowly and regularly varying functions (at infinity).

Here we recall the following basic definitions. A positive measurable function H defined on some neighborhood of

[x_{0}, \infty)

of infinity is said to be slowly varying at infinity (see, e.g., [15], page 6) if

lim_{t \to \infty} \frac{H (t x)}{H (t)} = 1 for all x > 0 .

Similarly, a positive measurable function M defined on some neighborhood of

[x_{0}, \infty)

of infinity is said to be regularly varying at infinity of index

ρ

(see, e.g., [15], page 18) if

lim_{t \to \infty} \frac{M (t x)}{M (t)} = x^{ρ} for all x > 0 .

Obviously, we recover the slowly varying case if

ρ = 0

. Recall the following well-known result for slowly varying functions.

Lemma 1 (Karamata’s representation of slowly varying functions).

A function H is slowly varying at infinity if and only if

H (x) = c (x) exp (\int_{x_{0}}^{x} \frac{ϕ (t)}{t} d t)

where

ϕ (x) \to 0

and

c (x) \to c_{\infty}

for some

c_{\infty} > 0

(as

x \to \infty

).

Proof.

See, e.g., Theorem 1.3.1 in [15]. ☐

In view of what follows we also present the following results. They are more or less known; but we prefer to give detailed proofs in order to ensure that the paper is self-contained.

Lemma 2.

Let M be a regularly varying function (at infinity) of index

ρ \geq 0

. Then,

lim_{t \to \infty} \frac{M (⌊ t x ⌋)}{M (t)} = x^{ρ} f o r a l l x > 0 .

Proof.

It is well-known (see, e.g., Theorem 1.4.1 in [15]) that we have

M (x) = x^{ρ} H (x)

for a suitable slowly varying function H. Thus, it is easy to check that it suffices to prove the result for the case

ρ = 0

(namely for a slowly varying function H), i.e.,

lim_{t \to \infty} \frac{H (⌊ t x ⌋)}{H (t)} = 1 for all x > 0 .

(10)

By Lemma 1, for all

x > 0

, we have

\frac{H (⌊ t x ⌋)}{H (t)} = \frac{c (⌊ t x ⌋)}{c (t)} exp (\int_{t}^{⌊ t x ⌋} \frac{ϕ (v)}{v} d v)

for

t > 0

. Obviously,

\frac{c (⌊ t x ⌋)}{c (t)} \to 1

(as

t \to \infty

). Moreover, for all

ε > 0

, we have

|\int_{t}^{⌊ t x ⌋} \frac{ϕ (v)}{v} d v| \leq ε | log (⌊ t x ⌋ / t) |

for

t > 0

, and

log (⌊ t x ⌋ / t) \to log x

(as

t \to \infty

); thus,

\int_{t}^{⌊ t x ⌋} \frac{ϕ (v)}{v} d v \to 0 (as t \to \infty)

by the arbitrariness of

ε > 0

. Thus, Equation (10) holds, and the proof is complete. ☐

Lemma 3.

Let H be a slowly varying function (at infinity). Then,

lim_{x \to \infty} \frac{x H (x)}{\sum_{k = 1}^{⌊ x ⌋} H (k)} = 1 .

Proof.

By the representation of H in Lemma 1, for all

ε > 0

there is an integer

n_{0} \geq 1

such that, for all

x > n_{0}

, we have

c_{\infty} - ε < c (x) < c_{\infty} + ε

and

- ε < ϕ (x) < ε

. Then, we take

x \geq n_{0} + 1

, and

\frac{\sum_{k = 1}^{⌊ x ⌋} H (k)}{x H (x)} = \frac{\sum_{k = 1}^{n_{0}} H (k)}{x H (x)} + \frac{\sum_{k = n_{0} + 1}^{⌊ x ⌋} H (k)}{x H (x)} .

The first summand in the right hand side can be ignored since, if we take

ε \in (0, 1)

, for a sufficient high x, we have

H (x) > \frac{c_{\infty}}{2} exp (- ε \int_{x_{0}}^{x} \frac{1}{t} d t) = \frac{c_{\infty}}{2} {(\frac{x}{x_{0}})}^{- ε},

which yields

x H (x) > c_{1} x^{1 - ε}

for a suitable constant

c_{1} > 0

(and

x^{1 - ε} \to \infty

as

x \to \infty

). Therefore, we concentrate our attention on the second summand and, by taking into account again the representation of H in Lemma 1, for a sufficiently high x, we have

\frac{\sum_{k = n_{0} + 1}^{⌊ x ⌋} H (k)}{x H (x)} = \frac{\sum_{k = n_{0} + 1}^{⌊ x ⌋} c (k) exp (\int_{x_{0}}^{k} \frac{ϕ (t)}{t} d t)}{x c (x) exp (\int_{x_{0}}^{x} \frac{ϕ (t)}{t} d t)} = \frac{\sum_{k = n_{0} + 1}^{⌊ x ⌋} \frac{c (k)}{c (x)} exp (- \int_{k}^{x} \frac{ϕ (t)}{t} d t)}{x} .

Moreover,

\frac{\sum_{k = n_{0} + 1}^{⌊ x ⌋} \frac{c (k)}{c (x)} exp (- \int_{k}^{x} \frac{ϕ (t)}{t} d t)}{x} \leq \frac{c_{\infty} + ε}{c_{\infty} - ε} \frac{\sum_{k = n_{0} + 1}^{⌊ x ⌋} k^{- ε}}{x^{1 - ε}} \to \frac{c_{\infty} + ε}{c_{\infty} - ε} \frac{1}{1 - ε} (as x \to \infty)

and

\frac{\sum_{k = n_{0} + 1}^{⌊ x ⌋} \frac{c (k)}{c (x)} exp (- \int_{k}^{x} \frac{ϕ (t)}{t} d t)}{x} \geq \frac{c_{\infty} - ε}{c_{\infty} + ε} \frac{\sum_{k = n_{0} + 1}^{⌊ x ⌋} k^{ε}}{x^{1 + ε}} \to \frac{c_{\infty} - ε}{c_{\infty} + ε} \frac{1}{1 + ε} (as x \to \infty),

and the proof is complete by the arbitrariness of

ε

. ☐

3. Results

In this section we present large deviation results for Equations (1) and (2). We start with the case of Poisson distributed random variables (see Theorem 2 and Remark 1), and later we consider the case of Bernoulli distributed random variables (see Theorem 3 and Remark 2). Our large deviation results yield the almost sure convergence to 1 (as

n \to \infty

) of the involved random variables (see Remark 3 for details). In particular, the results for Bernoulli distributed random variables can be applied to the sequences of the generalized Cramér model in Equation (9) (see Corollary 1).

In all our results, we assume the following condition.

Condition 1.

The sequence

{b_{n} : n \geq 1}

is eventually positive;

{L_{n} : n \geq 1}

is eventually positive and non-decreasing.

In general, we can ignore the definition of

{b_{n} : n \geq 1}

and

{L_{n} : n \geq 1}

for a finite number of indices; therefore, in order to simplify the proofs, we assume that

{b_{n} : n \geq 1}

and

{L_{n} : n \geq 1}

are positive sequences and that

{L_{n} : n \geq 1}

is non-decreasing.

We start with the case where

{X_{n} : n \geq 1}

are (independent) Poisson distributed random variables.

Theorem 2 (the Poisson case; the sequence in Equation (1)).

Let

{b_{n} : n \geq 1}

and

{L_{n} : n \geq 1}

be two sequences as in Condition 1. Assume that

{L_{n} : n \geq 1} i s t h e r e s t r i c t i o n (o n N) o f a s l o w l y v a r y i n g f u n c t i o n (a t i n f i n i t y) .

(11)

F o r a l l c \in (0, 1), α (c) : = lim_{n \to \infty} \frac{b_{⌊ c n ⌋}}{b_{n}} e x i s t s, a n d lim_{c ↓ 0} α (c) = 0 .

(12)

lim_{n \to \infty} \frac{L_{n}}{b_{n}} = 0 .

(13)

Moreover, assume that

{X_{n} : n \geq 1}

are independent and

X_{n} \overset{law}{\sim} P (λ_{n})

for all

n \geq 1

, where

{λ_{n} : n \geq 1}

are positive numbers such that

\sum_{k = 1}^{n} λ_{k} \sim \frac{b_{n}}{L_{n}} .

(14)

The sequence in Equation (1) then satisfies the LDP with speed function

v_{n} = \frac{b_{n}}{L_{n}}

and good rate function

Λ^{*}

defined by Equation (4).

We point out that Equation (12) is satisfied if the sequence

{b_{n} : n \geq 1}

is nondecreasing and is the restriction (on

N

) of a regularly varying function with positive index (at infinity); this is a consequence of Lemma 2.

Proof.

We apply Theorem 1, i.e., we check that Equation (3) holds with

Z_{n} = \frac{\sum_{k = 1}^{n} L_{k} X_{k}}{b_{n}}

,

v_{n} = \frac{b_{n}}{L_{n}}

, and

Λ

as in Equation (4) (in fact, Equation (3) holds even without assuming (13); however, Equation (13) must be required in order that

v_{n} = \frac{b_{n}}{L_{n}}

be a speed function). We remark that

\begin{matrix} \frac{L_{n}}{b_{n}} log E [e^{\frac{b_{n}}{L_{n}} θ \frac{\sum_{k = 1}^{n} L_{k} X_{k}}{b_{n}}}] = \frac{L_{n}}{b_{n}} log E [e^{θ \frac{\sum_{k = 1}^{n} L_{k} X_{k}}{L_{n}}}] = \frac{L_{n}}{b_{n}} \sum_{k = 1}^{n} log E [e^{(θ L_{k} / L_{n}) X_{k}}] \\ = \frac{L_{n}}{b_{n}} \sum_{k = 1}^{n} log (e^{λ_{k} (e^{θ L_{k} / L_{n}} - 1)}) = \frac{L_{n}}{b_{n}} \sum_{k = 1}^{n} λ_{k} (e^{θ L_{k} / L_{n}} - 1) for all θ \in R . \end{matrix}

Equation (3) trivially holds for

θ = 0

. The proof is divided in two parts: the proof of the upper bound,

\underset{n \to \infty}{lim sup} \frac{L_{n}}{b_{n}} log E [e^{\frac{b_{n}}{L_{n}} θ \frac{\sum_{k = 1}^{n} L_{k} X_{k}}{b_{n}}}] \leq e^{θ} - 1 for all θ \in R,

(15)

and that of the lower bound,

\underset{n \to \infty}{lim inf} \frac{L_{n}}{b_{n}} log E [e^{\frac{b_{n}}{L_{n}} θ \frac{\sum_{k = 1}^{n} L_{k} X_{k}}{b_{n}}}] \geq e^{θ} - 1 for all θ \in R .

(16)

We start with the proof of Equation (15). For

θ > 0

, we have

\frac{L_{n}}{b_{n}} log E [e^{\frac{b_{n}}{L_{n}} θ \frac{\sum_{k = 1}^{n} L_{k} X_{k}}{b_{n}}}] = \frac{L_{n}}{b_{n}} \sum_{k = 1}^{n} λ_{k} (e^{θ L_{k} / L_{n}} - 1) \leq \frac{L_{n}}{b_{n}} \sum_{k = 1}^{n} λ_{k} (e^{θ} - 1)

since

{L_{n} : n \geq 1}

is nondecreasing, and we obtain Equation (15) by letting n go to infinity and by taking into account Equation (14). For

θ < 0

, we take

c \in (0, 1)

and

γ : = sup {L_{n} : n \geq 1}

(possibly infinite). Recalling that

{L_{n} : n \geq 1}

is nondecreasing and that

\frac{L_{⌊ c n ⌋}}{L_{n}} \to 1

(it is a consequence of Lemma 2), we have

\begin{matrix} \frac{L_{n}}{b_{n}} log E [e^{\frac{b_{n}}{L_{n}} θ \frac{\sum_{k = 1}^{n} L_{k} X_{k}}{b_{n}}}] = \frac{L_{n}}{b_{n}} \sum_{k = 1}^{n} λ_{k} (e^{θ L_{k} / L_{n}} - 1) \\ \leq \frac{L_{n}}{b_{n}} \sum_{k = 1}^{⌊ c n ⌋} λ_{k} (e^{θ L_{1} / γ} - 1) + \frac{L_{n}}{b_{n}} \sum_{k = ⌊ c n ⌋ + 1}^{n} λ_{k} (e^{θ L_{⌊ c n ⌋} / L_{n}} - 1) \\ = \frac{L_{n}}{L_{⌊ c n ⌋}} \frac{b_{⌊ c n ⌋}}{b_{n}} \{\frac{L_{⌊ c n ⌋}}{b_{⌊ c n ⌋}} \sum_{k = 1}^{⌊ c n ⌋} λ_{k}\} (e^{θ L_{1} / γ} - 1) \\ + (\{\frac{L_{n}}{b_{n}} \sum_{k = 1}^{n} λ_{k}\} - \frac{L_{n}}{L_{⌊ c n ⌋}} \frac{b_{⌊ c n ⌋}}{b_{n}} \{\frac{L_{⌊ c n ⌋}}{b_{⌊ c n ⌋}} \sum_{k = 1}^{⌊ c n ⌋} λ_{k}\}) (e^{θ L_{⌊ c n ⌋} / L_{n}} - 1) . \end{matrix}

Then, by Equation (11) (and Lemma 2 with

ρ = 0

), (12) and (14), we obtain

\underset{n \to \infty}{lim sup} \frac{L_{n}}{b_{n}} log E [e^{\frac{b_{n}}{L_{n}} θ \frac{\sum_{k = 1}^{n} L_{k} X_{k}}{b_{n}}}] \leq α (c) (e^{θ L_{1} / γ} - 1) + (1 - α (c)) (e^{θ} - 1) .

Using Equation (12), we conclude by letting

c ↓ 0

.

The proof of Equation (16) is similar with reversed inequalities; hence, we only sketch it here. For

θ < 0

, we have

\frac{L_{n}}{b_{n}} log E [e^{\frac{b_{n}}{L_{n}} θ \frac{\sum_{k = 1}^{n} L_{k} X_{k}}{b_{n}}}] = \frac{L_{n}}{b_{n}} \sum_{k = 1}^{n} λ_{k} (e^{θ L_{k} / L_{n}} - 1) \geq \frac{L_{n}}{b_{n}} \sum_{k = 1}^{n} λ_{k} (e^{θ} - 1),

and we obtain Equation (16) by letting n go to infinity and by taking into account (14). For

θ \geq 0

, we take

c \in (0, 1)

and, for

γ

defined as above, after some manipulations, we obtain

\underset{n \to \infty}{lim inf} \frac{L_{n}}{b_{n}} log E [e^{\frac{b_{n}}{L_{n}} θ \frac{\sum_{k = 1}^{n} L_{k} X_{k}}{b_{n}}}] \geq α (c) (e^{θ L_{1} / γ} - 1) + (1 - α (c)) (e^{θ} - 1) .

We conclude by letting

c ↓ 0

(by Equation (12)). ☐

Remark 1 (The Poisson case; the sequence in Equation (2)).

The LDP in Theorem 2 holds also for the sequence in Equation (2) in place of the sequence in Equation (1). In this case we only need to use Condition 1 and to assume Equations (13) and (14), whereas Equations (11) and (12) (which were required in the proof of Theorem 2) can be ignored. For the proof, we still apply Theorem 1, so we have to check that Equation (3) holds with

Z_{n} = \frac{L_{n} \sum_{k = 1}^{n} X_{k}}{b_{n}}

,

v_{n} = \frac{b_{n}}{L_{n}}

, and Λ as in Equation (4). This can be easily checked noting that

\begin{matrix} \frac{L_{n}}{b_{n}} log E [e^{\frac{b_{n}}{L_{n}} θ \frac{L_{n} \sum_{k = 1}^{n} X_{k}}{b_{n}}}] = \frac{L_{n}}{b_{n}} log E [e^{θ \sum_{k = 1}^{n} X_{k}}] = \frac{L_{n}}{b_{n}} \sum_{k = 1}^{n} log E [e^{θ X_{k}}] \\ = \frac{L_{n}}{b_{n}} \sum_{k = 1}^{n} log (e^{λ_{k} (e^{θ} - 1)}) = \frac{L_{n}}{b_{n}} \sum_{k = 1}^{n} λ_{k} (e^{θ} - 1) \to e^{θ} - 1 f o r a l l θ \in R \end{matrix}

where the limit relation holds by Equation (14).

The next result is for Bernoulli distributed random variables

{X_{n} : n \geq 1}

. Here we shall use the concept of exponential equivalence (see, e.g., Definition 4.2.10 in [1]). The proof is similar to the one of Proposition 3.5 in [16] (see also Remark 3.6 in the same reference). We point out that it is not unusual to prove a convergence result for Bernoulli random variables

{X_{n} : n \geq 1}

starting from a similar one for Poisson random variables

{Y_{n} : n \geq 1}

and by setting

X_{n} : = Y_{n} \land 1

for all

n \geq 1

; see, for instance, Lemmas 1 and 2 in [17].

Theorem 3 (The Bernoulli case; the sequence in Equation (1)).

Let

{b_{n} : n \geq 1}

and

{L_{n} : n \geq 1}

be as in Theorem 2 (thus, Condition 1 together with Equations (11)–(13) hold). Moreover, assume that

{X_{n} : n \geq 1}

are independent and

X_{n} \overset{law}{\sim} B (λ_{n})

for all

n \geq 1

and that Equation (14) and

{lim}_{n \to \infty} λ_{n} = 0

hold. The sequence in Equation (1) satisfies the LDP with speed function

v_{n} = \frac{b_{n}}{L_{n}}

and the good rate function

Λ^{*}

defined by Equation (4).

Proof.

Let

n_{0}

such that

λ_{n} \in [0, 1)

for all

n \geq n_{0}

(recall that

λ_{n} \to 0

as

n \to \infty

), and let

{X_{n}^{*} : n \geq 1}

be independent random variables such that

X_{n}^{*} \overset{law}{\sim} P ({\hat{λ}}_{n})

(for all

n \geq 1

), where

{\hat{λ}}_{n} : = log \frac{1}{1 - λ_{n}}

for

n \geq n_{0}

(the definition of

{\hat{λ}}_{n}

for

n < n_{0}

is arbitrary). Notice that

\sum_{k = 1}^{n} {\hat{λ}}_{k} \sim \sum_{k = 1}^{n} λ_{k}

because

\sum_{k = 1}^{n} λ_{k} \to \infty

(as

n \to \infty

) by Equations (13) and (14) and, by the Cesaro theorem,

lim_{n \to \infty} \frac{\sum_{k = 1}^{n} {\hat{λ}}_{k}}{\sum_{k = 1}^{n} λ_{k}} = lim_{n \to \infty} \frac{{\hat{λ}}_{n}}{λ_{n}} = lim_{n \to \infty} \frac{log \frac{1}{1 - λ_{n}}}{λ_{n}} = 1 .

Hence, the assumption of Equation (14) and Theorem 2 are in force for the sequence

{X_{n}^{*} : n \geq 1}

(in fact, we have Equation (14) with

{{\hat{λ}}_{n} : n \geq 1}

in place of

{λ_{n} : n \geq 1}

) and, if we set

X_{n} : = X_{n}^{*} \land 1

(for all

n \geq 1

), the sequence

{X_{n} : n \geq 1}

is indeed an instance of the sequence appearing in the statement of the present theorem since, by construction,

X_{n} \overset{law}{\sim} B (1 - e^{- {\hat{λ}}_{n}})

and

1 - e^{- {\hat{λ}}_{n}} = λ_{n}

.

The statement will be proved by combining Theorem 4.2.13 in [1] and Theorem 2 (for the sequence

{X_{n}^{*} : n \geq 1}

). This means that we have to check the exponential equivalence condition

\underset{n \to \infty}{lim sup} \frac{L_{n}}{b_{n}} log P (Δ_{n} > δ) = - \infty (for all δ > 0)

(17)

where

Δ_{n} : = |\frac{1}{b_{n}} \sum_{k = 1}^{n} L_{k} X_{k} - \frac{1}{b_{n}} \sum_{k = 1}^{n} L_{k} X_{k}^{*}| .

(18)

We remark that

Δ_{n} \leq \frac{L_{n}}{b_{n}} \sum_{k = 1}^{n} | X_{k} - X_{k}^{*} |

(19)

by the monotonicity and the nonnegativeness of

{L_{n} : n \geq 1}

; therefore, if we combine Equation (19) and the Chernoff bound, for each arbitrarily fixed

θ \geq 0

, we obtain

P (Δ_{n} > δ) \leq P (\frac{L_{n}}{b_{n}} \sum_{k = 1}^{n} | X_{k} - X_{k}^{*} | > δ) \leq \frac{E [e^{θ \sum_{k = 1}^{n} | X_{j} - X_{j}^{*} |}]}{e^{θ δ b_{n} / L_{n}}} .

Therefore,

\frac{L_{n}}{b_{n}} log P (Δ_{n} > δ) \leq \frac{L_{n}}{b_{n}} \sum_{k = 1}^{n} log E [e^{θ | X_{k} - X_{k}^{*} |}] - θ δ .

Moreover, if we set

ρ_{k}^{(θ)} : = \frac{e^{λ_{k} e^{θ}} - 1}{λ_{k} e^{θ}},

we have

\begin{matrix} E [e^{θ | X_{k} - X_{k}^{*} |}] = P (X_{k}^{*} = 0) + P (X_{k}^{*} = 1) + \sum_{h = 2}^{\infty} e^{θ | 1 - h |} P (X_{k}^{*} = h) \\ = e^{- λ_{k}} + λ_{k} e^{- λ_{k}} + \sum_{h = 2}^{\infty} e^{θ (h - 1)} \frac{λ_{k}^{h}}{h!} e^{- λ_{k}} = e^{- λ_{k}} + λ_{k} e^{- λ_{k}} + e^{- θ} e^{- λ_{k}} (e^{λ_{k} e^{θ}} - 1 - λ_{k} e^{θ}) \\ = e^{- λ_{k}} + e^{- θ} e^{- λ_{k}} (e^{λ_{k} e^{θ}} - 1) = e^{- λ_{k}} (1 + e^{- θ} (e^{λ_{k} e^{θ}} - 1)) = e^{- λ_{k}} (1 + λ_{k} ρ_{k}^{(θ)}) . \end{matrix}

Therefore,

\frac{L_{n}}{b_{n}} log P (Δ_{n} > δ) \leq - \frac{L_{n}}{b_{n}} \sum_{k = 1}^{n} λ_{k} + \frac{L_{n}}{b_{n}} \sum_{k = 1}^{n} log (1 + λ_{k} ρ_{k}^{(θ)}) - θ δ .

(20)

The proof will be complete if we show that, for all

θ > 0

,

\underset{n \to \infty}{lim sup} \frac{L_{n}}{b_{n}} \sum_{k = 1}^{n} log (1 + λ_{k} ρ_{k}^{(θ)}) \leq 1 .

(21)

In fact, by Equations (14) and (21), we deduce from Equation (20) that

\underset{n \to \infty}{lim sup} \frac{L_{n}}{b_{n}} log P (Δ_{n} > δ) \leq - θ δ,

and we obtain Equation (17) by letting

θ

go to infinity.

Thus, we prove Equation (21). We remark that

ρ_{n}^{(θ)} \to 1

because

λ_{n} \to 0

(as

n \to \infty

). Hence, for all

ε \in (0, 1)

, there exists

n_{0}

such that, for all

n > n_{0}

, we have

ρ_{n}^{(θ)} < 1 + ε

and

\begin{matrix} \frac{L_{n}}{b_{n}} \sum_{k = 1}^{n} log (1 + λ_{k} ρ_{k}^{(θ)}) = \frac{L_{n}}{b_{n}} \sum_{k = 1}^{n_{0}} log (1 + λ_{k} ρ_{k}^{(θ)}) + \frac{L_{n}}{b_{n}} \sum_{k = n_{0} + 1}^{n} log (1 + λ_{k} ρ_{k}^{(θ)}) \\ \leq \frac{L_{n}}{b_{n}} \sum_{k = 1}^{n_{0}} log (1 + λ_{k} ρ_{k}^{(θ)}) + \frac{L_{n}}{b_{n}} \sum_{k = n_{0} + 1}^{n} log (1 + λ_{k} (1 + ε)) . \end{matrix}

Moreover,

\frac{L_{n}}{b_{n}} \sum_{k = 1}^{n_{0}} log (1 + λ_{k} ρ_{k}^{(θ)}) \to 0

(as

n \to \infty

) by Equation (13) and

\frac{L_{n}}{b_{n}} \sum_{k = n_{0} + 1}^{n} log (1 + λ_{k} (1 + ε)) \leq (1 + ε) \frac{L_{n}}{b_{n}} \sum_{k = n_{0} + 1}^{n} λ_{k} = (1 + ε) (\frac{L_{n}}{b_{n}} \sum_{k = 1}^{n} λ_{k} - \frac{L_{n}}{b_{n}} \sum_{k = 1}^{n_{0}} λ_{k}) .

Hence, Equation (21) follows from Equations (13) and (14), and the arbitrariness of

ε

. ☐

Remark 2 (The Bernoulli case; the sequence in Equation (2)).

The LDP in Theorem 3 holds also for the sequence in Equation (2) in place of the sequence in Equation (1). The proof is almost identical to the one of Theorem 3: in this case, we have

Δ_{n} : = |\frac{L_{n}}{b_{n}} \sum_{k = 1}^{n} X_{k} - \frac{L_{n}}{b_{n}} \sum_{k = 1}^{n} X_{k}^{*}|

in place of Equation (18), and Inequality (19) still holds (even without the monotonicity of

{L_{n} : n \geq 1}

).

Remark 3 (Almost sure convergence to 1 of the sequences in Theorems 2 and 3).

Let

{Z_{n} : n \geq 1}

be either the sequence in Equation (1) or the sequence in Equation (2), where

{X_{n} : n \geq 1}

is as in Theorem 2 or as in Theorem 3 (so we also consider Remarks 1 and 2). Then, by a straightforward consequence of the Borel–Cantelli lemma, the sequence

{Z_{n} : n \geq 1}

converges to 1 almost surely (as

n \to \infty

) if

\sum_{n \geq 1} P (Z_{n} \in C) < \infty f o r c l o s e d s e t C s u c h t h a t 1 \notin C .

Obviously this condition holds if

C \subset (- \infty, 0)

because

{Z_{n} : n \geq 1}

are nonnegative random variables. On the other hand, if

C \cap [0, \infty)

is not empty,

Λ^{*} (C) : = {inf}_{x \in C} Λ^{*} (x)

is finite; moreover,

Λ^{*} (C) \in (0, \infty)

because

1 \notin C

. Then, by the upper bound of the closed set, for all

δ > 0

, there exists

n_{δ}

such that, for all

n > n_{δ}

, we have

P (Z_{n} \in C) \leq e^{- (Λ^{*} (C) - δ) b_{n} / L_{n}} .

Thus, again by the Borel–Cantelli lemma,

{Z_{n} : n \geq 1}

converges almost surely to 1 (as

n \to \infty

) if, for all

κ > 0

, we have

\sum_{n \geq 1} e^{- κ b_{n} / L_{n}} < \infty .

(22)

Then, by the Cauchy condensation test, Equation (22) holds if and only if

\sum_{n \geq 1} 2^{n} e^{- κ b_{2^{n}} / L_{2^{n}}} < \infty

and, as we see below, the convergence of the condensed series is a consequence of the ratio test and of some hypotheses of Theorems 2 and 3. In fact,

\frac{2^{n + 1} e^{- κ b_{2^{n + 1}} / L_{2^{n + 1}}}}{2^{n} e^{- κ b_{2^{n}} / L_{2^{n}}}} = 2 exp (- κ \frac{b_{2^{n + 1}}}{L_{2^{n + 1}}} (1 - \frac{b_{2^{n}}}{b_{2^{n + 1}}} \cdot \frac{L_{2^{n + 1}}}{L_{2^{n}}})) \to 0 (a s n \to \infty)

because

\frac{b_{2^{n}}}{b_{2^{n + 1}}} \to α (1 / 2)

by Equation (12),

\frac{L_{2^{n + 1}}}{L_{2^{n}}} \to 1

by Equation (11) and

\frac{b_{2^{n + 1}}}{L_{2^{n + 1}}} \to + \infty

by Equation (13).

We conclude with the results for the generalized Cramér model (the sequences in Equation (9)).

Corollary 1 (Application to the sequences in Equation (9)).

Let

{X_{n} : n \geq 1}

be the random variables in Equation (7), and let

{b_{n} : n \geq 1}

and

{L_{n} : n \geq 1}

be defined by Equation (8). Then, the sequences

\{\frac{\sum_{k = 1}^{n} (log k) X_{k}}{n ℓ_{n}} : n \geq 1\}

and

\{\frac{(log n) \sum_{k = 1}^{n} X_{k}}{n ℓ_{n}} : n \geq 1\}

in Equation (9) satisfy the LDP with speed function

v_{n} = \frac{b_{n}}{L_{n}} = \frac{n ℓ_{n}}{log n}

and the good rate function

Λ^{*}

defined by Equation (4).

Proof.

In this proof, the sequences in Equation (9) play the roles of the sequences in Equations (1) and (2) in Theorem 3 and Remark 2, respectively. Therefore, we have to check that the hypotheses of Theorem 3 are satisfied. Condition 1 and Equations (11) and (13) and

{lim}_{n \to \infty} λ_{n} = 0

can be easily checked. Moreover, one can also check Equation (12) with

α (c) = c

; note that in this case, we have a regularly varying function with index

ρ = 1

(as

n \to \infty

), and

{b_{n} : n \geq 1}

is eventually nondecreasing. Finally, Equation (14), which is

lim_{n \to \infty} \frac{(log n) \sum_{k = 1}^{n} \frac{ℓ_{k}}{log k}}{n ℓ_{n}} = 1,

can be obtained as a consequence of Lemma 3; in fact,

{ℓ_{n} : n \geq 1}

and

{ℓ_{n} / (log n) : n \geq 1}

are restrictions (on

N

) of slowly varying functions at infinity. ☐

In conclusion, we can say that, roughly speaking, for any Borel set A such that

1 \notin \bar{A}

(where

\bar{A}

is the closure of A), the probabilities

P (\frac{\sum_{k = 1}^{n} (log k) X_{k}}{n ℓ_{n}})

and

P (\frac{(log n) \sum_{k = 1}^{n} X_{k}}{n ℓ_{n}})

decay exponentially as

e^{- \frac{n ℓ_{n}}{log n} {inf}_{x \in A} Λ^{*} (x)}

(as

n \to \infty

). Thus, in the spirit of Tao’s remark, we are able to suggest estimations concerning a sort of “generalized” Chebychev function defined by

\frac{\sum_{p_{1} \dots p_{r} \leq x} log (p_{1} \dots p_{r})}{x ℓ_{x}}

or by

\frac{(log x) \sum_{p_{1} \dots p_{r} \leq x} 1}{x ℓ_{x}}

. To our knowledge, such estimations are not available for

r > 1

.

Author Contributions

Rita Giuliano and Claudio Macci equally contributed to the proofs of the results. The paper was also written and reviewed cooperatively.

Conflicts of Interest

The authors declare no conflict of interest.

References

Dembo, A.; Zeitouni, O. Large Deviations Techniques and Applications, 2nd ed.; Springer: New York, NY, USA, 1998. [Google Scholar]
Fang, L. Large and moderate deviation principles for alternating Engel expansions. J. Number. Theory 2015, 156, 263–276. [Google Scholar] [CrossRef]
Granville, A. Harald Cramér and the distribution of prime numbers. Scand. Actuar. J. 1995, 1995, 12–28. [Google Scholar] [CrossRef]
Döring, H.; Eichelsbacher, P. Moderate deviations via cumulants. J. Theor. Probab. 2013, 26, 360–385. [Google Scholar] [CrossRef]
Féray, V.; Méliot, P.L.; Nikeghbali, A. Mod-φ Convergence, I: Normality Zones and Precise Deviations. Unpublished Manuscript. 2015. Available online: http://arxiv.org/pdf/1304.2934.pdf (accessed on 23 November 2015).
Jacod, J.; Kowalski, E.; Nikeghbali, A. Mod-Gaussian convergence: new limit theorems in probability and number theory. Forum Math. 2011, 23, 835–873. [Google Scholar] [CrossRef][Green Version]
Tenenbaum, G.; Mendès France, M. The Prime Numbers and Their Distribution; (Translated from the 1997 French original by P.G. Spain); American Mathematical Society: Providence, RI, USA, 2000. [Google Scholar]
Landau, E. Handbuch der Lehre von der Verteilung der Primzahlen (2 Volumes), 3rd ed.; Chelsea Publishing: New York, NY, USA, 1974. [Google Scholar]
Hardy, G.H.; Wright, E.M. An Introduction to the Theory of Numbers, 4th ed.; Oxford University Press: London, UK, 1975. [Google Scholar]
Tenenbaum, G. Introduction to Analytic and Probabilistic Number Theory, 3rd ed.; (Translated from the 2008 French Edition by P.D.F. Ion); American Mathematical Society: Providence, RI, USA, 2015. [Google Scholar]
Davenport, H. Multiplicative Number Theory, 3rd ed.; Springer: New York, NY, USA; Berlin, Germany, 2000. [Google Scholar]
Ford, K.; Sneed, J. Chebyshev’s bias for products of two primes. Exp. Math. 2010, 19, 385–398. [Google Scholar] [CrossRef]
Meng, X. Chebyshev’s Bias for Products of k Primes. Unpublished Manuscript. 2016. Available online: http://arxiv.org/pdf/1606.04877v2.pdf (accessed on 16 August 2016).
Tao, T. Probabilistic Models and Heuristics for the Primes (Optional). In Terence Tao Blog. 2015. Available online: https://terrytao.wordpress.com/2015/01/04/254a-supplement-4-probabilistic-models-and-heuristics-for-the-primes-optional/ (accessed on 4 January 2015).
Bingham, N.H.; Goldie, C.M.; Teugels, J.L. Regular variation. In Encyclopedia of Mathematics and its Applications; Cambridge University Press: Cambridge, UK, 1987; Volume 27. [Google Scholar]
Giuliano, R.; Macci, C. Asymptotic results for weighted means of random variables which converge to a Dickman distribution, and some number theoretical applications. ESAIM Probab. Stat. 2015, 19, 395–413. [Google Scholar] [CrossRef]
Arratia, R.; Tavaré, S. Independent processes approximations for random combinatorial structures. Adv. Math. 1994, 104, 90–154. [Google Scholar] [CrossRef][Green Version]

© 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Large Deviation Results and Applications to the Generalized Cramér Model
^†

Abstract

1. Introduction

2. Preliminaries

3. Results

Author Contributions

Conflicts of Interest

References

Article Metrics

Citations

Article Access Statistics

Large Deviation Results and Applications to the Generalized Cramér Model †

Abstract

1. Introduction

2. Preliminaries

3. Results

Author Contributions

Conflicts of Interest

References

Article Metrics

Citations

Article Access Statistics

Large Deviation Results and Applications to the Generalized Cramér Model
^†