The Emergence of the Normal Distribution in Deterministic Chaotic Maps

Zanette, Damián H.; Samengo, Inés

doi:10.3390/e26010051

Open AccessFeature PaperArticle

The Emergence of the Normal Distribution in Deterministic Chaotic Maps

by

Damián H. Zanette

^1,2,*,†

and

Inés Samengo

^1,2,*,†

¹

Instituto Balseiro, Centro Atómico Bariloche Comisión Nacional de Energía Atómica, Universidad Nacional de Cuyo, Av. Ezequiel Bustillo 9500, San Carlos de Bariloche 8400, Argentina

²

Consejo Nacional de Investigaciones Científicas y Técnicas, Argentina

^*

Authors to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Entropy 2024, 26(1), 51; https://doi.org/10.3390/e26010051

Submission received: 16 December 2023 / Revised: 2 January 2024 / Accepted: 3 January 2024 / Published: 5 January 2024

(This article belongs to the Section Information Theory, Probability and Statistics)

Download

Browse Figures

Versions Notes

Abstract

The central limit theorem states that, in the limits of a large number of terms, an appropriately scaled sum of independent random variables yields another random variable whose probability distribution tends to attain a stable distribution. The condition of independence, however, only holds in real systems as an approximation. To extend the theorem to more general situations, previous studies have derived a version of the central limit theorem that also holds for variables that are not independent. Here, we present numerical results that characterize how convergence is attained when the variables being summed are deterministically related to one another through the recurrent application of an ergodic mapping. In all the explored cases, the convergence to the limit distribution is slower than for random sampling. Yet, the speed at which convergence is attained varies substantially from system to system, and these variations imply differences in the way information about the deterministic nature of the dynamics is progressively lost as the number of summands increases. Some of the identified factors in shaping the convergence process are the strength of mixing induced by the mapping and the shape of the marginal distribution of each variable, most particularly, the presence of divergences or fat tails.

Keywords:

stable distributions; deterministic systems; central limit theorem

1. Introduction

According to the central limit theorem (CLT), the sum of independent variables with finite first and second moments is governed by a Gaussian distribution when the number of summands is asymptotically large. The mean value and the variance of the Gaussian equal the sum of the individual mean values and variances, respectively. The Gaussian distribution has maximal entropy for a given variance and is reached independently of the distributions from which the summands are sampled. The convergence to the Gaussian limit, therefore, can be viewed as a loss of information about the original data. Extension to sums of variables with diverging first and second moments have been derived in [1,2,3,4]; the asymptotic distributions of there are no longer Gaussian, but are still members of a family of so-called stable distributions.

Experience shows that many systems are successfully modeled by stable distributions, for example, in the theory of errors and propagation of uncertainty. This is often justified by the fact that errors, as well as many other quantities of interest, can be conceived as the sum of a large number of variables representing disparate magnitudes that appear to be unrelated. Yet, Physics, for instance, dictates that all the variables describing a system of interacting particles (as opposed to an ensemble of free particles) are correlated to one another. Therefore, the independence condition is no more than an approximation.

To improve this approximation, extensions of the CLT have also been developed for variables that bear different degrees of statistical dependencies, including those obtained through the subsequent application of a deterministic rule that produces ergodicity and aperiodicity [5,6,7,8,9,10,11]. Here, we analyze several systems of this type. As discussed in the next section, a conveniently modified version of the CLT exists for appropriately scaled sums of variables deterministically related to one another. Notably, the family of stable distributions for these cases coincides with the ones obtained for independent variables. These extensions provide mathematical certainty that sums of strongly correlated variables, if produced through a chaotic dynamical system, lose all memory of their original distribution, and asymptotically approach a distribution that also happens to be the limit of sums of independent variables sampled from a certain distribution. The strong statistical dependencies governing the physical world, therefore, may be legitimately ignored when describing the probability distributions of macroscopic variables, and it is legitimate to conceive the latter as a sum of a large number of microscopic, independent variables. This property greatly simplifies the description of macroscopic systems and has probably played a crucial role in the development of the theory of probability.

In practical situations, however, it is important to know how many terms a sum needs to include for its distribution to be well described by the asymptotic result. To shed light on this question, we study in this paper the convergence of the probability distribution of a sum of perfectly correlated variables, generated through the iteration of a chaotic, deterministic map, towards the asymptotic distribution predicted using the extensions of the CLT. The aim is to characterize how the loss of information about the deterministic nature of the map depends on the number of variables that are summed together. Since previous theoretical results do not predict the rate of convergence towards asymptotic distributions in deterministic systems, our analysis is based on numerical simulations of several paradigmatic examples, and on a comparison with the behavior of randomly sampled systems with the same distributions.

The paper is organized as follows. In Section 2, we present the main theoretical tools to be employed later; these include the extension of the CLT to variables that are strongly correlated, the information–theoretical measures that quantify the differences between probability distributions, and the behavior of the variance of a sum of variables that are correlated. The following three sections apply these tools to the analysis of a chaotic dynamical system with a uniform marginal distribution and varying Lyapunov exponent (the Bernoulli map, Section 3), a chaotic dynamical system with a highly nonuniform marginal distribution and several types of orbits (the logistic map, Section 4), and an example of a process with fat-tailed distribution (Section 5). Our main conclusions are summarized in Section 6.

2. Central Limit Theorem for Deterministic Maps

We consider a generic one-dimensional map,

x (t + 1) = f [x (t)]

, with a well-defined invariant measure

ρ_{x} (x)

, determined using the identity

ρ_{x} (x) d x = ρ_{x} [f (x)] d f (x) = [ρ_{x} \circ f] (x) f^{'} (x) d x,

(1)

where

[ρ_{x} \circ f] (x)

is the composition of functions

ρ_{x} (x)

and

f (x)

, and the prime indicates differentiation with respect to x. We assume that the mean value

\bar{x}

is finite over the distribution

ρ_{x} (x)

, and—for now—we assume the variance

σ_{x}^{2}

of x is also finite:

\bar{x} = \int x ρ_{x} (x) d x < \infty, σ_{x}^{2} = \int {(x - \bar{x})}^{2} ρ_{x} (x) d x < \infty,

(2)

where the integrals run over the whole domain of variation of x. In Section 5, we study a case where we relax the condition that

σ_{x}^{2}

is finite. A central limit theorem (CLT) for this kind of system applies [6,7,8,9,10,11] when the map under study is ergodic and aperiodic. We recall that a map is ergodic if all its invariant sets are null or co-null, and it is aperiodic if its periodic orbits form a null set [11]. The combination of ergodicity and aperiodicity is typically equivalent to the dynamics being chaotic [12]. In this case, the CLT states that the distribution of the (centered, suitably normalized) sums of N successive values of

x (t)

,

s_{N} (t) = \frac{1}{\sqrt{N}} \sum_{k = 1}^{N} [x (t + k - 1) - \bar{x}],

(3)

becomes normal for

N \to \infty

:

ρ_{s} (s) = \frac{1}{\sqrt{2 π σ_{s}^{2}}} exp (- \frac{s^{2}}{2 σ_{s}^{2}}) \equiv G_{σ_{s}} (s),

(4)

for some value of the variance

σ_{s}^{2}

. Here,

G_{σ_{s}}

denotes the Gaussian centered at zero, with standard deviation

σ_{s}

.

For each value of N, the variables

x (t)

and

s_{N} (t)

can be integrated into a single two-dimensional map:

\{\begin{matrix} x (t + 1) = f [x (t)], \\ s_{N} (t + 1) = s_{N} (t) - \frac{1}{\sqrt{N}} x (t) + \frac{1}{\sqrt{N}} f^{(N)} [x (t)], \end{matrix}

(5)

where

f^{(N)} (x) = \overset{N}{\overset{︷}{f \circ f \circ \dots \circ f}} (x)

is the N-th self-composition of

f (x)

. Thus, for

N \to \infty

, the marginal invariant measures of the variables x and

s_{N}

in map (5) are, respectively,

ρ_{x} (x)

and the Gaussian

ρ_{s} (s) = G_{σ_{s}} (s)

of Equation (4).

In contrast with the sums of statistically independent random variables drawn from a given distribution, in the limit

N \to \infty

, the variance of the sums

s_{N} (t)

does not necessarily coincide with that of the summands,

σ_{x}^{2}

. The difference arises from the correlations between successive values of

x (t)

, induced by the map

x (t + 1) = f [x (t)]

, with the ensuing mutual correlations between the values of

s_{N} (t)

. For a finite number of summands N, the variance of

s_{N} (t)

is given by the Green–Kubo formula [13]:

σ_{s_{N}}^{2} = σ_{x}^{2} + 2 \sum_{k = 1}^{N - 1} (1 - \frac{k}{N}) \bar{[x (t) - \bar{x}] [x (t + k) - \bar{x}]},

(6)

where the overline indicates the average with respect to the distribution

ρ_{x} (x)

. The value of

σ_{s_{N}}^{2}

becomes independent of t when the process

x (t)

has reached a stationary regime. For

N \to \infty

, the variance is

σ_{s}^{2} = lim_{N \to \infty} σ_{s_{N}}^{2} = σ_{x}^{2} + 2 \sum_{k = 1}^{\infty} \bar{[x (t) - \bar{x}] [x (t + k) - \bar{x}]} .

(7)

Provided that the sum converges, this formula gives the variance of the asymptotic normal distribution

G_{σ_{s}} (s)

of increasingly long sums

s_{N} (t)

.

In the following, we study the process of convergence towards the asymptotic distribution predicted by the above CLT for some selected deterministic maps, as the number of terms in the sums

s_{N}

grows. For each N, we numerically iterate Equation (5) and estimate the distribution of the sums

s_{N}

,

ρ_{s_{N}} (s_{N})

, as a suitably normalized

10^{3}

-column histogram built from, typically,

10^{7}

values of

s_{N}

. To quantify the difference between

ρ_{s_{N}}

and the expected asymptotic Gaussian distribution

G_{σ_{s}} (s)

, we use the Kullback–Leibler divergence (KLD). Recall that the KLD between two distributions

ρ_{1} (s)

and

ρ_{2} (s)

is defined as

D (ρ_{1} | | ρ_{2}) = \int ρ_{1} (s) {log}_{2} [\frac{ρ_{1} (s)}{ρ_{2} (s)}] d s .

(8)

This quantity measures the inefficiency with which the data s is represented by a code optimized to be maximally compact under the assumption that the distribution is

ρ_{2}

when, in reality, the data are generated from

ρ_{1}

. The inefficiency equals the mean number of extra bits per sample [14]. The divergence only vanishes when the two distributions coincide, and is otherwise positive. For brevity, we hereafter denote as

D_{G}

the KLD between the distribution

ρ_{s_{N}}

and the asymptotic normal distribution

G_{σ_{s}}

:

D_{G} \equiv D (ρ_{s_{N}} | | G_{σ_{s}})

.

Additionally, for each N, it is interesting to compare

ρ_{s_{N}}

with a normal distribution with the variance

σ_{s_{N}}^{2}

given by Equation (6), namely, the same variance as the sums

s_{N}

. Since

σ_{s_{N}}^{2} \to σ_{s}^{2}

as

N \to \infty

, this is an alternative way of characterizing the convergence to the asymptotic Gaussian

G_{σ_{s}}

. For this comparison, we introduce the KLD

D_{G_{N}} \equiv D (ρ_{s_{N}} | | G_{σ_{s_{N}}})

.

Finally, in order to contrast the deterministic dynamics of the chaotic map under study with a genuinely aleatory process, we calculate the KLD for the distribution of sums of the same form as in Equation (3), but with the N values of the variable x drawn at random from the invariant measure

ρ_{x} (x)

. According to the standard CLT for statistically independent variables, as N grows, the distribution

ρ_{s_{N}}^{random}

of these random sampling sums is expected to asymptotically converge to a Gaussian with variance

σ_{x}^{2}

. To quantify this convergence, we compute

D_{random} \equiv D (ρ_{s_{N}}^{random} | | G_{σ_{x}})

.

The measures

D_{random}

,

D_{G}

and

D_{G_{N}}

reflect three different aspects of the convergence of

ρ_{s_{N}}

to

G_{σ_{s}}

. The process by which

D_{random}

tends to zero describes how independent variables, when summed together, lose the memory of the distribution from which they are sampled and approach a Gaussian. The Gaussian distribution is the one with maximal entropy among those with fixed variance. By acquiring a Gaussian shape, therefore, the distribution of the sum maximizes uncertainty. In Appendix B, we show that, for large N, the divergence

D_{random}

decays as

N^{- 1}

if

ρ_{x}

is not symmetric around its mean value, and at least as fast as

N^{- 2}

if there is symmetry.

A steep decay of

D_{G_{N}}

with N, at a faster rate than

D_{G}

, implies a rapid evolution of

ρ_{s_{N}}

towards a bell-shaped distribution, whose variance may still have to evolve to its asymptotic value

σ_{s}^{2}

. The convergence process can therefore be conceived as a sequence of two stages, the first one consisting of shedding all the structure in

ρ (x)

and becoming Gaussian-like, and the second, adjusting the variance. Once

ρ_{s_{N}}

is approximately Gaussian, its KLD with the asymptotic distribution

G_{σ_{s}}

can be analytically calculated in terms of their respective variances:

D_{G} \approx {log}_{2} (\frac{σ_{s_{N}}}{σ_{s}}) + \frac{1}{2 ln 2} \frac{σ_{s}^{2} - σ_{s_{N}}^{2}}{σ_{s_{N}}^{2}} .

(9)

3. The Bernoulli Map

As a first case of study, we take the generalized Bernoulli map

x (t + 1) = f [x (t)] = {m x (t)},

(10)

where

{\cdot}

indicates a fractional part, and

m > 1

is an integer factor. This map has been extensively studied since long ago as a paradigm of deterministic chaotic systems, due to its combination of complex behavior and analytical traceability. Its Lyapunov exponent equals

ln m

. The invariant measure of

x (t)

is particularly simple:

ρ_{x} (x) = \{\begin{matrix} 1 & for x \in [0, 1), \\ 0 & otherwise, \end{matrix}

(11)

for all m, with

\bar{x} = 1 / 2

and

σ_{x}^{2} = 1 / 12

. We show in Appendix A that the variances of the sums

s_{N}

can be explicitly calculated:

σ_{s_{N}}^{2} = \frac{1}{12} + \frac{1}{6 (m - 1)} (1 - \frac{m}{m - 1} \frac{1 - m^{- N}}{N}) .

(12)

Note that for

N ≫ {(ln m)}^{- 1}

, this variance takes the approximate form

σ_{s_{N}}^{2} \approx \frac{1}{12} \frac{m + 1}{m - 1} (1 - \frac{N_{0}}{N}),

(13)

with

N_{0} = 2 m / (m^{2} - 1)

. For

N \to \infty

, in turn,

σ_{s_{N}}^{2} \to σ_{s}^{2} = \frac{1}{12} \frac{m + 1}{m - 1} .

(14)

We first consider the Bernoulli map for

m = 2

. Dark full lines on the left column of Figure 1 show numerical results for the distributions of the sums

s_{N}

,

ρ_{s_{N}}

, for three small values of N. Light-colored curves stand for the asymptotic Gaussian

ρ_{s} = G_{σ_{s}}

, and dashed curves are the Gaussians

G_{σ_{s_{N}}}

for each N. Their respective variances,

σ_{s}^{2}

and

σ_{s_{N}}^{2}

, are given by Equation (12). On the right column, dark- and light-colored curves, respectively, show the distributions of the sums of randomly sampled values of x,

ρ_{s_{N}}^{random}

, calculated analytically as N-th order self-convolutions of

ρ_{x} (x)

, and the expected asymptotic Gaussian

G_{σ_{x}}

. A comparison of the two columns illustrates the difference between the distributions of the sums generated by map iteration on one side and by random sampling on the other. It also shows that convergence to the asymptotic distribution is faster in the latter case.

The main panel of Figure 2 shows, with different symbols, the KLDs

D_{G}

,

D_{G_{N}}

, and

D_{random}

, defined in the preceding section. For

N = 1

, by definition,

D_{random} = D_{G_{N}}

. For large N (Appendix B),

D_{random}

decays as

N^{- 2}

. The straight lines in the log–log plot of the figure have slope

- 2

, suggesting that the decay of the divergence

D_{G}

approximately follows the same asymptotic dependence on N. The inset in Figure 2 shows, as dots, the numerical estimation of the variance of

s_{N}

over the distribution

ρ_{s_{N}}

as a function of N. The dashed curve corresponds to the analytical expression of Equation (12).

In the range shown in the figure, for

N ≳ 10

,

D_{G}

is larger than

D_{random}

by a factor of around 14. Meanwhile, in the same range,

D_{G_{N}}

decays faster, approximately as

N^{- 2.3}

. As discussed at the end of Section 2, this faster decay of

D_{G_{N}}

suggests that

ρ_{s_{N}}

is rapidly approaching a Gaussian distribution, with a KLD with the asymptotic distribution

ρ_{S}

as given by Equation (9). Replacing Equation (13) into Equation (9) and expanding up to second order in

N_{0} / N

yields

D_{G} \approx \frac{1}{4 ln 2} \frac{N_{0}^{2}}{N^{2}} .

(15)

For

m = 2

we have

N_{0} = 4 / 3

so that, according to the above equation,

D_{G} \approx 0.64 N^{- 2}

. A power–law fitting of the data for

D_{G}

for

N \leq 20 \leq 50

gives

D_{G} \approx 0.69 N^{- 1.9}

, which fits the prediction of Equation (15) remarkably well. This agreement provides strong evidence in favor of the hypothesis that

ρ_{s_{N}}

converges to

ρ_{s}

in two stages, acquiring a Gaussian shape in the first, and adjusting the variance in the second. The transition from the first stage to the second, however, does not imply that

ρ_{s_{N}}

is strictly speaking a Gaussian distribution.

What are the implications of the fact that after the initial transient

D_{G}

and

D_{random}

both decay with the same power law, approximately proportional to

N^{- 2}

? In this regime,

D_{G} \approx 14 D_{random}

which means that, for each N,

D_{random} (N)

is approximately equal to

D_{G} (\sqrt{14} N)

. By increasing the number of random samples drawn from the invariant measure (11),

D_{random}

diminishes by the same amount as

D_{G}

diminishes when running the Bernoulli deterministic mapping a re-scaled, larger number of samples, with a scaling factor of

α \approx \sqrt{14} \approx 3.7

. In other words,

α

samples of the deterministic map are as informative about the asymptotic distribution as a single sample in the random drawing. The presence of correlations makes each new sample from the deterministic dynamics less informative (by a factor of

α

) than from purely independent draws.

The factor

α

may also be semi-quantitatively associated with the relation between the asymptotic variance

σ_{s}^{2}

and the original variance

σ_{x}^{2}

. In Equation (3), the normalization factor

1 / \sqrt{N}

compensates for the fact that the variance of a sum of N independent samples is proportional to N. Yet, when the summands bear statistical interdependence, the intended compensation need not be attained. The higher the correlations in the deterministic map, the less informative each new datum is, the more unsuccessful the compensation, and the larger the increase in the asymptotic variance. In the present case, the variance increases threefold, from

1 / 12

to

1 / 4

, which is similar to the factor relating

D_{G}

and

D_{random}

, namely,

α

.

Considering now the other values of m in the Bernoulli map (10), the numerical results presented in Figure 3 show that the dependence of

D_{G}

on N is similar to that obtained for

m = 2

, with the only difference that

D_{G}

becomes progressively smaller as m grows. As before, the convergence may be conceived as consisting of two stages, with Equation (9) approximately holding for the second stage. According to the results of Figure 3, the second state is reached faster for larger values of m. As expected from the large-N asymptotic behavior of

D_{G}

predicted by Equation (15) with

N_{0} = 2 m / (m^{2} - 1)

[cf. Equation (13)], it approaches

D_{random}

for large m. This implies that the effect of the statistical dependencies induced by the deterministic nature of the map decreases as m grows. The KLD

D_{G_{N}}

is not shown in Figure 3, but its behavior is similar to that of the case of

m = 2

(Figure 2).

In summary, in the Bernoulli map,

D_{G_{N}}

decreases faster than

D_{G}

during the first stage of the convergence process, where

ρ_{s_{N}}

acquires a Gaussian-like shape. Only later is the variance adjusted towards its final value. The second stage can be modeled analytically, providing a good qualitative description of the asymptotic behavior inferred from numerical results.

4. The Logistic Map: Full Chaos and Intermittency

We now turn our attention to the logistic map [17,18]

x (t + 1) = f [x (t)] = λ x (t) [1 - x (t)],

(16)

with

0 < λ \leq 4

. Much like Bernoulli’s, the logistic map hardly needs any presentation. We first consider the case

λ = 4

, which we call the regime of “full chaos”. For this value of

λ

, the dynamics are chaotic and therefore comply with the hypotheses of the CLT for deterministic systems discussed in Section 2. Moreover, due to the existence of a nonlinear change of variables that transforms the logistic map with

λ = 4

into the Bernoulli map of Equation (10) with

m = 2

, several analytical results for the latter can be extended to the former. In spite of this connection, as we show below, the statistics of the sums

s_{N}

are qualitatively different between the two maps.

For

λ = 4

, the invariant measure of the logistic map can be written explicitly as [19]

ρ_{x} (x) = \frac{1}{π \sqrt{x (1 - x)}}

(17)

for

0 \leq x \leq 1

, and 0 otherwise. The mean value is

\bar{x} = 1 / 2

and the variance is

σ_{x}^{2} = 1 / 8

. As we show in Appendix A, the correlations between iterations of the map,

c_{k} = \bar{[x (t) - \bar{x}] [x (t + k) - \bar{x}]}

, vanish for all k. From Equations (6) and (7), this implies that the variances of the sums

s_{N}

are the same for all N, and therefore coincide with both the variance of x and with the limit for

N \to \infty

:

σ_{s_{N}}^{2} = σ_{s}^{2} = σ_{x}^{2}

. Therefore—in contrast with the Bernoulli map studied in the preceding section—it is not possible to discern between a first stage of convergence to a Gaussian profile and a second stage of adjustment of the variance.

In Figure 4, the left column shows numerical estimations of the distributions

ρ_{s_{N}} (s_{N})

of the sums of N consecutive iteration of the logistic map with

λ = 4

, for three values of N. The light-colored curve corresponds to the expected asymptotic Gaussian.

In addition to the sharp peaks in the profile of

ρ_{s_{N}}

for small N, an important difference with the Bernoulli map (Figure 1) is that

ρ_{s_{N}}

is no longer symmetric with respect to zero. This asymmetry may come as a surprise, taking into account that both

f (x)

and

ρ_{x} (x)

are symmetric around the mean value

\bar{x}

. The asymmetry, however, originates from the fact that the functions

x + f (x)

,

x + f (x) + f^{(2)} (x)

,

x + f (x) + f^{(2)} (x) + f^{(3)} (x)

, …, which ultimately determine the distributions of the sums

s_{N}

, are not symmetric around

\bar{x}

.

On the right column of Figure 4 we show, for the same values of N, the distributions

ρ_{s_{N}}^{random}

of sums of N random values of x sampled from

ρ_{x}

. In contrast with the case of the Bernoulli map,

ρ_{s_{N}}^{random}

is here estimated numerically. As expected, the distributions of random sampling sums are now symmetric with respect to zero, and exhibit a fast convergence to the asymptotic Gaussian.

Figure 5 shows

D_{G}

and

D_{random}

for the fully chaotic logistic map, as functions of N. Since, as explained above,

σ_{s_{N}}^{2}

equals

σ_{s}^{2}

for all N, now

D_{G_{N}}

coincides with

D_{G}

. Due to the symmetry of

ρ_{x}

with respect to its mean value, the arguments given in Appendix B apply to this case, and

D_{random}

decays as

N^{- 2}

for large N. The full straight line in the log–log plot of the figure has slope

- 2

, confirming this prediction in the plotted range. Yet, the behavior of

D_{G}

is considerably different. It starts with a small increment between

N = 1

and 2, where it attains a maximum, and thereafter decays rapidly up to

N \approx 20

. This decay corresponds to the interval of N for which the distribution

ρ_{s_{N}}

displays identifiable singularities. For

N ≳ 20

, the singularities start to overlap, and the distribution

ρ_{s_{N}}

varies more smoothly and displays a well-defined asymmetric bell-shaped profile. In this zone, the decay of

D_{G}

is slower and approximately behaves as

N^{- 1}

, as illustrated by the dashed straight segment of slope

- 1

. As shown in Appendix B, a decay as

N^{- 1}

is expected for the KLD of the distribution of random sampling sums when the distribution of the individual summands is not symmetric with respect to the mean value. If the disparate dependence on N between

D_{G}

and

D_{random}

persists as N grows beyond the range considered here, their relative difference would increase indefinitely for

N \to \infty

.

Although still chaotic, other values of

λ

in Equation (16) give rise to qualitatively different dynamical features in the logistic map. For

λ = 3.828

, which is our next case of study, the dynamics are intermittent. Just above this value of

λ

, at

λ_{3} = 1 + 2 \sqrt{2} \approx 3.8284

, the logistic map enters the largest stability window within its chaotic regime, where

x (t)

becomes asymptotically locked in a period-3 orbit. For

λ ≲ λ_{3}

, the vicinity of the critical point manifests itself in the form of intermittent behavior for

x (t)

. Namely, the dynamics alternate intermittently between intervals of “turbulent” evolution, where its behavior is conspicuously chaotic, and “laminar” evolution, where

x (t)

remains temporarily close to the period-3 orbit, but eventually departs away from it. The left panel of Figure 6 shows 900 successive iterations of

x (t)

for the above value of

λ

, illustrating both kinds of behavior.

For

λ = 3.828

, no analytical description of the logistic map exists, and we must resort to numerical techniques. As inferred from the left panel of Figure 6, in this case,

ρ_{x} (x)

covers only a portion of the interval

[0, 1]

, between

x \approx 0.157

and

0.957

, and displays three peaks near the values of x in the period-3 orbit. Our numerical estimations for the mean value and the variance are

\bar{x} \approx 0.593

and

σ_{x}^{2} \approx 0.0864

. In principle, the variance of the sums

s_{N}

could be obtained from Equations (6) and (7) by numerically computing the correlations

c_{k} = \bar{[x (t) - \bar{x}] [x (t + k) - \bar{x}]}

. These quantities, however, exhibit sharp oscillations and slow convergence as k grows, as well as persistent fluctuations for large k. The right panel in Figure 6 shows

c_{k}

up to

k = 90

. In practice, such features make impossible the evaluation of the variances

σ_{s_{N}}^{2}

and

σ_{s}^{2}

using the sums in Equations (6) and (7). We therefore resort to their direct numerical calculation using the values of

s_{N} (t)

obtained from successive map iterations. In particular, our estimation for the variance of the sums in the limit

N \to \infty

is

σ_{s}^{2} \approx 0.0403

.

Colored symbols in the main panel of Figure 7 stand for

D_{G}

in the case of the logistic map with

λ = 3.828

, as a function of N. As with full chaos (cf. Figure 5), two distinct decay regimes are identifiable. Moreover, the behavior for

N ≲ 50

now contains signatures of the pseudo-periodic nature of the mapping in the “laminar” intervals, namely, the relatively large values of

D_{G}

when N is a multiple of 3 (triangles). In fact, for those values of N, the distributions

ρ_{s_{N}}

are narrower and sharper than for the remaining values, giving rise to higher KLDs. This is clearly illustrated by the dependence of the variance

σ_{s_{N}}^{2}

on N, shown in the inset of the figure. After an abrupt initial decay,

σ_{s_{N}}^{2}

displays oscillations of period 3, which progressively damp out as N grows. For

N ≳ 50

, the difference in

D_{G}

for multiples of 3 rapidly smooths out, as the KLD enters a regime where it decays approximately as

N^{- 1}

, as indicated by the dashed segment of slope

- 1

.

For this case of intermittent dynamics, we have also calculated

D_{G_{N}}

, finding qualitatively the same behavior as for

D_{G}

. As a matter of fact,

D_{G}

and

D_{G_{N}}

typically differ from each other in just about a 10%. Thus, for the sake of clarity, the numerical estimations of

D_{G_{N}}

were not included in Figure 7. As for the KLD of the distribution of random sampling sum,

D_{ramdom}

, the results of Appendix B indicate that it should decay as

N^{- 1}

for large N. This behavior, however, has not yet been reached in the range of values displayed in Figure 7. Assuming nevertheless that this is the asymptotic dependence of

D_{random}

, our results suggest that the KLD for random sampling sums is no less than three orders of magnitude smaller than

D_{G}

for large N.

In summary, both for

λ = 4

and

3.828

, the main difference between the statistics of the sums

s_{N}

obtained from the iteration of the logistic map and from a random sampling of the corresponding invariant measures, as N grows, resides in their disparate rates of approach towards the asymptotic distribution. Within the range of N considered in our numerical calculations, the decay of

D_{G}

as

N^{- 1}

can be qualitatively understood by the lack of symmetry in the invariant measures although, strictly speaking, the corresponding result in Appendix B holds for random sampling only.

Both when

λ = 4

and

3.828

, for

N ≳ 20

, the difference between

D_{G}

and

D_{random}

is well above two orders of magnitude. In the intermittent case, moreover, the pseudo-periodic character of the “laminar” dynamics reveals itself in the form of oscillations in

D_{G}

for small N, which are naturally absent in

D_{random}

. Plausibly, pseudo-periodicity is also responsible for the slow decrease in

D_{G}

during the oscillatory regime. Intermittency degrades the mixing properties of the mapping since, during the pseudo-periodic intervals, the dynamics only explore a reduced portion of the available range in x.

5. A Fat-Tailed Invariant Distribution

Much like the standard CLT, the CLT for deterministic systems can be generalized to the situation where the variance of the relevant variable x diverges [11]. In particular, this is the case of invariant distributions with a sufficiently slow algebraic decay for large

| x |

:

ρ_{x} (x) \sim {| x |}^{- α - 1}

with

0 < α < 2

. Under the same hypotheses of ergodicity and aperiodicity stated in Section 2, and assuming for simplicity that

\bar{x} = 0

—for instance, due to the symmetry of

ρ_{x} (x)

around zero—the distribution of the sums

s_{N} (t) = \frac{1}{N^{1 / α}} \sum_{k = 1}^{N} x (t + k - 1)

(18)

[cf. Equation (3)] converges to a stable distribution given by the Fourier anti-transform of

Q_{γ_{s}} (k) = exp (- γ_{s}^{α} {| k |}^{α})

, for some value of the dispersion parameter

γ_{s}

. The result for distributions with finite variance is re-obtained in the limit

α = 2

, with

γ_{s} \equiv σ_{s}

as defined in Equation (7).

In this section, we give an example of convergence toward a stable distribution different from a Gaussian in the case of a map with a fat-tailed invariant distribution decaying as

{| x |}^{- 2}

for large

| x |

(i.e.,

α = 1

). This specific case has the analytical advantage that the stable distribution predicted by the CLT can be explicitly written out, namely,

C_{γ_{s}} (s) = \frac{1}{π} \frac{γ_{s}}{γ_{s}^{2} + s^{2}},

(19)

which is nothing but the Cauchy (or Lorentzian) distribution. Like the Gaussian, the Cauchy distribution is a maximum entropy distribution, but with a different constraint.

To obtain a deterministic chaotic map with a variable distributed following a fat-tailed function, we use the ad hoc procedure of applying a suitable transformation to a map whose invariant distribution is known in advance. Specifically, we take the Bernoulli map of Equation (10) with

m = 2

, for which we know that the invariant distribution is the function given by Equation (11), and introduce a change of variables that transforms this function into the desired fat-tailed profile. This is formally achieved by defining the two-variable map

\{\begin{matrix} u (t + 1) = {2 u (t)}, \\ x (t + 1) = τ [{2 u (t)}], \end{matrix}

(20)

where

τ (u) = \{\begin{matrix} (2 u - 1) / 2 u & for 0 < u \leq 1 / 2, \\ (2 u - 1) / 2 (1 - u) & for 1 / 2 \leq u < 1 \end{matrix}

(21)

transforms a variable u with uniform distribution in

(0, 1)

into a variable

τ \in (- \infty, \infty)

with distribution

ρ_{τ} (τ) = 1 / 2 {(1 + | τ |)}^{2}

. By construction, thus, the invariant measure of variable x in map (20) is

ρ_{x} (x) = \frac{1}{2 {(1 + | x |)}^{2}},

(22)

with x varying from

- \infty

to ∞.

By analyzing the behavior of the Fourier transform of

ρ_{x} (x)

near the origin, it is possible to obtain the dispersion parameter for the Cauchy distribution of sums of independently chosen values of x, which turns out to be

γ_{s} = π / 2

. Unfortunately, the value of

γ_{s}

when the summands are successive iterations of x in map (20) cannot be found analytically in an explicit way. However, we have numerically found that, for

N \to \infty

, the dispersion parameter again coincides with

γ_{s} = π / 2

to a high precision. This is the value of

γ_{s}

that we use to compute the KLD

D_{C} = D (ρ_{s_{N}} | | C_{γ_{s}})

between the distribution of the sums

s_{N}

of Equation (18) and the Cauchy distribution (19). In addition, we do not have a practical procedure to assign a value to the dispersion parameter when the number of summands N is finite. Therefore, in the present case, we do not calculate a quantity analogous to the KLD

D_{G_{n}}

of Section 3 and Section 4. Regarding

D_{random}

, due to the non-analytic behavior of the Fourier transform of

ρ_{x} (x)

at the origin, it is now not possible to use the procedure of Appendix B to predict how this KLD decreases as N grows. Our analysis must thus rely on numerical results.

In Figure 8, we show the distributions

ρ_{s_{N}} (s_{N})

(left column) and

ρ_{s_{N}}^{random} (s_{N})

(right column) for three small values of N. Light-colored curves correspond to the expected asymptotic Cauchy distribution, given by Equation (19) with

γ_{s} = π / 2

. Note that for

N = 2

, due to the peak at

s_{N} = 0

, the difference between

ρ_{s_{N}}^{random}

and the asymptotic distribution seems to be larger than that of

ρ_{s_{N}}

. The KLDs, however, reveal that

ρ_{s_{N}}^{random}

is slightly closer to the Cauchy distribution (see Figure 9). For

N = 10

, it is already clear that the approach to the Cauchy distribution is faster for the random sampling sums. Comparison with the results for the Bernoulli and the logistic maps (cf. Figure 1 and Figure 4) suggest however that, in the present situation, the convergence to the corresponding asymptotic distribution is considerably slower than for those cases.

Figure 9 presents numerical results for the KLDs

D_{C}

and

D_{random}

. In order to have significant statistics in the construction of the 1000-column histogram that represents

ρ_{s_{N}} (s)

from

10^{7}

samples of the sums

s_{N}

, we have cut off the interval of variation of

s_{N}

to (

- 10, 10

), disregarding samples outside that interval. Otherwise, for the fat-tailed distributions involved in the present case, the calculation of the KLDs would be dominated by sampling fluctuations for large values of

| s_{N} |

. Along most of the range of N spanned by the figure, both

D_{C}

and

D_{random}

exhibit rather well defined power–law decays. Their different exponents, however, make that they progressively diverge from each other as N grows, while

D_{random}

approximately decays as

N^{- 1}

; as illustrated by the full straight line of slope

- 1

, a linear fitting of

D_{C}

for

N \geq 2

, shown as a dashed line, points to a slower decay with a nontrivial exponent:

N^{- 0.68}

. This result suggests that the convergence to an asymptotic distribution for the sums

s_{N}

in the case of fat-tailed invariant measures may generally be characterized by unusual exponents in the decay of the KLD. This conjecture will be thoroughly explored in future work, through both analytical and numerical means.

6. Conclusions

We here analyzed the convergence to the asymptotic probability density distribution

ρ_{s}

of a succession of distributions

ρ_{s_{N}}

for a conveniently scaled sum of N samples obtained from iterations of a deterministic map. Previous analytical studies had established that a modified version of the central limit theorem (CLT) exists for these cases. Yet, as far as we know, the convergence to the limit had not yet been characterized. Here, we studied several archetypal examples that expose a variety of ways the limiting distributions are approached.

Our characterization was based on the behavior of the Kullback–Leibler divergence (KLD)

D_{G}

between

ρ_{s}

and

ρ_{s_{N}}

, in that specific order. With this choice, the KLD equals the number of extra bits required to encode a sample from

ρ_{s}

if the code has been optimized for

ρ_{s_{N}}

. The CLT for sums of random samples with finite variance predicts a KLD

D_{random}

that decreases as

N^{- 2}

if each sample is drawn from a distribution that is symmetric around its mean value, and as

N^{- 1}

if it is not. This is a bold statement, since an infinitesimal modification may suffice to turn a symmetric distribution into an asymmetric one, so even a minute modification would suffice to change the entire asymptotic behavior of the KLD—the change, however, would only become relevant at increasingly larger values of N, as the asymmetry tended to disappear. We are not aware of an analogous theoretical prediction for the case of correlated samples, but the results presented have revealed similar behaviors:

D_{G}

decreased as

N^{- 2}

for the Bernoulli map, for which the sums are distributed symmetrically around their mean value, and as

N^{- 1}

for the logistic map, where the distributions are asymmetric.

In both the Bernoulli and the logistic map, the rates at which

ρ_{s_{N}}

approached the asymptotic distribution increased with the strength of mixing. Moreover, for the intermittent logistic map, where mixing is virtually absent during pseudo-periodic intervals, convergence to the asymptotic distribution was particularly slow. Therefore, even though all the explored examples were equally deterministic, their behavior differed considerably. Details in the chaotic dynamics are crucial to the behavior of

D_{G}

for large N.

The convergence of

ρ_{s_{N}}

in the Bernoulli map could be divided into two stages, one in which the distribution acquired an approximately Gaussian profile, and a subsequent one, in which the variance was adjusted to approach its asymptotic value. Remarkably, in the second stage and for sufficiently large N, the divergence

D_{G} (N)

was equal to

D_{random} (α N)

with

α \approx 3.74

, implying that each sample of the deterministic map was as informative about the asymptotic distribution as

α

random samples. This equivalence could not be established in the other explored examples, since in all of them,

D_{random}

and

D_{G}

decreased with N with different power laws. No re-scaling procedure, hence, could transform one into the other.

The last example involved variables with divergent variance. In this case, the derivation of Appendix B is no longer valid, and no theoretical formulation describing how

D_{random}

tends to zero is known to us. Our numerical explorations revealed a behavior proportional to

N^{- 1}

for

D_{random}

, even for samples drawn from distributions that are symmetric around their mean values. The deterministic counterpart

D_{G}

exhibited an even slower evolution, at a rate that is also slower than the one observed in the cases of finite variance.

In conclusion, in all the examples explored here, the asymptotic trend of the KLD behaved as a power law. Different deterministic maps yielded different exponents, displaying a variety of behaviors. The factors that influenced the exponents were (a) the strength of mixing in the chaotic map, (b) the tendency of the system to evolve near periodic orbits, and (c) the tails of the distribution of individual variables. We stress that the open question of establishing a quantitative connection between the rate of mixing, on one hand, and of KLD decay, on the other, remains an interesting problem for future work. Remarkably, except for the logistic map in the intermittent regime, all the maps explored here are related to each other by simple, nonlinear transformations. Despite these deterministic functional relations, their nonlinear nature determines differences in the statistical behavior of the sums of samples drawn from each map, with a large impact on the convergence towards their asymptotic distributions.

Author Contributions

Conceptualization, D.H.Z. and I.S.; Formal analysis, D.H.Z. and I.S.; Investigation, D.H.Z. and I.S.; Writing—review & editing, D.H.Z. and I.S. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

Data are contained within the article.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A. Calculation of the Variances of N-Term Sums, Equation (6)

Appendix A.1. The Bernoulli Map

To calculate the variances

σ_{s^{(N)}}^{2}

of the sums

s_{t}^{(N)}

of Equation (3), as given by Equation (6), it is necessary to compute the correlations

c_{k} = \bar{[x (t) - \bar{x}] [x (t + k) - \bar{x}]} = \int (x - \bar{x}) [f^{(k)} (x) - \bar{x}] ρ_{x} (x) d x,

(A1)

where

f^{(k)} (x)

is the k-th self-iteration of

f (x)

. For the Bernoulli map, Equation (10),

f^{(k)} (x)

can be explicitly given as a piece-wise linear function over the interval

[0, 1)

:

f^{(k)} (x) = m^{k} [x - (j - 1) m^{- k}], for (j - 1) m^{- k} \leq x < j m^{- k},

(A2)

and

j = 1, 2, \dots, m^{k}

. Taking into account that

ρ_{x} (x) = 1

over the same interval, with

\bar{x} = 1 / 2

, the correlation in Equation (A1) turns out to be

c_{k} = \frac{1}{12} m^{- k} .

(A3)

Inserting this result in Equation (6), the variances of Equation (12) are straightforwardly obtained. Equation (A3) shows that, for the Bernoulli map, correlations between successive values of

x (t)

decay exponentially with the span k, decreasing the faster the larger m is.

Appendix A.2. The Logistic Map in the Fully Chaotic Regime

The correlations

c_{k} = \bar{[x (t) - \bar{x}] [x (t + k) - \bar{x}]}

for the fully chaotic logistic map,

x (t + 1) = 4 x (t) [1 - x (t)]

, can be conveniently computed by exploiting the exact solution of the map [20,21],

x (t) = {sin}^{2} (2^{t} ξ_{0}),

(A4)

where

ξ_{0}

is determined using the initial condition

x (0)

through the relation

x (0) = {sin}^{2} ξ_{0}

. Recalling that

ρ_{x} (x) = {[π \sqrt{x (1 - x)}]}^{- 1}

and

\bar{x} = 1 / 2

, Equation (A1) implies

c_{k} = - \frac{1}{4} + \frac{1}{π} \int_{0}^{1} \sqrt{\frac{x}{1 - x}} {sin}^{2} (2^{k} arcsin \sqrt{x}) d x .

(A5)

The integral in this equation may look somehow intimidating but, using the change of variables

x = {sin}^{2} ξ

, it becomes the much simpler form

2 \int_{0}^{π / 2} {sin}^{2} ξ {sin}^{2} (K ξ) d ξ

, with

K = 2^{k}

. Now, it can be easily shown—for instance, by induction over K—that the integral equals

π / 8

for all integers

K > 1

. From this result, it follows that

c_{k} = 0

for all k. Remarkably, therefore, successive iterations of the logistic map in the fully chaotic regime are linearly uncorrelated with each other, although their functional correlation is obviously very large. Thus, the variance of the sums

s_{t}^{(N)}

is

σ_{s_{N}}^{2} = σ_{x}^{2} = \frac{1}{8}

(A6)

for all N.

Appendix B. Kullback–Leibler Divergence for the Distribution of Random Sampling Sums

According to the Berry–Esséen theorem [22,23], the difference between the distribution for the sum of N independent random variables and the Gaussian predicted by the standard central limit theorem decays as

1 / \sqrt{N}

or faster as N grows. We show in this Appendix that, when the distribution of the individual random variables

ρ_{x} (x)

admits a cumulant expansion—i.e., when the logarithm of its Fourier transform can be expanded in powers of its variable—that difference decays as

1 / \sqrt{N}

if

ρ_{x} (x)

is asymmetric with respect to the mean value

\bar{x}

, and as

1 / N

if it is symmetric. This implies that the Kullback–Leibler divergence

D_{random}

defined in the main text decays as

1 / N

in the former case, and as

1 / N^{2}

in the latter. In the distributions considered in the main text, the symmetry with respect to the mean value is verified for the Bernoulli map and for the logistic map in the fully chaotic regime.

Without generality loss, we assume that the mean value over the distribution

ρ_{x} (x)

of the individual random variables is zero. For the sums

s = \sum_{i = 1}^{N} x_{i} / \sqrt{N}

, where

x_{i}

are independent samples of

ρ_{x}

, the distribution

ρ_{s} (s)

results from the N-th order self-convolution of

ρ_{x} (x)

. This operation is most conveniently expressed in terms of the characteristic functions (Fourier transforms)

G_{x} (k)

and

G_{s} (k)

of, respectively,

ρ_{s} (s)

and

ρ_{x} (x)

. Namely,

G_{s} (k) = {[G_{x} (k / \sqrt{N})]}^{N}

, or

ln G_{s} (k) = N ln G_{x} (\frac{k}{\sqrt{N}}) = N \sum_{j = 1}^{\infty} {(- \frac{i k}{\sqrt{N}})}^{j} \frac{κ_{j}}{j!},

(A7)

where the sum in the right-hand side is the power expansion of

ln G_{x} (k)

around

k = 0

, which we assume to exist, and

κ_{j}

is the j-th order cumulant of

ρ_{x} (x)

[24]. We recall that

κ_{1} = \bar{x} = 0

,

κ_{2} = σ_{x}^{2}

is the variance of x over

ρ_{x}

, and

κ_{3} = \bar{{(x - \bar{x})}^{3}}

.

Using this information, the anti-transform of

G_{s} (k)

can be written as

\begin{matrix} ρ_{s} (s) & = & \frac{1}{2 π} \int_{- \infty}^{\infty} exp (i k s - \frac{σ_{x}^{2}}{2} k^{2}) exp [\sum_{j = 3}^{\infty} \frac{{(- i k)}^{j}}{N^{- 1 + j / 2}} \frac{κ_{j}}{j!}] d k \\ \equiv & \frac{exp (- s^{2} / 2 σ_{x}^{2})}{\sqrt{2 π σ_{x}^{2}}} + Δ ρ_{s} (s), \end{matrix}

(A8)

with

Δ ρ_{s} (s) = \frac{1}{2 π} \int_{- \infty}^{\infty} exp (i k s - \frac{σ_{x}^{2}}{2} k^{2}) [\sum_{j = 3}^{\infty} \frac{{(- i k)}^{j}}{N^{- 1 + j / 2}} \frac{κ_{j}}{j!} + \dots] d k,

(A9)

where the ellipsis stands for higher-order terms in the power expansion of the second exponential in the integrand of Equation (A8). Note that

Δ ρ_{s} (s)

is nothing but the difference between

ρ_{s} (s)

and the asymptotic Gaussian distribution

G_{σ_{x}} (s)

and that, due to normalization, it must verify

\int Δ ρ_{s} (s) d s = 0

. If

ρ_{x} (x)

is asymmetric around zero, the third-order cumulant

κ_{3}

is different from zero, and the leading term in powers of N in

Δ ρ_{s} (s)

is given by the summand with

j = 3

in Equation (A9), which implies

Δ ρ_{s} \sim 1 / \sqrt{N}

. If, on the other hand,

ρ_{x} (x)

is symmetric around zero, we have

κ_{3} = 0

. In this case, the sum effectively starts at

j \geq 4

, and

Δ ρ_{s}

decreases as

1 / N

or faster. Of course, if

ρ_{x}

is Gaussian from the start, all the higher-order cumulants vanish, and

Δ ρ_{s} (s)

is trivially equal to 0 for all N.

For the Kullback–Leibler divergence we have, from Equation (8),

\begin{matrix} D_{random} & = & D (ρ_{s} | | G_{σ_{x}}) = \int [G_{σ_{x}} (s) + Δ ρ_{s} (s)] {log}_{2} [1 + \frac{Δ ρ_{s} (s)}{G_{σ_{x}} (s)}] d s \\ \approx & \int [G_{σ_{x}} (s) + Δ ρ_{s} (s)] \frac{Δ ρ_{s} (s)}{G_{σ_{x}} (s)} d s = \int \frac{{[Δ ρ_{s} (s)]}^{2}}{G_{σ_{x}} (s)} d s, \end{matrix}

(A10)

where the approximation holds if

Δ ρ_{s} (s)

is sufficiently small. If the distribution

ρ_{x} (x)

is asymmetric around zero, since

Δ ρ_{s} (s)

decays asymptotically as

1 / \sqrt{N}

, then the decay of

D_{random}

turns out to be as

1 / N

. If it is symmetric,

D_{random}

decays as

1 / N^{2}

or faster, depending on whether the subsequent cumulants vanish or not.

References

Lévy, P. Theorie de L’addition des Variables Aleatoires; Gauthier-Villars: Paris, France, 1937. [Google Scholar]
Gnedenko, B.V.; Kolmogorov, A.N. Limit Distributions for Sums of Independent Random Variables; Addison-Wesley: Cambridge, MA, USA, 1954. [Google Scholar]
Uchaikin, V.V.; Zolotarev, V.M. Chance and Stability: Stable Distributions and Their Applications; Walter de Gruyter: Boston, MA, USA, 2011. [Google Scholar]
Araujo, A.; Giné, E. The Central Limit Theorem for Real and Banach Valued Random Variables; Wiley: New York, NY, USA, 1980. [Google Scholar]
Billingsley, P. Probability and Measure; Wiley: New York, NY, USA, 1995. [Google Scholar]
Burton, R.; Denker, M. On the central limit theorem for dynamical systems. Trans. Am. Math. Soc. 1987, 302, 715. [Google Scholar] [CrossRef]
Denker, M. The central limit theorem for dynamical systems. Banach Cent. Publ. 1989, 1, 33. [Google Scholar] [CrossRef]
Liverani, C. Central limit theorem for deterministic systems. Pitman Res. Notes Math. Ser. 1996, 362, 56. [Google Scholar]
Nicol, M.; Török, A.; Vaienti, S. Central limit theorems for sequential and random intermittent dynamical systems. Ergod. Theor. Dyn. Syst. 2018, 38, 1127. [Google Scholar] [CrossRef]
Kosloff, Z.; Volný, D. Local limit theorem in deterministic systems. Ann. Inst. Henri Poincaré Probab. Stat. 2022, 58, 548. [Google Scholar] [CrossRef]
Kosloff, Z.; Volný, D. Stable CLT for deterministic systems. arXiv 2023, arXiv:2211.03448. [Google Scholar]
Buzzi, J. Chaos and Ergodic Theory. In Encyclopedia of Complexity and Systems Science; Meyers, R., Ed.; Springer: New York, NY, USA, 2009. [Google Scholar]
Wouters, J. Deviations from Gaussianity in deterministic discrete time dynamical systems. Chaos 2020, 30, 023117. [Google Scholar] [CrossRef] [PubMed]
Cover, T.M.; Thomas, J.A. Elements of Information Theory; Wiley-Interscience: Hoboken, NJ, USA, 2006. [Google Scholar]
Irwin, J.O. On the frequency distribution of the means of samples from a population having any law of frequency with finite moments, with special reference to Pearson’s type II. Biometrika 1927, 19, 225. [Google Scholar] [CrossRef]
Hall, P. The distribution of means for samples of size N drawn from a population in which the variate takes values between 0 and 1, all such values being equally probable. Biometrika 1927, 19, 240. [Google Scholar] [CrossRef]
May, R.M. Simple mathematical models with very complicated dynamics. Nature 1976, 261, 459. [Google Scholar] [CrossRef] [PubMed]
Tsuchiya, T.; Yamagishi, D. The complete bifurcation diagram for the logistic map. Z. Naturforsch. 1997, 52, 513. [Google Scholar] [CrossRef]
Jakobson, M. Absolutely continuous invariant measures for one-parameter families of one-dimensional maps. Comm. Math. Phys. 1981, 81, 39. [Google Scholar] [CrossRef]
Schroder, E. Über iterirte Functionen. Math. Ann. 1870, 3, 296. [Google Scholar] [CrossRef]
Maritz, M.F. A note on exact solutions of the logistic map. Chaos 2020, 30, 033136. [Google Scholar] [CrossRef] [PubMed]
Hazewinkel, M. (Ed.) Encyclopaedia of Mathematics; Reidel: Dordrecht, The Netherlands, 1988. [Google Scholar]
Korolev, V.Y.; Shevtsova, I.G. On the upper bound for the absolute constant in the Berry–Esseen inequality. Theor. Prob. Appl. 2010, 54, 638. [Google Scholar] [CrossRef]
Gardiner, C.W. Handbook of Stochastic Methods; Springer: Berlin/Heidelberg, Germany, 1997. [Google Scholar]

Figure 1. Left, dark line: Numerical results for the distribution

ρ_{s_{N}}

of the sums

s_{N}

defined in Equation (3), in the case the Bernoulli (10) map with

m = 2

, for three small values of N. The light curve is the Gaussian expected for

N \to \infty

, and the dashed curve is a Gaussian with the same variance as predicted for

ρ_{s_{N}}

. Right, dark curve: The distribution

ρ_{s_{N}}^{random}

for sums of N values of x randomly sampled from

ρ_{x} (x)

is a normalized version of the Irwin–Hall distribution [15,16], which can be obtained analytically through the successive self-convolution of

ρ_{x}

. The light curve is the Gaussian expected for

N \to \infty

. Note the different scales on the left and right columns.

Figure 1. Left, dark line: Numerical results for the distribution

ρ_{s_{N}}

of the sums

s_{N}

defined in Equation (3), in the case the Bernoulli (10) map with

m = 2

, for three small values of N. The light curve is the Gaussian expected for

N \to \infty

, and the dashed curve is a Gaussian with the same variance as predicted for

ρ_{s_{N}}

. Right, dark curve: The distribution

ρ_{s_{N}}^{random}

for sums of N values of x randomly sampled from

ρ_{x} (x)

is a normalized version of the Irwin–Hall distribution [15,16], which can be obtained analytically through the successive self-convolution of

ρ_{x}

. The light curve is the Gaussian expected for

N \to \infty

. Note the different scales on the left and right columns.

Figure 2. Main panel: The Kullback–Leibler divergences

D_{G}

,

D_{G_{N}}

, and

D_{random}

, defined in the text, as functions of the number of terms in the sums

s_{N}

of Equation (3), for the Bernoulli map (10) with

m = 2

. The straight lines in this log–log plot have a slope

- 2

. The inset shows, as dots, numerical results for the variance

σ_{s_{N}}^{2}

over the distribution

ρ_{s_{N}} (s_{N})

. The dashed line joins the analytical values predicted from Equation (12).

Figure 2. Main panel: The Kullback–Leibler divergences

D_{G}

,

D_{G_{N}}

, and

D_{random}

, defined in the text, as functions of the number of terms in the sums

s_{N}

of Equation (3), for the Bernoulli map (10) with

m = 2

. The straight lines in this log–log plot have a slope

- 2

. The inset shows, as dots, numerical results for the variance

σ_{s_{N}}^{2}

over the distribution

ρ_{s_{N}} (s_{N})

. The dashed line joins the analytical values predicted from Equation (12).

Figure 3. The Kullback–Leibler divergence

D_{G}

for the Bernoulli map (10) with various values of m, and

D_{random}

(which is the same for all m). The straight lines have slope

- 2

.

Figure 3. The Kullback–Leibler divergence

D_{G}

for the Bernoulli map (10) with various values of m, and

D_{random}

(which is the same for all m). The straight lines have slope

- 2

.

Figure 4. As in Figure 1 for the logistic map of Equation (16) in the regime of full chaos,

λ = 4

. The distributions

ρ_{s_{N}}^{random}

, dark lines on the right column, have now been obtained numerically. Note the different scales in different panels.

Figure 4. As in Figure 1 for the logistic map of Equation (16) in the regime of full chaos,

λ = 4

. The distributions

ρ_{s_{N}}^{random}

, dark lines on the right column, have now been obtained numerically. Note the different scales in different panels.

Figure 5. The Kullback–Leibler divergence

D_{G}

for the logistic map of Equation (16) in the regime of full chaos,

λ = 4

, and

D_{random}

, as a functions of N. In this case,

D_{G_{N}}

coincides with

D_{G}

. The full and dashed straight lines have slopes

- 2

and

- 1

, respectively.

Figure 5. The Kullback–Leibler divergence

D_{G}

for the logistic map of Equation (16) in the regime of full chaos,

λ = 4

, and

D_{random}

, as a functions of N. In this case,

D_{G_{N}}

coincides with

D_{G}

. The full and dashed straight lines have slopes

- 2

and

- 1

, respectively.

Figure 6. Left: 900 successive iterations of the logistic map, Equation (16), in the intermittent regime,

λ = 3.828

. The arrows at

t = 300

and 500 point at “turbulent” and period-3 “laminar” intervals, respectively. Right: The correlation

c_{k} = \bar{[x (t) - \bar{x}] [x (t + k) - \bar{x}]}

as a function of k in the same intermittent regime, calculated numerically from sequences of

10^{7}

iterations of

x (t)

. Symbols are connected by lines to facilitate visualization.

Figure 6. Left: 900 successive iterations of the logistic map, Equation (16), in the intermittent regime,

λ = 3.828

. The arrows at

t = 300

and 500 point at “turbulent” and period-3 “laminar” intervals, respectively. Right: The correlation

c_{k} = \bar{[x (t) - \bar{x}] [x (t + k) - \bar{x}]}

as a function of k in the same intermittent regime, calculated numerically from sequences of

10^{7}

iterations of

x (t)

. Symbols are connected by lines to facilitate visualization.

Figure 7. The Kullback–Leibler divergences

D_{G}

for the logistic map (16) in the intermittent regime,

λ = 3.828

, and

D_{random}

, as functions of N. For the former, triangles correspond to values of N which are multiples of 3. The slope of the dashed straight line is

- 1

. Inset: Numerical results for the variance

σ_{s_{N}}^{2}

of the sums

s_{N}

, as a function of N. The arrow to the right indicates the variance obtained for large N. Symbols are connected by dashed lines to facilitate visualization.

Figure 7. The Kullback–Leibler divergences

D_{G}

for the logistic map (16) in the intermittent regime,

λ = 3.828

, and

D_{random}

, as functions of N. For the former, triangles correspond to values of N which are multiples of 3. The slope of the dashed straight line is

- 1

. Inset: Numerical results for the variance

σ_{s_{N}}^{2}

of the sums

s_{N}

, as a function of N. The arrow to the right indicates the variance obtained for large N. Symbols are connected by dashed lines to facilitate visualization.

Figure 8. As in Figure 4, for the sums of Equation (18) with

x (t)

obtained from map (20). Note that the scales are the same in all plots.

Figure 8. As in Figure 4, for the sums of Equation (18) with

x (t)

obtained from map (20). Note that the scales are the same in all plots.

Figure 9. The Kullback–Leibler divergences

D_{C}

and

D_{random}

for the distributions of the sums of Equation (18), with the values of x obtained from the map (20) and the distribution of Equation (22), respectively. The full straight line has slope

- 1

, and the dashed line, with slope

- 0.68

, is a linear fitting of

D_{C}

for

N \geq 2

.

Figure 9. The Kullback–Leibler divergences

D_{C}

and

D_{random}

for the distributions of the sums of Equation (18), with the values of x obtained from the map (20) and the distribution of Equation (22), respectively. The full straight line has slope

- 1

, and the dashed line, with slope

- 0.68

, is a linear fitting of

D_{C}

for

N \geq 2

.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Zanette, D.H.; Samengo, I. The Emergence of the Normal Distribution in Deterministic Chaotic Maps. Entropy 2024, 26, 51. https://doi.org/10.3390/e26010051

AMA Style

Zanette DH, Samengo I. The Emergence of the Normal Distribution in Deterministic Chaotic Maps. Entropy. 2024; 26(1):51. https://doi.org/10.3390/e26010051

Chicago/Turabian Style

Zanette, Damián H., and Inés Samengo. 2024. "The Emergence of the Normal Distribution in Deterministic Chaotic Maps" Entropy 26, no. 1: 51. https://doi.org/10.3390/e26010051

APA Style

Zanette, D. H., & Samengo, I. (2024). The Emergence of the Normal Distribution in Deterministic Chaotic Maps. Entropy, 26(1), 51. https://doi.org/10.3390/e26010051

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

The Emergence of the Normal Distribution in Deterministic Chaotic Maps

Abstract

1. Introduction

2. Central Limit Theorem for Deterministic Maps

3. The Bernoulli Map

4. The Logistic Map: Full Chaos and Intermittency

5. A Fat-Tailed Invariant Distribution

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Appendix A. Calculation of the Variances of N-Term Sums, Equation (6)

Appendix A.1. The Bernoulli Map

Appendix A.2. The Logistic Map in the Fully Chaotic Regime

Appendix B. Kullback–Leibler Divergence for the Distribution of Random Sampling Sums

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI