Next Article in Journal
Collective Sensitivity and Collective Accessibility of Non-Autonomous Discrete Dynamical Systems
Previous Article in Journal
Fractional COVID-19 Modeling and Analysis on Successive Optimal Control Policies
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

On Benford’s Law and the Coefficients of the Riemann Mapping Function for the Exterior of the Mandelbrot Set

1
Department of Mathematics, University of Milan, 20133 Milan, Italy
2
Department of Mathematics, CUNY Graduate Center, New York, NY 10016, USA
3
Department of Mathematics, University of Notre Dame, South Bend, IN 46556, USA
4
Department of Mathematics, Harvey Mudd College, Claremont, CA 90701, USA
5
Department of Mathematics and Statistics, Williams College, Williamstown, MA 01267, USA
6
Department of Mathematics, University of Michigan, Ann Arbor, MI 48109, USA
*
Authors to whom correspondence should be addressed.
Fractal Fract. 2022, 6(10), 534; https://doi.org/10.3390/fractalfract6100534
Submission received: 3 July 2022 / Revised: 28 August 2022 / Accepted: 13 September 2022 / Published: 20 September 2022
(This article belongs to the Section Probability and Statistics)

Abstract

:
We investigate Benford’s law in relation to fractal geometry. Basic fractals, such as the Cantor set and Sierpinski triangle are obtained as the limit of iterative sets, and the unique measures of their components follow a geometric distribution, which is Benford in most bases. Building on this intuition, we aim to study this distribution in more complicated fractals. We examine the Laurent coefficients of a Riemann mapping and the Taylor coefficients of its reciprocal function from the exterior of the Mandelbrot set to the complement of the unit disk. These coefficients are 2-adic rational numbers, and through statistical testing, we demonstrate that the numerators and denominators are a good fit for Benford’s law. We offer additional conjectures and observations about these coefficients. In particular, we highlight certain arithmetic subsequences related to the coefficients’ denominators, provide an estimate for their slope, and describe efficient methods to compute them.

1. Introduction

The Mandelbrot set M was first introduced and drawn by Brooks and Matelski. By analyzing the family of functions f c ( z ) = z 2 + c , Douady and Hubbard began the formal mathematical study of the Mandelbrot set as the set of parameters c, for which the orbit of 0 under f c remains bounded. We study Benford’s law in relation to the Mandelbrot set to both investigate the distribution’s extension to fractal geometry and search for patterns in the Mandelbrot set.
In 1980, Douady and Hubbard were able to prove the connectedness of M by constructing a conformal isomorphism
Φ : C \ M C \ D ¯
between the complement of the Mandelbrot set and the complement of the closed unit disk [1]. Using the Douady-Hubbard map Φ , we can define related conformal isomorphisms,
Ψ : C \ D ¯ C \ M , Θ : D C \ M 1 ,
where M 1 = 1 / c : c M , by setting Ψ = Φ 1 and Θ ( c ) = 1 / Ψ ( 1 / c ) . One of the most heavily studied questions in complex dynamics is whether or not M is locally connected (MLC). By a theorem of Caratheodory [2], these two maps can be extended continuously to the unit circle if and only if the Mandelbrot set is locally connected. As such, we focus on studying these maps and their respective Laurent and Taylor expansions, as outlined in [3,4], respectively:
Ψ ( z ) = z + m = 0 b m z m ,
Θ ( z ) = z + m = 2 a m z m .
In Section 3, we outline the methods [3,4,5] we used to compute the a m and b m coefficients. The computation time grows exponentially, so methods of improving computation are explored. Using recursion, we were able to compute the first 7168 coefficients.
In Section 4, we discuss Benford’s law along with statistical testing to determine whether the coefficients obey a Benford distribution. Given a base b 2 , a data set is Benford base b if the probability of observing a given leading digit, d, is log b ( d + 1 ) / d (see [6,7]).(We can write any positive x as S b ( x ) b k b ( x ) , where S b ( x ) [ 1 , b ) is the significand and k b ( x ) is an integer. If the probability of observing a significand of s [ 1 , b ) is log b ( s ) , we say the set is strongly Benford (or frequently just Benford).) In most cases, a data set is demonstrated to be Benford through statistical testing. There are few straightforward proofs for Benfordness, all of which rely heavily on understanding the structure and properties of the data. Well understood sets such as geometric series and the Fibonacci numbers have explicit proofs for Benfordness, but the structure and properties of coefficients we study are still the subject of active research in complex dynamics. Therefore, we rely on statistical testing for our results. We consider the standard χ 2 distribution and the sequence of the data’s logarithms modulo 1 for our statistical testing, and a standard goodness of fit test demonstrates that the numerators and denominators are a good fit for Benford’s law, while the decimal representations are not.
Section 5 deals with conjecture, observations, and theorems related to the coefficients. Section 5.1 and Section 5.2 are meant to tie together the most important of these that we have found from various authors for the a m and b m coefficients, respectively. In Section 5.3, we present new results and conjectures on the a m and b m coefficients from our work. Theorem 7 [3,8] states that they are 2-adic rational numbers; in other words, they are of the form p / 2 ν , where p is an odd integer. The integer ν is, by definition, the 2-adic valuation ν 2 of a m or b m . Therefore, we focus on the denominator’s exponents ν ( a m ) , ν ( b m ) . Setting m = 2 n m 0 , with m 0 odd and n fixed; the subsequences { ν ( a m ) } , { ν ( b m ) } appear to be near-arithmetic progressions. We present the results observed in the following conjecture.
Conjecture 1.
Let m be written as 2 n m 0 as above, with n = n ¯ fixed. Then, the sequence { ν ( a m ) } n = n ¯ is asymptotically linear, with slope
2 / ( 2 n ¯ + 1 1 ) .
We also present an efficient way to compute the denominator’s exponents for the cases n = 0 , 1 , 2 .
Our work offers a new approach to some classical problems in complex dynamics, and we do this through our statistical testing. These coefficients have been studied extensively, so we compile relevant observations from disparate authors to use as a basis for offering new conjectures and results. The idea of studying Benford’s law in complex dynamics and fractal sets is also original; to our knowledge, no other authors have attempted to study Benford’s law in this setting. Our approach offers both a new and unique way to study important series in complex dynamics, and it provides motivation for number theorists and statisticians to study Benford’s law in the new field of data, namely fractal sets.

2. Preliminaries in Complex Dynamics

We give a brief introduction to complex dynamics. For more detailed proofs see [9,10].
Let f : C ^ C ^ be a rational map. The Julia set  J f associated to the map f may be defined as the closure of the set of repelling periodic points of f. For a rational map of degree 2 or higher, the Julia set J f is non-empty. We now restrict our attention to polynomial maps of degree d 2 , which have a superattracting fixed point at infinity. We can thus define the filled Julia set K f as the complement of the basin of the attraction of infinity:
K f = { z C : { f n ( z ) } n 0 is bounded } .
Making use of the above, it is possible to redefine J f as the boundary of the filled Julia set.
Lemma 1.
Let f be a polynomial of degree d 2 . The filled Julia set K f C is compact, with boundary K f = J f equal to the Julia set. The complement C ^ \ K f is connected and equal to the basin of attraction A ( ) of the point ∞.
It follows that the Julia set J f of a polynomial f is precisely the boundary of the basin of attraction A f ( ) .
We may now characterize the connectedness of J f . This is determined entirely by the activity of the critical points of f.
Theorem 1.
The Julia set J f for a polynomial of degree d 2 is connected if and only if the filled Julia set K f contains every critical point of f.

The Mandelbrot Set

We now focus primarily on the family of quadratic functions of the form { f c ( z ) = z 2 + c } c C . Since f c has a single critical point 0, it follows from Theorem 1 that J f c is connected if and only if the orbit { f c n ( 0 ) n N } is bounded. This motivates the following definition of the Mandelbrot set, M .
Definition 1.
M C is the set of all the parameters c C such that the Julia set J f c is connected. Equivalently, M is the set of all the c such that the orbit of 0 under f c remains bounded:
{ M : = c C : there exists R > 0 such that for all n , | f c n ( 0 ) | < R } .
Remark 1.
It is possible to generalize this definition and most of the following results to the family of unicritical degree d polynomials f c , d ( z ) = z d + c , where d is an integer d 2 . In this case, M d is called the multibrot set of degree d. For simplicity, we focus only on M = M 2 , which has historically been the object of greatest interest. For more information on M d , see [3,8].
It is possible to demonstrate that the interior of M is nonempty. We utilized an escape time algorithm and computer graphics to obtain the visualization of M in presented in Figure 1.
When the first computer images of M were generated, Benoit Mandelbrot observed small regions that appeared to be separate from the main cardioid, and conjectured that M was disconnected, which was later disproved.
Theorem 2
(Douady, Hubbard). The Mandelbrot set is connected.
This result was first proved by Douady and Hubbard [1] by explicitly constructing a conformal isomorphism Φ : C ^ \ M C ^ \ D ¯ . Douady and Hubbard’s proof is significant not only for the result, but also since it provides an explicit formula for the uniformization of the complement of the Mandelbrot set.
A large amount of research has been devoted to the local connectivity of the Mandelbrot set, which is generally regarded as one of the most important open problems in complex dynamics. We recall that a set A in a topological space X is locally connected at p A if for every open set V X containing p, there is an open subset U with p U V such that U X is connected. The set A is said to be locally connected if it is locally connected at p for all p A .
As above, let Φ : C ^ \ M C ^ \ D ¯ be the conformal isomorphism constructed by Douady and Hubbard; notice that the map Ψ : C ^ \ D ¯ C ^ \ M is the Riemann mapping function of C ^ \ M . We consider it the Laurent expansion at
Ψ ( z ) = z + m = 0 b m z m .
Another possibility is to consider Θ ( z ) : = 1 / Ψ ( 1 / z ) , which is the Riemann mapping of the bounded domain C \ { 1 / z : z M } . We have the corresponding Taylor expansion for Θ at the origin:
Θ ( z ) = z + m = 2 a m z m .
For the general Multibrot set M d , we refer to the coefficients with the notation b d , m and a d , m . To underline the importance of these maps, we reference a lemma from Caratheodory [2].
Theorem 3
(Caratheodory’s continuity lemma). Let G C ^ be a simply connected domain and a function f maps D onto G via a conformal isomorphism. Then f has a continuous extension to D ¯ if and only if the boundary of G is locally connected.
Therefore, if the map Ψ or the map f can be extended continuously to the unit circle D , then M is locally connected. To demonstrate this extension, it would be sufficient to prove that one of the two series converges uniformly on D .
There have been numerous attempts to prove this result. For example, Ewing and Schober demonstrated that the inequality 0 < | b m | < 1/m holds for every m < 240 , 000 . A bound of the type | b m | < K / m ( 1 + ϵ ) would lead to the desired result; however, this would imply that the extension of Ψ is Hölder continuous, which does not happen, as expressed in [11]. Proving that | b m | < K/(m log 2 (m)) would prove that the series converges absolutely; however, modern computations suggest that such a bound does not exist [11]. Therefore, the MLC conjecture and its consequences remain the object of study.
Remark 2
(Consequences of local connectedness). This paper tackles problems related to the behaviour of the coefficients { a m } and { b m } , with the ultimate aim of improving the state-of-art on the local connectedness of M . In this section, we highlight the importance of this property of M . In particular, it implies a conjecture known as the Density of Hyperbolicity. A hyperbolic component is an interior connected component of M , in which the sequences { f c n ( 0 ) } have an attracting periodic cycle of period p. The Density of Hyperbolicity conjecture states that these are the only interior regions of M . Another consequence of MLC is related to a topologically equivalent description of M . In particular, the boundary of the Mandelbrot set can be identified with the unit circle S 1 under a specific relation ∼, known as the abstract Mandelbrot set [12]. More information and other implications of MLC and the Density of Hyperbolicity may be found in [1,13].

3. Algorithms and Complexity

There are algorithms to compute both the coefficients a m and b m . While these algorithms work for a generic degree d 2 , we focus on d = 2 , which is historically the most interesting case, since it is the one associated with M . For simplicity, we denote b m = b 2 , m and a m = a 2 , m . The behavior for other values is similar. Derivations for the explicit form of the b m coefficients may be found in [3]. Derivations for the explicit form of the a m coefficients may be found in [4]. There is also a formula to switch between the coefficients, outlined in [14].
Theorem 4.
Let n N and 1 m d n + 1 3 . Then
b d , m = 1 m C j 1 m d n C j 2 m d n 1 d j 1 C j 3 m d n 2 d 2 j 1 d j 2 C j n m d d n 1 j 1 d j n 1 ,
where the sum is over all non-negative indices j 1 , , j n such that
d n 1 j 1 + d n 1 1 j 2 + d n 2 1 j 3 + + d 1 j n = m + 1
and C j a is the generalized binomial coefficient
C j a = a a 1 a j + 1 j j 1 2 · 1 .
The a m coefficients can be obtained from the b m using the formula
a d , m = b d , m 2 j = 2 m 1 a d , j b d , m 1 j ,
or they can be directly calculated as in the following theorem.
Theorem 5.
Let n N and 2 m d n + 1 1 . Then
a d , m = 1 m C j 1 m d n C j 2 m d n 1 d j 1 C j 3 m d n 2 d 2 j 1 d j 2 C j n m d d n 1 j 1 d j n 1 ,
where the sum is over all non-negative indices j 1 , , j n such that
( d n 1 ) j 1 + ( d n 1 1 ) j 2 + ( d n 2 1 ) j 3 + + ( d 1 ) j n = m 1
and C j ( a ) is the generalized binomial coefficient
C j ( a ) = a ( a 1 ) ( a j + 1 ) j ( j 1 ) 2 · 1 .
While the above theorems give the explicit forms of the coefficients, the following theorem provides a recursive method to find b m , which is more suitable for computers. Once we find b m , we can apply the relationship between a m and b m outlined in Theorem 4 to find a m . More details can be found in [5].
Theorem 6.
Let n N and d = 2 . Then
b m = β 0 , m + 1
where the following holds true.
β j , 1 = 0 , j 1 ; β n , 0 = 1 , n N
β n , m = 0 , n 1 , 1 m 2 n + 1 2
β n 1 , m = 1 2 β n , m k = 2 n 1 m 2 n + 1 β n 1 , k β n 1 , m k β 0 , m 2 n + 1
Example 1.
For example, to calculate the first several b m coefficients, we can use Theorem 6 to obtain:
b 0 = β 0 , 1 = 1 2 β 1 , 1 β 0 , 0 = 1 2 0 1 = 1 2
b 1 = β 0 , 2 = 1 2 β 1 , 2 β 0 , 1 2 β 0 , 1 = 1 2 0 1 / 4 + 1 / 2 = 1 8
b 2 = β 0 , 3 = 1 2 β 1 , 3 2 ( β 0 , 1 β 0 , 2 ) β 0 , 2 = 1 2 β 1 , 3 + 1 / 8 1 / 8 = 1 4
since
β 1 , 3 = 1 2 β 2 , 3 β 0 , 0 = 1 2 0 1 = 1 2

3.1. Direct Computation

Our initial algorithm to generate these coefficients was to directly compute them, and we include our original methodology for reference as we cross checked our results. Most of the data was generated through the recursive algorithm, and details are provided in Section 3.2.
We wrote a program in Python to compute b 2 , m and a 2 , m based on the formulas given in Theorems 4 and 5, and we obtained the first 1024 exact values of both coefficients’ sequences.
Our methodology for computing the mth coefficient was to first generate the solutions j 1 , , j n to the Diophantine Equations (4) and (6). Then, we plug them into the exact formula of a m = a 2 , m and b m = b 2 , m to find the m th coefficient.
We improved the method to solve the Diophantine equations by first setting an upper limit on the degree for which to obtain coefficients, generated the solutions for the highest order coefficient, and created data structures for dynamic storage. We stored each individual solution as a tuple of length n, where the k th entry denoted the value of the coefficient j k . Every solution for the upper bound was then given a reference in a linked list, which we can use to find the highest order coefficient. The solution stored for the upper bound can then be modified through decrementing the value for j n for each tuple and then deleting the reference to the tuple and deallocating the memory in the linked list when the value for j n reaches zero.
To deal with the time sink in generating the binomial coefficients, we utilized multi-core parallel computing. Each coefficient can be computed independently after we obtain the solutions to the Diophantine equations. We structure our code for concurrent computation and use generator expressions so that we can use multiple cores where our code is executing simultaneously. In a multi-core setting, each core deals with one coefficient at a time. When one coefficient calculation is conducted, the core takes the next awaiting task that is not being taken by other cores. We chose a high-performance server machine and ran our code in a parallel environment with 72 valid cores. We obtained the first 1024 coefficients with a CPU time of 166 hours and a total run time of 7 days.

3.2. Recursive Computation

The direct computation runs in exponential time, and it is generally impractical for generating large degree coefficients. Therefore, we switched to a recursive method to generate these coefficients.
The method is described in [5,11,14], and we outline the formula for the computation in Theorem 6. This method is more efficient because it is able to reuse information from the previous coefficients to compute the next one.
We wrote a Python program to implement the recursion to find b m and then use Equation (5) to find the corresponding a m . We were able to obtain 10,240 coefficients for both series within 82 hours with a single core. We are also able to cross-check our computation results with the direct computation method before starting the statistical analysis. All codes and results can be found at https://github.com/DannyStoll1/polymath-fractal-geometry (accessed on 2 July 2022). Detailed instructions can be found in the README file.

4. Benford’s Law

Frank Benford’s 1938 paper, The Law of Anomalous Numbers [6], illustrated a profound result, in which the first digits of numbers in a given data set are not uniformly distributed in general. Benford applied statistical analysis to a variety of well-behaved but uncorrelated data sets, such as the areas of rivers, financial returns, and lists of physical constants; in an overwhelming amount of the data, 1 appeared as the leading digit around 30% of the time, and each higher digit was successively less likely [6,7]. He then outlined the derivation of a statistical distribution which maintained that the probability of observing a leading digit, d, for a given base, b, is log b ( d + 1 ) / d for such data sets [6,7]. This logarithmic relation is referred to as Benford’s Law, and its resultant probability measure for base 10 is outlined in Figure 2. Benford’s Law has been the subject of intensive research over the past several decades, arising in numerous fields; see [7] for an introduction to the general theory and numerous applications.
Benford’s law appears throughout purely mathematical constructions as well as geometric series, recurrence relations, and geometric Brownian motion. Its ubiquity makes it one of the most interesting objects in modern mathematics, as it arises in many disciplines. Therefore, it is worthwhile to consider non-traditional data sets where the law may appear, such as fractal sets. Basic fractals such as the Cantor set and Sierpinski triangle are obtained as the limiting iterations of sets, and as a result, their unique component measures follow a geometric distribution, which is Benford in most bases. Building on these results, it is plausible that more complicated fractals obey this distribution as well. We studied the Riemann mapping of the exterior of the Mandelbrot set to the complement of the unit disk, along with its reciprocal. These mappings are given by a Taylor and Laurent series, respectively, and we studied their coefficients to determine their fit for Benford’s law. These coefficients are of interest as their asymptotic convergence is intimately related to the conjectured local connectivity of the Mandelbrot set, which is an important open problem in complex dynamics.

4.1. Statistical Testing for Benford’S Law

A common statistical methodology for evaluating whether a data set is distributed according to Benford’s law is to utilize the standard χ 2 goodness of fit test. As we are investigating Benford’s law in base 10, we utilize 8 degrees of freedom for our χ 2 testing.(There are nine possible first digits, but once we know the proportion that is digits 1 through 8 the percentage that starts with a 9 is forced, and thus we lose one degree of freedom.) If there are N observations, letting p d = log 10 ( d + 1 ) / d be the Benford probability of having a first digit of d, we expect p d N values to have a first digit of d. If we let O d be the observed number with a first digit of d, the χ 2 value is
χ 2 : = d = 1 9 ( O d p d N ) 2 p d N .
If the data are Benford, with 8 degrees of freedom, then 95% of the time the χ 2 value will be at most 15.5073; this corresponds to a significance level of α = 0.05 . We perform multiple testing by creating a distribtuion of χ 2 values as a function of the sample size by calculating the χ 2 values of the first m values in the dataset. This is standard practice for studying Benford sequences, and we conduct this account for periodicity in the χ 2 values, which is typical for certain Benford data sets, such as the integer powers of 2. To account for this multiplicity, we also incorporate the standard Bonferroni correction so that the overall testing is conducted at the same level of significance while giving equal weight in terms of significance to each individual test. We do this by making each test be completed at a significance level of α / m . In particular, we perform 10,045 tests for the a m and 10,046 tests for the b m coefficients as we compute the χ 2 statistic each time we add a new non-zero coefficient to our data set. This brings our final corrected threshold value to 38.9706 and 38.9708, respectively. This corresponds to α = 0.05/10,045 = 4.978 × 10 6 for each individual hypothesis in the a m dataset and α = 0.05/10,046 = 4.977 × 10 6 for each individual hypothesis in the b m dataset.
Each data point is not independent, as the χ 2 values are computed by using a running total of the data, and as such, each point is built on the previous one. This results in a high correlation between the data, and Bonferroni correction likely overcompensates for the increase in type I error in our multiple testing. Still, it is one of the most plausible methods of dealing with the increase in multiplicity, since it is one of the simplest and most conservative estimates, and a value above the Bonferroni correction provides strong evidence that the data are not Benford. Using independent increments to compute each χ 2 statistic for Benford’s law would fix the issue of independence, but is not recommended since periodic behavior can be missed if the increments are chosen poorly relative to the frequency in the data, and this has the potential to lead to erroneous conclusions.
Since the data set is built iteratively, we also provide a curve to represent the Bonferroni correction for m tests. The rationale is to keep the significance of the overall test constant with respect to the number of tests performed. As we increase the total number of tests performed, we wish to increase our correction accordingly. As such, it does not make sense to consider every value on our plot of χ 2 values with the same threshold, and this curve is provided as a reference for the tests performed on subsets of our data.
We considered the distribution of the χ 2 values to account for random fluctuations and periodic effects. The distribution’s limiting behavior provides our benchmark for the fit to Benford’s law. In addition, we provide the provide the p-value of our computed χ 2 statistic to provide the type I error rate for our conclusions, and we compute the powers of the χ 2 tests relative to our null hypothesis that the data are Benford by using the noncentral chi-squared distribution [15]. We also conducted simulations to estimate the sampling error relative to our null hypothesis. We wish to see how likely it is that a random sample falls in the rejection region for our testing; based on our significance level of α = 0.05 , we expect the sampling error to be roughly 5% if the data are Benford. We randomly sample 1000 coefficients from our data sets and calculate the χ 2 value for this sample data. We repeat this simulation 1000 times, and take the ratio of χ 2 values in the rejection region to the total number of sample statistics calculated, to estimate the sampling error.
An equivalent test is to consider the distribution of the base 10 logarithms for the absolute value of the data set modulo 1; as a necessary and sufficient condition for a Benford distribution, this sequence converges to the uniform distribution [16]. As a measure of quantifying the uniformity of this distribution, we may again consider the standard χ 2 test. We only perform a single test for the total data set, so we do not need to account for multiplicity. Specifically, we split the interval [ 0 , 1 ] into 10 equal bins. If the data are uniform, we expect that each bin obtains 1 / 10 of the total data. Therefore, for each bin in the a m data we expect a value of 10 , 045 / 10 = 1004.5 and for each bin in the b m data we expect a value of 10 , 046 / 10 = 1004.6 . Since there are 10 possible observations, we have 9 degrees of freedom for the data. If the percentage of the data in the first nine bins is known, then the percentage of data in the last bin is forced, and we lose one degree of freedom. We again take α = 0.05 and for nine degrees of freedom, this corresponds to a χ 2 value of 16.919. These χ 2 results are generated by cells 16, 17, and 18 by the Jupyter notebooks amLogData.ipynb and bmLogData.ipynb, respectively, which may be found under the Data Analysis folder. We also provide the associated p-values and powers of the χ 2 statistic relative to the null hypothesis that this data are uniform.
The coefficients we studied are 2-adic rational numbers, so we considered the distributions of the numerators, denominators, and decimal expansions separately. We considered only the non-zero coefficients, since zero is not defined for our probability measure, and certain theorems and conjectures outlined by Shiamuchi in [4] already describe the distribution of the zeroes in the coefficients. Our goal is to identify which components of these coefficients are a good fit for Benford’s law through statistical testing, and it is hoped that this will provide motivation for a formal proof.
Table 1 provides examples of the coefficients computed. When the coefficient is 0, numerators are set to 0 and denominators to - for readability. We then use them to compute the exact values in decimal expansion for a m and b m .

4.2. Benfordness of the Taylor and Laurent Coefficients

We examine the distribution of the first digits of the a m and b m coefficients. As mentioned earlier, we restrict our discussion to the non-zero coefficients. We conduct the χ 2 test and distribution of the base 10 logarithms modulo 1 to evaluate the data. The plots of the χ 2 values are shown in Figure 3, Figure 4 and Figure 5; these figures were generated by cells 16, 17, and 18 in the Jupyter notebooks amChiSquareData.ipynb and bmChiSquareData.ipynb respectively, which may be found under the Data Analysis folder. The plots of the logarithms modulo 1 are provided in Figure 6, Figure 7 and Figure 8; these figures were generated by cells 10,11, and 12 in the Jupyter notebooks amLogData.ipynb and bmLogData.ipynb, respectively, which may be found under the Data Analysis folder. Simulation results can be reproduced using a sampling_simulation.ipynb notebook in the same folder.
On the plots of the χ 2 values, the numerators stay below the uncorrected threshold for statistical significance. The a m coefficients seem to settle towards the end of the distribution while the b m coefficients end during a period of relative inflation with many fluctuations. Bearing this in mind, the results for the b m numerators will be slightly worse than the a m numerators due to the behavior of the distribution at the point where we finished collecting our data. The final χ 2 values for the a m and b m numerators are 5.968 and 12.785, respectively. The type I error rates (p-values) for the final χ 2 values are 0.651 and 0.119, both of which are greater than our critical values of α = 0.05 / 10045 for the individual hypotheses and α = 0.05 for the overall test. The sampling errors we obtain from our simulations for a m and b m are 0.043 and 0.058. The powers relative to the null hypothesis for a m and b m are 0.355 and 0.718, respectively. As a result, there is not sufficient evidence to reject our null hypothesis that the data are Benford.
The denominators stay well below the original threshold. This is expected, as they consist of a random sampling of a geometric series, which is known to be Benford in most bases [16]. The final χ 2 values for the a m and b m denominators are 6.148 and 3.093, respectively. The type I error rates (p-values) for the final χ 2 values are 0.631 and 0.928, both of which are greater than our critical values of α = 0.05 / 10 , 045 for the individual hypotheses and α = 0.05 for the overall test. The sampling errors we obtain from our simulations for a m and b m are 0.047 and 0.028. The powers relative to the null hypothesis for a m and b m are 0.366 and 0.187, respectively. As a result, there is not sufficient evidence to reject our null hypothesis that the data are Benford based on this testing.
The χ 2 values for the decimal expansions are the most interesting. The χ 2 values for the a m coefficients seem to increase in general, but there are erratic periods where the values fall. The sequence ends above the final corrected threshold for significance, and it is plausible that the χ 2 values will continue to increase more quickly than the correction allows with extra data. The b m meanwhile surpasses the corrected threshold, which is strong evidence that they are not Benford; the values in the data do decrease towards the end of the distribution, but it is still well above the threshold for significance. The final χ 2 values for the a m and b m data are 58.054 and 51.934, respectively. The type I error rates (p-values) for the final χ 2 values are 1.121 × 10 9 and 1.733 × 10 8 , both of which are less than our critical values of α = 0.05 / 10 , 045 for the individual hypotheses and α = 0.05 for the overall test. The sampling errors we obtain from our simulations for a m and b m are 0.235 and 0.243. The powers relative to the null hypothesis for a m and b m are 0.999992 and 0.999956, respectively. As a result, there is sufficient evidence to reject our null hypothesis that the data are Benford based on this testing.
As for the distribution of the base 10 logarithms modulo 1, the numerators have a skew towards zero in the distribution of their logarithms. There is a pattern in the a m coefficients that explains their behavior, we have observed that when m = 2 n , a m = 1 / m . This result seems to generalize for a d , m , such that when m = d n , a d , m = 1/m, which can be observed in the tables provided by Shiamuchi in [4], and we have not found a counterexample in our computations. There is regularity in the b m numerators as discussed by Bielefeld, Fisher, and von Haeseler in [11], and it is likely that similar regularities are present in the a m coefficients as well. It seems that these patterns become less frequent as the sample size increases. The extra regularity in the a m coefficients also helps explain why their distribution is more skewed towards 0 relative to the one provided for the b m coefficients as well. Still, for a finite number of iterations, we do not expect perfect agreement, and convergence is likely with larger samples. The χ 2 values for the a m and b m data are 8.482 and 10.203, respectively. These correspond to p-values of 0.486 and 0.334; the powers relative to the null hypothesis are 0.482 and 0.574. There is not sufficient evidence to reject the null hyopthesis that the data are uniform. In turn, this provides further evidence that the data are Benford.
The denominators consist of a sampling of integer powers of 2. Since log 10 ( 2 ) is irrational, the sequence x n = 2 n is Benford in base 10, and log 10 ( 2 n ) (mod 1) converges to the uniform distribution [16]. Since the denominators span many orders of magnitude, it is expected that they will similarly converge in distribution. The χ 2 values for the a m and b m data are 6.334 and 4.416, respectively. These correspond to p-values of 0.706 and 0.882; the powers relative to the null hypothesis are 0.358 and 0.248. There is not sufficient evidence to reject the null hyopthesis that the data are uniform. In turn, this provides further evidence that the data are Benford.
The χ 2 values for the a m and b m decimals are 64.261 and 60.757, respectively. These correspond to p-values of 2.008 × 10 10 and 9.580 × 10 10 ; the powers relative to the null hypothesis are 0.99998 and 0.99994. There is sufficient evidence to reject the null hyopthesis that the data are uniform. This provides further evidence that the data are not Benford. It is worth noting that the a m distribution is skewed towards the second half of the interval, while the b m distribution is skewed towards the first half of the interval. This asymmetry may be related to how the series represent coefficients of reciprocal functions and how they may be computed from each other.
We may also investigate how many orders of magnitude a data set spans on average by computing the arithmetic mean and standard deviation of log 10 | x n | to tell where the data set is centered and how spread out it is. It is typical, but not necessary, for a data set to be Benford if it spans many orders of magnitude (see Chapter 2 of [7,17] for an analysis that a sufficiently large spread is not enough to ensure Benfordness). Our findings for the Taylor and Laurent coefficients are summarized in Table 2. The data are generated by cell 8 in the Jupyter notebooks amLogData.ipynb and bmLogData.ipynb, respectively, which may be found under the Data Analysis folder.
The numerators and denominators span many orders of magnitude, but the decimal expansions cluster. The mean for the decimal expansions being negative indicates that the denominators are larger than the numerators, on average. The ratio between the growth rates of the numerators and denominators likely has some form of regularity as well to account for the small standard deviation, but more analysis would be needed to determine the exact relationship. These observations are consistent with the previously discussed conjecture that 0 < | b m | < 1 / m for all m, and it is plausible a similar relation holds for the a m coefficients as well. Ultimately, this provides insight into the growth of the coefficients and the shape of the data, and this motivates our conclusion that the numerators and denominators appear to converge to a Benford distribution, while there is no strong evidence for convergence in the decimal expansions.

5. On the Taylor and Laurent Coefficients

This Section deals with observations and theorems on the coefficients from various authors. Our goal is to compile and highlight important results from disparate sources. We link to the papers where the original observations may be found and their proofs when applicable. Section 5.1 deals specifically with observations related to the a m coefficients, Section 5.2 deals with the b m coefficients, and Section 5.3 deals specifically with new observations we make.
Theorem 3 highlights the relevance of the study of the Riemann mappings Ψ and Φ . Much effort has been put into the understanding of the behavior of both series. We now refer to a few important results and introduce new conjectures on the behavior of the coefficients. One of the most important theorems relating to both sets of coefficients is the following.
Theorem 7.
The a d , m and b d , m coefficients are d-adic rational numbers. In other words, a d , m and b d , m are of the form
p d v ,
where p is an integer indivisible by d. The integer ν is, by definition, the d-adic valuation ν d of a d , m or b d , m .
This is a combination of Theorems outlined by Shiamuchi in [3,8], and it provided our motivation for studying the numerators, denominators, and decimal expansions, separately. We consider only the case d = 2 . The majority of the following results hold also for a general integer d 2 , under simple modifications.

5.1. Results for the Taylor Coefficients

Remark 3.
For any integers k and ν satisfying k 1 and 2 ν k + 1 , let m = ( 2 k + 1 ) 2 ν . Then a 2 , m = 0 .
It is unknown whether the converse is true. The proof may be found in [18]. The authors have reported that their computation of 1000 terms of a 2 , m has not produced a zero-coefficient besides those indicated by the theorem, which is consistent with our observations. The result may also be expressed as the following corollary:
Corollary 1.
Let m = m 0 2 n with n 0 , and m 0 odd. If 3 m 0 2 n + 1 , then a 2 , m = 0 .
Making use of the 2-adic valuation, it is possible to obtain the following theorem, as outlined in [4].
Theorem 8.
We have ν 2 ( a m ) ν 2 ( ( 2 m 2 ) ! ) for all m, with equality attained exactly when m is odd.
In the following, since our interest is only for the 2-adic valuation, we will make use of the notation ν ( x ) : = ν 2 ( x ) , and we immediately obtain the following remark.
Remark 4.
In the case that m is odd, we may obtain an efficient algorithm to compute ν ( a m ) through ν ( ( 2 m 2 ) ! ) . Following immediately from Theorem 8 by the properties of the d-adic evaluation outlined in [3], we have,
ν ( ( 2 m 2 ) ! ) = l = 1 2 m 2 2 l = l = 0 m 1 2 l .
Therefore, if we set a value, N, the denominator’s exponent for every odd number m < 2 N is given by
ν ( a m ) = ν ( ( 2 m 2 ) ! ) = l = 0 N m 1 2 l .
We may also summarize Theorem 3.1 and Corollary 3.5 from [8] for the case that d = 2 .
Theorem 9.
Given m N \ { 1 } , we have that ν ( a m ) x ( m ) , where
x ( m ) = ν ( ( m 1 ) ! ) + m 1 .
Under the same assumptions, the result is also true with
x ( m ) = ν ( m 1 ) + m 1 .

5.2. Results for the Laurent Coefficients

Similar results hold for the b m coefficients. We use the notation m = 2 n m 0 , where m 0 is odd. The first result was presented in [19] in 1988.
Remark 5.
If n 2 and m 0 2 n + 1 5 , then b m = 0 .
It is still unknown whether the converse of this theorem is true. In [5], the only coefficients that have been observed to be zero are those mentioned in this theorem. The following result, from [11], underlines that a result similar to the one for the a m holds.
Theorem 10.
We have ν ( b m ) ν ( ( 2 m + 2 ) ! ) for all m, and equality is attained exactly when m is odd.
Don Zagier has made several observations and conjectures about the exponents of b m . We shall later extend them to the a m coefficients. The original conjectures are outlined in [11].
Conjecture 1. 
For the b m coefficients, we have the following.
  • If n = 0 , then ν ( b m ) = ν ( ( 2 m + 2 ) ! ) .
  • If n = 1 , then ν ( b m ) = ν ( ( ( 2 m + 2 ) / 3 ) ! ) + ϵ ( m 0 ) , where ϵ ( m 0 ) = 0 if m 0 11 mod 12 and 1 otherwise.
  • If n = 2 , then ν ( b m ) = ν ( ( ( 2 m 25 ) / 7 ) ! ) + ϵ ( m 0 ) , where ϵ ( m 0 ) moves with periodicity of 28, as expressed in [11].

5.3. New Remarks on the Coefficients

Remark 6.
Theorem 10 and the results on ν ( x ! ) reported in Remark 4 imply that the denominator’s coefficients for b m have regularity in terms of consecutive differences for odd m. Zagier’s conjectures posit similar patterns for even m. This gives a constant discrete derivative in the b m denominator’s exponents for m 2 mod 4 ( n = 1 ) , m 4 mod 8 ( n = 2 ) , m 8 mod 16 ( n = 3 ) ; m 2 N mod 2 N + 1 ( n = N ) . Note that for each of these subsequences, there are some starting values of 0. These correspond exactly with the values predicted by Corollaries 1 and 5, respectively; when a m = 0 , the algorithm gives 0 for the denominator’s exponent. In general, for each n there is a partial periodicity with period 2 ( 2 n + 1 1 ) in m 0 , and equivalently, 2 n + 1 ( 2 n + 1 1 ) in m.
Another direction is to calculate the slope of each of the subsequences, since they seem to grow linearly. Our observations make use of the previous remark and have led to the following conjecture.
Conjecture 2. 
Conjecture 1 Let m be written as 2 n m 0 as before, with n = n ¯ fixed. Then, the sequence { ν ( a m ) } n = n ¯ is asymptotically linear, with slope 2 / ( 2 n ¯ + 1 1 ) .
The numerators tend to follow similar behavior as the denominators. In particular, the modulus of the numerators in the subsequences tend to organize as follows: { a m } n = 0 > { a m } n = 1 > ⋯ { a m } n = N . The possibility of bounding the numerators by making use of its associated denominator is a subject of further study.
We now extend Conjecture Section 5.2 to the a m coefficients, as follows:
Conjecture 3. 
For the a m coefficients we have:
  • If n = 0 , ν ( a m ) = ν ( ( 2 m 2 ) ! ) , and
  • If n = 1 , ν ( a m ) = ν ( ( ( 2 m 2 ) / 3 ) ! ) + ϵ ( m 0 ) , where ϵ ( m 0 ) = 1 if m 0 ¬ 3 mod 12. Otherwise, it follows the pattern in Table 3.
This suggests a partial periodicity with period 2 4 ( 2 n + 1 1 ) in m 0 , or of 2 n + 4 ( 2 n + 1 1 ) in m. As before, it is possible to write m 0 as 2 ( 2 n + 1 1 ) k + l , but it is more difficult to identify a general pattern in this case.

6. Future Work

The most natural extension of our work would be to generate more coefficients, which would allow more thorough statistical testing. Given more data, we could look at the coefficients over a certain zoom or average the coefficients over certain subsets. It would be particularly interesting if certain subsets of the coefficients also converge to the Benford distribution. We may also look at the powers of the denominators to observe whether they follow a Benford distribution.
The algorithms for computing the coefficients of the Mandelbrot can also be easily generalized to obtain other abstract Multibrot sets, which could be analyzed using the same methods. We could look at the data in different bases to observe whether Benfordness holds there. It would be interesting to se the numerators and decimal expansions of the coefficients for the Multibrot set of degree d follow a Benford distribution in base d while it is trivial to observe that the denominators will not.
The results of Section 5 also present interesting extensions for future work. In particular, Remark 6 suggests that dividing the coefficients into subsequences to be bounded separately may be the best approach to study the convergence of the Laurent series of the coefficients. This approach, which has not been followed in the past, to the best of our knowledge, could provide valuable results in the study of the local connectedness of M .

Author Contributions

This paper is a result of a collaborative effort of all the authors on all aspects of the work, with J.D. on visualization and formal analysis, W.F. on data curation and software, and F.B. on formal analysis and conceptualization. The research was performed as part of the 2021 Polymath Jr program, advised by T.C.M., S.J.M. and D.S. All authors have read and agreed to the published version of the manuscript.

Funding

This work was partially supported by NSF Grant DMS2113535.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The codes and data can be found here https://github.com/DannyStoll1/polymath-fractal-geometry (accessed on 2 July 2022).

Acknowledgments

We thank our colleagues from the 2021 Polymath Jr REU program for helpful comments on earlier drafts.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Douady, A.; Hubbard, J. Exploring the Mandelbrot set. Publ. Math. Orsay 1985. Available online: https://pi.math.cornell.edu/~hubbard/OrsayEnglish.pdf (accessed on 2 July 2022).
  2. Luo, J.; Yao, X. To Generalize Carathéodory’s Continuity Theorem. arXiv 2020, arXiv:2002.08644. [Google Scholar]
  3. Shiamuchi, H. A remark on Zagier’s observation of the Mandelbrot set. Osaka J. Math. 2015, 52, 737–747. [Google Scholar]
  4. Shiamuchi, H. On the coefficients of the Riemann mapping function for the exterior of the Mandelbrot set. RIMS KôKyûRoku Bessatsu 2012, 1807, 21–30. [Google Scholar]
  5. Ewing, J.H.; Schober, G. The area of the Mandelbrot set. Numer. Math. 1992, 61, 59–72. [Google Scholar] [CrossRef]
  6. Benford, F. The Law of Anomalous Numbers. Proc. Am. Philos. Soc. 1938, 78, 551–572. [Google Scholar]
  7. Miller, S.J. Theory and Applications of Benford’s Lawn; Princeton University Press: Princeton, NJ, USA, 2015. [Google Scholar]
  8. Shiamuchi, H. Taylor Coefficients of the Conformal Map for the Exterior of the Reciprocal of the Multibrot Set. Interdiscip. Inf. Sci. 2019, 25, 33–44. [Google Scholar] [CrossRef]
  9. Milnor, J. Dynamics in One Complex Variable, 3th ed; Princeton University Press: Princeton, NJ, USA, 2011. [Google Scholar]
  10. Stoll, D. A Brief Introduction to Complex Dynamics; University of Chicago: Chicago, IL, USA, 2015; pp. 1–23. [Google Scholar]
  11. Bielefeld, B.; Fisher, Y.; von Haeseler, Y. Computing the Laurent Series of the Map Ψ: C\D¯ → C\M. Adv. Appl. Math. 1993, 14, 25–38. [Google Scholar] [CrossRef]
  12. Bandt, C.; Keller, K. Symbolic dynamics for angle-doubling on the circle. II. Symbolic description of the abstract Mandelbrot set. Nonlinearity 1993, 6, 377–392. [Google Scholar] [CrossRef]
  13. Benini, A.M. A survey on MLC, Rigidity and related topics. arXiv 2018, arXiv:1709.09869. [Google Scholar]
  14. Shiamuchi, H. On the coefficients of the Riemann mapping function for the complement of the Mandelbrot set. RIMS Kôkyûroku Bessatsu 2011, 1772, 109–113. [Google Scholar]
  15. Guenther, W.C. Power and Sample Size for Approximate Chi-Square Tests. Am. Stat. 1977, 31, 83–85. [Google Scholar]
  16. Miller, S.J.; Takloo-Bighash, R. An Invitation to Modern Number Theory; Princeton University Press: Princeton: NJ, USA, 2006. [Google Scholar]
  17. Berger, A.; Hill, T.P. Benford’s law strikes back: No simple explanation in sight for mathematical gem. Math. Intell. 2011, 33, 21–30. [Google Scholar] [CrossRef]
  18. Ewing, J.H.; Schober, G. Coefficients associated with the reciprocal of the Mandelbrot Set. J. Math. Anal. Appl. 1992, 1, 104–114. [Google Scholar] [CrossRef]
  19. Levin, G.M. On the arithmetic properties of a certain sequence of polynomials. Commun. Mosc. Math. Soc. 1988, 43, 245–246. [Google Scholar] [CrossRef]
Figure 1. The Mandelbrot set M in the complex plane.
Figure 1. The Mandelbrot set M in the complex plane.
Fractalfract 06 00534 g001
Figure 2. Probability Measure for Benford’s Law in Base 10.
Figure 2. Probability Measure for Benford’s Law in Base 10.
Fractalfract 06 00534 g002
Figure 3. χ 2 distribution for the a m and b m numerators. Legend Guide: FBC-Final Bonferroni Correction, BCC-Bonferroni Correction Curve.
Figure 3. χ 2 distribution for the a m and b m numerators. Legend Guide: FBC-Final Bonferroni Correction, BCC-Bonferroni Correction Curve.
Fractalfract 06 00534 g003
Figure 4. χ 2 distribution for the a m and b m denominators. Legend Guide: FBC-Final Bonferroni Correction, BCC-Bonferroni Correction Curve.
Figure 4. χ 2 distribution for the a m and b m denominators. Legend Guide: FBC-Final Bonferroni Correction, BCC-Bonferroni Correction Curve.
Fractalfract 06 00534 g004
Figure 5. χ 2 distribution for the a m and b m decimal expansions. Legend Guide: FBC-Final Bonferroni Correction, BCC-Bonferroni Correction Curve.
Figure 5. χ 2 distribution for the a m and b m decimal expansions. Legend Guide: FBC-Final Bonferroni Correction, BCC-Bonferroni Correction Curve.
Fractalfract 06 00534 g005
Figure 6. a m and b m numerators.
Figure 6. a m and b m numerators.
Fractalfract 06 00534 g006
Figure 7. a m and b m denominators.
Figure 7. a m and b m denominators.
Fractalfract 06 00534 g007
Figure 8. a m and b m decimal expansions.
Figure 8. a m and b m decimal expansions.
Fractalfract 06 00534 g008
Table 1. The table of first 11 coefficients computed.(codes/coeff_compute.py).
Table 1. The table of first 11 coefficients computed.(codes/coeff_compute.py).
m a m Num a m Denom b m Num b m Denom
00-−12
10-18
212−14
31815128
4140-
515128−471024
60-−116
781102498732,768
8180-
9149932,768−3673262,144
10132132
Table 2. Parameters for the log 10 distribution of the data sets.
Table 2. Parameters for the log 10 distribution of the data sets.
Data Set μ σ
a m numerators1899.0331793.427
a m denominators1903.5451793.680
a m decimals−4.5130.679
b m numerators1899.2841793.752
b m denominators1904.1321793.994
b m decimals−4.8480.689
Table 3. The distribution of ϵ ( m 0 ) when m 0 12 3 has periodicity 16 × 12 = 192 . From m 0 = 195 , it repeats itself, and will have the same ϵ ( m 0 ) of m 0 = 3 and following.
Table 3. The distribution of ϵ ( m 0 ) when m 0 12 3 has periodicity 16 × 12 = 192 . From m 0 = 195 , it repeats itself, and will have the same ϵ ( m 0 ) of m 0 = 3 and following.
m 0 (mod 192)
ϵ ( m 0 )
3−2
15−2
27−4
39−3
51−2
63−2
75−2
87−3
99−4
111−2
123−2
135−6
147−3
159−2
171−2
183−3
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Beretta, F.; Dimino, J.; Fang, W.; Martinez, T.C.; Miller, S.J.; Stoll, D. On Benford’s Law and the Coefficients of the Riemann Mapping Function for the Exterior of the Mandelbrot Set. Fractal Fract. 2022, 6, 534. https://doi.org/10.3390/fractalfract6100534

AMA Style

Beretta F, Dimino J, Fang W, Martinez TC, Miller SJ, Stoll D. On Benford’s Law and the Coefficients of the Riemann Mapping Function for the Exterior of the Mandelbrot Set. Fractal and Fractional. 2022; 6(10):534. https://doi.org/10.3390/fractalfract6100534

Chicago/Turabian Style

Beretta, Filippo, Jesse Dimino, Weike Fang, Thomas C. Martinez, Steven J. Miller, and Daniel Stoll. 2022. "On Benford’s Law and the Coefficients of the Riemann Mapping Function for the Exterior of the Mandelbrot Set" Fractal and Fractional 6, no. 10: 534. https://doi.org/10.3390/fractalfract6100534

APA Style

Beretta, F., Dimino, J., Fang, W., Martinez, T. C., Miller, S. J., & Stoll, D. (2022). On Benford’s Law and the Coefficients of the Riemann Mapping Function for the Exterior of the Mandelbrot Set. Fractal and Fractional, 6(10), 534. https://doi.org/10.3390/fractalfract6100534

Article Metrics

Back to TopTop