Base Dependence of Benford Random Variables

Benford, Frank

doi:10.3390/stats4030034

Open AccessArticle

Base Dependence of Benford Random Variables

by

Frank Benford

Benford Applied Math, Salem, OR 97304, USA

Stats 2021, 4(3), 578-594; https://doi.org/10.3390/stats4030034

Submission received: 4 May 2021 / Revised: 5 June 2021 / Accepted: 7 June 2021 / Published: 2 July 2021

(This article belongs to the Special Issue Benford's Law(s) and Applications)

Download

Browse Figure

Versions Notes

Abstract

:

A random variable X that is base b Benford will not in general be base c Benford when

c \neq b

. This paper builds on two of my earlier papers and is an attempt to cast some light on the issue of base dependence. Following some introductory material, the “Benford spectrum” of a positive random variable is introduced and known analytic results about Benford spectra are summarized. Some standard machinery for a “Benford analysis” is introduced and combined with my method of “seed functions” to yield tools to analyze the base c Benford properties of a base b Benford random variable. Examples are generated by applying these general methods to several families of Benford random variables. Berger and Hill’s concept of “base-invariant significant digits” is discussed. Some potential extensions are sketched.

Keywords:

significand function; Benford random variable; base b first digit law; Benford spectrum; Benford analysis; seed functions; base-invariant significant digits

1. Introduction

My grandfather, the physicist Frank Benford for whom Benford’s Law is named, considered his “law of anomalous numbers” as evidence of a “real world” phenomenon. He realized that geometric sequences and exponential functions are generally base 10 Benford, and on this basis he wrote [1]:

“If the view is accepted that phenomena fall into geometric series, then it follows that the observed logarithmic relationship is not a result of the particular numerical system, with its base, 10, that we have elected to use. Any other base, such as 8, or 12, or 20, to select some of the numbers that have been suggested at various times, would lead to similar relationships; for the logarithmic scales of the new numerical system would be covered by equally spaced steps by the march of natural events. As has been pointed out before, the theory of anomalous numbers is really the theory of phenomena and events, and the numbers but play the poor part of lifeless symbols for living things.”

This argument seems compelling, and it might seem to apply to Benford random variables as well as to geometric sequences and exponential functions. It is therefore somewhat surprising to observe that a random variable that is base b Benford is not generally base c Benford when

c \neq b

. We’ll see some examples shortly.

This paper builds on two of my earlier papers [2,3] and is an attempt to cast some light on the issue of base dependence. It’s organized as follows. Section 2 introduces the significand function and the fractional part notation and gives several logically equivalent definitions of “Benford random variable.” The base b first digit law is introduced, and several examples of random variables are presented that are Benford relative to one base but not to another. Section 3 introduces the “Benford spectrum”

B_{X}

of a positive random variable X and summarizes some of the known analytical results that involve

B_{X}

. Section 4 is a brief digression listing some facts about Fourier transforms that are needed in subsequent sections. Section 5 introduces some fundamental notation and results that provide a framework for the “Benford analysis” of a positive random variable. Section 6 combines the framework of Section 5 with my method of “seed functions” to develop the theory of the base c Benford properties of random variables X that are known to be Benford relative to base b, and Section 7 gives several examples of such random variables. Section 8 discusses Berger and Hill’s concept of “base-invariant significant digits.” Section 9 is a summary and a look ahead.

2. Benford Random Variables

The best way to define Benford random variables is via the significand function. Let

b > 1

be a fixed “base.” Any

x > 0

may be written uniquely in the form

x = s \times b^{k} where s \in [1, b) and k \in Z,

and the base b significand of x, written

S_{b} (x)

, is defined as this s. Hence,

x = S_{b} (x) \times b^{k} where S_{b} (x) \in [1, b) and k \in Z .

(1)

(Berger and Hill [4] define the significand of x for all

x \in R

, but we don’t require this generality.)

Now let X be a positive random variable; that is,

Pr (X > 0) = 1 .

Assume that X is continuous with a probability density function (pdf).

Definition 1.

X is base b Benford (or X is b-Benford) if and only if the distribution function of

S_{b} (X)

is given by

Pr (S_{b} (X) \leq s) = {log}_{b} (s) for all s \in [1, b) .

(2)

Nothing written above requires that b be an integer. For this paragraph alone, we assume that b is an integer greater than or equal to 3. Let

D_{1} (X)

denote the first (i.e., leftmost or most significant) digit of X in the base b representation of X, so

D_{1} (X) \in \{1, \dots, b - 1\}

. (Leading zeros, if there are any, are ignored.)

Proposition 1.

If X is b-Benford, then

Pr (D_{1} (X) = d) = {log}_{b} (\frac{d + 1}{d})

(3)

for all

d \in \{1, \dots, b - 1\}

. This is the “base b First Digit Law.” To prove it, it is sufficient to observe that

D_{1} (X) = d

if and only if

d \leq S_{b} (X) < d + 1

.

It’s useful at this point to introduce some non-standard notation. Let

y \in R

and recall that the “floor” of y, written

⌊y⌋

, is defined as the largest integer that is less than or equal to y. Define

〈 y 〉

as:

〈 y 〉 \equiv y - ⌊y⌋

(4)

and note that

0 \leq 〈 y 〉 < 1

for every

y \in R

. We’ll call

〈 y 〉

the fractional part of y, though if

y < 0

this description is misleading.

If we take the logarithm base b of Equation (1) we obtain

{log}_{b} (x) = {log}_{b} (S_{b} (x)) + k .

(5)

On the other hand,

{log}_{b} (x) = 〈 {log}_{b} (x) 〉 + ⌊{log}_{b} (x)⌋ .

(6)

As

⌊{log}_{b} (x)⌋

is necessarily an integer and

0 \leq {log}_{b} (S_{b} (x)) < 1

, comparison of Equations (5) and (6) shows that

{log}_{b} (S_{b} (x)) = 〈 {log}_{b} (x) 〉 and k = ⌊{log}_{b} (x)⌋

(7)

for any

x > 0 .

Using Equation (7), we may rephrase Definition 1 in several logically equivalent ways.

Proposition 2.

X is b-Benford if and only if any one of the following four conditions is met.

\begin{matrix} (1) & Pr ({log}_{b} (S_{b} (X)) \leq {log}_{b} (s)) = {log}_{b} (s) f o r a l l s \in [1, b), \\ (2) & Pr (〈 {log}_{b} (X) 〉 \leq u) = u f o r e v e r y u \in [0, 1), \\ (3) & 〈 {log}_{b} (X) 〉 \sim U [0, 1), \\ (4) & X = b^{Y} w h e r e 〈 Y 〉 \sim U [0, 1), \end{matrix}

where the notation “

W \sim U [0, 1)

” means that W is uniformly distributed on the half open interval

[0, 1)

. (More generally, I use the symbol “∼” to mean “is distributed as.” Hence, for example, “

X \sim f

” means that X is distributed with pdf f, and “

X_{1} \sim X_{2}

” means that

X_{1}

and

X_{2}

have the same distribution.)

For any random variable Y, if

〈 Y 〉 \sim U [0, 1)

we sometimes say that Y is “uniformly distributed modulo one,” abbreviated “u.d. mod 1.” Hence X is b-Benford if and only if

{log}_{b} (X)

is u.d. mod 1.

With this background we can now give a couple of examples of random variables that are Benford with respect to one base but not to another. Let

Y \sim U [0, 1)

.

Example 1.

Let

X \equiv 10^{Y}

, so X is 10-Benford. But it’s not 8-Benford as it fails to satisfy the base 8 First Digit Law. To see this, note that the support of X is

[1, 10)

, and let

D_{1} (X)

denote the first digit in the base 8 representation of X. Then

\begin{matrix} Pr (D_{1} (X) = 1) = Pr (1 \leq X < 2) + Pr (8 \leq X < 10) \\ = {log}_{10} (2) + {log}_{10} (5 / 4) \approx 0.3979, \end{matrix}

whereas

{log}_{8} (\frac{1 + 1}{1}) = {log}_{8} (2) = \frac{1}{3} .

Example 2.

Let Y be as above, but now let

X \equiv 8^{Y}

, so X is 8-Benford. Note that the support of X is

[1, 8)

. Let

D_{1} (X)

denote the first digit in the base 10 representation of X. Hence

Pr (D_{1} (X) \in \{8, 9\}) = 0,

whereas

{log}_{10} (\frac{9}{8}) + {log}_{10} (\frac{10}{9}) \approx 0.09691 .

Hence, X fails to satisfy the base 10 First Digit Law.

3. The Benford Spectrum

Let X be a positive random variable.

Definition 2.

Following Wójcik [5], the “Benford spectrum” of X, denoted

B_{X}

, is defined as

B_{X} \equiv \{b \in (1, \infty) : X is b - Benford\} .

(8)

The Benford spectrum of X may be empty. In fact, the Benford spectra of essentially all the standard random variables used in statistics are empty.

This section summarizes some of the known facts about Benford spectra. While proofs are provided for Proposition 4 and 6, I’m just going to provide citations for proofs of the other propositions.

Proposition 3

(Berger and Hill [4], page 44, Proposition 4.3 (iii)). A random variable Y is u.d. mod 1 if and only if

k Y + c

is u.d. mod 1 for every integer

k \neq 0

and every

c \in R

.

Proposition 4

(Whittaker [6]). If

b \in B_{X}

, then

\sqrt[m]{b} \in B_{X}

for all

m \in N

. In other words, if X is b-Benford, then X is

\sqrt[m]{b}

-Benford for all

m \in N

.

Proof.

Suppose that X is b-Benford, so

X = b^{Y}

where Y is u.d. mod 1. Hence, for any

m \in N,

X = {(b^{1 / m})}^{m Y} .

As

b^{1 / m} > 1

and

m Y

is u.d. mod 1 by Proposition 3, it follows that

b^{1 / m} \in B_{X} .

□

Proposition 5.

If

B_{X}

is non-empty, then it is bounded above. In other words, no random variable can be b-Benford for arbitrarily large b. Citations: Refs. [3,5,6].

Proposition 6.

If X is b-Benford and

c > 0

, then

c X

is b-Benford.

Proof.

As X is b-Benford,

Y \equiv {log}_{b} (X)

is u.d. mod 1. As

{log}_{b} (c X) = Y + {log}_{b} (c)

is u.d. mod 1 from Proposition 3, it follows that

c X

is b-Benford. □

We say of this result that the Benford property of a random variable is “scale-invariant.”

Proposition 7.

Suppose that X and W are independent positive random variables and that X is b-Benford. Then the product

X W

is also b-Benford. Citations: Refs. [3,4,5].

Proposition 8

(a corollary of Proposition 7). If X and W are independent positive random variables, then

B_{X} \cup B_{W} \subseteq B_{X Y} .

So far, the spectra we’ve seen are at most countably infinite. One may wonder if there exists a random variable with an uncountable spectrum. Whittaker showed by an example that such a random variable exists. Let

b > 1

be given. Define

g : R \to R

by

g (y) \equiv \frac{1 - cos (2 π y)}{2 π^{2} y^{2}} .

(9)

It may be shown that g is a legitimate pdf, and Y is u.d. mod 1 if

Y \sim g

. Hence

X \equiv b^{Y}

is b-Benford. (This is what I’ve called Whittaker’s random variable.) For any

c > 1

, define

Y_{c} \equiv {log}_{c} (X)

. It may then be shown that

Y_{c}

is u.d. mod 1 (and hence that X is c-Benford) if and only if

c \leq b

. In summary,

B_{X} = (1, b]

. Citations: Refs. [3,5,6].

4. Digression: Fourier Transforms

Before going much further, we need to list some facts about Fourier transforms. Let g denote the pdf of a real valued random variable

Y

. The Fourier transform of g is the function

\hat{g} : R \to C

defined as

\begin{matrix} \hat{g} (ξ) & \equiv \int_{- \infty}^{\infty} e^{- 2 π i ξ y} g (y) d y = E (e^{- 2 π i ξ Y}) = u (ξ) - i v (ξ) \end{matrix}

(10)

for all

ξ \in R

, where

\begin{matrix} u (ξ) \equiv \int_{- \infty}^{\infty} cos (2 π ξ y) g (y) d y = E (cos (2 π ξ Y)) and \\ v (ξ) \equiv \int_{- \infty}^{\infty} sin (2 π ξ y) g (y) d y = E (sin (2 π ξ Y)) . \end{matrix}

(11)

Note that u is an even function and v is an odd function, and hence that

\hat{g} (- ξ) = \bar{\hat{g} (ξ)}

where the overbar denotes complex conjugation. Though the Fourier transform

\hat{g} (ξ)

is generally complex valued, it is real valued if g is an even function, i.e., if Y is symmetrically distributed around the origin. Hence, if g is an even function, then

\hat{g}

is an even function. Finally, note that

\hat{g} (0) = \int_{- \infty}^{\infty} g (y) d y = 1

.

The following fact is very useful.

Proposition 9

(shift and scale with random variables). Suppose that

W = σ Y + μ

where

σ > 0

. Suppose that

Y \sim g

and let h denote the pdf of W. We may obtain h from g and

\hat{h}

from

\hat{g}

as follows:

h (w) = \frac{1}{σ} g (\frac{w - μ}{σ})

(12)

(proof left to reader) and

\hat{h} (ξ) = E (e^{- 2 π i ξ W}) = E [e^{- 2 π i ξ (σ Y + μ)}] = e^{- 2 π i ξ μ} \hat{g} (σ ξ) .

(13)

If

μ = 0

, Equation (13) becomes

\hat{h} (ξ) = \hat{g} (σ ξ)

.

Appendix A of this paper contains a table of selected Fourier transforms.

5. A Framework for Benford Analysis

Suppose that X is a positive random variable and that

b > 1

. We may wish to know if X is b-Benford, and if it’s not by how far does it differ from “Benfordness.” I call an attempt to answer these and related questions a “Benford analysis.” In this section I establish some notation I’ll use for a Benford analysis, and give some fundamental results that allow us to proceed.

First, define

Y \equiv {log}_{b} (X) = Λ_{b} ln (X) where Λ_{b} \equiv \frac{1}{ln (b)} .

(14)

Next, let

\begin{matrix} g & denote the pdf of Y, \\ \tilde{g} & denote the pdf of 〈 Y 〉 . \end{matrix}

Given

\tilde{g}

we may answer the two questions given above. (1) X is b-Benford if and only if

\tilde{g} (u) = 1

for almost all

u \in [0, 1)

. (2) If X is not b-Benford, we may measure its deviation from Benfordness by any measure of the deviation of

\tilde{g}

from a uniform distribution. For example, if

\tilde{g}

is continuous, or if its only discontinuities are “jumps,” we could use the infinity norm:

{∥\tilde{g} - 1∥}_{\infty} \equiv sup (|\tilde{g} (u) - 1| : 0 \leq u < 1) .

(15)

We need a way to find

\tilde{g}

from g. Under a reasonable assumption, it may be shown that

\tilde{g} (u) = \sum_{k \in Z} g (k + u)

(16)

for all

u \in [0, 1)

. The “reasonable assumption” is described in [2]. In this paper we’ll just accept Equation (16) as given.

Although Equation (16) is fundamental for a Benford analysis of X, it is not very useful for finding the answers to some analytical questions one may ask. Fourier analysis provides the tools needed to continue the analysis. It may be shown [3] that the Fourier series representation of

\tilde{g} (u)

is

\tilde{g} (u) = \sum_{n \in Z} \hat{g} (n) e^{2 π i n u} for all u \in [0, 1) .

(17)

At first sight this expression may not seem very useful; the series of real valued functions in Equation (16) has been replaced by a series of complex valued functions multiplied by complex coefficients. But

\hat{g} (0) = 1

, and Equation (17) may be written as

\tilde{g} (u) = 1 + \sum_{n \in N} [\hat{g} (- n) e^{- 2 π i n u} + \hat{g} (n) e^{2 π i n u}] .

(18)

As

\hat{g} (- n) e^{- 2 π i n u}

is the complex conjugate of

\hat{g} (n) e^{2 π i n u}

, it follows that each term in brackets in Equation (18) is real valued. In fact,

\hat{g} (- n) e^{- 2 π i n u} + \hat{g} (n) e^{2 π i n u} = a_{n} cos (2 π n u) + b_{n} sin (2 π n u)

(19)

where

\begin{matrix} a_{n} = \hat{g} (- n) + \hat{g} (n) = 2 \int_{- \infty}^{\infty} cos (2 π n y) g (y) d y, \\ b_{n} = - i \hat{g} (- n) + i \hat{g} (n) = 2 \int_{- \infty}^{\infty} sin (2 π n y) g (y) d y . \end{matrix}

(20)

Combining Equations (18) and (19) yields

\tilde{g} (u) = 1 + \sum_{n \in N} [a_{n} cos (2 π n u) + b_{n} sin (2 π n u)] .

(21)

In practice, it is often convenient to go one step further and rewrite Equation (21) as

\tilde{g} (u) = 1 + \sum_{n \in N} A_{n} cos [2 π n (u - θ_{n})]

(22)

where

A_{n}

satisfies

A_{n}^{2} = a_{n}^{2} + b_{n}^{2}

(23)

and

θ_{n}

is any solution to

cos (2 π n θ_{n}) = \frac{a_{n}}{A_{n}} and sin (2 π n θ_{n}) = \frac{b_{n}}{A_{n}} .

(24)

The parameters

A_{n}

and

θ_{n}

are not uniquely determined by Equations (23) and (24), but in practice natural candidates for

A_{n}

and

θ_{n}

often present themselves. I’ll call

A_{n}

an “amplitude” (though this term generally refers to

|A_{n}|

) and

θ_{n}

a “phase.”

Proposition 10.

The pdf

\tilde{g}

is that of a

U [0, 1)

random variable if and only if

\hat{g} (n) = 0

for all

n \in N

. Equivalently, the pdf

\tilde{g}

is that of a

U [0, 1)

random variable if and only if

A_{n} = 0

for all

n \in N

.

Proof.

The first assertion follows fromEquation (18) combined with

\hat{g} (- n) = \bar{\hat{g} (n)}

for any

n \in N

. The second assertion follows from Equation (22). □

Proposition 11.

|A_{n}| = 2 |\hat{g} (n)| for all n \in N .

(25)

Proof.

Solving Equation (20) for

\hat{g} (- n)

and

\hat{g} (n)

, we find

\hat{g} (n) = \frac{1}{2} (a_{n} - i b_{n}), \hat{g} (- n) = \frac{1}{2} (a_{n} + i b_{n}) .

(26)

It follows that

A_{n}^{2} = a_{n}^{2} + b_{n}^{2} = 4 \hat{g} (n) \hat{g} (- n) = 4 {|\hat{g} (n)|}^{2} \Rightarrow |A_{n}| = 2 |\hat{g} (n)| .

□

6. Base Dependence: Theory

Suppose we’re given a u.d. mod 1 random variable Y with pdf g and

b > 1

. Then

X \equiv b^{Y}

is b-Benford. Now let

c > 1

be another possible base and define

Y_{c} \equiv {log}_{c} (X)

. Let

g_{c}

and

{\tilde{g}}_{c}

denote the pdfs of

Y_{c}

and

〈 Y_{c} 〉

, respectively, and let

{\hat{g}}_{c}

denote the Fourier transform of

g_{c}

. My aim in this section is to present tools that allow one to study how

{\tilde{g}}_{c}

varies as a function of c.

The first thing to observe is that

Y_{c}

is proportional to Y:

Y_{c} = \frac{ln (X)}{ln (c)} = \frac{ln (b)}{ln (c)} \cdot \frac{ln (X)}{ln (b)} = ρ Y where ρ \equiv \frac{ln (b)}{ln (c)} .

(27)

It then follows from Proposition 9 that

{\hat{g}}_{c} (ζ) = \hat{g} (ρ ζ)

(28)

for any

ζ \in R

.

To use Equation (28) we first need to say something about g. I introduced “seed functions” in [2] and showed that every pdf g of a u.d. mod 1 random variable may be written

g (y) = H (y) - H (y - 1)

(29)

for every

y \in R

, where H is a seed function. Hence

\hat{g} (ξ) = \int_{- \infty}^{\infty} e^{- 2 π i ξ y} [H (y) - H (y - 1)] d y .

(30)

Under various assumptions about H, we may combine Equations (28) and (30) to compute

{\hat{g}}_{c} (n)

for all

n \in Z

, and given

{\hat{g}}_{c} (n)

we may compute

A_{n}

and

θ_{n}

in the expression

{\hat{g}}_{c} (- n) e^{- 2 π i n u} + {\hat{g}}_{c} (n) e^{2 π i n u} = A_{n} cos [2 π n (u - θ_{n})]

for all

n \in N

, and thereby derive

\tilde{g}

. In this section I’ll partially carry out this program for two broad classes of seed functions: (1)

H

is a step function, and (2)

H

is increasing and absolutely continuous.

Example 3.

Suppose that

H

is the following step function:

H (y) = \{\begin{matrix} 0 & if y < - \frac{1}{2}, \\ 1 & if y \geq - \frac{1}{2} . \end{matrix}

This seed function implies that

g (y) = \{\begin{matrix} 1 & if y \in [- \frac{1}{2}, \frac{1}{2}), \\ 0 & otherwise . \end{matrix}

(31)

Hence, from the table of Fourier transforms in Appendix A,

\hat{g} (ξ) = \frac{sin (π ξ)}{π ξ}

(32)

(where it is understood that

\hat{g} (0) = 1

). Combining this with Equation (28) yields

{\hat{g}}_{c} (n) = \frac{sin (π ρ n)}{π ρ n}

(33)

for any

n \neq 0

. From Proposition 10 we know that

Y_{c}

will be c-Benford if and only if

{\hat{g}}_{c} (n) = 0

for every

n \in N

, and from Equation (33) it’s clear that this happens if and only if ρ is an integer. But

ρ = \frac{ln (b)}{ln (c)} = m \Leftrightarrow c = b^{1 / m}

(34)

for every

m \in N

. Hence,

Y_{c}

is c-Benford if and only if c is an integral root of b. This result agrees with Proposition 4.

Certain features of this result are repeated with every seed function H we consider. In particular, we always find that

{\hat{g}}_{c} (n) = 0

for all

n \in N

whenever c is an integral root of b. Also, note that

{\hat{g}}_{c} (n)

depends on c entirely through the parameter

ρ

.

Equation (33) implies that

A_{n} = \frac{2 sin (π ρ n)}{π ρ n}, θ_{n} = 0

(35)

for this example.

Example 4.

To generalize Example 3 slightly, suppose that H jumps from 0 to 1 at

μ - \frac{1}{2}

for some

μ \in R

. The pdf g implied by this seed function is just that given by Equation (31) shifted right by μ. From Proposition 9 and Equation (32) we obtain

\hat{g} (ξ) = e^{- 2 π i ξ μ} \frac{sin (π ξ)}{π ξ}

and hence

{\hat{g}}_{c} (n) = e^{- 2 π i ρ n μ} \frac{sin (π ρ n)}{π ρ n}

(36)

for any

n \in N

. Note that

{\hat{g}}_{c} (n) = 0

if and only if

c = b^{1 / m}

for some

m \in N

. Equation (36) implies that

A_{n} = \frac{2 sin (π ρ n)}{π ρ n}, θ_{n} = ρ μ

(37)

for this example. The only effect of including μ is to change the phase. Note that the phase does not depend on n.

Now assume that H is increasing and absolutely continuous. This assumption makes H mathematically equivalent to the distribution function of an absolutely continuous random variable. Under this assumption H is differentiable almost everywhere and

h (y) \equiv H^{'} (y) \geq 0

. We want to evaluate the integral

\hat{g} (ξ) = \int_{- \infty}^{\infty} e^{- 2 π i ξ y} [H (y) - H (y - 1)] d y = E (e^{- 2 π i ξ Y}) .

It’s clear from the rightmost expression in this equation that

\hat{g} (0) = 1

. When

ξ \neq 0

, an initial integration by parts yields

\hat{g} (ξ) = \frac{1}{2 π i ξ} \int_{- \infty}^{\infty} e^{- 2 π i ξ y} [h (y) - h (y - 1)] d y .

Evaluating this integral,

\begin{matrix} \begin{matrix} \hat{g} (ξ) & = \frac{1}{2 π i ξ} (1 - e^{- 2 π i ξ}) \hat{h} (ξ) \\ = \frac{e^{- i π ξ}}{2 π i ξ} (e^{i π ξ} - e^{- i π ξ}) \hat{h} (ξ) = \frac{e^{- i π ξ}}{π ξ} sin (π ξ) \hat{h} (ξ) . \end{matrix} \end{matrix}

Hence,

{\hat{g}}_{c} (n) = \frac{e^{- i π ρ n}}{π ρ n} sin (π ρ n) \hat{h} (ρ n)

(38)

for any

n \neq 0

. We see once again that

{\hat{g}}_{c} (n) = 0

for all

n \in N

whenever c is an integral root of b. In addition, there’s another possibility;

{\hat{g}}_{c} (n) = 0

for all

n \in N

if

\hat{h} (ρ n) = 0

for all

n \in N

. This is essentially the possibility that was exploited in the construction of Whittaker’s random variable. We’ll return to this point in a moment.

Example 5.

Still working with the assumption that H is increasing and absolutely continuous, we now make the additional assumption that h is an even function, which implies that

\hat{h}

is an even function. Under these assumptions, Equation (38) implies that

A_{n} = \frac{2 sin (π ρ n) \hat{h} (ρ n)}{π ρ n}, θ_{n} = \frac{1}{2} ρ .

(39)

Example 6.

In Example 5 we assume that h is even, so that it’s symmetrical around the point

y = 0

. Now assume that h is symmetrical around the point

y = μ

for some

μ \in R

. Define

h_{0} (y) \equiv h (y + μ)

so

h_{0}

is an even function. It is easy to show that

\hat{h} (ξ) = e^{- 2 π i ξ μ} {\hat{h}}_{0} (ξ)

. Combining this fact with Equation (38) yields

\begin{matrix} {\hat{g}}_{c} (n) & = \frac{e^{- i π ρ n}}{π ρ n} sin (π ρ n) e^{- 2 π i ρ n μ} {\hat{h}}_{0} (ρ n) \\ = \frac{e^{- 2 π i ρ n (\frac{1}{2} + μ)}}{π ρ n} sin (π ρ n) {\hat{h}}_{0} (ρ n) . \end{matrix}

(40)

Equation (40) implies that

A_{n} = \frac{2 sin (π ρ n) {\hat{h}}_{0} (ρ n)}{π ρ n}, θ_{n} = ρ (\frac{1}{2} + μ)

(41)

for Example 6. We observe that the phase depends on μ and ρ, but not on n.

Note that

A_{n}

and

θ_{n}

depend on c only through

ρ

in all of these examples. It’s useful to keep in mind that

ρ

depends on c as shown in Figure 1 (where I’ve let

b \equiv 16

). In words,

ρ

increases from 1 to ∞ as c decreases from b towards 1.

7. Base Dependence: Examples

Equations (38) and (41) provide the scaffolding for the construction of

\tilde{g}

, but require insertion of an actual formula for

\hat{h}

in Equation (38) or

{\hat{h}}_{0}

in Equation (41) for completion. This section completes this construction using the table of Fourier transforms in Appendix A.

Every distribution function is a legitimate seed function. Hence every Fourier transform given in Appendix A is a legitimate candidate for

\hat{h}

. Moreover, four of the distributions in Appendix A (the normal, Laplace, Cauchy, and logistic) are even functions, and their Fourier transforms are therefore legitimate candidates for

{\hat{h}}_{0}

. All four of these distributions have fixed variances, however, and it is desirable to append a “scale” parameter

σ

that allows these variances to be adjusted. Proposition 9 justifies the following expanded table of Fourier transforms.

Example 7.

Gauss-Benford random variables. Suppose that H is the distribution function of a

N (μ, σ)

random variable, i.e., a

N (0, σ)

random variable shifted μ to the right. I’ll call the random variable X implied by this seed function a “Gauss-Benford” random variable. Combining Equation (41) with the appropriate entry from Table 1, we obtain

A_{n} = \frac{2 sin (π ρ n)}{π ρ n} exp (- 2 π^{2} σ^{2} ρ^{2} n^{2}), θ_{n} = ρ (\frac{1}{2} + μ) .

(42)

As

exp (- 2 π^{2} σ^{2} ρ^{2} n^{2}) > 0

, it follows that

B_{X} = \{b^{1 / m} : m \in N\}

. Let

A_{n}^{*} = \frac{2}{π ρ n} exp (- 2 π^{2} σ^{2} ρ^{2} n^{2}),

(43)

so

A_{n} = sin (π ρ n) A_{n}^{*}

. Viewed as a function of n or ρ,

A_{n}

oscillates within an envelope

[- A_{n}^{*}, A_{n}^{*}]

, and

|A_{n}| \leq A_{n}^{*}

for all

n,

σ,

and ρ. Asymptotically, letting any of the parameters n, ρ, or

σ \to \infty

implies that

A_{n}^{*} ↓ 0

. Equation (43) implies that

A_{1}^{*} > A_{2}^{*} > \dots

. The descent of

A_{n}^{*}

towards zero with increases in n, ρ, or σ is extremely rapid, and

A_{1}^{*}

can be small with even low values of ρ and σ. For example, letting

ρ = σ = 1

implies that

A_{1}^{*} \approx 1.7 \times 10^{- 9}

. In this case, the graph of

\tilde{g}

is visually indistinguishable from that of a uniform distribution on

[0, 1)

and we would have to conclude that X is “effectively” c-Benford for all

c \leq b

.

Example 8.

Laplace-Benford random variables. Now suppose that H is the distribution function of a Laplace

(μ, σ)

random variable. I’ll call the random variable X implied by this seed function a “Laplace-Benford” random variable. Combining Equation (41) with the appropriate entry from Table 1, we obtain

A_{n} = \frac{2 sin (π ρ n)}{π ρ n} \cdot \frac{1}{1 + 4 π^{2} σ^{2} ρ^{2} n^{2}}, θ_{n} = ρ (\frac{1}{2} + μ) .

(44)

As

{(1 + 4 π^{2} σ^{2} ρ^{2} n^{2})}^{- 1} > 0

, it follows that

B_{X} = \{b^{1 / m} : m \in N\}

. Let

A_{n}^{*} = \frac{2}{π ρ n} \cdot \frac{1}{1 + 4 π^{2} σ^{2} ρ^{2} n^{2}},

(45)

so

A_{n} = sin (π ρ n) A_{n}^{*}

and

|A_{n}| \leq A_{n}^{*}

for all

n,

σ,

and ρ. Asymptotically, letting any of the parameters n, ρ, or

σ \to \infty

implies that

A_{n}^{*} ↓ 0

. Though the asymptotic limits of Equations (43) and (45) are identical, the approach of

A_{n}^{*}

to zero (as n, ρ, or σ increases) is very much slower for Equation (45) than it is for Equation (43).

Example 9.

Cauchy-Benford random variables. Now suppose that H is the distribution function of a Cauchy

(μ, σ)

random variable. I’ll call the random variable X implied by this seed function a “Cauchy-Benford” random variable. Combining Equation (41) with the appropriate entry from Table 1, we obtain

A_{n} = \frac{2 sin (π ρ n)}{π ρ n} e^{- 2 π σ ρ n}, θ_{n} = ρ (\frac{1}{2} + μ) .

(46)

As

e^{- 2 π σ ρ n} > 0

, it follows that

B_{X} = \{b^{1 / m} : m \in N\}

. Let

A_{n}^{*} = \frac{2}{π ρ n} e^{- 2 π σ ρ n}

(47)

so

A_{n} = sin (π ρ n) A_{n}^{*}

. The asymptotic behavior for this

A_{n}^{*}

is identical to that of Equations (43) or (45). The rate of descent of

A_{n}^{*}

towards zero is intermediate between that of a Gauss-Benford random variable and that of a Laplace-Benford random variable.

Example 10.

Logistic-Benford random variables. For our final example of a symmetric seed function, let H be the distribution function of a logistic

(μ, σ)

random variable. I’ll call the random variable X implied by this seed function a “Logistic-Benford” random variable. Combining Equation (41) with the appropriate entry from Table 1, we obtain

A_{n} = \frac{2 sin (π ρ n)}{π ρ n} \cdot \frac{2 π^{2} σ ρ n}{sinh (2 π^{2} σ ρ n)} = sin (π ρ n) A_{n}^{*}, θ_{n} = ρ (\frac{1}{2} + μ)

(48)

where

A_{n}^{*} = \frac{4 π σ}{sinh (2 π^{2} σ ρ n)} > 0 .

(49)

The asymptotic behavior for this

A_{n}^{*}

is identical to that of the previous three random variables. The rate of convergence of

A_{n}^{*}

to zero is comparable to that of a Cauchy-Benford random variable.

Example 11.

Gamma-Benford random variables. Suppose that the seed function H is the distribution function of a

Γ (α, β)

random variable. I’ll call the random variable X implied by this seed function a “Gamma-Benford” random variable. This seed function is increasing and absolutely continuous, but h isnot symmetrically distributed around any point μ, so Equation (41) does not apply. Combining Equation (38) with the appropriate entry from the table of Fourier transforms found in Appendix A, we obtain

{\hat{g}}_{c} (n) = \frac{e^{- i π ρ n}}{π ρ n} sin (π ρ n) {(1 + 2 π i β ρ n)}^{- α}

(50)

for every integer

n \neq 0

. To make headway, define

z_{n} \equiv 1 + 2 π i β ρ n = 1 + i y_{n} where y_{n} \equiv 2 π β ρ n,

(51)

and rewrite

z_{n}

in polar form, so

z_{n} = r_{n} e^{i ϕ_{n}} where r_{n} \equiv \sqrt{1 + y_{n}^{2}}, tan (ϕ_{n}) = y_{n} .

(52)

Hence,

{\hat{g}}_{c} (n) = \frac{e^{- i π ρ n}}{π ρ n} sin (π ρ n) r_{n}^{- α} e^{- i α ϕ_{n}} = \frac{sin (π ρ n)}{π ρ n r_{n}^{α}} e^{- 2 π i n θ_{n}}

(53)

where

θ_{n} \equiv \frac{1}{2} ρ + \frac{α ϕ_{n}}{2 π n} .

(54)

Hence,

\begin{matrix} {\hat{g}}_{c} (- n) e^{- 2 π i n u} + {\hat{g}}_{c} (n) e^{2 π i n u} = \frac{sin (π ρ n)}{π ρ n r_{n}^{α}} (e^{- 2 π i n (u - θ_{n})} + e^{2 π i n (u - θ_{n})}) \\ = \frac{2 sin (π ρ n)}{π ρ n r_{n}^{α}} cos [2 π n (u - θ_{n})] = A_{n} cos [2 π n (u - θ_{n})] \end{matrix}

(55)

where

A_{n} \equiv \frac{2 sin (π ρ n)}{π ρ n r_{n}^{α}} .

(56)

To “compare and contrast” these results with those with symmetric distributions, we make the following observations. (1) The presence of

sin (π ρ n)

in the numerator of Equation (56), combined with

r_{n}^{- α} > 0

, implies that

A_{n} = 0

for all

n \in N

if and only if ρ is an integer, i.e., if and only if c is an integral root of b. (2) Unlike our earlier results, where the phase

θ_{n}

is given by Equation (41) and does not depend on n, for a Gamma-Benford random variable the phase is given by Equation (54). It’s easy to show that

ϕ_{n} \to \frac{1}{2} π

as

n \to \infty

, and hence that

θ_{n} ↓ \frac{1}{2} ρ .

(3) It’s easy to show that

A_{n}^{*} \equiv \frac{2}{π ρ n r_{n}^{α}} ↓ 0 as ρ n \to \infty .

Example 12.

Whittaker-Benford random variables: For our final example, we return to Equation (38),

{\hat{g}}_{c} (n) = \frac{e^{- i π ρ n}}{π ρ n} sin (π ρ n) \hat{h} (ρ n),

which holds for all increasing and absolutely continuous seed functions. All of our previous examples have made use of the fact that

sin (π ρ n) = 0

for all

n \in N

whenever ρ is an integer. We now consider another possibility:

{\hat{g}}_{c} (n) = 0

for all

n \in N

if

\hat{h} (ρ n) = 0

for all

n \in N

. I’ll say that a b-Benford random variable X satisfying this condition is a “Whittaker-Benford” random variable. The key here is to find

\hat{h}

with bounded support, and the simplest such

\hat{h}

is triangular:

\hat{h} (ξ) = max (0, 1 - \frac{|ξ|}{γ}),

(57)

where

γ > 0

. With this

\hat{h}

it’s clear that

\hat{h} (ρ n) = 0

for all

n \in N

if

ρ \geq γ

. Note that

ρ \geq γ \Leftrightarrow c \leq b^{1 / γ}

. Therefore, the Benford spectrum

B_{X}

of a Whittaker-Benford random variable X with

\hat{h}

given by Equation (57) has two (overlapping) components:

B_{X} = B_{X}^{d} \cup B_{X}^{c}

where

\begin{matrix} B_{X}^{d} \equiv \{b^{1 / m} : m \in N\}, \\ B_{X}^{c} \equiv (1, b^{1 / γ}] . \end{matrix}

(58)

(The superscript d stands for “discrete,” and the superscript c stands for “continuous.”) If

γ \leq 1

, then

B_{X}^{d} \subset B_{X}^{c}

. For example, if

γ = \frac{1}{2}

then

B_{X} = B_{X}^{c} = (1, b^{2}]

. On the other hand, if

γ > 1

, then

B_{X}^{c} = (1, b^{1 / γ}] \subset (1, b]

, so

B_{X}

equals the disjoint union of the discrete set (

B_{X}^{d} - B_{X}^{c}

) and the continuous set

B_{X}^{c}

.

The function h that yields

\hat{h}

given by Equation (57) is

h (y) = \frac{1 - cos (2 π γ y)}{2 γ π^{2} y^{2}} .

(59)

8. On “Base-Invariant Significant Digits”

I wish to acknowledge that I first encountered many of the ideas discussed in this section in Michal Wójcik’s admirable paper [5]. All citations to Berger and Hill in this section are to their text, reference [4].

Proposition 12.

If X is b-Benford, then

X^{n}

is b-Benford for any

n \in N .

Proof.

As X is b-Benford,

X = b^{Y}

where Y is u.d. mod 1. Hence

X^{n} = b^{n Y}

. But

n Y

is u.d. mod 1 by Proposition 3. Therefore

X^{n}

is b-Benford. □

Corollary 1.

As

X^{n} = {(b^{n})}^{Y}

, it follows that

X^{n}

is

b^{n}

-Benford if X is b-Benford.

Corollary 2.

If X is b-Benford, then

S_{b} (X) \sim S_{b} (X^{n})

for any

n \in N

. This follows from Definition 1.

One may wonder if the converse of Corollary 2, namely

if S_{b} (X) \sim S_{b} (X^{n}) for all n \in N, then X is b - Benford,

is true. The answer is “no.” Here’s a counterexample. If

X \equiv 1

, then

S_{b} (X) \sim S_{b} (X^{n})

for all

n \in N

, but X is not b-Benford. In fact, any X of the form

b^{m}

where

m \in Z

is a counterexample, as

S_{b} (X) = 1 = S_{b} (X^{n})

. However, we may show the following:

Proposition 13.

If

S_{b} (X) \sim S_{b} (X^{n})

for all

n \in N,

then either X is b-Benford, or

S_{b} (X) = 1

. We’ll provide a proof in a moment.

Definition 3.

Let

S_{b} (X) \sim S_{b} (X^{n}) for all n \in N

(60)

be called Wójcik’s condition.

Here’s another way to state Proposition 13. (This is Wójcik’s Theorem 19.)

Proposition 14.

X satisfies Wójcik’s condition if and only if the distribution function of

S_{b} (X)

is given by

Pr (S_{b} (X) \leq s) = q + (1 - q) {log}_{b} (s)

(61)

for some

q \in [0, 1]

and for all

s \in [1, b)

.

To prove Proposition 13 or 14, we first massage Wójcik’s condition into an alternative form. Let X be a positive random variable and define

Y \equiv {log}_{b} (X)

. For all

n \in N,

\begin{matrix} S_{b} (X) \sim S_{b} (X^{n}) \Leftrightarrow {log}_{b} (S_{b} (X)) \sim {log}_{b} (S_{b} (X^{n})) \\ \Leftrightarrow 〈 {log}_{b} (X) 〉 \sim 〈 {log}_{b} (X^{n}) 〉 = 〈 n {log}_{b} (X) 〉 \\ \Leftrightarrow 〈 Y 〉 \sim 〈 n Y 〉 = 〈 n 〈 Y 〉 〉 \end{matrix}

(62)

where the last equality follows from the identity

〈 n y 〉 = 〈 n 〈 y 〉 + n ⌊y⌋ 〉 = 〈 n 〈 y 〉 〉

for any

y \in R

.

Berger and Hill ([4], Lemma 5.15, page 77) show the following.

Proposition 15.

For any random variable Y, the relation

〈 Y 〉 \sim 〈 n 〈 Y 〉 〉

for all

n \in N

holds if and only if

Pr (〈 Y 〉 \leq u) = q + (1 - q) u for all u \in [0, 1)

(63)

for some

q \in [0, 1]

.

Propositions 13 and 14 are straightforward corollaries of Proposition 15.

I bring these facts to the reader’s attention because Wójcik’s condition is effectively equivalent to Berger and Hill’s notion of “base-invariant significant digits” and sheds some light on this notion. (I say “effectively equivalent” as Berger and Hill’s concept applies to a probability measure P, whereas Wójcik’s condition applies to a random variable X.)

Here’s Berger and Hill’s definition (Definition 5.10, page 75). Let

A \supseteq S

be a

σ

-algebra on

R^{+} .

A probability measure P on

(R^{+}, A)

has base-invariant significant digits if

P (A) = P (A^{1 / n})

for all

A \in S

and all

n \in N

.

Here’s a guide to the symbols used in this definition. (1)

S

is the

σ

-algebra generated by the significand function

S_{b}

. (2)

R^{+} \equiv (0, \infty)

, the set of strictly positive real numbers. (3) For any

A \subseteq R^{+}

and

n \in N, A^{1 / n} \equiv \{x > 0 : x^{n} \in A\}

. Also, it’s useful at this point to introduce one more bit of non-standard notation used by Berger and Hill: for every

x \in R

and every set

C \subseteq R

, let

x C \equiv \{x c : c \in C\}

.

The following proposition (showing the effective equivalence of Wójcik’s condition and Berger and Hill’s base-invariant significant digits) is the major result of this section.

Proposition 16.

Suppose that

X \sim (R^{+}, B (R^{+}), P)

. Then

S_{b} (X) \sim S_{b} (X^{n})

for all

n \in N

if and only if P has base-invariant significant digits.

Proof.

We begin by proving that Wójcik’s condition holds whenever P has base-invariant significant digits. Suppose that

A \in S

. From the definition of

A^{1 / n}

, we have

X \in A^{1 / n} \Leftrightarrow X^{n} \in A

. Hence,

P (A^{1 / n}) = Pr (X \in A^{1 / n}) = Pr (X^{n} \in A) .

(64)

If P has base-invariant significant digits, then

P (A^{1 / n}) = P (A) = Pr (X \in A) .

(65)

Combining Equations (64) and (65), we see that

Pr (X \in A) = Pr (X^{n} \in A)

(66)

whenever P has base-invariant significant digits. As

A \in S

there exists a set

A_{0} \in B [1, b)

such that

A = ⋃_{k \in Z} b^{k} A_{0} .

(67)

In fact,

A_{0} = S_{b} (A) \equiv \{S_{b} (x) : x \in A\}

. Hence

\begin{matrix} X \in A \Leftrightarrow S_{b} (X) \in A_{0}, \\ X^{n} \in A \Leftrightarrow S_{b} (X^{n}) \in A_{0} . \end{matrix}

(68)

Combining Equations (66) and (68), we conclude that

Pr (S_{b} (X) \in A_{0}) = Pr (S_{b} (X^{n}) \in A_{0}) .

(69)

As this equation holds for every

A_{0} \in B [1, b)

, we conclude that

S_{b} (X) \sim S_{b} (X^{n})

whenever

X \sim (R^{+}, B (R^{+}), P)

and P has base-invariant significant digits.

To prove that Wójcik’s condition implies that P has base-invariant significant digits, we essentially reverse this chain of logic. Wójcik’s condition implies Equation (69) for any

A_{0} \in B [1, b)

, which in turn implies Equation (66) for A given by Equation (67). But

Pr (X \in A) = P (A)

and

Pr (X^{n} \in A) = P (A^{1 / n})

, so Equation (66) implies that

P (A) = P (A^{1 / n})

. As

A_{0}

was an arbitrary element of

B [1, b)

, the equation

P (A) = P (A^{1 / n})

holds for A, an arbitrary element of

S

, and the proof is complete. □

Berger and Hill state the following theorem (Theorem 5.13, page 76). A probability measure P on

(R^{+}, A)

with

A \supseteq S

has base-invariant significant digits if and only if, for some

q \in [0, 1],

P (A) = q δ_{1} (A) + (1 - q) B (A) for every A \in S .

(The meaning of the “Dirac measure”

δ_{1}

is given on page 22 of their book, and the meaning of the “Benford measure”

B

is given on page 32.)

In the light of Proposition 16, it can be seen that Berger and Hill’s Theorem 5.13 is equivalent to Proposition 14 given above.

I conclude this section with a personal opinion about Berger and Hill’s exposition. I think that the terminology “base-invariant” they chose for their concept is a misnomer. There is only one base (b) in the definition, and their concept of “base-invariant” significant digits tells us nothing about the Benford properties of alternative bases for a b-Benford random variable. Hence, the label “base-invariant” they chose for their concept seems misleading and I believe they really should give it a different name.

9. Conclusions and Prospect

Let Y be a u.d. mod 1 random variable with pdf g, let

b > 1

, and define

X \equiv b^{Y}

, so X is b-Benford. Without loss of generality we may assume that

g (y) = H (y) - H (y - 1) for any y \in R,

where

H

is a seed function. Let

c > 1

. In principle, the machinery introduced in Section 6 allows one to investigate the dependence of the distribution of

〈 {log}_{c} (X) 〉

on c. In practice, I’ve carried out this investigation only for seed functions of the first two types in the following list of classes of seed functions.

(1): Step functions that jump from 0 to 1 in a single step.
(2): Increasing functions that are absolutely continuous.
(3): Step functions that increase from 0 to 1 at a finite or countably infinite number of “points of jump.”
(4): Convex combinations of seed functions in classes (2) and (3).
(5): “Singular” distribution functions. These functions are increasing and continuous, but not absolutely continuous. The Cantor function is the best known example.
(6): Seed functions satisfy a condition I call “unit interval increasing.” Every increasing function is unit interval increasing, but not conversely. That is, a function H may be unit interval increasing, but not everywhere increasing. Several examples of such seed functions are given in [2].

My intuition suggests that seed functions of types (3) and (4) will offer no additional conceptual difficulties, though they will certainly complicate the algebra. I’ll leave the investigation of seed functions of classes (5) and (6) to the reader.

With X and c defined as above, let

{\tilde{g}}_{c}

denote the pdf of

〈 {log}_{c} (X) 〉

. If X is c-Benford, and if

{\tilde{g}}_{c}

is continuous or has only “jump” discontinuities, then

{∥{\tilde{g}}_{c} - 1∥}_{\infty} = 0 .

(70)

Hence, c is in the Benford spectrum

B_{X}

if and only if Equation (70) is satisfied. For almost all random variables X, the Benford spectrum

B_{X}

is empty. We might want to say that X is “effectively” c-Benford if

{∥{\tilde{g}}_{c} - 1∥}_{\infty} < ϵ

(71)

for some small number

ϵ

. If we define the “effective” Benford spectrum of X to be the set

B_{X, ϵ} \equiv \{c > 1 : {∥{\tilde{g}}_{c} - 1∥}_{\infty} < ϵ\},

(72)

then

B_{X} \subseteq B_{X, ϵ}

. In general, I suggest, the effective spectrum will be a much larger set than the spectrum.

The machinery described in Section 5 to carry out a “Benford analysis” helps us determine whether or not the criterion of Equation (71) is satisfied. In Section 7 I suggested that a Gauss-Benford random variable should be regarded as effectively c-Benford if the product

ρ σ

is large enough. In [3] I suggested that a lognormal random variable, which is not b-Benford for any b, should be regarded as effectively c-Benford if

Λ_{c} σ

is large enough.

I leave further investigation of effectively Benford random variables to the reader.

Funding

This research received no external funding.

Acknowledgments

I’d like to thank Kenneth Ross for thoughtful comments on earlier drafts of this paper, and three anonymous referees for useful suggestions. I’d also like to thank William Davis and Don Lemons for their heroic efforts to convert my EXP document into an acceptable LaTeX form.

Conflicts of Interest

The author declares no conflict of interest.

Appendix A. A Small Table of Fourier Transforms

Feller [7] gives a table of characteristic functions of selected probability density functions. I’ve adapted his table to give the Fourier transforms of 8 of his 10 densities, and added a row for an additional pdf (the logistic).

Table A1. Fourier transforms of selected probability density functions.

No.	Name	Density $g (x)$	Interval	Fourier Transform $\hat{g} (ξ)$
1	$N (0, 1)$	${(2 π)}^{- 1 / 2} e^{- x^{2} / 2}$	$R$	$exp (- 2 π^{2} ξ^{2})$
2	$U [- a, a]$	$1 / 2 a$	$[- a, a]$	$\frac{sin (2 π a ξ)}{2 π a ξ}$
3	$U [0, a]$	$1 / a$	$[0, a]$	$\frac{1 - exp (- 2 π i a ξ)}{2 π i a ξ}$
4	Triangular	$\frac{1}{a} (1 - \frac{\|x\|}{a})$	$\|x\| \leq a$	$\frac{1 - cos (2 π a ξ)}{2 π^{2} a^{2} ξ^{2}}$
5	Dual of 4	$\frac{1 - cos (2 π a x)}{2 a π^{2} x^{2}}$	$R$	$max (0, 1 - \frac{\|ξ\|}{a})$
6	$Γ (α, β)$	$\frac{1}{Γ (α) β^{α}} x^{α - 1} e^{- x / β}$	$x > 0$	${(1 + 2 π i β ξ)}^{- α}$
7	Laplace $(0, 1)$	$\frac{1}{2} e^{- \|x\|}$	$R$	$\frac{1}{1 + 4 π^{2} ξ^{2}}$
8	Cauchy $(0, 1)$	$\frac{1}{π} \frac{1}{1 + x^{2}}$	$R$	$e^{- 2 π \|ξ\|}$
9	Logistic $(0, 1)$	${(e^{x / 2} + e^{- x / 2})}^{- 2}$	$R$	$\frac{2 π^{2} ξ}{sinh (2 π^{2} ξ)}$

References

Benford, F. The Law of Anomalous Numbers. Proc. Am. Philos. Soc. 1938, 78, 551–572. [Google Scholar]
Benford, F.A. Construction of Benford Random Variables: Generators and Seed Functions. arXiv 2020, arXiv:1609.04852. [Google Scholar]
Benford, F.A. Fourier Analysis and Benford Random Variables. arXiv 2020, arXiv:2006.07136. [Google Scholar]
Berger, A.; Theodore, H. An Introduction to Benford’s Law; Princeton University Press: Princeton, NJ, USA, 2015. [Google Scholar]
Wójcik, M. Notes on Scale-Invariance and Base-Invariance for Benford’s Law. arXiv 2013, arXiv:1307.3620. [Google Scholar]
Whittaker, J. On Scale-Invariant Distributions. SIAM J. Appl. Math. 1983, 43, 257–267. [Google Scholar] [CrossRef]
Feller, W. An Introduction to Probability Theory and Its Applications, 2nd ed.; John Wiley & Sons: New York, NY, USA, 1971; Volume II. [Google Scholar]

Figure 1.

ρ

as a function of c.

Figure 1.

ρ

as a function of c.

Table 1. Fourier transforms of selected even density functions with a scale parameter.

Name	$h_{0} (y)$	${\hat{h}}_{0} (ξ)$
$N (0, σ)$	${(2 π σ^{2})}^{- 1 / 2} e^{- y^{2} / (2 σ^{2})}$	$exp (- 2 π^{2} σ^{2} ξ^{2})$
Laplace $(0, σ)$	$\frac{1}{2 σ} e^{- \|y\| / σ}$	$\frac{1}{1 + 4 π^{2} σ^{2} ξ^{2}}$
Cauchy $(0, σ)$	$\frac{1}{π σ} {(1 + \frac{y^{2}}{σ^{2}})}^{- 1}$	$e^{- 2 π σ \|ξ\|}$
Logistic $(0, σ)$	$\frac{1}{σ} {(e^{y / (2 σ)} + e^{- y / (2 σ)})}^{- 2}$	$\frac{2 π^{2} σ ξ}{sinh (2 π^{2} σ ξ)}$

Note: Among these four distributions,

σ

is the standard deviation of the rescaled random variable only for the normal distribution

N (0, σ)

.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Benford, F. Base Dependence of Benford Random Variables. Stats 2021, 4, 578-594. https://doi.org/10.3390/stats4030034

AMA Style

Benford F. Base Dependence of Benford Random Variables. Stats. 2021; 4(3):578-594. https://doi.org/10.3390/stats4030034

Chicago/Turabian Style

Benford, Frank. 2021. "Base Dependence of Benford Random Variables" Stats 4, no. 3: 578-594. https://doi.org/10.3390/stats4030034

APA Style

Benford, F. (2021). Base Dependence of Benford Random Variables. Stats, 4(3), 578-594. https://doi.org/10.3390/stats4030034

Article Menu

Base Dependence of Benford Random Variables

Abstract

1. Introduction

2. Benford Random Variables

3. The Benford Spectrum

4. Digression: Fourier Transforms

5. A Framework for Benford Analysis

6. Base Dependence: Theory

7. Base Dependence: Examples

8. On “Base-Invariant Significant Digits”

9. Conclusions and Prospect

Funding

Acknowledgments

Conflicts of Interest

Appendix A. A Small Table of Fourier Transforms

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI