Benford Behavior in Stick Fragmentation Problems

Fang, Bruce; Irons, Ava; Lippelman, Ella; Miller, Steven J.

doi:10.3390/stats8040091

Open AccessArticle

Benford Behavior in Stick Fragmentation Problems

¹

Department of Mathematics, Williams College, Williamstown, MA 01267, USA

²

Department of Mathematics and Computer Science, Colorado College, Colorado Spring, CO 80903, USA

^*

Author to whom correspondence should be addressed.

Stats 2025, 8(4), 91; https://doi.org/10.3390/stats8040091

Submission received: 23 August 2025 / Revised: 30 September 2025 / Accepted: 6 October 2025 / Published: 8 October 2025

(This article belongs to the Special Issue Benford's Law(s) and Applications (Second Edition))

Download

Browse Figures

Versions Notes

Abstract

Benford’s law states that in many real-world datasets, the probability that the leading digit is d equals

{log}_{10} ((d + 1) / d)

for all

1 \leq d \leq 9

. We call this weak Benford behavior. A dataset is said to follow strong Benford behavior if the probability that its significand (i.e., the significant digits in scientific notation) is at most s equals

{log}_{10} (s)

for all

s \in [1, 10)

. We investigate Benford behavior in a multi-proportion stick fragmentation model, where a stick is split into m substicks according to fixed proportions at each stage. This generalizes previous work on the single proportion stick fragmentation model, where each stick is split into two substicks using one fixed proportion. We provide a necessary and sufficient condition under which the lengths of the stick fragments converge to strong Benford behavior in the multi-proportion model.

Keywords:

Benford’s law; high-dimensional fragmentation; multinomial distribution

1. Introduction

In the late nineteenth century, astronomer Simon Newcomb observed that in the logarithmithic books at his workplace, certain pages were “more worn than others” [1]. In particular, there was more wear and tear in the earlier pages than the later pages. He deduced that there is a “bias” towards smaller leading digits, with the digit 1 showing up roughly

30 %

of the time, the digit 2 showing up roughly

18 %

of the time, and so on. Newcomb’s findings were practically ignored until about fifty years later, when physicist Frank Benford published his own research on the distribution of leading digits in Reader’s Digest [2]. Benford displayed a table of roughly 20,000 observations from twenty different sets of data, shown in Figure 1.

Benford called this distribution the “law of Anomalous Numbers”. However, due to the popularity of his publication, the phenomena eventually became known as “Benford’s law”; see Figure 2 for probabilities. Benford’s law is a powerful phenomena that occurs in a variety of data, including accounting, elections, finance, geosciences, physics, population data, street addresses, and more. The law’s prevalence makes it a valuable tool for ensuring data integrity. For example, it is often used in fraud detection for tax returns, insurance claims, and expense reports [3,4]. For further details on the history of Benford’s law, see [5,6,7,8,9].

There is extensive research on fragmentation problems related to Benford’s law, which is the subject of our paper. In [10], Kakutani considered the following fragmentation problem: Start with the initial set

Q_{0} = {0, 1}

and a fixed constant

α \in (0, 1)

. At each stage, from

Q_{k} = {x_{1}, \dots, x_{k + 2} ∣ x_{1} < \dots < x_{k + 2}}

, construct

Q_{k + 1}

by adding the point

x_{i} + α (x_{i + 1} - x_{i})

, where

x_{i + 1} - x_{i} = {max}_{1 \leq j \leq k + 1} | x_{j + 1} - x_{j} |

. Kakutani was able to show that as

k \to \infty

, the points of

Q_{k}

converge to a uniform distribution on

[0, 1]

and are therefore non-Benford. This problem has been generalized to various setting; see [11,12,13,14,15,16] for examples. For other decomposition problems, see [17,18] for fragmentation in two dimensions, [19] for fragmentation in a fractal setting, and [20] for discrete fragmentation.

In this paper, we investigate a fragmentation problem inspired by the work by Becker et al. [21], who studied the stick fragmentation problem. Their model begins with a stick of length L and a density function f on

[0, 1]

from which proportions are sampled. At Stage 1, the stick is split into two substicks at proportion

p_{1}

. At Stage 2, the left substick is split into two substicks at proportion

p_{2}

, while the right substick is split into two substicks at proportion

p_{3}

. Iterating this procedure for N stages can produce up to

2^{N}

sticks.

Becker et al. analyzed three versions of this process: (i) the unrestricted case, in which a new proportion is drawn from f at each stage and all sticks split; (ii) the restricted case, in which a new proportion is drawn at each stage but only one of the two substicks splits; and (iii) the fixed-proportion case, in which all sticks split but a single proportion p is fixed in advance and applied throughout. For the restricted and unrestricted cases, they established convergence to strong Benford behavior under mild conditions on the Mellin transform of f and on the mean and variance of

{log}_{10} X

for

X \sim f

. For the fixed-proportion case, they proved that convergence occurs if and only if

{log}_{10} (p / (1 - p))

is irrational.

We extend the fixed single-proportion stick fragmentation problem to a fixed multi-proportion setting. Specifically, we begin with a stick of length L, an integer

m \geq 2

, and any proportions

p_{1}, p_{2}, \dots, p_{m} \in (0, 1)

satisfying

p_{1} + p_{2} + \dots + p_{m} = 1

, independent of L. At Stage 1, the stick is split into m substicks of lengths

p_{1} L, p_{2} L, \dots, p_{m} L

. At Stage 2, each of these substicks is split into m smaller substicks according to the same proportions. In general, at Stage N, every substick from Stage

N - 1

is split into m substicks using

p_{1}, p_{2}, \dots, p_{m}

. After N stages, the process produces

m^{N}

substicks.

Our principal question is whether the stick lengths exhibit Benford behavior. The novelty of this model lies in dividing the subdivision into more than two substicks at each stage, which renders the techniques used in Becker et al. [21] inapplicable. Our main result is a necessary and sufficient condition under which this fixed multi-proportion stick fragmentation model yields stick lengths converging to strong Benford’s behavior.

Theorem 1.

Let

m > 2

be an integer, and choose

p_{1}, p_{2}, \dots, p_{m} \in (0, 1)

such that

p_{1} + p_{2} + \dots + p_{m} = 1

. At each stage, a stick is split into m substicks according to the proportions

p_{1}, p_{2}, \dots, p_{m - 1}

. After N stages, the process produces

m^{N}

sticks, whose lengths are

A_{k_{1}, k_{2}, \dots, k_{m}} = L p_{1}^{k_{1}} p_{2}^{k_{2}} \dots p_{m}^{k_{m}}

for all non-negative integers

k_{1}, k_{2}, \dots, k_{m}

with

k_{1} + k_{2} + \dots + k_{m} = N

. For

1 \leq i \leq m - 1

, set

y_{i} = {log}_{10} (p_{i} / p_{i + 1})

. Then the stick lengths converge to strong Benford behavior if and only if

y_{i}

is irrational for some

1 \leq i \leq m - 1

.

1.1. Definitions and Theory of Benford’s Law

We begin with the definition of Benford’s law (see, for example, [8,22]).

Definition 1

(Benford’s Law for the Leading Digit). A dataset is said to satisfy Benford’s law for the leading digit if the frequency of having the leading digit d is given by

{log}_{10} ((d + 1) / d)

for all

1 \leq d \leq 9

.

There are several approaches to proving that a dataset follows Benford’s law. A standard method is to use the Uniform Characterization Theorem. For this, we first introduce the notion of the significand of a real number.

Definition 2

(The Significand). For any

x > 0

, we can uniquely write

x = S_{10} (x) \cdot 10^{k_{10} (x)},

(1)

where

S_{10} (x) \in [1, 10)

and

k_{10} (x) \in Z

. Equivalently,

k_{10} (x) = ⌊ {log}_{10} (x) ⌋

and

S_{10} (x) = x / 10^{k_{10} (x)}

. We call

S_{10} (x)

the significand of x.

The following definition generalizes Benford’s law from the leading digit to the entire significand.

Definition 3.

A sequence of random variables

{X^{(N)}}_{N = 1}^{\infty}

is said to converge to strong Benford’s law if

lim_{N \to \infty} P (S_{10} (X^{(N)}) \leq s) = {log}_{10} (s)

(2)

for all

s \in [1, 10)

. Note that the index N typically represents the size of the dataset, so convergence to strong Benford’s law should be interpreted as an asymptotic property.

Definition 4

(Uniform Distribution Modulo 1). A sequence of random variables

{X^{(N)}}_{N = 1}^{\infty}

is said to converge to being equidistributed mod 1 if

lim_{N \to \infty} P (X^{(N)} \mod 1 \leq s) = s

(3)

for all

s \in [0, 1]

.

We are now ready to state the Uniform Characterization Theorem [8].

Theorem 2

(Uniform Characterization Theorem). A sequence of random variables converges to strong Benford’s law if and only if the sequence of their base-10 logarithms converges to being equidistributed mod 1.

Thus, convergence to strong Benford’s law is equivalent to convergence to equidistribution modulo 1. Consequently, proving or disproving convergence to strong Benford’s law can be reduced to proving or disproving convergence to equidistribution modulo 1. In what follows, we illustrate the above definitions with an example of random variables that converge to strong Benford behavior, as well as an example of random variables that do not.

Example 1.

Let

X^{(N)} = e^{Y^{(N)}}

, where

Y^{(N)}

is normally distributed with mean 0 and variance N. No assumptions are made on the dependence structure among the

Y^{(N)}

. Then the sequence

{X^{(N)}}_{N = 1}^{\infty}

converges to strong Benford’s law. This follows from the fact that

{Y^{(N)}}_{N = 1}^{\infty}

is equidistributed modulo 1. For details, see [8].

Example 2.

Let

X^{(N)}

be uniformly distributed on

[0, 1]

for all

N \geq 1

, with no assumptions on dependence among the

X^{(N)}

. Then for every N,

\begin{matrix} P (S_{10} (X^{(N)}) \leq s) = s . \end{matrix}

(4)

By comparison with Definition 3, it follows that

{X^{(N)}}_{N = 1}^{\infty}

does not converge to strong Benford’s law.

1.2. Fixed-Proportion Stick Fragmentation Model and the Multinomial Distribution

Now that we have a proper foundation for Benford’s law, it is time to describe in detail the model used in this paper. The model can be viewed as an extension of a model introduced by Becker et al., called the fixed single-proportion stick fragmentation model [21].

Suppose we start with a stick of length L. We split the stick at a fixed proportion of

0 < p < 1

. After the first break, we obtain two sticks of lengths

L p

and

L (1 - p)

. We then split each of these two sticks again at the same fixed proportion p, producing four sticks of lengths

L p^{2}

,

L p (1 - p)

,

L (1 - p) p

, and

L {(1 - p)}^{2}

. Repeating this process for N stages, we obtain

2^{N}

sticks in total, with

N + 1

distinct lengths. These lengths follow a binomial distribution.

Becker et al. were interested in whether the leading digits of the significands of the stick lengths converge to strong Benford’s law. They discovered a necessary and sufficient condition for this convergence and proved it in [21].

Theorem 3

(Fixed Single-Proportion Stick Fragmentation Theorem [21]). Consider the fixed single-proportion stick fragmentation model. Choose y so that

10^{y} = (1 - p) / p

. The fragmentation model produces stick lengths that converge to strong Benford’s law if and only if y is irrational.

In the non-Benford case, Becker et al. observed cyclic behavior in the significands and used the multisection formula [23]. By contrast, establishing the Benford case was more difficult. They adopted methods from [22,24,25], applying truncation to demonstrate roughly equal probability across intervals, and ultimately proved equidistribution modulo 1. In summary, they showed that stick lengths governed by a binomial distribution converge to strong Benford’s law when the ratio equals 10 and is raised to an irrational power and fail to converge when the ratio equals 10 and is raised to a rational power.

We now extend their fixed single-proportion stick fragmentation model to what we call the fixed multi-proportion stick fragmentation model. Becker et al. considered only the case in which the stick L is cut at a single fixed proportion p at each stage. We generalize this by cutting the stick at multiple distinct fixed proportions

p_{1}, p_{2}, \dots, p_{m - 1}

at every stage. Our model is as follows:

Suppose we have a stick of length L. We split the stick simultaneously at fixed proportions

p_{1}, p_{2}, \dots, p_{m - 1} \in (0, 1)

, with

p_{1} + \dots + p_{m - 1} < 1

. Define

p_{m} : = 1 - (p_{1} + \dots + p_{m - 1})

. Thus, after Stage 1, we obtain sticks of lengths

L p_{1}, L p_{2}, \dots, L p_{m}

. At Stage 2, we cut each stick obtained from the previous stage at the same fixed proportions

p_{1}, p_{2}, \dots, p_{m - 1}

. The resulting stick lengths from Stage 2 are

L p_{1}^{2}, L p_{1} p_{2}, \dots, L p_{1} p_{m}

,

L p_{2} p_{1}, L p_{2}^{2}, \dots, L p_{2} p_{m}

, and so on. After Stage N, we are left with

m^{N}

sticks in total, with

(\binom{m + N - 1}{N})

distinct lengths (see Figure 3 for the case where

m = 3

and

N = 2

). The number of distinct lengths arises because the process can be interpreted as unordered sampling with replacement.

Moreover, the stick lengths are distributed according to a generalization of the binomial distribution known as the multinomial distribution, which is defined in terms of the multinomial coefficient, which itself is a generalization of the binomial coefficient. We recall the following definition and result (see [26]).

Definition 5

(Multinomial Coefficient). For any non-negative integer N and positive integer m, the multinomial coefficient is

(\binom{N}{k_{1}, k_{2}, \dots, k_{m}}) = \frac{N!}{k_{1}! k_{2}! \dots k_{m}!}

(5)

for

k_{1} + k_{2} + \dots + k_{m} = N

. A random vector

(X) = (X_{1}, X_{2}, \dots, X_{m})

follows a multinomial distribution with parameters N and

(p) : = (p_{1}, p_{2}, \dots, p_{m})

if

\begin{matrix} P ((X) = (k)) = (\binom{N}{k_{1}, k_{2}, \dots, k_{m}}) p_{1}^{k_{1}} p_{2}^{k_{2}} \dots p_{m}^{k_{m}} \end{matrix}

(6)

for all

(k) : = (k_{1}, k_{2}, \dots, k_{m})

with

k_{1} + k_{2} + \dots + k_{m} = N

.

Remark 1.

Formula (5) gives the number of ways to choose N objects with exactly

k_{j}

objects of type j, which is when order does not matter.

Theorem 4

(Multinomial Theorem). Let N be any non-negative integer and

p_{1}, p_{2}, \dots, p_{m}

be real numbers. Then

{(p_{1} + p_{2} + \dots + p_{m})}^{N} = \sum_{k_{1} + k_{2} + \dots + k_{m} \geq 0} (\binom{N}{k_{1}, k_{2}, \dots, k_{m}}) p_{1}^{k_{1}} p_{2}^{k_{2}} \dots p_{m}^{k_{m}},

(7)

where the

k_{j}

s are non-negative integers summing to N.

2. Fixed Multi-Proportion Stick Fragmentation Model

Recall that we are interested in studying whether or not a stick fragmentation process results in stick lengths that converge to strong Benford’s law. For Becker et al., they were able to prove that if the ratio

(1 - p) / p

is equal to 10 and to an irrational power, the stick lengths will follow strong Benford’s law, but if the ratio is equal to 10 and to a rational power, then the distribution of stick lengths will not follow strong Benford’s law.

In what follows, we generalize the results of Becker et al. to the fixed multi-proportion stick fragmentation model we introduced in Section 1.2. In particular, we present Theorem 1 again along with the proof for the necessity of the condition.

Theorem 1.

Let

m > 2

be an integer, and choose

p_{1}, p_{2}, \dots, p_{m} \in (0, 1)

such that

p_{1} + p_{2} + \dots + p_{m} = 1

. At each stage, a stick is split into m substicks according to the proportions

p_{1}, p_{2}, \dots, p_{m - 1}

. After N stages, the process produces

m^{N}

sticks, whose lengths are

A_{k_{1}, k_{2}, \dots, k_{m}} = L p_{1}^{k_{1}} p_{2}^{k_{2}} \dots p_{m}^{k_{m}}

for all non-negative integers

k_{1}, k_{2}, \dots, k_{m}

with

k_{1} + k_{2} + \dots + k_{m} = N

. For

1 \leq i \leq m - 1

, set

y_{i} = {log}_{10} (p_{i} / p_{i + 1})

. Then the stick lengths converge to strong Benford behavior if and only if

y_{i}

is irrational for some

1 \leq i \leq m - 1

.

We prove the necessity of the condition; i.e., if

y_{i}

is rational for all

1 \leq i \leq m - 1

, then the stick fragmentation model produces lengths that do not converge to strong Benford’s law. The sufficiency of the condition, i.e., if

y_{i}

is irrational for some

1 \leq i \leq m - 1

, then the model produces lengths that converge to strong Benford’s law, is more technical; see [27] for details. Before presenting the proof, we refer the reader to numerical simulations supporting the theorem. See Figure A1 for rational cases and Figure A2 for irrational cases.

Proof.

By the Uniform Characterization Theorem 2, it suffices to show that

\begin{matrix} {log}_{10} (A_{k_{1}, k_{2}, \dots, k_{m}}) = L {log}_{10} (p_{1}^{k_{1}} p_{2}^{k_{2}} \dots p_{m}^{k_{m}}) \end{matrix}

(8)

is not equidistributed mod 1. Since Benford’s law is scale-invariant [6], we may assume

L = 1

without the loss of generality. Each stick length

A_{k_{1}, k_{2}, \dots, k_{m}}

factors as

\begin{matrix} A_{k_{1}, k_{2}, \dots, k_{m}} = p_{1}^{k_{1}} p_{2}^{k_{2}} \dots p_{m}^{k_{m}} = {(\frac{p_{1}}{p_{2}})}^{k_{1}} {(\frac{p_{2}}{p_{3}})}^{k_{1} + k_{2}} \dots {(\frac{p_{m - 1}}{p_{m}})}^{\sum_{j = 1}^{m - 1} k_{j}} {(p_{m})}^{N} . \end{matrix}

(9)

Since

y_{i}

is rational for all

1 \leq i \leq m - 1

, write

y_{i} = a_{i} / b_{i}

with

a_{i} \in Z

,

b_{i} \in Z_{> 0}

, and

\gcd (a_{i}, b_{i}) = 1

. Then

\begin{matrix} A_{k_{1}, k_{2}, \dots, k_{m}} = {(10^{\frac{a_{1}}{b_{1}}})}^{k_{1}} {(10^{\frac{a_{2}}{b_{2}}})}^{k_{1} + k_{2}} \dots {(10^{\frac{a_{m - 1}}{b_{m - 1}}})}^{\sum_{j = 1}^{m - 1} k_{j}} {(p_{m})}^{N} . \end{matrix}

(10)

Taking

{log}_{10}

gives

\begin{matrix} {log}_{10} (A_{k_{1}, k_{2}, \dots, k_{m}}) = k_{1} (\frac{a_{1}}{b_{1}}) + (k_{1} + k_{2}) (\frac{a_{2}}{b_{2}}) + \dots + (\sum_{j = 1}^{m - 1} k_{j}) (\frac{a_{m - 1}}{b_{m - 1}}) + N {log}_{10} (p_{m}) . \end{matrix}

(11)

The term

k_{1} (a_{1} / b_{1})

has at most

b_{1}

distinct values under mod 1, since it is periodic with a period of at most

b_{1}

. More generally,

(\sum_{j = 1}^{i} k_{j}) (a_{i} / b_{i})

has at most

b_{i}

distinct values under mod 1. The term

N {log}_{10} (p_{m})

is constant across all

A_{k_{1}, k_{2}, \dots, k_{m}}

. Hence,

{log}_{10} (A_{k_{1}, k_{2}, \dots, k_{m}})

has at most

\prod_{i = 1}^{m - 1} b_{i}

distinct values, independent of N. This number is also finite for fixed m,

b_{1}, \dots, b_{m - 1}

.

Since a uniform distribution on

[0, 1]

is continuous and thus not discrete,

{log}_{10} (A_{k_{1}, k_{2}, \dots, k_{m}})

cannot be equidistribted mod 1. Thus, the fragmentation process produces stick lengths that do not converge in distribution to strong Benford’s law. □

3. Conclusions

In this paper, we extended the fixed single-proportion stick fragmentation model of Becker et al. [21] to the fixed multi-proportion setting and determined precisely when Benford behavior arises. We proved the necessity of the condition by showing that in the purely rational case, the logarithms of the stick lengths lie on a finite lattice and convergence fails. This provides a sharp criterion that fully characterizes Benford behavior in the multi-proportion model.

Our results show that the emergence of Benford’s law is governed by the arithmetic properties of the proportions. Numerical simulations confirm these results, illustrating clear non-Benford behavior in rational cases and rapid convergence otherwise. This work thus generalizes earlier results and clarifies the precise mechanism by which Benford behavior arises in a deterministic fragmentation process.

4. Future Work

Our proof of necessity in Theorem 1 shows that

{log}_{10} (A_{k_{1}, k_{2}, \dots, k_{m}})

takes at most

\prod_{i = 1}^{m - 1} b_{i}

distinct values and therefore is not equidistributed mod 1. However, it does not explicitly determine the exact distribution. An interesting direction for future work is to ask whether the distribution converges to a discrete uniform distribution and, if not, to characterize it under various assumptions.

Author Contributions

Conceptualization, A.I., B.F., E.L. and S.J.M.; methodology, A.I., B.F., E.L. and S.J.M.; software, A.I. and E.L.; validation, B.F., A.I. and E.L.; formal analysis, B.F., A.I. and E.L.; investigation, B.F., A.I. and E.L.; resources, S.J.M.; data curation, A.I. and E.L.; writing—original draft preparation, writing, B.F., A.I. and E.L.; writing—review and editing, B.F., A.I., E.L. and S.J.M.; visualization, A.I. and E.L.; supervision, S.J.M.; project admistration, S.J.M.; funding acquisition, S.J.M. All authors have read and agreed to the published version of the manuscript.

Funding

This work was partially supported by the Williams College Summer Science Program Research Fellowship, the Finnerty Fund, and NSF Grant DMS2241623.

Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the authors on request.

Acknowledgments

The third author thanks her thesis advisor Molly Moran for guidance and encouragement.

Conflicts of Interest

The authors declare no conflicts of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

Appendix A. Numerical Evidence for Theorem 1

In this section, we provide numerical evidence for Theorem 1. We adopt a method used by Nigrini in [3] for testing Benford behavior, called the Mean Absolute Deviation (MAD). This statistic differs slightly from the chi-square goodness-of-fit test and is appropriate in our case because the expected frequencies under Benford’s law for the leading digit are biased towards smaller digits, and squaring the differences (as in chi-square) can distort the results. We recall the definition of MAD here.

Definition A1.

\begin{matrix} MAD = \frac{1}{9} \sum_{d = 1}^{9} | P_{o b s} (d) - P_{e} (d) |, \end{matrix}

(A1)

where

P_{e}

is the expected frequency under Benford’s law and

P_{o b s}

is the observed frequency.

Nigrini classifies conformity levels in terms of MAD as follows: high conformity

(0.000 \leq MAD < 0.006)

, acceptable conformity

(0.006 \leq MAD < 0.012)

, marginally acceptable conformity

(0.012 \leq MAD \leq 0.016)

, and no conformity

(MAD > 0.016)

.

In the remainder of this section, we present visualizations of the distribution of observed leading digits for various choices of proportions, compared against the expected frequencies from Benford’s law for the leading digit. For each case, we also compute the MAD and report the corresponding conformity level. All computations and simulations were carried out in Mathematica.

For given parameters

m \geq 2

(the number of proportions) and N (the number of stages), the computational complexity is on the order of

m^{N}

, since we must check the leading digit of each stick length. This makes calculations infeasible for large m and N. Hence, in the experiments, we take

N = 1000

when

m = 3

and

N = 100

when

m = 4

.

Appendix A.1. Evidence for Non-Benford Behavior

Figure A1. (a)

y_{1} = - 1 / 3

,

y_{2} = - 1 / 2

, and

N = 1000

. (b)

y_{1} = - 1 / 4

,

y_{2} = - 1 / 6

, and

N = 1000

. (c)

y_{1} = - 1 / 2

,

y_{2} = - 1 / 3

,

y_{3} = - 1 / 4

, and

N = 100

. (d)

y_{1} = - 1 / 4

,

y_{2} = - 1 / 2

,

y_{3} = - 1 / 6

, and

N = 100

.

Figure A1. (a)

y_{1} = - 1 / 3

,

y_{2} = - 1 / 2

, and

N = 1000

. (b)

y_{1} = - 1 / 4

,

y_{2} = - 1 / 6

, and

N = 1000

. (c)

y_{1} = - 1 / 2

,

y_{2} = - 1 / 3

,

y_{3} = - 1 / 4

, and

N = 100

. (d)

y_{1} = - 1 / 4

,

y_{2} = - 1 / 2

,

y_{3} = - 1 / 6

, and

N = 100

.

Appendix A.2. Evidence for Benford Behavior

Figure A2. (a)

y_{1} = - 1 / 2

,

y_{2} = - \sqrt{2}

, and

N = 1000

. (b)

y_{1} = - 1 / 3

,

y_{2} = - \sqrt{3}

, and

N = 1000

. (c)

y_{1} = - \sqrt{5}

,

y_{2} = - 1 / 5

,

y_{3} = - 1 / 3

, and

N = 100

. (d)

y_{1} = - \sqrt{3}

,

y_{2} = - 1 / 10

,

y_{3} = - 1 / 8

, and

N = 100

.

Figure A2. (a)

y_{1} = - 1 / 2

,

y_{2} = - \sqrt{2}

, and

N = 1000

. (b)

y_{1} = - 1 / 3

,

y_{2} = - \sqrt{3}

, and

N = 1000

. (c)

y_{1} = - \sqrt{5}

,

y_{2} = - 1 / 5

,

y_{3} = - 1 / 3

, and

N = 100

. (d)

y_{1} = - \sqrt{3}

,

y_{2} = - 1 / 10

,

y_{3} = - 1 / 8

, and

N = 100

.

References

Newcomb, S. Note on the Frequency of Use of the Different Digits in Natural Numbers. Am. J. Math. 1881, 4, 39–40. [Google Scholar] [CrossRef]
Benford, F. The Law of Anomalous Numbers. Proc. Am. Philos. Soc. 1938, 78, 551–572. [Google Scholar]
Nigrini, M. Benford’s Law: Applications for Forensic Accounting, Auditing, and Fraud Detection, 1st ed.; John Wiley & Sons: Hoboken, NJ, USA, 2012; ISBN 978-1-119-20309-4. [Google Scholar]
Pietronero, L.; Tosatti, E.; Tosatti, V.; Vespignani, A. Explaining the Uneven Numbers in Nature: The Laws of Benford and Zipf. Phys. A Stat. Mech. Appl. 2001, 293, 297–304. [Google Scholar] [CrossRef]
Berger, A.; Hill, T.P. An Introduction to Benford’s Law, 1st ed.; Princeton University Press: Princeton, NJ, USA, 2015; ISBN 978-0-691-16306-2. [Google Scholar]
Hill, T.P. The First-Digit Phenomenon. Am. Sci. 1996, 86, 358–363. [Google Scholar] [CrossRef]
Hill, T.P. A Statistical Derivation of the Significant-Digit Law. Stat. Sci. 1995, 10, 354–363. [Google Scholar] [CrossRef]
Miller, S.J. Benford’s Law: Theory and Applications, 1st ed.; Princeton University Press: Princeton, NJ, USA, 2015; ISBN 978-0-691-14761-1. [Google Scholar]
Raimi, R.A. The First Digit Problem. Am. Math. Mon. 1976, 83, 521–538. [Google Scholar] [CrossRef]
Kakutani, S. A Problem of Equidistribution on the Unit Interval [0, 1]. In Measure Theory. Lecture Notes in Mathematics; Bellow, A., Kölzow, D., Eds.; Springer: Berlin/Heidelberg, Germany, 1976; Volume 541, ISBN 978-3-540-38107-5. [Google Scholar]
Adler, R.L.; Flatto, L. Uniform Distribution of Kakutani’s Interval Splitting Procedure. Z. Wahrscheinlichkeitstheorie Verw. Geb. 1977, 38, 253–259. [Google Scholar] [CrossRef]
Carbone, I. Discrepancy of LS-Sequences of Partitions and Points. Ann. Mat. Pura Appl. 2012, 191, 819–844. [Google Scholar] [CrossRef]
Lootgieter, J.C. Sur la Répartition des Suites de Kakutani. I. Ann. L’Institut Henri Poincaré Sect. B 1977, 13, 385–410. [Google Scholar]
Pyke, R.; van Zwet, W.R. Weak Convergence Results for the Kakutani Interval Splitting Procedure. Ann. Probab. 2004, 32, 380–423. [Google Scholar] [CrossRef]
Slud, E. Entropy and Maximal Spacings for Random Partitions. Z. Wahrscheinlichkeitstheorie Verw. Geb. 1978, 41, 341–352. [Google Scholar] [CrossRef]
van Zwet, W.R. A Proof of Kakutani’s Conjecture on Random Subdivision of Longest Intervals. Ann. Probab. 1978, 6, 133–137. [Google Scholar]
Carbone, I.; Volčič, A. Kakutani’s Splitting Procedure in Higher Dimension. Rend. Istit. Mat. Univ. Trieste 2007, 39, 119–126. [Google Scholar]
Olli, J. Division Point Measures Resulting from Triangle Subdivisions. Geom. Dedicata 2012, 158, 69–86. [Google Scholar] [CrossRef]
Infusino, M.; Volčič, A. Uniform Distribution on Fractals. Unif. Distrib. Theory 2009, 4, 47–58. [Google Scholar]
Iafrate, J.; Miller, S.J.; Strauch, F. Equipartitions and a Distribution for Numbers: A Statistical Model for Benford’s Law. Phys. Rev. E 2015, 91, 062138. [Google Scholar] [CrossRef]
Becker, T.; Burt, D.; Corcoran, C.; Greaves-Tunnell, A.; Iafrate, J.R.; Jing, J.; Miller, S.J.; Porfilio, J.D.; Ronan, R.; Samranvedhya, J.; et al. Benford’s Law and Continuous Dependent Random Variables. Ann. Phys. 2018, 388, 350–381. [Google Scholar] [CrossRef]
Diaconis, P. The Distribution of Leading Digits and Uniform Distribution Mod 1. Ann. Probab. 1979, 5, 72–81. [Google Scholar] [CrossRef]
Chen, H. On the Summation of First Subseries in Closed Form. Int. J. Math. Educ. Sci. Technol. 2010, 41, 538–547. [Google Scholar] [CrossRef]
Kuipers, L.; Niederreiter, H. Uniform Distribution of Sequences, 1st ed.; John Wiley & Sons: Hoboken, NJ, USA, 1974; ISBN 978-0-471-51045-1. [Google Scholar]
Miller, S.J.; Takloo-Bighash, R. Invitation to Modern Number Theory, 1st ed.; Princeton University Press: Princeton, NJ, USA, 2006; ISBN 978-0-691-12060-7. [Google Scholar]
Kataria, K.K. A Probabilistic Proof of the Multinomial Theorem. Am. Math. Mon. 2016, 123, 94–96. [Google Scholar] [CrossRef]
Fang, B.; Miller, S.J. Benford Resulting From Stick and Box Fragmentation Processes. arXiv 2025. [Google Scholar] [CrossRef]

Figure 1. From [1]: Benford’s 20,000 observations.

Figure 2. The distribution of Benford’s law.

Figure 3. A trinomial stick fragmentation with

m = 3

and

N = 2

.

Figure 3. A trinomial stick fragmentation with

m = 3

and

N = 2

.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Fang, B.; Irons, A.; Lippelman, E.; Miller, S.J. Benford Behavior in Stick Fragmentation Problems. Stats 2025, 8, 91. https://doi.org/10.3390/stats8040091

AMA Style

Fang B, Irons A, Lippelman E, Miller SJ. Benford Behavior in Stick Fragmentation Problems. Stats. 2025; 8(4):91. https://doi.org/10.3390/stats8040091

Chicago/Turabian Style

Fang, Bruce, Ava Irons, Ella Lippelman, and Steven J. Miller. 2025. "Benford Behavior in Stick Fragmentation Problems" Stats 8, no. 4: 91. https://doi.org/10.3390/stats8040091

APA Style

Fang, B., Irons, A., Lippelman, E., & Miller, S. J. (2025). Benford Behavior in Stick Fragmentation Problems. Stats, 8(4), 91. https://doi.org/10.3390/stats8040091

Article Menu

Benford Behavior in Stick Fragmentation Problems

Abstract

1. Introduction

1.1. Definitions and Theory of Benford’s Law

1.2. Fixed-Proportion Stick Fragmentation Model and the Multinomial Distribution

2. Fixed Multi-Proportion Stick Fragmentation Model

3. Conclusions

4. Future Work

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A. Numerical Evidence for Theorem 1

Appendix A.1. Evidence for Non-Benford Behavior

Appendix A.2. Evidence for Benford Behavior

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI