Entropy, Periodicity and the Probability of Primality

Croll, Grenville J.

doi:10.3390/e27121204

Open AccessArticle

Entropy, Periodicity and the Probability of Primality

by

Grenville J. Croll

Alternative Natural Philosophy Association, Bury St Edmunds IP30 9QX, UK

Entropy 2025, 27(12), 1204; https://doi.org/10.3390/e27121204

Submission received: 31 October 2025 / Revised: 24 November 2025 / Accepted: 24 November 2025 / Published: 27 November 2025

Download

Browse Figures

Versions Notes

Abstract

The distribution of prime numbers has long been viewed as a balance between order and randomness. In this work, we investigate the relationship between entropy, periodicity, and primality through the computational framework of the binary derivative. We prove that periodic numbers are composite in all bases except for a single trivial case and establish a set of twelve theorems governing the behavior of primes and composites in terms of binary periodicity. Building upon these results, we introduce a novel scale-invariant entropic measure of primality, denoted p(s′), which provides an exact and unconditional entropic probability of primality derived solely from the periodic structure of a binary number and its binary derivatives. We show that p(s′) is quadratic, statistically well-defined, and strongly correlated with our earlier BiEntropy measure of binary disorder. Empirical analyses across several numerical ranges demonstrate that the variance in prime density relative to quadratic expectation is small, binormal, and constrained by the central limit theorem. These findings reveal a deep connection between entropy and the randomness of the primes, offering new insights into the entropic structure of number theory, with implications for the Riemann Hypothesis, special classes of primes, and computational applications in cryptography.

Keywords:

probability of primality; binary derivative; periodicity; prime number distribution; BiEntropy; randomness and complexity; Riemann Hypothesis

1. Introduction

Entropy provides a fundamental framework for quantifying order, disorder, and randomness across natural and computational systems. Since Shannon’s original formulation [1], entropy has been widely applied to measure the uncertainty and information content of symbolic sequences, with extensions into fields as diverse as physics, cryptography, biology, and algorithmic complexity.

Recent research has increasingly explored connections between entropy and prime distribution. Kontoyiannis [2] demonstrated that elementary information-theoretic arguments can provide new proofs for classical results in number theory, including Chebyshev’s theorem on prime density, by quantifying how many bits of information about an integer we learn from each of its prime factors. Billingsley’s [3] influential 1973 work established probabilistic connections between prime numbers and Brownian motion, introducing entropy-based methods to number theory, an approach that earned the Lester R. Ford Award for mathematical exposition. Wolf [4] and Szpiro [5] investigated fractal and statistical properties of prime gaps, revealing 1/f noise patterns and hidden structure in prime distributions through entropy-related analysis. More recently, quantum information approaches have identified prime numbers through singular behaviors in linear entanglement entropy [6]. These studies collectively suggest that entropic approaches provide valuable complementary insights into the structure and apparent randomness of prime distributions alongside traditional analytic methods.

In this paper, we explore how basic entropic principles can further illuminate one of mathematics’ most elusive structures—the distribution of prime numbers.

The apparent irregularity of the primes has long invited probabilistic and statistical interpretation. Classical approaches, such as the Prime Number Theorem and the Riemann Hypothesis, express the distribution of primes through asymptotic density and analytic continuation, yet provide little insight into the local structure and apparent randomness of the prime sequence. Recent research has attempted to capture this complexity through entropy-based measures of randomness and symbolic dynamics. Our study contributes to this growing perspective by developing a binary entropic framework that connects periodicity, entropy, and primality.

In earlier work, we introduced the BiEntropy function as a measure of order and disorder in finite binary strings [7]. This measure, later extended through the TriEntropy framework [8], has been applied to diverse domains including knot theory [9], cryptography [10], cellular automata [11], computational neuroscience [12,13] and surface science [14]. Here, we build upon these foundations to develop an entropic probability of primality, p(s′), derived from the binary derivative—a discrete operator that transforms a binary sequence into its pairwise XOR-based derivatives.

The binary derivative possesses an important property: it preserves statistical independence and thus acts as an entropy-conserving transformation. We exploit this property to define a set of theorems relating periodicity and compositeness in binary numbers. Specifically, we show that numbers exhibiting periodic structure in their binary representation are necessarily composite, and that primes, in contrast, are asymptotically equidistributed across the binary derivative space. From these results, we derive p(s′), a quadratic, scale-invariant function that quantifies the entropic likelihood that a number is prime.

Empirical investigations of p(s′) over several domains (up to 2³²) reveal a close correspondence between predicted and actual prime frequencies, with correlation coefficients exceeding 0.95. The distribution of residuals in entropic prime density is binormal and consistent with the central limit theorem, suggesting that the primes exhibit bounded stochastic variation rather than true randomness. These findings provide a new perspective on the statistical and entropic foundations of primality and open pathways toward entropy-based number classification and cryptographic modeling.

The remainder of this paper is organized as follows. Section 2 defines periodicity in natural numbers and establishes a set of related compositeness theorems. Section 3 introduces the binary derivative and explores its entropic properties. Section 4 defines the entropic probability of primality, p(s′), and relates it to the Prime Number Theorem. Section 5 presents empirical analyses across multiple numerical ranges. Section 6 discusses broader implications for entropy, randomness, and the Riemann Hypothesis.

2. Periodicity and Compositeness in Binary Numbers

2.1. Definition of Periodic Numbers

Consider the numeric string s = s₁s₂…s_n. where s ∈ N₀. Let s = ab, where a and b are similar numeric strings of equal length n ≥ 1. The length of s is |s| = n. The concatenation s = ab is said to be periodic when a = b. Periodic numbers represent elementary cases of iterants [15,16], reflecting internal repetition within a numeric structure.

2.2. Periodic Numbers Are Composite

Theorem 1.

Periodic numbers are composite in all bases, except for a single trivial case.

Proof.

Let k = ab in base m, with a = b and a > 1. Then:

k = (mⁿ⋅a) + b

(1)

Since a = b, we obtain

k = (mⁿ + 1)⋅a

(2)

and because a > 1, k must be composite.

The exception occurs when a = b = 1, yielding numbers of the form 11, 0101, 001001, in any base, i.e., k = mⁿ + 1. Every natural number can be represented in this way. □

Hence, periodic numbers are composite in all bases, except for one specific form. In what follows, we restrict our analysis to the binary base m = 2, which is particularly well-suited for the analysis of entropy.

2.3. n-Periodic Binary Numbers Are Composite

Theorem 2.

Binary numbers of the form 00111100 or 10010110, where the second half is the one’s complement of the first half, also exhibit compositeness.

Let

b = a′ = 2ⁿ − a − 1.

(3)

Then all numbers of the form s = ab are composite.

Proof.

k = (2ⁿ⋅a) + b = (2ⁿ⋅a) + (2ⁿ − a − 1)

(4)

Simplifying gives:

k = (a + 1)⋅(2ⁿ − 1)

(5)

If a ≥ 1, k is composite.

If a = 0, b is a binary string of all ones. When n is odd, such numbers may correspond to Mersenne primes; otherwise, they remain composite. □

2.4. z-Periodic Numbers Are Composite

Theorem 3.

Numbers of the form zab, where a = b and z is an even number of leading zeros, are also composite.

Proof.

Removing the leading zeros reduces the number to the periodic form described in Theorem 1. □

2.5. Periodic Numbers Are Composite by Construction

Empirical enumeration of all 8-bit binary numbers (reproduced in the Supplementary Materials) confirms that approximately 20% of them exhibit periodicity as defined above and are therefore composite by construction. We ignore the even numbers (s = abz) and note certain previously observed periodic special cases [17].

2.6. Periodicity and Shannon Entropy

The Shannon entropy of a binary sequence s = s₁s₂…s_n, where P(s_i = 1) = p.

H(p) = −p log₂ p − (1 − p) log₂(1 − p)

(6)

The Shannon entropy is zero when all the bits are identical (i.e., it is periodic with a period of one) and maximal when p = 0.5, corresponding to maximum variety.

Thus, the absence of periodicity corresponds to greater entropy, linking compositeness and entropy in a direct way.

3. The Binary Derivative and Its Entropic Properties

3.1. Definition of the Binary Derivative

Given a binary string, the first binary derivative, d₁(s), is defined as the XOR of the adjacent bits:

d₁(s) = s_i ⊕ s_i₊₁, 1 ≤ i < n

(7)

The k-th binary derivative d_k(s) is here defined iteratively:

d_k(s) = d_k−1(s), 1 ≤ k ≤ n − 1

(8)

The sequence {d₀(s), d₁(s), …, d_n−1(s)} forms a binary derivative hierarchy D(s), where d₀(s) = s and d_n−1(s) is a single bit.

The binary derivative behaves as a finite-difference operator under XOR addition, making it a natural discrete analog of continuous differentiation.

3.2. Periodicity and the Zero Derivative

Theorem 4.

A binary number is periodic with period 2ⁿ if and only if one of its derivatives, d_k(s), equals zero for some 0 ≤ k < 2ⁿ.

Proof.

The proof exactly follows Nathanson’s Lemma 6 [18]. □

This connects periodicity to a total loss of informational entropy, since d_k(s) = 0 equates to zero Shannon entropy within the binary derivative hierarchy.

3.3. Statistical Independence of the Binary Derivatives

Theorem 5.

If the bits of s follow a Bernoulli process with P(s_i = 1) = 0.5, then the bits of each derivative dk(s) are statistically uncorrelated and independent, i.e.,:

E[s_i] = 0.5, Var(s_i) = 0.25, Corr(s_i, s_i+j) = 0, ∀ j > 0

Proof:

This was proven by Davies et al. in 1995 [19]. □

Thus, the binary derivative acts as an entropy-preserving transformation, maintaining the Shannon entropy of the underlying sequence under successive XOR operations.

3.4. Independence of Successive Derivatives

Theorem 6.

Because each derivative is composed of statistically independent bits, successive binary derivatives are also independent random variables, preserving the entropy structure across successive levels of differentiation.

Proof:

This was again proven by Davies et al. in 1995 [19]. □

This property implies that differentiation does not introduce correlation; rather, it disperses structure—much like a whitening transformation in information theory. Consequently, the binary derivative provides a stable computational framework for analyzing the entropy of binary representations of natural numbers.

3.5. Binary Integrands, Complements and Derivatives

Theorem 7.

Every binary number is the binary derivative of exactly two binary numbers of one bit greater length.

Theorem 8.

The two binary integrands [20] are one’s complements of each other.

Proof.

By definition, the XOR operator and its iterative extension to strings of arbitrary length n yield the same derivative from a string and its one’s complement. Hence, each binary derivative of length 2ⁿ corresponds to a binary number and its one’s complement of length 2ⁿ⁺¹. Conversely, each pair of complementary binary integrands of length 2ⁿ⁺¹ corresponds to a single binary derivative of length 2ⁿ. Since there are only 2ⁿ binary derivatives of length n, the mapping of 2ⁿ⁺¹ binary integrands to 2ⁿ binary derivatives must be exactly and uniquely 2:1; otherwise, Theorems 7 and 8 would be contradicted. □

Corollary 1.

Every prime number > 2 expressed in binary has two binary integrands, one bit longer and a single binary derivative one bit shorter.

Corollary 2.

Following also from Theorems 5 and 6, the primality of a binary number is independent of the primality of its single binary derivative and its two binary integrands.

Corollary 3.

Every prime expressed in binary has a one’s complement composite twin.

3.6. Termination of the Derivative Chain

Theorem 9.

If d_k(s) = 111…1, then d_k+₁(s) = 0.

Theorem 10.

If d_k(s) = 0, d_k+j(s) = 0 for all 0 < j < n − k − 1.

Proof.

The proof of Theorems 9 and 10 follows directly from the defined properties of the XOR operation and completes the closure of the derivative sequence. □

Corollary 4.

Due to Theorems 4, 9 and 10, the final binary derivative of every periodic number is zero.

3.7. The Primes Are Equidistributed Across the Binary Derivatives

Theorem 11.

The primes are asymptotically equidistributed across all levels of the binary derivative hierarchy. As n → ∞, the set P_n = {s ∈ ℕ: s is prime, |s| = n} is uniformly distributed over all possible derivative states d_k(s).

Proof.

By Corollary 2, the primality of a binary number is independent of the primality of both its single binary derivative and its two binary integrands. Crucially, once the first derivative d₁(s) of a prime s is obtained, all connection with its origin is lost—d₁(s) provides no information about whether s was prime or composite.

From Theorem 5, the bits of each derivative d_k(s) are statistically independent with E[s_i] = 0.5 and Corr(s_i, s_i+j) = 0 for all j > 0. By Theorem 6, successive binary derivatives are independent random variables, with XOR-based differentiation acting as a whitening transformation that disperses structure. From Theorems 7 and 8, every binary number is the derivative of exactly two binary numbers (one’s complements), creating a precise 2:1 mapping.

If primes were not equidistributed across derivative states, some derivative pattern of d_k would preferentially correspond to more than one prime. However, the derivative operation destroys all information about primality while preserving statistical independence through a symmetric, entropy-preserving mapping. Therefore, as n → ∞, any deviation from uniform distribution vanishes, and primes become equidistributed across all 2^n−k possible states at each derivative level k. □

Corollary 5.

The primes are equidistributed across the two states {0, 1} of the final binary derivative d_n−1(s).

4. The Entropic Probability of Primality

4.1. Definition of the Entropic Probability of Primality

A binary number may not be prime if any of its derivatives d_k(s) equals zero (Theorems 1, 2 and 3). The entropic probability of primality therefore measures the extent to which a number avoids zero-entropy or fully periodic states. The final binary derivative is assigned the highest weight due to Theorem 4. The greatest weight is purposefully assigned to the final derivative as periodicity emerges gradually through the derivatives, with the earlier derivatives capturing shorter periodicity (e.g., 11111111) and the later derivatives capturing more complex, longer periodicity, which takes numerous derivative steps, up to and including the final derivative, to evolve to a zero-entropy state. Let

z_{k} = \{\begin{matrix} 0, i f H (d_{k} (s)) = 0 (i . e ., d_{k} (s) = 0) \\ 1, otherwise \end{matrix}

Then we define

p (s^{'}) = \sum_{k = 0}^{n - 1} (z_{k} (s) / 2^{(n - k)}) + 1 / 2^{n}

(9)

where the final term corrects for the special case of the Mersenne numbers (s = 11…1).

This function, in general, satisfies

0 < p(s′) ≤ 1, μ(p(s′)) = 2/3, σ²(p(s′)) = 1/8

p(s’) can thus be interpreted as a scale-invariant entropic likelihood that a given number is prime, derived solely from the entropic structure of its binary representation and its binary derivatives. p(s’) is a hierarchical weighted average of the Shannon zero-entropy states of the binary derivatives of s.

4.2. Connection to the Prime Number Theorem

Because p(s′) is independent of numerical magnitude and derived only from structural entropy, it complements the analytic form of the Prime Number Theorem (PNT). For numbers of length 2^k bits:

P (s is prime) ≈ p(s’) · Li(2^k) · 3/2^k⁺¹

(10)

and, exactly:

P (s is prime) = p(s’) · π(2^k) · 3/2^k⁺¹

(11)

This formulation integrates entropic structure (via p(s′)) with analytic density (via the PNT), providing an entropic, unconditional and computable probability of primality.

4.3. Entropic Interpretation

The metric p(s′) quantifies the proportion of derivative states that retain non-zero entropy. Primes correspond to high-entropy regions in the derivative space D(s), while composites occupy low-entropy basins associated with periodicity. The independence of p(s′) from numerical magnitude implies that primality is an intrinsic property of entropy distribution as well as numerical size.

4.4. Bounded Stochastic Variance of the Prime Distribution in p(s′) Space

Theorem 12.

Let P_n(p) denote the set of primes of length n with entropic probability p(s′) = p. Then the variance of prime density across p(s′) partitions is bounded:

Var[|P_n(p)|/|N_n(p)|] = O(1/n)

with residuals following a binormal distribution constrained by the Central Limit Theorem.

Proof.

Since p(s′) = Σ(z_k(s)/2^n-k) where z_k(s) ∈ {0,1}, and each z_k(s) is an independent Bernoulli variable (Theorems 5 and 6), the Central Limit Theorem applies: p(s′) ~ N(2/3, 1/8n) as n → ∞.

From Theorem 11, primes are equidistributed across derivative space, imposing |P_n(p)| = |N_n(p)|π(2ⁿ)/2ⁿ + ε_n(p), where ε_n(p) represents stochastic fluctuation with E[ε_n(p)] = 0.

The relative variance is Var(|P_n(p)|/|N_n(p)|) = π(2ⁿ)/2ⁿ·(1 − π(2ⁿ)/2ⁿ)/|N_n(p)|. Since |N_n(p)| is proportional to 2ⁿ and π(2ⁿ)~2ⁿ/n by the Prime Number Theorem, this yields Var(π(n)) = O(1/n).

The partition p(s′) = 1.0 contains exactly half of all numbers. By Corollary 5, primes are equidistributed across the final derivative state, creating independent normal distributions in the upper and lower partitions—a binormal distribution. □

Corollary 6.

Prime distribution exhibits bounded stochastic variance rather than true randomness, with deviations that are asymptotically normal, bounded in magnitude, balanced across binary partitions, and diminishing relative to total prime count.

Corollary 7.

The bounded variance Var = O(1/n) in p(s′) space imposes a stronger constraint on prime density fluctuations than the Riemann Hypothesis bound |π(x) − Li(x)| = O(√x log x). For x = 2ⁿ, the entropic variance bound O(1/x) = O(1/log x) suggests actual deviations are substantially smaller than the von Koch [21] bound suggests. This reveals that equidistribution across derivative space forces prime density to track Li(x) with tighter regulation than analytic continuation alone predicts.

Interpretation: Theorems 11 and 12 establish that apparent prime “randomness” is actually regulated stochastic behavior, constrained by equidistribution across derivative space, statistical independence, and the Central Limit Theorem.

5. Empirical Results

5.1. Numerical Evaluation of p(s′)

The metric p(s′) was computed for all integers s < 65,536 using the algorithmic procedure defined in Equation (11). The calculations were implemented in Microsoft Excel. All worksheets, graphics and tables, together with further supporting materials, are available in the Supplementary Materials. For illustration, we firstly examined a small domain (s < 256) to permit direct visual inspection of binary patterns and periodicities. We illustrate the algorithm in Table 1.

Each number < 256 was expressed in binary form, its successive derivatives were computed by XOR differentiation, and the fraction of derivatives with non-zero entropy was accumulated according to Equation (11). The resulting p(s′) values were then compared with the actual occurrence of primes in the same intervals and with the predictions of both π(x) and Li(x).

5.2. Primes Below 256

For the 8-bit domain (s < 256), 54 numbers are prime. Figure 1 below lists the calculated p(s′) values, and Table 2 lists the corresponding observed prime counts.

A strong linear relationship was obtained between expected and actual primes, with

R² = 0.95 and ρ _Spearman = 0.94.

Figure 1 shows the distribution of primes within the p(s′) framework for s < 256.

Figure 1 is structured such that integers with p(s′) ≥ 0.5 are coloured red and integers with p(s′) ≥ 0.25 are coloured yellow. Integers with p(s′) < 0.25 are white The horizontal and vertical borders give the decimal (white) and binary (blue) values of the Higher (MSD) and lower (LSD) Significant digits. Primes (shaded purple) cluster most preferentially within the region p(s′) = 1.0, corresponding to maximal entropy across all derivative levels. In contrast, numbers with p(s′) < 0.5 show a marked deficit of primes, consistent with the hypothesis that low-entropy (periodic) binary structures are more likely to be composite.

5.3. Extension to s < 65,536

The procedure was repeated for 16-bit numbers. Table 3, Table 4 and Table 5 summarize actual and expected prime counts across discrete p(s′) intervals for each power-of-two boundary (2⁴ to 2¹⁶).

These observed prime distributions display three notable features:

5.3.1. Scale Invariance

The correlation between predicted and observed prime frequencies remains stable (R² > 0.93) across all magnitudes, confirming the scale-independent nature of p(s′).

5.3.2. Balanced Stochastic Asymmetry

Deviations of actual primes from expectation are evenly but asymmetrically distributed above and below p(s′) ≥ 0.5. This indicates bounded stochastic behavior rather than truly random variation.

5.3.3. Diminishing Imbalance

A small stochastic imbalance in the number of primes between the upper and lower binary halves (partitioned by the final derivative value p(s′) = 1.0) is clearly observed for small s but proportionally diminishes as s → ∞ as it must do due to Theorem 11.

5.4. Large-Scale Sampling (s < 2³²)

Ten samples of 2000 randomized 32-bit integers were evaluated to assess the persistence of the predicted and observed trends in p(s′). The empirical distribution of p(s′) values among the sampled primes was almost identical to p(s′) and PNT theoretical expectations. Near-zero residuals between predicted and actual primes in the upper and lower halves of the p(s′) distribution were observed.

These findings demonstrate that p(s′) functions as a statistically consistent estimator of prime probability across multiple scales.

5.5. Prime Density for p(s′)

Prime density for p(s′) for s < 65,536 was again calculated [22]. A quadratic curve was fitted to the obtained prime density, and the difference between the two was computed. The distribution of the difference e was plotted in Figure 2 and shown to be small and binormal (μ(e) ≈ 0 and σ(e) ≈ 56). This was as expected, as the value of p(s′) is 1.0 for exactly half of the sample.

6. Discussion

6.1. Entropy, Periodicity and Primality

Our results reinforce the conceptual view that primes are high-entropy entities within the binary domain. Periodic or low-entropy patterns correspond to compositeness, while the random-like, high-entropy patterns are characteristic of primes. The binary derivative provides a natural operational link between these domains: its XOR-based differentiation process preserves Shannon entropy and reveals any periodic structure through the eventual emergence of zero-entropy derivatives.

From an information-theoretic perspective, composite numbers can be viewed as compressible sequences—they exhibit internal redundancy through repetition—whereas almost all primes behave as incompressible sequences exhibiting continuing maximal disorder under binary differentiation.

6.2. Entropic Structure of Number Space

We can consequently divide the number space into differing entropic sets and establish clear relationships between them. We illustrate this in Table 6 below.

6.3. Statistical Structure of Number Space

The empirical data indicate that the apparent randomness of primes is statistically constrained. Deviations from analytic expectations in p(s′) based prime density analysis are binormal, small, and bounded by the central limit theorem, implying a regulated stochastic process rather than pure randomness. The decreasing asymmetric prime imbalance across binary partitions (which has all but disappeared by 2³²) further supports the hypothesis that prime variance is asymptotically zero, consistent with the Riemann Hypothesis’ analytic predictions and Theorem 11.

6.4. Computational and Cryptographic Implications

Because p(s′) can be evaluated directly from a number’s binary representation, it provides a computationally efficient heuristic for testing primality likelihood. While not a deterministic test, it correlates strongly with exact primality and could be combined with conventional probabilistic algorithms (e.g., Rabin–Miller [23] or AKS [24]) to optimize candidate selection in large-scale prime generation.

The entropic interpretation also links number theory with cryptographic entropy assessment, suggesting possible further applications in randomness generation and testing, key validation, and entropy-driven encryption schemes.

6.5. Comparison Between p(s′) and BiEntropy

The entropic probability metric p(s′) exhibits an almost perfect correlation with the previously defined BiEntropy function (R² = 0.9992 for s < 256), confirming that both measures capture the same underlying structural information within binary representations. While BiEntropy provides a smooth, almost continuous measure of disorder, p(s′) offers a discrete and computationally more efficient formulation. Crucially, p(s′) incorporates the final binary derivative, which BiEntropy necessarily omits, thereby revealing the small stochastic asymmetry in the distribution of primes across their terminal derivative states. This capability makes p(s′) not only a more precise estimator of primality but also a clearer and simpler analytical expression of the entropic structure of the natural numbers, with Theorems 5–12 easier to argue.

The high correlation between p(s′) and BiEntropy demonstrates conceptual continuity with prior work, yet p(s′) extends the framework by providing an exact analytic expression rooted in Shannon’s entropy and binary periodicity. Whereas BiEntropy evaluates disorder across derivative layers, p(s′) translates that same structure into an exact probabilistic measure with closed-form parameters (μ, σ²). This unification of entropy and periodicity establishes a quantitative bridge between information theory and analytic number theory. The simpler definition of p(s′) facilitated the development of the theoretical framework.

6.6. Comparison with Rabin–Miller and AKS Primality Tests

The metric p(s′) is conceptually distinct from traditional primality tests such as Miller–Rabin [23] and AKS [24]. Whereas the Miller–Rabin test evaluates compositeness probabilistically through modular exponentiation and AKS verifies primality deterministically via polynomial congruences, p(s′) is entirely analytic and derived from the intrinsic periodic structure of binary sequences. It requires neither random witnesses nor modular arithmetic, instead estimating the likelihood of primality through the entropy of a number’s binary derivatives. The computation of p(s′) is scale-invariant and of complexity O(log₂(s)), making it well-suited for large-scale numerical analysis. Although not a deterministic test, p(s′) provides a structural probability of primality that complements, rather than replaces, conventional algorithms such as Rabin–Miller and AKS in computational number theory.

6.7. Twin, Fermat, and Mersenne Primes

Application of p(s′) to special prime classes demonstrates its consistency with established prime distributions. Under the equidistribution of primes across binary derivatives (Theorem 11), the expected and actual frequencies of twin primes before and after an arbitrary threshold z (which might correspond to the last twin prime) must remain in agreement, thereby supporting the twin primes conjecture within the entropic framework. Similarly, Fermat and Mersenne primes arise naturally at densities consistent with p(s′) and the Prime Number Theorem, which we demonstrated in the appendices of our prior work. The entropic formulation thus accommodates these prime families seamlessly, reinforcing the generality of p(s′) as a unifying descriptor of primality.

6.8. The Riemann Hypothesis and Skewes’ Number

The analysis of p(s′) provides a complementary entropic interpretation of the Riemann Hypothesis (RH) and related prime-distribution phenomena. The variance of the difference π(s) − Li(s) is shown to diminish asymptotically as s → ∞, implying a tighter convergence than the classical von Koch equivalence π(s) = Li(s) + O(√s log s) ⇔ RH. This behavior arises from the stochastic fluctuations in prime density across p(s′) partitions, consistent with Littlewood’s theorem [25] that the sign of π(s) − Li(s) changes infinitely often beyond Skewes’ number [26]. Skewes’ number corresponds to the first zero value of the entropic prime excess in the highest p(s′) partition. These findings position p(s′) as a scale-invariant and computationally tractable construct linking entropy, periodicity, and the statistical regularities underlying the prime number distribution.

6.9. Limitations and Future Work

This study has established an entropic framework linking periodicity and primality in the binary domain. Previous work has already extended these findings to the ternary domain, confirming that the relationship between entropy and primality generalizes across numerical bases. Future research may explore higher-order derivative systems and cross-base entropic interactions to identify deeper invariances in integer structure.

Given that p(s′) is scale-invariant with known variance, further investigation could refine bounds for the first occurrence where the entropic prime excess in the highest binary partition equals zero, thereby improving estimates and bounds [27] of Skewes’ number.

Although not a deterministic test of primality, p(s′) remains a computationally efficient and analytically grounded heuristic for cryptographic key generation, entropy estimation, and prime-density modeling.

Finally, the transparent spreadsheet-based framework in the Supplementary Materials supporting p(s′) offers a set of practical pedagogical tools for demonstrating the interplay between entropy, periodicity, and primality in both research and educational [28] contexts.

7. Conclusions

This study introduced an entropic probability of primality, p(s′), derived from the periodic and differential structure of binary numbers. Theoretical proofs and empirical analyses collectively demonstrate that p(s′) is scale-invariant, analytically well-defined, and strongly predictive of primality. Primes emerge as maximal-entropy states within the binary derivative hierarchy space D(s), while composites correspond to entropy minima associated with periodicity.

These findings strengthen the bridge between entropy, randomness, and number theory, providing both conceptual and computational mechanisms for further understanding the mathematical and statistical architecture of the primes.

Supplementary Materials

The following supporting information is available online at https://figshare.com/articles/dataset/Aperiodicity_of_the_Primes/21187339, accessed on 23 November 2025. Aperiodicity_Primality_of_Nat_Num_256.xls: Complete numerical evaluation of the binary derivatives for all integers s < 256. Includes cell-by-cell computation of p(s′), z_k(s). Graphic illustration of the distribution of the primes by p(s′). Means, counts and variances of key parameters, including derivatives, primes, twin primes and associated graphics. P_s_prime_64k_calc_v1.45.xls: Complete numerical evaluation of p(s′) for all integers < 65,536. Includes cell-by-cell computation of p(s′). Graphic illustrations of the distribution of the primes by p(s′). Means, counts, variances and regressions of key parameters. Contains comparison with π(s), Li(s) and Prime Number Theorem predictions. Binormal error distribution of p(s′). P_S_Prime_32_Bit_Calculator_v1.00.xls: Full 32-bit p(s′) calculator including primality test. P_S_Prime_32_Bit_Test_V1.00.xls: Test set generator for 32-bit randomized samples. P_S_Prime_32_bit_Test_Results.xls: Test set results for 20,000 32-bit samples. Appendix_B_v1.01.xls: Illustration of the periodicity, n-periodicity and z-periodicity of the 8-bit integers. Appendix_C_v1.01.xls: Enumeration of the counts of integers < 65,536 deterministically precluded from primality due to Theorems 1, 2 and 3. Together, these materials permit full reproduction and validation of every result, figure, table and comment reported in the main text. They provide additional datasets and analyses.

Funding

This research received no external funding.

Data Availability Statement

The data presented in this study are openly available in Figshare at [https://figshare.com/articles/dataset/Aperiodicity_of_the_Primes/21187339], accessed on 23 November 2025.

Acknowledgments

The author thanks the anonymous reviewers of this paper. The author thanks the late John Amson, the late Mike Manthey, Kojiro Kobayashi, Louis Kauffman, Divyamaan Sahoo and James Flagg for their reviews of earlier versions of this paper. I thank my wife, Deborah, for her constant support of this work over many years.

Conflicts of Interest

The author declares no conflicts of interest.

References

Shannon, C.E. A Mathematical Theory of Communication. Bell Syst. Tech. J. 1948, 27, 379–423. [Google Scholar] [CrossRef]
Kontoyiannis, I. Counting the primes using entropy. In Proceedings of the 2008 IEEE Information Theory Workshop (ITW), Porto, Portugal, 5–9 May 2008; IEEE: Piscataway, NJ, USA, 2008; p. 268. [Google Scholar] [CrossRef]
Billingsley, P. Prime numbers and Brownian motion. Am. Math. Mon. 1973, 80, 1099–1115. [Google Scholar] [CrossRef]
Wolf, M. 1/f noise in the distribution of prime numbers. Phys. A Stat. Mech. Its Appl. 1997, 241, 493–499. [Google Scholar] [CrossRef]
Szpiro, G.G. The gaps between the gaps: Some patterns in the prime number sequence. Phys. A Stat. Mech. Its Appl. 2004, 341, 607–617. [Google Scholar] [CrossRef]
Southier, A.L.M.; Santos, L.F.; Ribeiro, P.H.S.; Ribeiro, A.D. Identifying primes from entanglement dynamics. Phys. Rev. A 2023, 108, 042404. [Google Scholar] [CrossRef]
Croll, G.J. BiEntropy—The Approximate Entropy of a Finite Binary String. arXiv 2013, arXiv:1305.0954. [Google Scholar] [CrossRef]
Croll, G.J. BiEntropy, TriEntropy and Primality. Entropy 2020, 22, 311. [Google Scholar] [CrossRef]
Croll, G.J. BiEntropy of Knots on the Simple Cubic Lattice. In Unified Field Mechanics II: Formulations and Empirical Tests, Proceedings of the Xth Symposium Honoring Noted French Mathematical Physicist Jean-Pierre Vigier, Porto Novo, Italy, 25–28 July 2016; World Scientific: Singapore, 2018; p. 447. [Google Scholar] [CrossRef]
Saini, A.; Tsokanos, A.; Kirner, R. CryptoQNRG: A new framework for evaluation of cryptographic strength in quantum and pseudorandom number generation for key-scheduling algorithms. J. Supercomput. 2023, 79, 12219–12237. [Google Scholar] [CrossRef]
Goyal, R. Genealogy Interceded Phenotypic Analysis (GIPA) of ECA Rules. In Proceedings of the Second Asian Symposium on Cellular Automata Technology, West Bengal, India, 2–4 March 2023; Springer Nature: Singapore, 2023; pp. 177–191. [Google Scholar] [CrossRef]
Calvet, E.; Rouat, J.; Reulet, B. Excitatory/inhibitory balance emerges as a key factor for RBN performance, overriding attractor dynamics. Front. Comput. Neurosci. 2023, 17, 1223258. [Google Scholar] [CrossRef]
Calvet, E.; Reulet, B.; Rouat, J. The connectivity degree controls the difficulty in reservoir design of random boolean networks. Front. Comput. Neurosci. 2024, 18, 1348138. [Google Scholar] [CrossRef]
Aguiar, C.; Camps, M.; Dattani, N.; Camps, I. Functionalized boron–nitride nanotubes: First-principles calculations. Appl. Surf. Sci. 2023, 611, 155358. [Google Scholar] [CrossRef]
Kauffman, L.H. Iterant Algebra. Entropy 2017, 19, 347. [Google Scholar] [CrossRef]
Kauffman, L.H. Iterants, Majorana Fermions and the Majorana-Dirac Equation. Symmetry 2021, 13, 1373. [Google Scholar] [CrossRef]
Wagstaff, S.S. Prime numbers with a fixed number of one bits or zero bits in their binary representation. Exp. Math. 2001, 10, 267–273. [Google Scholar] [CrossRef]
Nathanson, M.B. Derivatives of binary sequences. SIAM J. Appl. Math. 1971, 21, 407–412. [Google Scholar] [CrossRef]
Davies, N.; Dawson, E.; Gustafson, H.; Pettitt, A.N. Testing for randomness in stream ciphers using the binary derivative. Stat. Comput. 1995, 5, 307–310. [Google Scholar] [CrossRef]
Nathanson, M.B. Integrals of binary sequences. SIAM J. Appl. Math. 1972, 23, 84–86. [Google Scholar] [CrossRef]
von Koch, H. Ueber die Riemann’sche Primzahlfunction. Math. Ann. 1901, 55, 441–464. [Google Scholar] [CrossRef]
OEIS Foundation Inc. Entry A333409 in the On-Line Encyclopedia of Integer Sequences. 2023. Available online: https://oeis.org/A333409 (accessed on 23 November 2025).
Rabin, M.O. Probabilistic algorithm for testing primality. J. Number Theory 1980, 12, 128–138. [Google Scholar] [CrossRef]
Agrawal, M.; Kayal, N.; Saxena, N. PRIMES is in P. Ann. Math. 2004, 160, 781–793. [Google Scholar] [CrossRef]
Littlewood, J.E. Sur la distribution des nombres premiers. Comptes Rendus. Acad. Sci. 1914, 158, 1869–1872. [Google Scholar]
Skewes, S. On the difference π (x) − Li (x). J. Lond. Math. Soc. 1933, 8, 277–283. [Google Scholar] [CrossRef]
Chao, K.F.; Plymen, R. A new bound for the smallest x with π (x) > Li (x). Int. J. Number Theory 2010, 6, 681–690. [Google Scholar] [CrossRef]
Csernoch, M.; Biró, P.; Máth, J.; Abari, K. Testing algorithmic skills in traditional and non-traditional programming environments. Inform. Educ. 2015, 14, 175–197. [Google Scholar] [CrossRef]

Figure 1. p(s′) values for s < 256.

Figure 2. Error e between p(s’) prime density and a quadratic curve fit.

Table 1. Calculating p(s′).

s	s₁	s₂	s₃	s₄	s₅	s₆	s₇	s₈	1’s	n	z	k	2^8−k	z/2^8−k
23	0	0	0	1	0	1	1	1	4	8	1	0	256	0.0039
	0	0	1	1	1	0	0		3	7	1	1	128	0.0078
	0	1	0	0	1	0			2	6	1	2	64	0.0156
	1	1	0	1	1				4	5	1	3	32	0.0313
	0	1	1	0					2	4	1	4	16	0.0625
	1	0	1						2	3	1	5	8	0.1250
	1	1							2	2	1	6	4	0.2500
	0								0	1	0	7	2	0.0000
													∑	0.4961
													Mersenne	0.0039
													p(s′)	0.5000

Table 2. Actual and Expected primes by p(s′) for s < 256.

Fraction	p(s′)	Expected	Actual	PNT (Li(x))
256	0.0078	0.21	0	0.24
128	0.0156	0.42	0	0.47
64	0.0313	0.84	0	0.95
32	0.0625	1.89	1	1.89
16	0.1250	3.38	0	3.78
8	0.2500	6.75	1	7.56
4	0.5000	13.50	14	15.13
2	1.0000	27.00	38	30.26
Total		54.00	54	60.51

Table 3. Actual primes by p(s’) for various s < 65,536.

p(s′)	16	32	64	128	256	512	1024	2048	4096	8192	16,384	32,768	65,536
≤0.001953
0.003906						1	1	1	1	1	1	1	1
0.007813
0.015625							2	3	5	9	11	23	51
0.031250					0	2	2	5	8	14	28	53	95
0.062500		1	1	1	1	1	3	5	9	16	31	56	102
0.125000						3	8	17	30	62	108	211	403
0.250000	1	1	1	1	1	4	9	18	26	79	149	287	494
0.500000	1	3	5	10	14	28	41	79	144	264	484	958	1712
1.000000	4	6	11	19	36	58	106	181	341	583	1088	1923	3684
∑ =π(s)	6	11	18	31	54	97	172	309	564	1028	1900	3512	6542
Note: Li(s)	9	14	22	36	61	104	181	321	577	1048	1920	3544	6584

Table 4. Expected primes (rounded) by p(s’) for various s < 65,536.

p(s′)	16	32	64	128	256	512	1024	2048	4096	8192	16,384	32,768	65,536
≤0.000244													1
0.000488												1	2
0.000977										1	1	2	3
0.001953									1	1	2	3	6
0.003906								1	1	2	4	7	13
0.007813							1	1	2	4	7	14	26
0.015625						1	1	2	4	8	15	27	51
0.031250					1	2	3	5	9	16	30	55	102
0.062500			1	1	2	3	5	10	18	32	59	110	204
0.125000		1	1	2	3	6	11	19	35	64	119	220	409
0.250000	1	1	2	4	7	12	22	39	71	129	238	439	818
0.500000	2	3	5	8	14	24	43	77	141	257	475	878	1636
1.000000	3	6	9	16	27	49	86	155	282	514	950	1756	3271
∑ =π(s)	6	11	18	31	54	97	172	309	564	1028	1900	3512	6542
Note: Li(s)	9	14	22	36	61	104	181	321	577	1048	1920	3544	6584

Table 5. Expected minus Actual primes by p(s’) for various s < 65,536.

p(s′)	16	32	64	128	256	512	1024	2048	4096	8192	16,384	32,768	65,536
≤0.000244													−1
0.000488												−1	−2
0.000977										−1	−1	−2	−3
0.001953									−1	−1	−2	−3	−6
0.003906						1	1			−1	−3	−6	−12
0.007813							−1	−1	−2	−4	−7	−14	−26
0.015625						−1	1	1	1	1	−4	−4	0
0.031250					−1		−1		−1	−2	−2	−2	−7
0.062500		1			−1	−2	−2	−5	−9	−16	−28	−54	−102
0.125000		−1	−1	−2	−3	−3	−3	−2	−5	−2	−11	−9	−6
0.250000			−1	−3	−6	−8	−13	−21	−45	−50	−89	−152	−324
0.500000	−1		1	2	1	4	−2	2	3	7	9	80	77
1.000000	1	1	2	4	11	10	20	27	59	69	136	167	413
∑p(s′) < 0.5	0	−1	−2	−6	−11	−13	−18	−28	−62	−76	−147	−247	−489
∑p(s′) ≥ 0.5	1	1	3	6	12	13	18	28	62	76	147	247	490

Table 6. The Entropic Structure of Number Space.

Set	Entropic Interpretation	Observation
S (Periodic Numbers)	Zero entropy p(s′) ≈ 0	Composite (*)
S′ (Non-PeriodicNumbers)	Non-Zero entropy p(s′) > 0	Potentially Prime
P (Primes)	All Entropies	Prime
C (Composites)	All Entropies	Not Prime
P ∩ S	∅	Empty Set (*)
S ∪ S′	Union of all Entropies	Natural Numbers
S ⊆ C	Lower Entropy	Composite
P ⊆ S′	Higher Entropy	Prime
X	High Entropy p(s′) = 1	Potentially prime
Y	Low Entropy p(s′) < 1	Probably composite
\|X\| = \|Y\|	High and Low Entropy	Equality of set size
X ⊆ S′	High Entropy	Potentially prime
S ⊆ Y	Low Entropy	Composite (*)
\|P\|⊆ X ≈ \|P\|⊆ Y	High and Low Entropy	Equality of set size

* Exc. Fermat.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Croll, G.J. Entropy, Periodicity and the Probability of Primality. Entropy 2025, 27, 1204. https://doi.org/10.3390/e27121204

AMA Style

Croll GJ. Entropy, Periodicity and the Probability of Primality. Entropy. 2025; 27(12):1204. https://doi.org/10.3390/e27121204

Chicago/Turabian Style

Croll, Grenville J. 2025. "Entropy, Periodicity and the Probability of Primality" Entropy 27, no. 12: 1204. https://doi.org/10.3390/e27121204

APA Style

Croll, G. J. (2025). Entropy, Periodicity and the Probability of Primality. Entropy, 27(12), 1204. https://doi.org/10.3390/e27121204

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Entropy, Periodicity and the Probability of Primality

Abstract

1. Introduction

2. Periodicity and Compositeness in Binary Numbers

2.1. Definition of Periodic Numbers

2.2. Periodic Numbers Are Composite

2.3. n-Periodic Binary Numbers Are Composite

2.4. z-Periodic Numbers Are Composite

2.5. Periodic Numbers Are Composite by Construction

2.6. Periodicity and Shannon Entropy

3. The Binary Derivative and Its Entropic Properties

3.1. Definition of the Binary Derivative

3.2. Periodicity and the Zero Derivative

3.3. Statistical Independence of the Binary Derivatives

3.4. Independence of Successive Derivatives

3.5. Binary Integrands, Complements and Derivatives

3.6. Termination of the Derivative Chain

3.7. The Primes Are Equidistributed Across the Binary Derivatives

4. The Entropic Probability of Primality

4.1. Definition of the Entropic Probability of Primality

4.2. Connection to the Prime Number Theorem

4.3. Entropic Interpretation

4.4. Bounded Stochastic Variance of the Prime Distribution in p(s′) Space

5. Empirical Results

5.1. Numerical Evaluation of p(s′)

5.2. Primes Below 256

5.3. Extension to s < 65,536

5.3.1. Scale Invariance

5.3.2. Balanced Stochastic Asymmetry

5.3.3. Diminishing Imbalance

5.4. Large-Scale Sampling (s < 232)

5.5. Prime Density for p(s′)

6. Discussion

6.1. Entropy, Periodicity and Primality

6.2. Entropic Structure of Number Space

6.3. Statistical Structure of Number Space

6.4. Computational and Cryptographic Implications

6.5. Comparison Between p(s′) and BiEntropy

6.6. Comparison with Rabin–Miller and AKS Primality Tests

6.7. Twin, Fermat, and Mersenne Primes

6.8. The Riemann Hypothesis and Skewes’ Number

6.9. Limitations and Future Work

7. Conclusions

Supplementary Materials

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

5.4. Large-Scale Sampling (s < 2³²)