1. Introduction
Combinatorial optimization problems (COPs) are gaining significant interest because they are tightly coupled with real-world applications such as RSA encryption [
1], RTL design [
2], and logistics [
3]. These problems aim to find an optimal solution from a solution space, where each solution can be evaluated with a cost function. Most COPs are classified as nondeterministic polynomial time (NP) hard problems because classical computers cannot efficiently search through the exponentially expanding discrete solution space. In the absence of algorithmic breakthroughs, classical computers have improved performance by adopting the latest processors, yet they are reaching their limits as the transistor density growth has slowed below Moore’s law [
4]. Consequently, obtaining solutions for massive COPs with classical computing presents significant challenges.
In 1983, S. Kirkpatrick et al. introduced a statistical-mechanics-based approach called simulated annealing [
5]. This method employs a collection of spins and their interaction network to represent COP instances using the Ising model. Simulated annealing aims to find approximate solutions by searching for the ground states in the solution space, which is achieved by progressively lowering the effective temperature of the system and applying the Metropolis algorithm [
6]. Like the Ising model in statistical mechanics, simulated annealing is an energy-based method. The energy landscape of the target COP is configured with the Ising Hamiltonian, and spin states yielding the lowest energy become the correct solution.
Beyond its algorithmic formulation, the annealing principle has driven a range of dedicated hardware. The most prominent quantum realization is D-Wave’s commercial annealer, which encodes spins in superconducting flux qubits [
7]. Although it has proven effective on real-world optimization tasks, the platform must be held at cryogenic temperatures, and the associated cooling apparatus imposes substantial energy and footprint penalties. Quantum error and decoherence further constrain how far such systems can scale.
In contrast, CMOS-based Ising hardware offers the distinct advantage of room-temperature operability [
8,
9,
10,
11,
12,
13,
14,
15]. SRAM-based configurations have successfully implemented the simulated annealing algorithm at the transistor level [
8,
9]. These configurations inherently require SRAM read operations to access spin states. Certain designs also utilize dual external random pulses distributed across the spin array to modulate spin states, targeting escape from local minima [
9]. This approach, however, may compromise the solution quality because of correlated spin flips and mark-space ratio deviation caused by the strength difference in pMOS and nMOS of inverter buffers. Also, the operating frequency of the system is restricted due to chip-level propagation delays. Register-based configurations have mitigated the overhead of SRAM read and write operations using D-flip-flops for storing spin states [
10,
11]. While this architectural choice improves operational efficiency, the challenges related to the use of shared external random pulses still need to be addressed, affecting both solution quality and operational frequency. More recent CMOS Ising hardware has also improved system scale, coefficient flexibility, density, and connectivity through multi-chip annealing systems, flexible spin processing elements, SRAM-based in-memory Ising macros, and all-to-all coupled-oscillator Ising chips [
12,
13,
14,
15].
Most Ising solvers, however, are tuned for COPs with a single optimal answer, as in
Figure 1a. Lowering the effective temperature during annealing naturally drives the spins toward one configuration, which is why their use is largely confined to problems such as max-cut. Many important COPs admit several energy-equivalent solutions instead. Boolean satisfiability (SAT) is a representative example, and its landscape is sketched in
Figure 1b. Handling such problems calls for a scheme that can sample multiple valid configurations rather than collapse onto a single one.
Probabilistic computing answers this requirement directly. It pivots on the probabilistic bit (p-bit), first introduced by Camsari et al. [
16]. A p-bit is a bipolar primitive whose state switches at random, and
Figure 2 places it conceptually between the deterministic classical bit and the superposed qubit. What distinguishes it from a classical bit is that the input strength biases this random switching, so the system can deliberately perturb p-bit states to climb out of suboptimal configurations and keep traversing the COP energy landscape. Crucially, unlike a qubit, it sustains this stochastic behavior at ordinary room temperature, removing the cooling overhead that quantum hardware imposes.
Reported p-bit hardware has been explored through several implementation routes. The first uses nanomagnetic devices: near-zero-barrier magnetic tunnel junctions (MTJs) act as compact, energy-efficient computing units whose intrinsic thermal fluctuations serve as the randomness source [
17]. A recent on-chip stochastic MTJ/2D-MoS2 p-bit core demonstrated voltage-controllable stochasticity by interconnecting a stochastic MTJ with a 2D-MoS2 FET in a 1T-1MTJ configuration [
18]. Their attractive density, however, is offset by the immature large-scale fabrication of such unstable junctions. The second category is fully CMOS and instead synthesizes randomness with pseudorandom number generators (PRNGs) [
19,
20]. A representative design distributes a single 64-bit xorshift+ stream across every p-bit [
19]. Because all bitstreams originate from one generator, inter-p-bit correlation becomes a concern. Allocating a private 32-bit LFSR to each p-bit on an FPGA [
20] removes this shared-source coupling, but in turn raises two scaling burdens: guaranteeing distinct seeds across many generators and lengthening each LFSR as the problem grows [
21]. Beyond MTJ- and PRNG-based implementations, recent physical p-bit demonstrations have also explored fully CMOS-compatible biristor p-bits and NbOx self-oscillatory p-bits as alternative stochastic devices [
22,
23]. These developments indicate continuing progress in physical random-source p-bits, but scalable CMOS implementation still benefits from compact local randomness that avoids shared PRNG state, seed assignment, and global random-distribution overhead.
Here, we present a CMOS p-bit for probabilistic computing applications. To address the challenges of random seed uniqueness and the scaling overhead inherent in PRNG-based systems during expansion, our CMOS p-bit leverages transistor device noise to generate the required randomness for probabilistic outputs. Because each p-bit derives randomness from its own device noise, the local random-source circuit does not require shared PRNG state or seed assignment, and its per-p-bit area does not increase as the probabilistic circuit (p-circuit) is scaled to more p-bits. The total p-circuit area, however, still scales with the number of p-bits, coefficient storage, weighted-sum logic, update control, and interconnects. The time-averaged output of the proposed p-bit, expressed as a percentage (representing the occurrences of logical “1” at the output), is controlled via a 5-bit digital input. A collection of these p-bits is connected bidirectionally, forming a p-circuit capable of solving various COPs. We configured a prototype p-circuit with chips fabricated using 180 nm CMOS technology and an FPGA to solve COPs involving both single-solution and multiple-solution scenarios.
The remainder of this article proceeds as follows.
Section 2 reviews the Ising-solver background, the probabilistic computing scheme, and the COP encoding used in this study. The proposed p-bit circuit and its p-circuit integration are described in
Section 3.
Section 4 reports the measurement results, and the work is summarized in
Section 5.
2. Background
2.1. Ising Solver
The Ising model offers a mathematical description of how magnetic spins behave collectively. A representative two-dimensional lattice configuration appears in
Figure 3a, where every spin takes one of two discrete states: up (+1) or down (−1). Each pair of spins is joined by a bidirectional weight that sets their coupling strength. A positive weight favors parallel alignment so that the connected spins prefer {+1, +1} or {−1, −1}, whereas a negative weight pushes them toward the antiparallel pairs {+1, −1} and {−1, +1}. An external magnetic field additionally tilts each spin individually. Under these interactions, the total system energy is described by the Ising Hamiltonian:
In Equation (1),
H denotes the Ising Hamiltonian that represents the total system energy,
Jij is the interaction coefficient,
σi is the bipolar spin state, and
hi is the external field. Statistical mechanics treats the spin assignment that minimizes this energy as the physically meaningful one. Prior studies have shown that a wide class of COPs can be reduced to such an Ising ground-state problem in polynomial time [
11,
24]. An Ising solver therefore tackles a COP by searching its solution space for the ground states defined by the Ising Hamiltonian [
25].
Figure 3.
(a) Spin configuration of an Ising model on a two-dimensional lattice graph. (b) Visualization of an Ising model escaping local minima. The green, blue, and yellow markers denote the initial state, intermediate spin-update states, and the ground state, respectively. The arrows indicate a trajectory in which the system escapes a local minimum through an energy-increasing update and then relaxes toward the ground state.
Figure 3.
(a) Spin configuration of an Ising model on a two-dimensional lattice graph. (b) Visualization of an Ising model escaping local minima. The green, blue, and yellow markers denote the initial state, intermediate spin-update states, and the ground state, respectively. The arrows indicate a trajectory in which the system escapes a local minimum through an energy-increasing update and then relaxes toward the ground state.
Figure 3b depicts the same process from an energy standpoint. The spins, serving as bipolar variables, span the solution space, and their energy is scored through Equation (1) as the solver adjusts the states toward lower energy. Because the landscape is riddled with local minima that correspond to suboptimal answers, the solver must occasionally force spin flips that raise the energy on purpose. Such deliberate uphill moves let the system leave a local minimum and eventually relax into a configuration of lower energy.
Building on this theoretical framework, previous works [
8,
9,
10,
11] have explored various hardware implementations to optimize local minima escape strategies. In [
8], SRAM error bits achieve the required randomness for updating the system state to higher energy levels. As fabricated SRAMs exhibit varying minimum operating voltages, this method deliberately lowers the supply voltage of SRAMs that store spin states, thereby inducing random error bits. While this approach allows for altering the system’s energy level without affecting interconnect values, it should be noted that the SRAM locations susceptible to such errors are spatially deterministic. Thus, the system may explore only a part of the energy landscape.
Another method for randomly flipping spin states involves applying external random pulses [
9]. A spin flips its state only when it receives logical “1” from random pulses propagated both horizontally and vertically. In any other case, it keeps its original state. One can control the flipping tendency by modifying the mark-space ratio of these pulses. This method, however, may degrade solution quality because of a correlated flipping tendency among adjacent spins. Scalability challenges arise from the corruption of the mark-space ratio caused by accumulated strength imbalances in lengthy inverter buffers. To address these issues, Su et al. [
10,
11] applied multiple dedicated random pulses to each row and column of the spin array. Their approach adds weighted sums of random pulse values and a global weight to the weighted sums of interaction coefficients and states of adjacent spins. Although this method distributes random pulses individually to each row and column, spins within the same row or column still share the same pulse, potentially leading to correlated flipping tendencies that could limit the thorough exploration of the solution space.
2.2. Probabilistic Computing
Figure 4a abstracts the internal organization of a single p-bit. It receives the states of the p-bits connected to it together with one bias value, and these terms are scaled by their respective weights to form the drive into the probability controller. The controller then resolves the p-bit into a bipolar output in a probabilistic manner that depends on how strong that drive is.
Figure 4b shows the resulting behavior, where a zero drive yields a 50% chance of logical “1”, and pushing the drive higher or lower raises or lowers that chance accordingly. Equations (2) and (3) formalize this state-update rule [
16,
26].
Equation (2) combines two parts: a tunable sigmoidal term written as tanh(
Ii.total(
t)) and a random value drawn uniformly over the interval from −1 to +1. In a real device, however, the randomness comes from transistor thermal noise, so the underlying distribution is Gaussian rather than uniform. The state decision of a practical p-bit therefore relies on a Gaussian probability density function (PDF) split by an adjustable threshold into two regions, as
Figure 5 shows. Sliding this threshold along the horizontal axis tunes the output probability.
In Equation (3), the controller input is built from a bias term hi together with the two-body interaction terms Jij. The Ising Hamiltonian in Equation (1) is quadratic by construction, so a COP whose natural cost function contains products of three or more variables does not map onto it directly. Prior CMOS Ising solvers were implemented with pairwise couplings only, so higher-order problems were handled by first reducing them to a quadratic form, for example, through order reduction or minor embedding. Either route introduces auxiliary spins whose count grows with the number and degree of the higher-order terms.
The p-bit update itself does not share this restriction. The only quantity each p-bit requires is its total input, which corresponds to the local field of the energy function at that node. For a higher-order cost function, this local field is a higher-order polynomial of the neighboring states, so a system that can evaluate this polynomial can incorporate terms such as
Jijk and
Jijkl with no auxiliary spins. Equation (4) extends Equation (3) to support interactions up to fourth order:
A second strength follows from how the controller input shapes the flipping tendency of each p-bit, a point already raised in the introduction. This makes the p-circuit naturally suited to COPs that possess several solutions of equal energy. Reaching all of them with an annealing Ising solver usually means repeatedly re-warming the schedule or running multiple replicas within one pass. The p-circuit instead visits the different solutions within a single continuous run while the effective temperature is held fixed, so no annealing restart is needed.
Figure 6 shows the end-to-end probabilistic computing flow. The COP is first cast as a collection of bipolar variables and the interactions linking them. For a more hardware-friendly form, the bipolar variables
mi ∈ {−1, +1} are recast as binary variables
si ∈ {0, 1} so that the problem fits the higher-order unconstrained binary optimization (HUBO) model, which generalizes quadratic unconstrained binary optimization. Substituting
m = 2
s − 1 keeps Equations (2)–(4) valid in the binary form, and every equation, state, and coefficient from here on follows the HUBO convention. A p-circuit is then assembled by mapping the binary variables and coefficients onto p-bits and their connections, after which it runs and samples system states over time. A final post-processing stage converts the raw samples into candidate solutions. When the weighted sum evaluations are fast enough and no two connected p-bits update at the same time, the collected samples approach the Boltzmann distribution [
16,
26,
27]. The expected occurrence of each solution then follows from the relations below:
Here, pk is the probability that the system occupies a particular state configuration, and Ek is the energy of that configuration. Because lower-energy configurations are favored, the candidates that appear most often are examined first when extracting the correct solutions.
Figure 6.
Workflow of probabilistic computing.
Figure 6.
Workflow of probabilistic computing.
2.3. Applications and Encoding Methods
In order to solve COPs with a probabilistic computing system, target problems must first be encoded into the HUBO model for system implementation. This encoding procedure determines the required number of p-bits and interactions. Among various COP instances, we focus on two tasks to validate the probabilistic computing system: one characterized by multiple ground states and the other by an essentially single ground state.
A representative COP with multiple ground states is the SAT problem, which asks whether some assignment of truth values makes a given Boolean formula evaluate to true. SAT carries an exponentially large search space and is historically notable as the first problem proven NP-complete. In probabilistic computing, Boolean constraints can be implemented by realizing their logic as p-bit gate networks [
16,
28]. Each primitive gate, such as NOT, AND, or OR, is encoded through a set of interaction coefficients defined over binary variables [
29], and
Figure 7 lists these gates together with their graphs and weight values. Larger circuits are then formed by fusing the p-bits that stand for variables shared between gates and adding their bias contributions.
SAT is a natural fit for probabilistic computing because a Boolean formula can admit several satisfying assignments depending on how it is written. The conjunctive-normal-form formula F = (A ∨ B) ∧ (C), for instance, is satisfied by three distinct input patterns. A conventional logic circuit would have to enumerate input combinations exhaustively to find them, since its inputs are driven deterministically. The p-circuit instead treats the formula as an energy function whose multiple ground states are all valid assignments, so one continuous run can surface them together. Holding the p-bits that represent the circuit output at a chosen logical value, a procedure referred to as clamping, drives the network in the inverted direction and makes it search for the input patterns consistent with that output. Under clamping, the usual separation between inputs and outputs no longer applies, since any p-bit can act as either.
Integer factorization is a contrasting example, a COP with essentially a single ground state. RSA security rests on an asymmetry in classical computing, where multiplying two integers is easy while factoring their product becomes rapidly harder as the number grows. A probabilistic computing system addresses this by driving the network toward the minimum-energy configuration, recovering the factors without enumerating all candidate pairs.
Encoding integer factorization in the HUBO model starts with a cost function [
30]:
where
E is the cost,
X and
Y are two integer factors in binary form, and
F is the product to be factored.
E becomes zero if and only if
X and
Y are the factors whose product equals
F. For odd integer factorization, the least significant bits of
X and
Y (
x0 and
y0) can be fixed to 1 so that the candidate factors are restricted to odd values. Interaction coefficients for the p-circuit network are obtained by computing the partial derivatives of the cost function with respect to each of the binary variables that represent the factors
X and
Y [
20]. The results are a group of equations containing interaction coefficients up to four-body terms. Each equation yields a weighted sum result of the corresponding p-bit, which determines the input strength of the probability controller.
4. Results
The proposed CMOS p-bit was fabricated in a 180 nm process, and its micrograph is shown in
Figure 15a. Each prototype chip integrates one p-bit occupying an active area of 6551 μm
2, and multiple such chips are interfaced through the FPGA to form the p-circuit. Driven by a 1.8 V supply and a 150 MHz clock, the p-bit updates its state at 50 MHz, and the energy dissipation at a 50% output probability is measured as 6.95 pJ/bit.
Figure 15b shows the power breakdown of the CMOS p-bit with the same operating conditions. The random source circuit, timing block, and comparator dominate the p-bit-core power because they perform dynamic precharge, noise integration, timing generation, and state decision in every update cycle. In contrast, the CDS circuit consumes only a small fraction because it operates as a switched-capacitor sampling path with small capacitive loading and no static current path. The probability controller also consumes little power in this measurement because the 5-bit code D
i is fixed at the 50% output-probability setting. The “Others” category includes global buffers and test options.
Table 1 shows measured per-p-bit energy dissipation with various probability controller inputs. Five D
i values were selected: 0, 10, 16, 22, and 31 to show the relationship between output probability and energy being dissipated. Considering that the output probability of 50% yields the maximum toggling frequency, energy consumption decreases as the input of the probability controller deviates from the center. Although the differences are small, operation at 50% output probability is the worst case in terms of energy dissipation.
Table 2 summarizes the update rate and energy dissipation of the prototype CMOS p-bit chip across temperature and supply-voltage variation. Likewise, the output probability was set to 50% to represent the worst-case energy condition. Across all combinations, the CMOS p-bit maintained a 50 MHz update rate. When the supply voltage was decreased by 10%, the measured energy dissipation decreased by approximately 20%.
The remainder of this section reports the output characteristics of the p-bit, the measured invertible Boolean operation of a p-circuit logic gate, and the measured result of an integer factorization instance.
4.1. Output Characteristics of the CMOS P-Bit
The measured output-probability curve of the CMOS p-bit is plotted in
Figure 16a. Every point on the curve is the time average of a 1 Mb output bitstream, given in percent. The prototype p-bits reproduce the full 32-level control range and follow the sigmoidal characteristic of
Figure 4b. P-bit #3 was chosen as a representative sample for plotting the measured output-probability curves across five temperature and supply-voltage variations, as shown in
Figure 16b. The curves showed the maximum difference of 2.209 percentage points at D
i = 14. Chip-to-chip variation, on the other hand, showed the maximum difference of 14.08 percentage points at D
i = 15.
Randomness of the bitstream is evaluated with the NIST Statistical Test Suite (STS), a standard battery of quantitative randomness tests. For each variation condition, a 10 Mb measured bitstream was applied to the suite, and the outcome is summarized in
Table 3. The minimum pass rates are 8/10 for the ten-partition inputs, 9/11 for the eleven-partition inputs, 10/12 for the twelve-partition inputs, and 18/20 for the twenty-partition inputs, taken at a significance level of 0.01. The proposed CMOS p-bit passed every NIST STS test under all evaluated variation conditions.
4.2. Measurement Results of Invertible Boolean Operation
An AND-OR-invert (AOI22) gate is mapped onto CMOS p-bits to demonstrate invertible Boolean operation through output clamping. The AOI22 gate is built from eight p-bits, as drawn in
Figure 17. The output p-bit E is clamped to a target logical value, and the p-circuit is run while configuration counts are collected. Repeating this measurement for the two output-clamping conditions recovers the input combinations that drive the AOI22 output to logical “0” and logical “1”.
4.3. Measurement Results of an Integer Factorization
Figure 18 shows the integer factorization measured on the proposed CMOS p-circuit for the integer 1539. With the connections configured up to 4-body terms, 100,000 samples are collected and shown as a 3D histogram. The correct factorization, 57 × 27, accounts for 840 of the 100,000 samples. Because configurations near the correct answer also receive high counts, the p-circuit effectively narrows the search space, and the correct factors are confirmed by checking the top candidates.
4.4. Comparison with Prior Works
Table 4 compares the proposed CMOS p-bit with a D-Wave quantum annealer [
7], an MTJ p-bit [
17], and an LFSR-based FPGA p-bit [
20]. Each prior approach gives up one of three properties: the quantum annealer needs a cryogenic environment, the MTJ p-bit is not compatible with standard CMOS technology, and the FPGA p-bit relies on pseudorandom LFSR bits. The proposed work instead pairs a transistor-noise random source in a standard 180 nm CMOS process with FPGA-based weighted-sum and connection logic, yielding a room-temperature, physically random p-bit whose random source stays fully CMOS while the network-level functions remain in programmable logic.
5. Discussion
5.1. Random-Source Scaling and System-Level Overheads
The scalability benefit of the proposed p-bit is limited to the local random-source circuit. Each p-bit uses transistor device noise as its own local random source, so it does not require a shared PRNG state, per-p-bit seed assignment, or global random-pulse distribution. In this sense, the random-source circuit size remains fixed per p-bit as the number of p-bits increases. This does not mean that the complete p-circuit area is independent of problem size. A complete p-circuit still requires additional p-bit cores, coefficient storage, weighted-sum logic, update-control logic, and interconnects as the mapped problem becomes larger.
A first-order estimate can be made from the measured p-bit core area and energy. The active area of one CMOS p-bit core is 6551 μm2. Therefore, the p-bit-core area scales approximately linearly with the number of cores. This corresponds to approximately 0.0655 mm2, 0.655 mm2, and 6.55 mm2 for 10, 100, and 1000 p-bit cores, respectively. These values are core-only estimates and do not include coefficient storage, weighted-sum circuits, routing, update scheduling, or peripheral circuits.
The measured worst-case energy dissipation is 6.95 pJ/bit at a 50 MHz update rate. This corresponds to an active p-bit-core power of approximately 347.5 μW per p-bit. Thus, 10, 100, and 1000 active p-bit cores would require approximately 3.48 mW, 34.8 mW, and 348 mW, respectively, excluding the network-level overhead. The FPGA power reported in this work belongs to the flexible proof-of-concept platform and should not be interpreted as the optimized power of a custom integrated p-circuit.
At the system level, the dominant overhead can shift from the p-bit core to the network implementation. For sparse p-circuits, coefficient storage, weighted-sum logic, and routing scale mainly with the number of programmed interactions. For fully connected pairwise networks, these terms scale approximately with the square of the number of p-bits. Higher-order HUBO mappings add further coefficients and local-field terms. Therefore, the proposed local random source removes PRNG-state, seed-assignment, and random-distribution overhead from the stochastic source itself, but it does not eliminate the area, power, latency, and synchronization overheads of the full weighted-sum network.
The present prototype uses one p-bit per chip and an FPGA for weighted-sum computation, so it does not demonstrate full-system scalability. Future multi-p-bit CMOS implementations should co-design p-bit placement, coefficient storage, local-field accumulation, update scheduling, and interconnects to preserve the intended probability distribution while reducing system-level overhead.
5.2. Model-Based Factorization-Size Analysis and Target Selection
For the integer-factorization demonstration, the maximum problem size should be distinguished from the simple variable-representation range. In the present setup, the two odd factors are represented as
and
where the least significant bits of both factors are fixed to one. Therefore, the largest representable odd factors are
Xmax = 63 and
Ymax = 31, giving a representational upper product of 63 × 31 = 1953. This value, however, only indicates the largest product that can be encoded by the available factor bits; it does not guarantee that the p-circuit can reliably generate the correct factor pair as a high-probability sample. The practical limit of the present architecture is also affected by the 5-bit local-field interface between the FPGA weighted-sum logic and the CMOS p-bit probability controller.
To estimate this limit, we used a simple finite-state model of the present 9-p-bit factorization setup. For each target integer
F, all 512 states were enumerated and assigned the cost
This enumeration and the following local-field calculation were repeated separately for each target
. Equivalently, the factorization coefficients and local fields were re-derived from each target-specific cost function rather than fixed to those used for the
F = 1539 experiment. For each p-bit
i, the local weighted sum was calculated from the energy difference between the two possible values of the target p-bit, while all other p-bit states were fixed:
A positive
Ai favors
si = 1, while a negative
Ai favors
si = 0. The implemented p-circuit scales this local weighted sum by the fixed inverse-temperature parameter
I0, shifts it by the center code 16 that produces approximately 50% output probability, and quantizes it into the 5-bit probability-control code. Accordingly, the model used
where the operator sat
[0, 31] denotes unsigned saturation to the range from 0 to 31. The measured 32-level output-probability curve was then used to assign the update probability
Since the measurement records the complete 9-p-bit state after every single p-bit update, rather than after every full nine-update cycle, the model used the serially recorded stationary distribution. Let
Ti be the transition matrix for updating p-bit
i. The cycle-boundary stationary distribution
π0 is the fixed point of this full-cycle transition:
Because the measurement records the full 9-p-bit state after every single p-bit update, the model uses the average distribution over the nine update phases:
Using this model, we swept the set of nontrivial odd products representable by the same 9-p-bit allocation. This set includes all products
F = XY, where
X is an odd integer from 3 to 63 and
Y is an odd integer from 3 to 31. For each
F, the correct-factor probability
Pcorrect(
F) was obtained from the probability assigned by
πserial to the correct factor-pair state, and the correct-factor rank was determined by comparing this probability with those of the other candidate factor pairs. Because each 9-p-bit state uniquely corresponds to one factor pair (
X,
Y) under the present bit allocation, this probability is directly read from the state probability assigned by
πserial. We then defined a top-K model-based supported range as
Here,
G denotes the set of nontrivial odd products representable by the current 9-p-bit allocation. The second condition requires the correct factor pair to appear more than 100 times in 100,000 serially recorded samples, so that it is not merely a vanishing-probability state. For the measurement condition used in
Figure 18,
I0 = 6.0 × 10
−6, the model estimated approximately 918 occurrences for the demonstrated target 1539 = 57 × 27, which is close to the measured count of 840. Under the same condition, the top-1 criterion gave
Fmax(1) = 1653, while the top-5 candidate-verification criterion gave
Fmax(5) = 1769. These numbers are not absolute mathematical limits of p-bit factorization. They are model-based estimates for the present 9-p-bit allocation, measured 32-level output-probability curve, 5-bit local-field quantization, fixed
I0, and serial-update sampling protocol. The analysis indicates that the present demonstration is constrained not only by the number of p-bits but also by local-field quantization and candidate ranking. Thus, the p-circuit should be interpreted as a stochastic candidate generator whose top-ranked candidates can be verified classically, rather than as a deterministic large-scale factorization solver.
Within this 9-p-bit factorization setup, 1539 = 57 × 27 was selected as a high-range but non-boundary validation target. The available bit allocation can represent odd factors up to 63 for X and 31 for Y, giving the endpoint product 1953 = 63 × 31. This endpoint case was not used because both factors correspond to all-one boundary patterns, which are less representative of a general factorization target within the available binary range. Instead, the selected factors 57 (111001(2)) and 27 (11011(2)) both activate their most significant available bits while avoiding the all-one pattern. The product 1539 is approximately 79% of the representational upper product 1953, and the finite-state model also places this target within the top-5 candidate criterion. Therefore, it is not a small low-bit example. It was not selected by screening for an unusually high sampling probability. The measured distribution remains broad, and nearby candidate factor pairs receive comparable counts. Thus, the 1539 result is best interpreted as a high-range proof-of-concept case showing that the p-circuit can generate the correct factor pair among top-ranked stochastic candidates, rather than as a sharply convergent or practically large-scale factorization benchmark.