This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Open AccessArticle
Learning in Probabilistic Boolean Networks via Structural Policy Gradients
by
Pedro Juan Rivera Torres
Pedro Juan Rivera Torres 1,2
1
Departamento de Computación y Automatización, Universidad de Salamanca, CB3 0BN Salamanca, Spain
2
St. Edmund’s College, University of Cambridge, Cambridge CB2 1TS, UK
Entropy 2025, 27(11), 1150; https://doi.org/10.3390/e27111150 (registering DOI)
Submission received: 5 October 2025
/
Revised: 2 November 2025
/
Accepted: 6 November 2025
/
Published: 13 November 2025
Abstract
We revisit Probabilistic Boolean Networks as trainable function approximators. The key obstacle, non-differentiable structural choices (which predictors to read and which Boolean operators to apply), is addressed by casting the PBN’s structure as a stochastic policy whose parameters are optimized with score-function (REINFORCE) gradients. Continuous output heads (logistic/linear/softmax or policy logits) are trained with ordinary gradients. We call the resulting model a Learning PBN. We formalize the Learning Probabilistic Boolean Network, derive unbiased structural gradients with variance reduction, and prove a universal approximation property over discretized inputs. Empirically, Learning Probabilistic Boolean Networks approach ANN performance across classification (accuracy ↑), regression (RMSE ↓), representation quality via clustering (ARI ↑), and reinforcement learning (return ↑) while yielding interpretable, rule-like internal units. We analyze the effect of binning resolution, operator sets, and unit counts, and show how the learned logic stabilizes as training progresses. Our results indicate that PBNs can serve as general-purpose learners, competitive with ANNs in tabular/noisy regimes, without sacrificing interpretability.
Share and Cite
MDPI and ACS Style
Rivera Torres, P.J.
Learning in Probabilistic Boolean Networks via Structural Policy Gradients. Entropy 2025, 27, 1150.
https://doi.org/10.3390/e27111150
AMA Style
Rivera Torres PJ.
Learning in Probabilistic Boolean Networks via Structural Policy Gradients. Entropy. 2025; 27(11):1150.
https://doi.org/10.3390/e27111150
Chicago/Turabian Style
Rivera Torres, Pedro Juan.
2025. "Learning in Probabilistic Boolean Networks via Structural Policy Gradients" Entropy 27, no. 11: 1150.
https://doi.org/10.3390/e27111150
APA Style
Rivera Torres, P. J.
(2025). Learning in Probabilistic Boolean Networks via Structural Policy Gradients. Entropy, 27(11), 1150.
https://doi.org/10.3390/e27111150
Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details
here.
Article Metrics
Article metric data becomes available approximately 24 hours after publication online.