Learning in Probabilistic Boolean Networks via Structural Policy Gradients

Rivera Torres, Pedro Juan

doi:10.3390/e27111150

This is an early access version, the complete PDF, HTML, and XML versions will be available soon.

Open AccessArticle

Learning in Probabilistic Boolean Networks via Structural Policy Gradients

by

Pedro Juan Rivera Torres

^1,2

¹

Departamento de Computación y Automatización, Universidad de Salamanca, CB3 0BN Salamanca, Spain

²

St. Edmund’s College, University of Cambridge, Cambridge CB2 1TS, UK

Entropy 2025, 27(11), 1150; https://doi.org/10.3390/e27111150 (registering DOI)

Submission received: 5 October 2025 / Revised: 2 November 2025 / Accepted: 6 November 2025 / Published: 13 November 2025

Download Versions Notes

Abstract

We revisit Probabilistic Boolean Networks as trainable function approximators. The key obstacle, non-differentiable structural choices (which predictors to read and which Boolean operators to apply), is addressed by casting the PBN’s structure as a stochastic policy whose parameters are optimized with score-function (REINFORCE) gradients. Continuous output heads (logistic/linear/softmax or policy logits) are trained with ordinary gradients. We call the resulting model a Learning PBN. We formalize the Learning Probabilistic Boolean Network, derive unbiased structural gradients with variance reduction, and prove a universal approximation property over discretized inputs. Empirically, Learning Probabilistic Boolean Networks approach ANN performance across classification (accuracy ↑), regression (RMSE ↓), representation quality via clustering (ARI ↑), and reinforcement learning (return ↑) while yielding interpretable, rule-like internal units. We analyze the effect of binning resolution, operator sets, and unit counts, and show how the learned logic stabilizes as training progresses. Our results indicate that PBNs can serve as general-purpose learners, competitive with ANNs in tabular/noisy regimes, without sacrificing interpretability.

Keywords: Probabilistic Boolean Network; statistical learning; Learning Probabilistic Boolean Networks

Share and Cite

MDPI and ACS Style

Rivera Torres, P.J. Learning in Probabilistic Boolean Networks via Structural Policy Gradients. Entropy 2025, 27, 1150. https://doi.org/10.3390/e27111150

AMA Style

Rivera Torres PJ. Learning in Probabilistic Boolean Networks via Structural Policy Gradients. Entropy. 2025; 27(11):1150. https://doi.org/10.3390/e27111150

Chicago/Turabian Style

Rivera Torres, Pedro Juan. 2025. "Learning in Probabilistic Boolean Networks via Structural Policy Gradients" Entropy 27, no. 11: 1150. https://doi.org/10.3390/e27111150

APA Style

Rivera Torres, P. J. (2025). Learning in Probabilistic Boolean Networks via Structural Policy Gradients. Entropy, 27(11), 1150. https://doi.org/10.3390/e27111150

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Article metric data becomes available approximately 24 hours after publication online.

Article Menu

Learning in Probabilistic Boolean Networks via Structural Policy Gradients

Abstract

Share and Cite

Article Metrics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI