TAN-FGBMLE: Tree-Augmented Naive Bayes Structure Learning Based on Fast Generative Bootstrap Maximum Likelihood Estimation for Continuous-Variable Classification

Wei, Chenghao; Zhang, Tianyu; Li, Chen; Wang, Pukai; Ye, Zhiwei

doi:10.3390/e27121216

This is an early access version, the complete PDF, HTML, and XML versions will be available soon.

Open AccessArticle

TAN-FGBMLE: Tree-Augmented Naive Bayes Structure Learning Based on Fast Generative Bootstrap Maximum Likelihood Estimation for Continuous-Variable Classification

by

Chenghao Wei

^1,2

,

Tianyu Zhang

^1,2,

Chen Li

^1,2,*

,

Pukai Wang

^1,2 and

Zhiwei Ye

^1,2

¹

School of Computer Science, Hubei University of Technology, Wuhan 430068, China

²

Hubei Provincial Key Laboratory of Green Intelligent Computing Power Network, Hubei University of Technology, Wuhan 430068, China

^*

Author to whom correspondence should be addressed.

Entropy 2025, 27(12), 1216; https://doi.org/10.3390/e27121216 (registering DOI)

Submission received: 31 August 2025 / Revised: 25 November 2025 / Accepted: 26 November 2025 / Published: 28 November 2025

(This article belongs to the Special Issue Bridging Bayesian and Information-Theoretic Approaches in Earth and Environmental System Modeling: Theory and Practical Applications)

Download Versions Notes

Abstract

Tree-Augmented Naive Bayes (TAN) is an interpretable graphical structure model. However, its structure learning for continuous attributes depends on the class-conditional mutual information, which is sensitive to one-dimensional or two-dimensional density estimation. Accurate estimation is challenging under complex distributions such as multi-peak, long-tailed and heteroscedastic cases. To address this issue, we propose a structure learning method for TAN based on Fast Generative Bootstrap Maximum Likelihood Estimation (TAN-FGBMLE). FGBMLE consists of two stages of work. In the first stage, resampling weights and random noise are input into a network generator to rapidly produce candidate parameters, efficiently covering the latent density space without repeated independent optimization. In the second stage, optimal mixture weights are estimated by maximum likelihood estimation, assigning appropriate contributions to each candidate component. This design enables fast and accurate complex density estimation for both single and joint attributes, providing reliable computation of class-conditional mutual information. The TAN structure is then constructed using Prim’s maximum spanning tree algorithm. Experiments show that our estimation method attains higher fitting accuracy and lower runtime compared with traditional nonparametric estimators. By using open-source datasets, the TAN-FGBMLE achieves superior accuracy and recall compared to classic methods, demonstrating good robustness and interpretability. On publicly available real air quality data, it has a high classification result and produces graph structures that more accurately capture dependencies among continuous attributes.

Keywords: Tree-Augmented Naive Bayes; class-conditional mutual information; complex density estimation; generative model; bootstrap; maximum likelihood estimation

Share and Cite

MDPI and ACS Style

Wei, C.; Zhang, T.; Li, C.; Wang, P.; Ye, Z. TAN-FGBMLE: Tree-Augmented Naive Bayes Structure Learning Based on Fast Generative Bootstrap Maximum Likelihood Estimation for Continuous-Variable Classification. Entropy 2025, 27, 1216. https://doi.org/10.3390/e27121216

AMA Style

Wei C, Zhang T, Li C, Wang P, Ye Z. TAN-FGBMLE: Tree-Augmented Naive Bayes Structure Learning Based on Fast Generative Bootstrap Maximum Likelihood Estimation for Continuous-Variable Classification. Entropy. 2025; 27(12):1216. https://doi.org/10.3390/e27121216

Chicago/Turabian Style

Wei, Chenghao, Tianyu Zhang, Chen Li, Pukai Wang, and Zhiwei Ye. 2025. "TAN-FGBMLE: Tree-Augmented Naive Bayes Structure Learning Based on Fast Generative Bootstrap Maximum Likelihood Estimation for Continuous-Variable Classification" Entropy 27, no. 12: 1216. https://doi.org/10.3390/e27121216

APA Style

Wei, C., Zhang, T., Li, C., Wang, P., & Ye, Z. (2025). TAN-FGBMLE: Tree-Augmented Naive Bayes Structure Learning Based on Fast Generative Bootstrap Maximum Likelihood Estimation for Continuous-Variable Classification. Entropy, 27(12), 1216. https://doi.org/10.3390/e27121216

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

TAN-FGBMLE: Tree-Augmented Naive Bayes Structure Learning Based on Fast Generative Bootstrap Maximum Likelihood Estimation for Continuous-Variable Classification

Abstract

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI