TAN-FGBMLE: Tree-Augmented Naive Bayes Structure Learning Based on Fast Generative Bootstrap Maximum Likelihood Estimation for Continuous-Variable Classification

TAN-FGBMLE:基于快速生成式自助法最大似然估计的树增强朴素贝叶斯结构学习,用于连续变量分类

阅读:1

Abstract

Tree-Augmented Naive Bayes (TAN) is an interpretable graphical structure model. However, its structure learning for continuous attributes depends on the class-conditional mutual information, which is sensitive to one-dimensional or two-dimensional density estimation. Accurate estimation is challenging under complex distributions such as multi-peak, long-tailed and heteroscedastic cases. To address this issue, we propose a structure learning method for TAN based on Fast Generative Bootstrap Maximum Likelihood Estimation (TAN-FGBMLE). FGBMLE consists of two stages of work. In the first stage, resampling weights and random noise are input into a network generator to rapidly produce candidate parameters, efficiently covering the latent density space without repeated independent optimization. In the second stage, optimal mixture weights are estimated by maximum likelihood estimation, assigning appropriate contributions to each candidate component. This design enables fast and accurate complex density estimation for both single and joint attributes, providing reliable computation of class-conditional mutual information. The TAN structure is then constructed using Prim's maximum spanning tree algorithm. Experiments show that our estimation method attains higher fitting accuracy and lower runtime compared with traditional nonparametric estimators. By using open-source datasets, the TAN-FGBMLE achieves superior accuracy and recall compared to classic methods, demonstrating good robustness and interpretability. On publicly available real air quality data, it has a high classification result and produces graph structures that more accurately capture dependencies among continuous attributes.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。