作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程

• 人工智能及识别技术 • 上一篇    下一篇

一种贝叶斯网络结构学习的混合随机抽样算法?

胡春玲1,胡学钢2,吕 刚1   

  1. (1. 合肥学院网络与智能信息处理重点实验室,合肥 230602;2. 合肥工业大学计算机与信息学院,合肥 230009)
  • 收稿日期:2013-02-20 出版日期:2014-05-15 发布日期:2014-05-14
  • 作者简介:胡春玲(1970-),女,副教授、博士,主研方向:贝叶斯网络,数据挖掘;胡学钢,教授、博士、博士生导师;吕 刚,副教授、硕士。
  • 基金资助:
    国家自然科学基金资助项目“基于协同训练策略的不完全标记数据流分类问题研究”(61273292);安徽省自然科学基金资助项目“面向若干复杂场景的水平集图像分割关键技术研究”(1308085MF84);合肥学院人才基金资助项目“基于多数据源和概率图模型的基因调控网络建模与分析研究”(1308085MF84)。

A Hybrid Stochastic Sampling Algorithm for Bayesian Network Structure Learning

HU Chun-ling  1, HU Xue-gang  2, LV Gang  1   

  1. (1. Key Laboratory of Network and Intelligent Information Processing, Hefei University, Hefei 230602, China; 2. School of Computer and Information, Hefei University of Technology, Hefei 230009, China)
  • Received:2013-02-20 Online:2014-05-15 Published:2014-05-14

摘要: 贝叶斯网络结构学习的随机抽样算法存在收敛速度慢的问题,为此,结合均匀抽样和独立抽样,从初始样本、抽样方式和建议分布3个方面对抽样过程进行改进,提出一种混合型马尔可夫链蒙特卡罗抽样算法(HSMHS)。基于节点之间的互信息生成网络结构的初始样本,在迭代抽样阶段,按一定的概率随机选择均匀抽样和独立抽样,并根据当前抽样的样本总体计算独立抽样的建议分布,以改善抽样过程的融合性,加快收敛速度。对算法进行正确性分析,证明其抽样过程收敛于网络结构的后验概率分布,可保持较高的学习精度。在标准数据集上的实验结果表明,HSMHS算法的学习效率和精度均高于同类算法MHS、PopMCMC和Order-MCMC。

关键词: 贝叶斯网络, 结构学习, 随机抽样, 混合抽样, 子结构抽样, 建议分布

Abstract: According to slow convergence speed of stochastic sampling algorithm, based on uniform sampler and independent sampler, by improving convergence speed from the initial sample, sampling method and proposal distribution, a hybrid markov chain Monte Carlo sampling algorithm(HSMHS) is put forward in this paper. Based on mutual information between network nodes, it generates initial samples of network structure. In iteration sampling phase, according to certain probability distribution, it randomly selects uniform sampler or independent sampler, and computs proposal distribution of independent sampler based on the current samples to improve the mixing of sampling process. It can be proved that sampling process of HSMHS converges to the posterior probability of network structure, and the algorithm has a good learning accuracy. Experimental results on standard data set also verify that both learning efficiency and precision of HSMHS outperform classical algorithms MHS, PopMCMC and Order-MCMC.

Key words: Bayesian network, structure learning, stochastic sampling, hybrid sampling, sub-structure sampling, proposal distribution

中图分类号: