基于级联加性噪声模型的因果结构学习算法

doi:10.19678/j.issn.1000-3428.0060176

计算机工程 ›› 2022, Vol. 48 ›› Issue (1): 93-98. doi: 10.19678/j.issn.1000-3428.0060176

基于级联加性噪声模型的因果结构学习算法

乔杰¹, 蔡瑞初¹, 郝志峰²

1. 广东工业大学计算机学院, 广州 510006;
2. 佛山科学技术学院数学与大数据学院, 广东佛山 528000

收稿日期:2020-12-03 修回日期:2021-01-23 发布日期:2021-01-12
作者简介:乔杰(1993-),男,博士研究生,主研方向为数据挖掘、机器学习;蔡瑞初(通信作者)、郝志峰,教授、博士生导师。
基金资助:
国家自然科学基金（61876043，61976052）。

Causal Structure Learning Algorithm Based on Cascade Additive Noise Model

QIAO Jie¹, CAI Ruichu¹, HAO Zhifeng²

1. School of Computer, Guangdong University of Technology, Guangzhou 510006, China;
2. School of Mathematics and Big Data, Foshan University, Foshan, Guangdong 528000, China

Received:2020-12-03 Revised:2021-01-23 Published:2021-01-12

摘要/Abstract

摘要： 现有级联非线性加性噪声模型可解决隐藏中间变量的因果方向推断问题，然而对于包含隐变量和级联传递因果关系的因果网络学习存在全局结构搜索、等价类无法识别等问题。设计一种面向非时序观测数据的两阶段因果结构学习算法，第一阶段根据观测数据变量间的条件独立性，构建基本的因果网络骨架，第二阶段基于级联非线性加性噪声模型，通过比较骨架中每个相邻因果对在不同因果方向假设下的边缘似然度进行因果方向推断。实验结果表明，该算法在虚拟因果结构数据集的不同隐变量数量、平均入度、结构维度、样本数量下均表现突出，且在真实因果结构数据集中的F1值相比主流因果结构学习算法平均提升了51%，具有更高的准确率和更强的鲁棒性。

关键词: 因果结构学习, 加性噪声模型, 级联加性噪声模型, 因果发现, 函数式因果模型

Abstract: The existing cascade nonlinear Additive Noise Model(ANM) can infer the causal direction of hidden intermediate variables, but fail to deal with global structure search and equivalence class recognition in the case of causal network learning that includes hidden variables and cascade causality transferring.This paper presents a two-stage causal structure learning algorithm for non-chronological observation data.In the first stage, a basic causal network skeleton is constructed based on the conditional independence between the observation data variables. In the second stage, by using a cascaded nonlinear ANM, the causal direction of the edge likelihood under the assumptions of different causal directions is inferred by comparing each adjacent causality in the skeleton.The experimental results show that the algorithm has outstanding performance on the virtual causal structure dataset for a varying number of hidden variables, average in-degree, structural dimension, and number of samples.Furthermore, the F1 value of this algorithm on the real causal structure dataset improved by 51% on average compared with mainstream causal structure learning algorithms, displaying a higher accuracy and robustness.

Key words: causal structure learning, Additive Noise Model(ANM), Cascade Additive Noise Model(CANM), causal discovery, functional causal model

中图分类号:

TP301.6

乔杰, 蔡瑞初, 郝志峰. 基于级联加性噪声模型的因果结构学习算法[J]. 计算机工程, 2022, 48(1): 93-98.

QIAO Jie, CAI Ruichu, HAO Zhifeng. Causal Structure Learning Algorithm Based on Cascade Additive Noise Model[J]. Computer Engineering, 2022, 48(1): 93-98.

https://www.ecice06.com/CN/Y2022/V48/I1/93

图/表 9

20220108121914

20220108121917

20220108121922

20220108121927

20220108121930

20220108121934

20220108121938

20220108121943

20220108121947

参考文献

[1] SPIRTES P, GLYMOUR C, SCHEINES R.Causation, prediction, and search[M].Cambridge, USA:MIT Press, 2001.
[2] PEARL J.Causality:models, reasoning and inference[M].Cambridge, UK:Cambridge University Press, 2009.
[3] 蔡瑞初, 陈薇, 张坤, 等.基于非时序观察数据的因果关系发现综述[J].计算机学报, 2017, 40(6):1470-1490. CAI R C, CHEN W, ZHANG K, et al.A survey on non-temporal series observational data based causal discovery[J].Chinese Journal of Computers, 2017, 40(6):1470-1490.(in Chinese)
[4] CAI R C, ZHANG Z J, HAO Z F, et al.Understanding social causalities behind human action sequences[J].IEEE Transactions on Neural Networks and Learning Systems, 2017, 28(8):1801-1813.
[5] RUNGE J, BATHIANY S, BOLLT E, et al.Inferring causation from time series in earth system sciences[J].Nature Communications, 2019, 10:2553-2567.
[6] CAI R C, ZHANG Z J, HAO Z F.Causal gene identification using combinatorial v-structure search[J].Neural Networks, 2013, 43:63-71.
[7] HERNÁN M A, ROBINS J M.Causal inference[M].Boca Raton, USA:CRC Press, 2010.
[8] MOOIJ J M, PETERS J, JANZING D, et al.Distinguishing cause from effect using observational data:methods and benchmarks[J].Journal of Machine Learning Research, 2016, 17(1):1103-1204.
[9] COLOMBO D, MAATHUIS M H, KALISCH M, et al.Learning high-dimensional directed acyclic graphs with latent and selection variables[J].The Annals of Statistics, 2012, 40(1):294-321.
[10] OGARRIO J M, SPIRTES P, RAMSEY J.A hybrid causal search algorithm for latent variable models[C]//Proceedings of the 8th International Conference on Probabilistic Graphical Models.Lugano, Switzerland:[s.n.], 2016:368-379.
[11] CAI R C, QIAO J, ZHANG K, et al.Causal discovery with cascade nonlinear additive noise model[EB/OL].[2020-10-08].https://arxiv.org/abs/1905.09442v2.
[12] BRESSLER S L, SETH A K.Wiener-Granger causality:a well established methodology[J].Neuro Image, 2011, 58(2):323-329.
[13] 张义杰, 李培峰, 朱巧明.面向事件时序与因果关系的联合识别方法[J].计算机工程, 2020, 46(7):65-71. ZHANG Y J, LI P F, ZHU Q M.Joint identification method for temporal and causal relations of events[J].Computer Engineering, 2020, 46(7):65-71.(in Chinese)
[14] LAM W, BACCHUS F.Learning Bayesian belief networks:an approach based on the MDL principle[J].Computational Intelligence, 1994, 10(3):269-293.
[15] ANDERSSON S A, MADIGAN D, PERLMAN M D.A characterization of Markov equivalence classes for acyclic digraphs[J].The Annals of Statistics, 1997, 25(2):505-541.
[16] SHIMIZU S, HOYER P O, HYVÄRINEN A, et al.A linear non-Gaussian acyclic model for causal discovery[J].Journal of Machine Learning Research, 2006, 7:2003-2030.
[17] 姜枫, 朱辉生, 汪卫.含隐变量非高斯无环因果模型的估计算法[J].计算机工程, 2010, 36(9):178-180. JIANG F, ZHU H S, WANG W.Estimation algorithm for non-Gaussian acyclic causal model with latent variables[J].Computer Engineering, 2010, 36(9):178-180.(in Chinese)
[18] HOYER P, JANZING D, MOOIJ J M, et al.Nonlinear causal discovery with additive noise models[C]//Proceedings of the 22nd Annual Conference on Neural Information Processing Systems.New York, USA:ACM Press, 2008:689-696.
[19] ZHANG K, HYVÄRINEN A.On the identifiability of the post-nonlinear causal model[C]//Proceedings of the 25th Conference on Uncertainty in Artificial Intelligence.[S.l.]:AUAI Press, 2009:647-655.
[20] HOYER P O, SHIMIZU S, KERMINEN A J, et al.Estimation of causal effects using linear non-Gaussian causal models with hidden variables[J].International Journal of Approximate Reasoning, 2008, 49(2):362-378.
[21] KINGMA D P, WELLING M.Auto-encoding variational Bayes[EB/OL].[2020-11-04].https://dare.uva.nl/search?identifier=cf65ba0f-d88f-4a49-8ebd-3a7fce86edd7.
[22] HORNIK K, STINCHCOMBE M, WHITE H.Multilayer feedforward networks are universal approximators[J].Neural Networks, 1989, 2(5):359-366.
[23] GÁMEZ J A, MATEO J L, PUERTA J M.Learning Bayesian networks by hill climbing:efficient methods based on progressive restriction of the neighborhood[J].Data Mining and Knowledge Discovery, 2011, 22(1/2):106-148.
[24] TSAMARDINOS I, BROWN L E, ALIFERIS C F.The max-min hill-climbing Bayesian network structure learning algorithm[J].Machine Learning, 2006, 65(1):31-78.

选择文件类型/文献管理软件名称

选择包含的内容

基于级联加性噪声模型的因果结构学习算法

Causal Structure Learning Algorithm Based on Cascade Additive Noise Model

RichHTML

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

图/表 9

参考文献

相关文章 5

编辑推荐

Metrics

本文评价

[1]	郝志峰, 丁凯培, 蔡瑞初, 陈薇. 基于非稳态加性噪声模型的因果发现算法[J]. 计算机工程, 2024, 50(4): 78-86.
[2]	卢小金, 陈薇, 郝志峰, 蔡瑞初. 基于因果自回归流模型的因果结构学习算法[J]. 计算机工程, 2024, 50(3): 131-136.
[3]	蔡瑞初, 伍运金, 陈薇, 郝志峰. 面向多元时间序列的群体因果关系发现算法[J]. 计算机工程, 2023, 49(2): 127-135.
[4]	郝志峰, 喻建华, 乔杰, 蔡瑞初. 基于结构方程似然框架的缺失值因果学习算法[J]. 计算机工程, 2023, 49(12): 63-70.
[5]	郝志峰, 陈正鸣, 谢峰, 陈薇, 蔡瑞初. 一种任意分布下的隐变量因果结构学习算法[J]. 计算机工程, 2022, 48(9): 121-129.

模态框（Modal）标题

选择文件类型/文献管理软件名称

选择包含的内容

基于级联加性噪声模型的因果结构学习算法

Causal Structure Learning Algorithm Based on Cascade Additive Noise Model

RichHTML

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

图/表 9

参考文献

相关文章 5

编辑推荐

Metrics

本文评价