Author Login Editor-in-Chief Peer Review Editor Work Office Work

Computer Engineering ›› 2023, Vol. 49 ›› Issue (2): 127-135. doi: 10.19678/j.issn.1000-3428.0063674

• Artificial Intelligence and Pattern Recognition • Previous Articles     Next Articles

Collective Causal Relations Discovery Algorithm for Multivariate Time-Series

CAI Ruichu1, WU Yunjin1, CHEN Wei1, HAO Zhifeng1,2   

  1. 1. School of Computer, Guangdong University of Technology, Guangzhou 510006, China;
    2. College of Science, Shantou University, Shantou 515063, Guangdong, China
  • Received:2021-12-31 Revised:2022-02-28 Published:2022-07-05

面向多元时间序列的群体因果关系发现算法

蔡瑞初1, 伍运金1, 陈薇1, 郝志峰1,2   

  1. 1. 广东工业大学 计算机学院, 广州 510006;
    2. 汕头大学 理学院, 广东 汕头 515063
  • 作者简介:蔡瑞初(1983-),男,教授、博士生导师,主研方向为机器学习、数据挖掘;伍运金,硕士研究生;陈薇,博士;郝志峰,教授、博士生导师。
  • 基金资助:
    国家自然科学基金(61876043,61976052);中国博士后科学基金(2020M680225)。

Abstract: Causal discovery from multivariate time-series is a significant and fundamental problem in numerous disciplines.The existing multivariate time-series causal discovery methods learn the causal relations for each individual while some individuals may share the same causal relations;therefore, they may exploit data insufficiently.To this end, this study proposes a collective causal discovery algorithm for multivariate time-series, which is a two-stage algorithm.The first stage measures the similarity of individuals from the perspective of causal relations and clusters the individuals into different groups based on similarity without assigning the number of groups.The second stage involves learning the collective causal relations for each group using variational inference, which sufficiently utilizes the data of individuals in the same group.The experimental result shows that the proposed method outperforms existing methods on simulated data, and the AUC scores are improved by 5%-20%.On real data, the proposed algorithm can separate groups with different causal relations and determine the difference in causal relations for each group, which illustrates the capability of the proposed algorithm in causal discovery and multivariate time-series clustering.

Key words: collective causal discovery, multivariate time-series, causal relations, clustering, variational inference

摘要: 从多元时间序列观测数据中学习多个变量之间的因果关系是许多专业领域中的重要基本问题。现有的多元时间序列因果关系发现方法通常从每个个体的观测数据中学习个体因果关系,没有考虑部分个体之间可能存在相同的因果关系,导致样本利用不足。提出一种面向多元时间序列的群体因果关系发现算法。该算法分为2个阶段:第一阶段基于因果关系对个体之间的相似性进行度量,并把多个个体划分成多个群体,且无须指定群体的个数;第二阶段基于变分推断方法充分利用每个群体内的所有个体数据,从而学习群体因果关系。实验结果表明,该算法在多组不同参数生成的仿真数据上均具有较好的表现,与对比算法相比,AUC评分提升了5%~20%。在真实数据集中,该算法能够较好地区分具有不同因果关系的群体,并且能够学习到不同群体之间不同的因果关系,表明算法不仅具有因果关系发现能力,而且还具有多元时间序列聚类能力。

关键词: 群体因果发现, 多元时间序列, 因果关系, 聚类, 变分推断

CLC Number: