基于分解循环结构的流程模型挖掘方法

doi:10.19678/j.issn.1000-3428.0066258

计算机工程 ›› 2023, Vol. 49 ›› Issue (11): 94-105, 114. doi: 10.19678/j.issn.1000-3428.0066258

基于分解循环结构的流程模型挖掘方法

王康¹, 刘聪¹^,², 王路³^,*, 曾庆田¹

1. 山东科技大学电子信息工程学院, 山东青岛 266590
2. 山东理工大学计算机科学与技术学院, 山东淄博 255000
3. 山东科技大学计算机科学与工程学院, 山东青岛 266590

收稿日期:2022-11-14 出版日期:2023-11-15 发布日期:2023-11-08
通讯作者: 王路
作者简介:
王康(1998-), 男, 硕士研究生, 主研方向为流程挖掘
刘聪, 教授、博士
曾庆田, 教授、博士
基金资助:
国家自然科学基金(61902222); 山东省泰山学者工程专项基金(ts20190936); 山东省泰山学者工程专项基金(tsqn201909109); 山东省自然科学基金优秀青年基金(ZR2021YQ45); 山东省高等学校青创科技计划创新团队项目(QC2021948080); 教育部人文社会科学研究青年基金项目(20YJCZH159); 山东省自然科学基金青年基金(ZR2022QF020)

Process Model Mining Method Based on Decomposed Cycle Structure

Kang WANG¹, Cong LIU¹^,², Lu WANG³^,*, Qingtian ZENG¹

1. School of Electronic Information Engineering, Shandong University of Science and Technology, Qingdao 266590, Shandong, China
2. School of Computer Science and Technology, Shandong University of Technology, Zibo 255000, Shandong, China
3. College of Computer Science and Engineering, Shandong University of Science and Technology, Qingdao 266590, Shandong, China

Received:2022-11-14 Online:2023-11-15 Published:2023-11-08
Contact: Lu WANG

摘要/Abstract

摘要：

模型挖掘作为流程挖掘的热点领域之一，旨在从事件日志中生成描述业务流程的模型。事件日志包含具有可分解循环依赖关系的活动，此类活动既无法使用过滤非频繁活动的方式将其过滤，也不能当作混沌活动处理，导致流程模型精确度较低。现有方法不能在含有噪声的情况下根据有无循环结构划分事件日志，进而无法在无循环结构子日志上正确识别具有可分解循环依赖关系的活动，且需要依赖活动属性。为克服现有方法的不足，提高挖掘模型质量，提出分离循环结构和可分解循环依赖关系的分解流程模型挖掘框架。首先基于启发式方法将事件日志根据有无循环结构划分为两部分，在无循环结构事件日志中根据活动间可达关系频率和直接跟随关系频率识别具有可分解循环依赖关系的活动，进而将具有可分解循环依赖关系的活动从有循环结构事件日志中过滤，以识别事件日志的循环结构并投影得到子日志集合。然后使用现有流程模型挖掘方法挖掘子模型并基于边界活动分支结构关系合并子模型。实验结果表明，该方法基于ProM平台实现，并基于公开事件日志与直接使用Inductive Miner、基于最大划分框架和基于阶段的业务流程模型挖掘方法相比，精确度提高了0.08~0.42，复杂度降低了3.86~45.92。

关键词: 分解流程挖掘, 模型挖掘, 启发式挖掘, 可分解循环依赖关系, 模型质量

Abstract:

Model mining-one of the hot areas of process mining-aims to generate models describing business processes from event logs. Event logs may contain activities with decomposable cyclic dependencies, which cannot be filtered by filtering infrequent activities nor treated as chaotic activities and can lead to low precision of process models. The existing methods cannot divide the event logs according to the presence or absence of cyclic structures in the presence of noise and thus cannot correctly identify activities with decomposable cyclic dependencies on sub-logs without cyclic structures, and the use of the existing methods is dependent on activity attributes. To overcome the shortage of existing methods and improve the quality of mining models, a decomposable process model mining framework that separates the cyclic structure and decomposable cyclic dependencies is proposed. First, the event log is divided into two parts on the basis of heuristics, and the activities with decomposable cyclic dependencies are identified in the event log with no cyclic structure according to the frequency of inter-activity reachable relations and direct following relations. Then, the activities with decomposable cyclic dependencies are filtered from the event log with a cyclic structure to identify the cyclic structure of the event log and to project the set of sub-logs. Finally, existing process model mining techniques are used to mine sub-models and merge sub-models according to the boundary activity branch structure relationship.The proposed method is implemented using the ProM platform, and its performance is quantitatively compared with that of the maximal based framework, stage-based discovery of business process model methods, and the direct use of Inductive Miner to mine models based on public event logs. Experiments indicate that compared with the other methods, the precision of the proposed method is 0.08-0.42 higher, and the complexity is reduced by 3.86-45.92.

Key words: decomposed process mining, model mining, heuristic mining, decomposable cyclic dependencies, model quality

王康, 刘聪, 王路, 曾庆田. 基于分解循环结构的流程模型挖掘方法[J]. 计算机工程, 2023, 49(11): 94-105, 114.

Kang WANG, Cong LIU, Lu WANG, Qingtian ZENG. Process Model Mining Method Based on Decomposed Cycle Structure[J]. Computer Engineering, 2023, 49(11): 94-105, 114.

http://www.ecice06.com/CN/Y2023/V49/I11/94

图/表 9

图1 因果矩阵和因果依赖图

Fig.1 Relation matrix and relation dependence graph

图2 基于循环结构的分解流程挖掘方法框架

Fig.2 Framework of decomposition procedure mining method based on loop structure

图3 循环结构

Fig.3 Cyclic structure

图4 精确度对比实验结果

Fig.4 Precision comparison experiment results

图5 拟合度对比实验结果

Fig.5 Fit comparison experimental results

图6 F1值对比实验结果

Fig.6 F1 value comparison experimental results

参考文献 36

1	VAN DER AALST W . Process mining: data science in action. Berlin, Germany: Springer, 2016.
2	LIU C , DUAN H , ZENG Q T , et al. Towards comprehensive support for privacy preservation cross-organization business process mining. IEEE Transactions on Services Computing, 2019, 12 (4): 639- 653. doi: 10.1109/TSC.2016.2617331
3	VAN DER AALST W M P . Process mining: discovery, conformance and enhancement of business processes. Berlin, Germany: Springer, 2011.
4	VAN DER AALST W M P , WEIJTERS T , MARUSTER L . Workflow mining: discovering process models from event logs. IEEE Transactions on Knowledge and Data Engineering, 2004, 16 (9): 1128- 1142. doi: 10.1109/TKDE.2004.47
5	IKHSAN G, SARNO R, SUNGKONO K R. Modification of alpha for discovering collaboration business processes containing non-free choice[C]//Proceedings of IEEE Asia Pacific Conference on Wireless and Mobile. Bandung, Indonesia: IEEE Press, 2021: 66-72.
6	WEIJTERS A J M M , VAN DER AALST W M P . Rediscovering workflow models from event-based data using little thumb. Integrated Computer-Aided Engineering, 2003, 10 (2): 151- 162. doi: 10.3233/ICA-2003-10205
7	DESHMUKH S, GUPTA S, KUMAR N. GA-ProM: a genetic algorithm for discovery of complete process models from unbalanced logs[C]//Proceedings of the 9th International Conference on Big Data Analytics. Berlin, Germany: Springer, 2022: 209-218.
8	汤雅惠, 李彤, 朱锐, 等. 基于执行者过程树的双维度遗传过程挖掘方法. 计算机集成制造系统, 2021, 27 (9): 2680- 2690. doi: 10.13196/j.cims.2021.09.020
	TANG Y H , LI T , ZHU R , et al. Double-dimensional genetic process mining method based on executor process tree. Computer Integrated Manufacturing Systems, 2021, 27 (9): 2680- 2690. doi: 10.13196/j.cims.2021.09.020
9	BERGENTHUM R, DESEL J, LORENZ R, et al. Process mining based on regions of languages[C]//Proceedings of International Conference on Business Process Management. Berlin, Germany: Springer, 2007: 375-383.
10	GÜNTHER C W, VAN DER AALST W M P. Fuzzy mining-adaptive process simplification based on multi-perspective metrics[C]//Proceedings of International Conference on Business Process Management. Berlin, Germany: Springer, 2007: 328-343.
11	WEIJTERS A J M M, VAN DER AALST W M P, MEDROS A K. Process mining with the heuristics miner algorithm[D]. Eindhoven, Holland: Eindhoven University of Technology, 2006.
12	LEEMANS S J J, FAHLAND D, VAN DER AALST W M P. Discovering block-structured process models from event logs: a constructive approach[C]//Proceedings of International Conference on Applications and Theory of Petri Nets and Concurrency. Berlin, Germany: Springer, 2013: 311-329.
13	AUGUSTO A , CONFORTI R , DUMAS M , et al. Split miner: automated discovery of accurate and simple business process models from event logs. Knowledge and Information Systems, 2019, 59 (2): 251- 284. doi: 10.1007/s10115-018-1214-x
14	杜海森, 杜玉越. 基于不完备日志的块状并发过程挖掘. 计算机工程, 2018, 44 (12): 56- 61. URL
	DU H S , DU Y Y . Mining of block concurrent process based on incomplete logs. Computer Engineering, 2018, 44 (12): 56- 61. URL
15	TAX N , SIDOROVA N , VAN DER AALST W M P . Discovering more precise process models from event logs by filtering out chaotic activities. Journal of Intelligent Information Systems, 2019, 52 (1): 107- 139. doi: 10.1007/s10844-018-0507-6
16	LU F M , ZENG Q T , DUAN H A . Synchronization-core-based discovery of processes with decomposable cyclic dependencies. ACM Transactions on Knowledge Discovery from Data, 2016, 10 (3): 1- 29.
17	VAN DER AALST W M P . Decomposing Petri nets for process mining: a generic approach. Distributed and Parallel Databases, 2013, 31 (4): 471- 507. doi: 10.1007/s10619-013-7127-5
18	YAN Z Q , SUN B , CHEN Y , et al. Decomposed and parallel process discovery: a framework and application. Future Generation Computer Systems, 2019, 98, 392- 405. doi: 10.1016/j.future.2019.03.048
19	VERBEEK H M W, VAN DER AALST W M P. Decomposed process mining: the ILP case[C]//Proceedings of International Conference on Business Process Management. Berlin, Germany: Springer, 2015: 264-276.
20	VERBEEK H M W. Decomposed process mining with divide and conquer[EB/OL]. [2022-10-10]. https://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.653.7301.
21	VAN DER AALST W M P. Decomposing process mining problems using passages[C]//Proceedings of International Conference on Application and Theory of Petri Nets and Concurrency. Berlin, Germany: Springer, 2012: 72-91.
22	HOMPES B F A, VERBEEK H M W, VAN DER AALST W M P. Finding suitable activity clusters for decomposed process discovery[C]//Proceedings of International Symposium on Data-Driven Process Discovery and Analysis. Berlin, Germany: Springer, 2015: 32-57.
23	VERBEEK H M W , VAN DER AALST W M P , MUNOZ-GAMA J . Divide and conquer: a tool framework for supporting decomposed discovery in process mining. The Computer Journal, 2017, 60 (11): 1649- 1674. doi: 10.1093/comjnl/bxx040
24	NGUYEN H , DUMAS M , TER HOFSTEDE A H M , et al. Stage-based discovery of business process models from event logs. Information Systems, 2019, 84, 214- 237. doi: 10.1016/j.is.2019.05.002
25	廉鑫. 基于日志的业务流程分解与挖掘方法研究[D]. 北京: 北京邮电大学, 2021.
	LIAN X. Research on business process decomposition and mining method based on log[D]. Beijing: Beijing University of Posts and Telecommunications, 2021. (in Chinese)
26	LU X X, TABATABAEI S A, HOOGENDOORN M, et al. Trace clustering on very large event data in healthcare using frequent sequence patterns[EB/OL]. [2022-10-10]. https://arxiv.org/abs/2001.03411.
27	KONINCK P , NELISSEN K , BROUCKE S , et al. Expert-driven trace clustering with instance-level constraints. Knowledge and Information Systems, 2021, 63 (5): 1197- 1220. doi: 10.1007/s10115-021-01548-6
28	ZHANG Z Y, GUO C H, PENG W Y, et al. Using event log timing information to assist process scenario discoveries[C]//Proceedings of the 3th IEEE International Conference on Artificial Intelligence and Knowledge Engineering. Washington D.C., USA: IEEE Press, 2021: 58-64.
29	BOLTENHAGEN M , CHATAIN T , CARMONA J . Model-based trace variant analysis of event logs. Information Systems, 2021, 102, 101675. doi: 10.1016/j.is.2020.101675
30	DE KONINCK P, DE WEERDT J. Scalable mixed-paradigm trace clustering using super-instances[C]//Proceedings of International Conference on Process Mining. Washington D.C., USA: IEEE Press, 2019: 17-24.
31	REISIG W . Petri nets: an Introduction. Berlin, Germany: Springer, 2012.
32	MURATA T . Petri nets: properties, analysis and applications. Proceedings of the IEEE, 1989, 77 (4): 541- 580. doi: 10.1109/5.24143
33	WANG L , DU Y Y , QI M , et al. Petri net-based deviation detection between a process model with loop semantics and event logs. Concurrency and Computation: Practice and Experience, 2018, 30 (23): 1- 18.
34	CHATAIN T , BOLTENHAGEN M , CARMONA J . Anti-alignments: measuring the precision of process models and event logs. Information Systems, 2021, 98, 101708. doi: 10.1016/j.is.2020.101708
35	ADRIANSYAH A, VAN DONGEN B F, VAN DER AALST W M P. Conformance checking using cost-based fitness analysis[C]//Proceedings of the 15th IEEE International Enterprise Distributed Object Computing Conference. Washington D.C., USA: IEEE Press, 2011: 55-64.
36	LASSEN K B , VAN DER AALST W M P . Complexity metrics for workflow nets. Information and Software Technology, 2009, 51 (3): 610- 626. doi: 10.1016/j.infsof.2008.08.005

选择文件类型/文献管理软件名称

选择包含的内容

基于分解循环结构的流程模型挖掘方法

Process Model Mining Method Based on Decomposed Cycle Structure

RichHTML

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

图/表 9

参考文献 36

相关文章 1

编辑推荐

Metrics

本文评价

模态框（Modal）标题

选择文件类型/文献管理软件名称

选择包含的内容

基于分解循环结构的流程模型挖掘方法

Process Model Mining Method Based on Decomposed Cycle Structure

RichHTML

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

图/表 9

参考文献 36

相关文章 1

编辑推荐

Metrics

本文评价