基于FP-T的多层关联规则并发挖掘

doi:10.3969/j.issn.1000-3428.2006.15.031

计算机工程 ›› 2006, Vol. 32 ›› Issue (15): 87-89. doi: 10.3969/j.issn.1000-3428.2006.15.031

基于FP-T的多层关联规则并发挖掘

何友全

重庆交通学院计算机与信息工程学院，重庆 400074

收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2006-08-05 发布日期:2006-08-05

Parallel Mining of Morelevel Association Based on FP-T

HE Youquan

School of Computer & Information Engineering, Chonqing Jiaotong University, Chongqing 400074

Received:1900-01-01 Revised:1900-01-01 Online:2006-08-05 Published:2006-08-05

摘要/Abstract

摘要： 现有的数据挖掘方法大致有两类：有候选项集和无候选项集，有候选项集的挖掘以Apriori算法为代表，其特点是产生大量的候选项集，重复多次扫描数据库，挖掘效率低，不适合大型数据库的挖掘。无候选项集的挖掘以FP-T方法为代表，但它不能同时挖掘多概念层的关联规则，对具有超大项ID的大型数据库，无法生成“树”结构，使用也受到限制。该文将FP-T原理引入多层关联规则的并发挖掘，通过构建一个特殊节点链的指针表，可实现超大规模数据库的并发、多层挖掘。对实现物流系统信息自动化及其它数据挖掘应用领域都具有极其重要的指导意义。

关键词: 数据挖掘, 并发挖掘, 关联规则, 物流

Abstract: Present data mining method have two kinds: one has candidate itemset generation and the other without. The former, for example Apriori algorithm, has follow some disadvantages: produce large candidate itemsets, scan database repeat, mining data inefficiently, is not suit to large database mining. The latter, for example FP-T algorithm, has follow some disadvantages: do not mine association rule of more concept level parallel, do not produce tree structure of large database. Principle of FP-T is introduced into association rule of more concept level parallel mining, by building a special note-link point table, can realize parallel mining of large database. It is of important meaning about realizing modernization of interflow of commodities and some other application respect of data mining.

Key words: Data mining, Parallel mining, Association rule, Commodities interflow

中图分类号:

TP311.12

何友全. 基于FP-T的多层关联规则并发挖掘[J]. 计算机工程, 2006, 32(15): 87-89.

HE Youquan. Parallel Mining of Morelevel Association Based on FP-T[J]. Computer Engineering, 2006, 32(15): 87-89.

http://www.ecice06.com/CN/Y2006/V32/I15/87

[1]	席荣康, 蔡满春, 芦天亮. 基于数据增强与流数据处理的Tor流量分析模型[J]. 计算机工程, 2023, 49(3): 177-184.
[2]	谷青竹, 董红斌. PPDM中面向k-匿名的MI Loss评估模型[J]. 计算机工程, 2022, 48(4): 143-147.
[3]	王璐, 刘晓清, 何震瀛. 连续时间区间内的频繁词序列挖掘算法[J]. 计算机工程, 2022, 48(2): 79-85,91.
[4]	张攀, 高丰, 周逸, 饶涵宇, 毛冬, 李静. 一种在线实时微服务调用链异常检测方法[J]. 计算机工程, 2022, 48(11): 161-169.
[5]	吴军, 欧阳艾嘉, 张琳. 面向置换检验的冗余对比模式过滤算法[J]. 计算机工程, 2022, 48(1): 75-84.
[6]	吴军, 欧阳艾嘉, 张琳. 面向对比序列模式发现的独立精确置换检验算法[J]. 计算机工程, 2021, 47(8): 45-53,61.
[7]	杜诗晴, 王鹏, 汪卫. 一种基于MDL的日志序列模式挖掘算法[J]. 计算机工程, 2021, 47(2): 118-125.
[8]	刘治国, 蔡文珠, 李运琪, 潘成胜. 基于序列统计的未知无线协议特征提取方法[J]. 计算机工程, 2021, 47(11): 192-197.
[9]	魏文浩, 唐泽坤, 刘刚. 基于距离和密度的PBK-means算法[J]. 计算机工程, 2020, 46(9): 68-75.
[10]	史明阳, 王鹏, 汪卫. 有监督时间序列分割与状态识别算法[J]. 计算机工程, 2020, 46(5): 131-138.
[11]	王玉奇, 高建华. 一种基于关联规则的Web应用统计测试方法[J]. 计算机工程, 2020, 46(3): 206-213.
[12]	李洁, 朱洪亮, 陈玉玲, 辛阳. 基于哈希存储与事务加权的并行Apriori改进算法[J]. 计算机工程, 2020, 46(11): 109-116.
[13]	张潘, 卢光跃, 吕少卿, 赵雪莉. 基于矩阵分解的属性网络表示学习[J]. 计算机工程, 2020, 46(10): 67-73.
[14]	王慧健, 刘峥, 李云, 李涛. 基于神经网络语言模型的时间序列趋势预测方法[J]. 计算机工程, 2019, 45(7): 13-19,25.
[15]	张玺君, 袁占亭, 张红, 高玮军, 张恩展. 交通轨迹大数据预处理方法研究[J]. 计算机工程, 2019, 45(6): 26-31.

选择文件类型/文献管理软件名称

选择包含的内容

基于FP-T的多层关联规则并发挖掘

Parallel Mining of Morelevel Association Based on FP-T

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价

模态框（Modal）标题

选择文件类型/文献管理软件名称

选择包含的内容

基于FP-T的多层关联规则并发挖掘

Parallel Mining of Morelevel Association Based on FP-T

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价