Classification Rules Learning Algorithm Based on Selectivity

doi:10.3969/j.issn.1000-3428.2014.08.034

Abstract

Abstract: Many rule-based classifications use single measurement to select the attribute value.Thus,many attribute value pairs have the same measure.It is difficult to distinguish which attribute value pair is the best.Besides,rule-based classification usually extracts 100% confidence rules.So it takes long time to extract these rules.Moreover,the support of these rules is very low.Confronting these problems,this paper proposes a new measure,called selectivity.Selectivity is a multi-measure which includes three measures.So,it can select the best attribute.It develops a new algorithm LRSM which can extract rule based on selectivity.When the number of the negative instance is less than the threshold,LRSM stops the rule extraction.It extracts another rule.Experimental results show that LRSM has high accuracy and decreases consume time.

Key words: data mining, classification, FOIL algorithm, LRSM algorithm, deviation, selectivity

摘要： 规则式分类器通常使用单一度量选择属性值,然而单一度量会导致很多属性值具有相同的度量值,从而无法选择出“好”的属性值。此外,规则式分类器通常提取置信度为100%的规则,致使规则提取过程比较费时,并且所得到的规则支持度较低。针对上述不足,提出新的属性值度量——选择度。选择度是基于信息熵、类支持度及偏离度3种度量的结合,能更好地区分属性值的优劣。在此基础上,提出一种基于选择度的分类规则学习算法LRSM。在LRSM算法中,当规则包含的负实例数小于给定域值时,该规则被抽取,删除被此规则覆盖的实例,抽取下一条规则。实验结果表明,与FOIL算法相比较,LRSM算法提高了分类准确率,同时明显地减少了分类所消耗的时间。

关键词: 数据挖掘, 分类, FOIL算法, LRSM算法, 偏离, 选择度

CLC Number:

TP18

HE Tian-zhong,ZHOU Zhong-mei,HUANG Zai-xiang. Classification Rules Learning Algorithm Based on Selectivity[J]. Computer Engineering, doi: 10.3969/j.issn.1000-3428.2014.08.034.

何田中,周忠眉,黄再祥. 基于选择度的分类规则学习算法[J]. 计算机工程, doi: 10.3969/j.issn.1000-3428.2014.08.034.

/ / Recommend / Download Citations

URL: http://www.ecice06.com/EN/10.3969/j.issn.1000-3428.2014.08.034

http://www.ecice06.com/EN/Y2014/V40/I8/179

References

［1］刘建伟,李双成,罗雄麟.基于非近似求导过程的加更新和乘更新分类算法［J］.计算机学报,2013,36(2):327-340.  ［2］钟智,朱曼龙,张晨,等.最近邻分类方法的研究［J］.计算机科学与探索,2011,5(5):467-473.  ［3］王超学,潘正茂,马春森,等.改进型加权 KNN 算法的不平衡数据集分类［J］.计算机工程,2012,38(20):160-163.  ［4］李秋洁,茅耀斌,王执铨.基于 Boosting 的不平衡数据分类算法研究［J］.计算机科学,2011,38(12):224-228.  ［5］ Su Haijun,Yang Yupu,Zhao Liang.Classification Rule Discovery with DE/QDE Algorithm［J］.Expert Systems with Applications,2010,37(2):1216-1222. ［6］ Rezaei J,Dowlatshahi S.A Rule-based Multi-criteria Approach to Inventory Classification［J］.International Journal of Production Research,2010,48(23):7107-7126.   ［7］ Fernández A,del Jesus M J,Herrera F.Hierarchical Fuzzy Rule Based Classification Systems with Genetic Rule Selection for Imbalanced Data-sets［J］.International Journal of Approximate Reasoning,2009,50(3):561-577.  ［8］ Orriols-Puig A,Bernadó-Mansilla E.Evolutionary Rule-based Systems for Imbalanced Data Sets［J］.Soft Computing,2009,13(3):213-225.  ［9］ Anthony N N,Michael J L.Symbolic Rule-based Classification of Lung Cancer Stages from Free-text Pathology Reports［J］.Journal of the American Medical Informatics Association,2010,17(4):440-445.  ［10］ Sung H P,José A R.Prediction ofProtein-protein Interaction Types Using Association Rule Based Classification［J］.BMC Bioinformatics,2009,10(1):36.  ［11］ Quinlan J R.C4.5:Programs for Machine Learning［M］.Vol.1.San Francisco,USA:Morgan Kaufmann,1993.  ［12］ Quinlan J R,Cameron-Jones R M.FOIL:A Midterm Report［C］//Proc.of ECML’93.Vienna,Austria:Springer,1993:3-20.  ［13］ Yin Xiaoxin,Han Jiawei.CPAR:Classification Based on Predictive Association Rules［C］//Proc.of SDM’03.San Francisco,USA:Society for Industrial & Applied Mathematics,2003. 编辑索书志

[1]	Yanyan YANG, Mingxuan XIE, Jiangxia CAO, Xuebin WANG, Tingwen LIU, Yanhui DU. Adversarial Sample Generation for Chinese Classification Model Based on Prototypical Network [J]. Computer Engineering, 2023, 49(8): 54-62.
[2]	Zuhe YANG, Zhihui LI, Yunqi TANG, Yuwen YAN, Huaqing SONG. Pedestrian Attribute Recognition Algorithm Combining Semantic and Image Information [J]. Computer Engineering, 2023, 49(8): 215-222, 231.
[3]	Jinshuo LIU, Daichen WANG, Juan DENG, Lina WANG. Classification of Harmful Information on Internet Based on Long-Tailed Classification Algorithm [J]. Computer Engineering, 2023, 49(8): 13-19, 28.
[4]	Changhong YU, Ya LU, Haixin WANG, Ming GAO. Traffic Classification Algorithm for IoT Device Based on Sliding Time Window [J]. Computer Engineering, 2023, 49(7): 259-268.
[5]	Wenjun YIN, Jianhua HUANG, Yuanfa JI. Skin Tumor Classification Method Based on Improved Dense Convolutional Network [J]. Computer Engineering, 2023, 49(7): 288-294.
[6]	Ping CAO, Huaizhi YANG, Yijun BO, Jia YOU, Chunjie ZHANG, Danyong LI. Low-quality Crack Image Classification with Multi-Knowledge Distillation [J]. Computer Engineering, 2023, 49(7): 204-213.
[7]	ZHANG Boxu, PU Zhi, CHENG Xi. Research on Uyghur Text Classification Based on Prompt Learning [J]. Computer Engineering, 2023, 49(6): 292-299,313.
[8]	WANG Qihan, PANG Jianmin, YUE Feng, ZHU Di, SHEN Li, XIAO Qian. Implementation and Optimization of Parallel KNN Algorithm for Sunway Architecture [J]. Computer Engineering, 2023, 49(5): 286-294.
[9]	XI Rongkang, CAI Manchun, LU Tianliang. Tor Traffic Analysis Model Based on Data Enhancement and Stream Data Processing [J]. Computer Engineering, 2023, 49(3): 177-184.
[10]	YANG Hongju, JIN Xinyu. A General Model for Entity Relationship and Event Extraction [J]. Computer Engineering, 2023, 49(2): 143-149.
[11]	WANG Chundong, SUN Jiaqi, YANG Wenjun. Method for Generating Chinese Text Adversarial Examples Based on Rectification Understanding [J]. Computer Engineering, 2023, 49(2): 37-45.
[12]	WANG Song, Mairidan Wushouer, Gulanbaier Tuerhong, XUE Yuan. Continual Learning Method for Sentiment Classification Based on Knowledge Architecture [J]. Computer Engineering, 2023, 49(2): 112-118.
[13]	YUAN Lining, HU Hao, LIU Zhao. Graph Representation Learning Based on Multi-Channel Graph Convolutional Autoencoders [J]. Computer Engineering, 2023, 49(2): 150-160,174.
[14]	SUN Yi, GAO Jian, GU Yijun. Malicious Encrypted Traffic Detection Integrating One-Dimensional Inception Structure and ViT [J]. Computer Engineering, 2023, 49(1): 154-162.
[15]	LEI Jie, RAO Wenbi, YANG Yanchao, XIONG Shengwu. Pseudo-Label Object Detection Algorithm Based on Classification Uncertainty [J]. Computer Engineering, 2023, 49(1): 49-56.

Please choose a citation manager

Content to export