作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2008, Vol. 34 ›› Issue (21): 284-封三. doi: 10.3969/j.issn.1000-3428.2008.21.101

• 开发研究与设计技术 • 上一篇    

中药特性信息数据挖掘系统中的预处理设计

胡建军1,2   

  1. (1. 华南理工大学计算机科学与工程学院,广州 510640;2. 广东商学院信息学院,广州 510320)
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2008-11-05 发布日期:2008-11-05

Design of Preprocessing in Data Mining System for Traditional Chinese Medicine Prescriptions Information

HU Jian-jun1,2   

  1. (1. School of Computer Science and Engineering, South China University of Technology, Guangzhou 510640; 2. Information Science School, Guangdong University of Business Studies, Guangzhou 510320)
  • Received:1900-01-01 Revised:1900-01-01 Online:2008-11-05 Published:2008-11-05

摘要: 中药数据的不规范,使预处理成为数据挖掘系统中的一个重要过程。该文开发中药特性信息数据挖掘系统,介绍系统结构与挖掘流程,分析中药数据的特征,对数据进行预处理,包括过滤噪声数据、中医药术语规范化、缺损数据处理、剂量单位规范化、作用度规一化、功效量化等。

关键词: 数据挖掘, 中药, 方剂, 数据预处理

Abstract: The description of Traditional Chinese Medicine(TCM) information is not uniform, so data preprocessing is a key process in data mining system. This paper develops data mining system for TCM prescription information, introduces the system architecture and the mining process are described, analyzes the characteristic of data in TCM, and preprocess data such as filtering noisy data, jargon of TCM uniform, absent data processing, measurement units of dose uniform, measurement of effect for medicine, quantity of effect and so on.

Key words: data mining, Traditional Chinese Medicine(TCM), prescription, data preprocessing

中图分类号: