Author Login Editor-in-Chief Peer Review Editor Work Office Work

Computer Engineering ›› 2007, Vol. 33 ›› Issue (11): 190-192. doi: 10.3969/j.issn.1000-3428.2007.11.069

• Artificial Intelligence and Recognition Technology • Previous Articles     Next Articles

Semantic Analyzing Technology of Web Document

GUO Yong1,2   

  1. (1. Information System and Management College, National University of Defense Technology, Changsha 410073; 2. Beijing Institute of System Engineering, Beijing 100101)
  • Received:1900-01-01 Revised:1900-01-01 Online:2007-06-05 Published:2007-06-05

基于语义的Web文本分析技术

郭 勇1,2   

  1. (1. 国防科技大学信息系统与管理学院,长沙 410073;2. 北京系统工程研究所,北京 100101)

Abstract:

Semantic technology can improve the analyzing accuracy of the Web documents. Two semantic technologies are introduced in this paper: concept semantic technology and formalization semantic technology. The concept semanteme based on non-negative matrix factorization is a novel method, which can consider the concept semantic accuracy and the complexity of algorithm. Ontology is a current formalization semantic technology. Ontology-based information systems suffer from the problem of ontology heterogeneity. It introduces the definitions of simplified multielement bounds to find the best approximations of the concepts, and presents the idea for searching the simplified multielement bounds.

Key words: Concept semanteme, Formalization semanteme, Ontology, Approximation

摘要: 语义技术能够提高Web文本分析的精度。该文介绍了两种语义技术:概念语义技术和形式化语义技术。非负矩阵分解方法获取的概念语义技术同时满足概念语义的准确性和算法复杂性要求。本体是一种流行的形式化语义技术,基于本体的信息系统中通常存在本体的异构问题。引入概念的最简多元界定义来寻找概念的最佳近似,提供了寻找概念最简多元界的算法思想。

关键词: 概念语义, 形式化语义, 本体, 近似

CLC Number: