作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2022, Vol. 48 ›› Issue (2): 55-64. doi: 10.19678/j.issn.1000-3428.0060350

• 人工智能与模式识别 • 上一篇    下一篇

基于改进多嵌入空间的实时语义数据流推理

高峰1,2,3,4, 姚光涛1,2,3,4, 顾进广1,2,3,4   

  1. 1. 武汉科技大学 计算机科学与技术学院, 武汉 430065;
    2. 武汉科技大学 大数据科学与工程研究院, 武汉 430065;
    3. 智能信息处理与实时工业系统湖北省重点实验室, 武汉 430065;
    4. 富媒体数字出版内容组织与知识服务重点实验室, 武汉 430065
  • 收稿日期:2020-12-22 修回日期:2021-02-03 发布日期:2021-02-08
  • 作者简介:高峰(1986-),男,博士,主研方向为知识图谱、语义数据流处理;姚光涛,硕士研究生;顾进广,教授、博士、博士生导师。
  • 基金资助:
    国家自然科学基金(U1836118);国家社会科学基金重大计划(11&ZD189);湖北省自然科学基金(2018CFB194)。

Real-Time Semantic Data Flow Reasoning Based on Improved Multi-Embedding Space

GAO Feng1,2,3,4, YAO Guangtao1,2,3,4, GU Jinguang1,2,3,4   

  1. 1. School of Computer Science and Technology, Wuhan University of Science and Technology, Wuhan 430065, China;
    2. Big Data Science and Engineering Research Institute, Wuhan University of Science and Technology, Wuhan 430065, China;
    3. Hubei Province Key Laboratory of Intelligent Information Processing and Real-Time Industrial System, Wuhan 430065, China;
    4. Key Laboratory of Content Organization and Knowledge Service of Rich Media Digital Publishing, Wuhan 430065, China
  • Received:2020-12-22 Revised:2021-02-03 Published:2021-02-08

摘要: 将语义数据流处理引擎与知识图谱嵌入表示学习相结合,可以有效提高实时数据流推理查询性能,但是现有的知识表示学习模型更多关注静态知识图谱嵌入,忽略了知识图谱的动态特性,导致难以应用于实时动态语义数据流推理任务。为了使知识表示学习模型适应知识图谱的在线更新并能够应用于语义数据流引擎,建立一种基于改进多嵌入空间的动态知识图谱嵌入模型PUKALE。针对传递闭包等复杂推理场景,提出3种嵌入空间生成算法。为了在进行增量更新时更合理地选择嵌入空间,设计2种嵌入空间选择算法。基于上述算法实现PUKALE模型,并将其嵌入数据流推理引擎CSPARQL-engine中,以实现实时语义数据流推理查询。实验结果表明,与传统的CSPARQL和KALE推理相比,PUKALE模型的推理查询时间分别约降低85%和93%,其在支持动态图谱嵌入的同时能够提升实时语义数据流推理准确率。

关键词: 语义数据流, 数据流引擎, 推理, 知识表示学习, 知识图谱

Abstract: The joint use of semantic data flow processing engine and knowledge graph embedding representation learning can effectively improve the performance of real-time data stream reasoning and query.The existing knowledge representation learning models pay more attention to static knowledge graph embedding, but ignore the dynamic features of knowledge graphs, so they are not well suited for real-time dynamic semantic data flow reasoning.In order to make the knowledge representation models adaptable to online update of knowledge graphs and applicable to semantic data flow engine, this paper proposes a dynamic knowledge graph embedding model named PUKALE based on improved multi-embedding space.For complex reasoning scenarios such as transitive closure, three algorithms are proposed for embedding space generation.Then two algorithms are proposed to optimize the selection of embedding space during incremental update.On the basis of these algorithms, the PUKALE model is realized and embedded into the CSPARQL-engine for real-time semantic data flow reasoning and query.The experimental results show that compared with traditional reasoning engines such as CSPARQL and KALE, the proposed PUKALE engine reduces the reasoning and query time by about 85% and 93%.It supports dynamic graph embedding, and can improve the accuracy of real-time semantic data flow reasoning.

Key words: semantic data flow, data flow engine, reasoning, knowledge representation learning, knowledge graph

中图分类号: