作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程

• 先进计算与数据处理 • 上一篇    下一篇

CMS实验中关系数据库的精确关键词查询

梁 栋1,臧冬松1,孙功星1,Valentin Kuznetsov  2   

  1. (1. 中国科学院高能物理研究所,北京 100049;2. 康奈尔大学,美国 伊萨卡 14850)
  • 收稿日期:2012-11-15 出版日期:2014-01-15 发布日期:2014-01-13
  • 作者简介:梁 栋(1984-),男,博士研究生,主研方向:数据发现,分布式系统;臧冬松,博士研究生;孙功星,研究员、博士生导师;Valentin Kuznetsov,副研究员
  • 基金资助:
    国家自然科学基金资助项目(6116140454)

Precise Keywords Query for Relational Database in CMS Experiment

LIANG Dong 1, ZANG Dong-song 1, SUN Gong-xing 1, Valentin Kuznetsov 2   

  1. (1. Institute of High Energy Physics, Chinese Academy of Sciences, Beijing 100049, China; 2. Cornell University, Ithaca 14850, USA)
  • Received:2012-11-15 Online:2014-01-15 Published:2014-01-13

摘要: 在大型强子对撞机(LHC)上紧凑型缪子螺线管探测器(CMS)实验的复杂数据环境下,有多个关系型数据源记录了关于数据组织和分布的信息。为实现数据查询系统的精确关键词查询功能,通过分析数据库模式图的方法,将关键词查询语言动态翻译成SQL语言,设计并实现一个跨数据库平台的关键词查询系统。针对动态翻译过程中存在的二义性问题,提出基于查询实体的模式图分析算法,以及基于最小权重树查找的动态连接算法。实验结果表明,该动态连接算法能为关键词查询正确生成所需数据库表的连接方式,使关键词查询系统具有较高的查询效率,以满足用户实时、精确查询的需求。

关键词: 关键词查询, 查询语言, 关系数据库, 结构化查询语言, 二义性问题

Abstract: Under complex data environment of Compact Muon Solenoid(CMS) experiment on the Large Hadron Collider(LHC), there are a number of relational data sources providing organization and distribution information for indexing the complex CMS data. To provide accurate keywords query function for data query system, this paper presents a keywords query system which can support different databases. By analyzing the database schema graph, this system can dynamically translate keywords Query Language(QL) into Structured Query Language(SQL) language. During this translation, the key issue is how to solve the ambiguity problem, therefore two algorithms are provided: a schema graph analysis algorithm based on query entities and a dynamic join algorithm based on a minimal weight tree generation. Experimental result shows that the dynamic join algorithm can calculate the connection mode of the database table for keywords query, make the keywords query system have high query efficiency, and meet the needs of users in real time, accurate query.

Key words: keywords query, Query Language(QL), relational database, Structured Query Language(SQL), ambiguity problem

中图分类号: