作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2026, Vol. 52 ›› Issue (5): 281-292. doi: 10.19678/j.issn.1000-3428.0070356

• 网络空间安全 • 上一篇    下一篇

非平衡场景下的模糊隐私集合交集基数协议

刘铭辉1, 张恩1,2,*(), 王梦涛1, 黄昱晨1   

  1. 1. 河南师范大学计算机与信息工程学院, 河南 新乡 453007
    2. 智慧商务与物联网技术河南省工程实验室, 河南 新乡 453007
  • 收稿日期:2024-09-11 修回日期:2024-10-27 出版日期:2026-05-15 发布日期:2024-12-13
  • 通讯作者: 张恩
  • 作者简介:

    刘铭辉(CCF学生会员), 男, 硕士研究生, 主研方向为密码学

    张恩(通信作者), 教授

    王梦涛, 硕士研究生

    黄昱晨, 硕士研究生

  • 基金资助:
    国家自然科学基金(62372157)

Fuzzy Private Set Intersection Cardinality Protocol in Unbalanced Scenarios

LIU Minghui1, ZHANG En1,2,*(), WANG Mengtao1, HUANG Yuchen1   

  1. 1. College of Computer and Information Engineering, Henan Normal University, Xinxiang 453007, Henan, China
    2. Engineering Lab of Intelligence Business and Internet of Things of Henan Province, Xinxiang 453007, Henan, China
  • Received:2024-09-11 Revised:2024-10-27 Online:2026-05-15 Published:2024-12-13
  • Contact: ZHANG En

摘要:

在接触者追踪应用场景中, 查询者与"敏感"人群的历史轨迹不完全相同, 导致隐私集合交集基数(PSI-CA)协议无法直接应用到接触者追踪这一场景。针对该问题, 提出一种非平衡模糊隐私集合交集基数(uFPSI-CA)协议。uFPSI-CA协议是隐私集合交集基数(PSI-CA)协议的一种变体, 该协议允许发送方和接收方通过交互共同计算其私有集合的交集的大小, 而不泄露其他任何信息。与PSI-CA协议不同的是, 双方的交集元素不完全相等, 而是有一定的相似性, 且发送方的集合大小远远大于接收方。为了将PSI-CA协议扩展到uFPSI-CA协议, 提出可分类洗牌的Diffie-Hellman不经意伪随机函数(CS-DH-OPRF)算法, 该算法在计算接收方的OPRF值时, 可以为同组数据添加一个常数的加密作为标签, 在后续计算过程中接收方按照标签对数据进行分类。uFPSI-CA协议将大量的计算交给作为服务器的发送方, 接收方仅做简单的元素加解密操作, 并且协议的总体通信开销与接收方的数据元素集合大小正相关。最后, 实现了uFPSI-CA协议, 在发送方集合大小为218、接收方集合大小为26时, 仅需4 s的在线时间及15 MB的通信开销, 证明了协议是高效的。

关键词: 非平衡, 模糊隐私集合交集基数, 全同态加密, 哈希技术, 多项式插值

Abstract:

In the contact tracing application scenario, the historical trajectories of the querier and the "sensitive" population are not completely identical, which makes the Private Set Intersection CArdinality (PSI-CA) protocol unable to be directly applied to this contact tracing scenario. To address this issue, an unbalanced fuzzy PSI-CA (uFPSI-CA) protocol is proposed. The uFPSI-CA protocol is a variant of the privacy-set intersection-cardinality protocol. This protocol allows the sender and receiver to jointly calculate the intersection size of their private sets via interactions, without revealing any other information. This protocol assigns many computations to the sender and behaves as a server. The receiver performs only simple element encryption and decryption. Additionally, the overall communication overhead of the protocol is positively correlated with the size of the receiver dataset. To extend PSI-CA to uFPSI-CA, a Classified Shuffle Diffie-Hellman oblivious Pseudo-Random Function (CS-DH-OPRF) algorithm is proposed. When computing the OPRF value of the receiver, the algorithm encrypts the same value for the same group of data as a label, and the receiver classifies the data according to the label in the subsequent calculation. Finally, the uFPSI-CA protocol is implemented. When the sender set size is 218 and the receiver set size is 26, only needs 4 s online time and 15 MB of communication overhead are required, which proves that the protocol is efficient.

Key words: unbalance, fuzzy Private Set Intersection CArdinality (PSI-CA), Fully Homomorphic Encryption (FHE), hash technology, polynomial interpolation