作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2024, Vol. 50 ›› Issue (11): 350-359. doi: 10.19678/j.issn.1000-3428.0068379

• 开发研究与工程应用 • 上一篇    下一篇

基于噪声鲁棒学习的胸部X射线尘肺鉴别方法

崔锦莹1, 梁立河1, 任雪婷1, 强彦1,2, 赵涓涓3,4, 孔晓梅5,*(), 尉骁5, 张华6   

  1. 1. 太原理工大学计算机科学与技术学院(大数据学院), 山西 太原 030000
    2. 中北大学软件学院, 山西 太原 030000
    3. 太原理工大学软件学院, 山西 太原 030000
    4. 晋中信息学院大数据学院·信息工程学院, 山西 晋中 030600
    5. 国家卫生健康委尘肺病重点实验室/呼吸疾病山西省重点实验室/山西医科大学第一医院呼吸与危重症医学科, 山西 太原 030000
    6. 山西医科大学第一医院放射科, 山西 太原 030000
  • 收稿日期:2023-09-12 出版日期:2024-11-15 发布日期:2024-02-22
  • 通讯作者: 孔晓梅
  • 基金资助:
    国家自然科学基金重点项目(U21A20469); 中央级公益性科研院所基本科研业务费专项资金(N2020-PT320-005); 国家卫生健康委尘肺病重点实验室开放课题(YKFKT004)

Method of Noise Robust Learning Based Chest X-ray Discrimination of Pneumoconiosis

CUI Jinying1, LIANG Lihe1, REN Xueting1, QIANG Yan1,2, ZHAO Juanjuan3,4, KONG Xiaomei5,*(), YU Xiao5, ZHANG Hua6   

  1. 1. College of Computer Science and Technology (College of Data Science), Taiyuan University of Technology, Taiyuan 030000, Shanxi, China
    2. School of Software, North University of China, Taiyuan 030000, Shanxi, China
    3. School of Software, Taiyuan University of Technology, Taiyuan 030000, Shanxi, China
    4. School of Data Science and Information Engineering, Jinzhong College of Information, Jinzhong 030600, Shanxi, China
    5. NHC Key Laboratory of Pneumoconiosis/Shanxi Key Laboratory of Respiratory Diseases/Department of Respiratory and Critical Care Medicine, The First Hospital of Shanxi Medical University, Taiyuan 030000, Shanxi, China
    6. Department of Radiology, The First Hospital of Shanxi Medical University, Taiyuan 030000, Shanxi, China
  • Received:2023-09-12 Online:2024-11-15 Published:2024-02-22
  • Contact: KONG Xiaomei

摘要:

在医学图像噪声标注数据的训练中, 目前常用的方法是根据训练损失对噪声标签数据集进行划分, 以过滤掉噪声标签样本。然而, 这种方法面临两个需要解决的问题, 即如何在筛选出噪声样本的同时尽可能地保留与其损失分布相似的困难样本, 以及如何提高样本利用率, 挖掘隐藏在噪声样本中的有用信息以减轻模型过拟合的问题。为了解决上述问题, 提出一种由样本分布引导的噪声鲁棒学习策略(SGRL), 包括样本划分与半监督对比分类。为了更可靠地区分信息量大的困难样本与有害噪声样本, 介绍一种噪声滤波器样本选择方法。此外, 提出了一种增强匹配对比网络, 使用所有样本进行训练, 从而得到一个具有噪声鲁棒性的分类模型。在此基础上, 利用对比学习作为补充, 进一步对抗对噪声标签的记忆, 提高筛查准确率。实验结果表明, 该方法在5%、10%、20%和40%噪声比的尘肺胸片数据集上均取得了显著的性能提升。与现有的先进方法相比, 该方法的筛查准确率分别平均提升了5.88、7.05、7.59和6.19个百分点, 验证了改进方法的有效性。

关键词: 噪声标签, 尘肺筛查, 困难样本感知, 弱监督学习, 医学图像分类

Abstract:

In training medical image noise annotation data, the prevailing approach involves partitioning the noise-labeled dataset based on training loss to filter out the noise-labeled samples. However, this method faces two pressing issues that require resolution: first, filtering out noise samples while retaining difficult samples with similar loss distributions as much as possible, and second, enhancing sample utilization and uncovering valuable information embedded in noise samples to alleviate model overfitting. This study proposes a Sample Distribution Guided Noise Robust Learning strategy (SGRL) comprising sample partitioning and semi-supervised contrastive classification to address these challenges. A straightforward yet effective sample selection method called a noise filter method is introduced to distinguish informative, difficult samples from detrimental noise samples more accurately. Additionally, an enhanced matching contrastive network is proposed to train using all samples, yielding a noise-robust classification model. Contrastive learning is utilized as a supplement to counter the memorization of noise labels and improve screening accuracy. The experimental results demonstrate significant performance improvement of the proposed method across dust-induced pneumoconiosis chest X-ray datasets with noise ratios of 5%, 10%, 20%, and 40%. Compared with existing state-of-the-art methods, the screening accuracy of this method increased by an average of 5.88, 7.05, 7.59, and 6.19 percentage points, validating the effectiveness of the proposed improvement method.

Key words: noise labels, pneumoconiosis screening, hard sample aware, weakly supervised learning, medical image classification