作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2022, Vol. 48 ›› Issue (12): 224-231. doi: 10.19678/j.issn.1000-3428.0063741

• 图形图像处理 • 上一篇    下一篇

基于类别不平衡数据集的图像实例分割方法

范馨月1,2, 鲍泓1,2, 潘卫国1,2   

  1. 1. 北京联合大学 北京市信息服务工程重点实验室, 北京 100101;
    2. 北京联合大学 机器人学院, 北京 100027
  • 收稿日期:2022-01-11 修回日期:2022-03-14 发布日期:2022-12-07
  • 作者简介:范馨月(1998—),女,硕士研究生,主研方向为计算机视觉、智能驾驶;鲍泓,教授、博士;潘卫国(通信作者),讲师、博士。
  • 基金资助:
    国家自然科学基金(61932012,62102033)。

Image Instance Segmentation Method Based on Class-imbalanced Dataset

FAN Xinyue1,2, BAO Hong1,2, PAN Weiguo1,2   

  1. 1. Beijing Key Laboratory of Information Service Engineering, Beijing Union University, Beijing 100101, China;
    2. College of Robotics, Beijing Union University, Beijing 100027, China
  • Received:2022-01-11 Revised:2022-03-14 Published:2022-12-07

摘要: 随着深度学习在计算机视觉领域取得重大进展,包含多种类别的数据集不断被提出,但由于自然采集的数据集往往存在类别不平衡并呈现长尾分布的情况,导致稀有类的数据特征被频繁类的数据特征所抑制,从而严重影响模型的检测性能。为解决上述问题,提出一种新的图像实例分割方法。采用长尾实例分割数据集进行研究实验,使用基于目标尺度的数据增广方法对数据集进行处理,以达到扩充训练样本的目的,增加稀有类的目标数量,同时对稀有类数据进行重采样,解决稀有类的类别数据量过小的问题,提升模型在长尾数据集的鲁棒性。在此基础上,将均等化损失函数融入Mask R-CNN实例分割网络,以降低频繁类的数据特征对稀有类数据特征的抑制性。实验结果表明,该方法在LVIS实例分割数据集上的检测精度提升了4.9%,达到了25.7%,同时APr、APc、APf分别达到了16.2%、26.1%、30.4%,相比Baseline方法均有明显提升,在消融实验上的结果也表明该方法能有效解决长尾类问题。

关键词: 长尾分布, 实例分割, 数据增强, 损失函数, 深度学习

Abstract: With the significant progress made by deep learning in the field of computer vision, datasets containing various categories are constantly being proposed.However, in naturally collected datasets, there are often class-imbalances and long-tailed distributions.As a result, the data features of rare classes are suppressed by the data features of frequent classes, seriously affecting the performance of the model.Therefore, the algorithm cannot be implemented well.To solve this problem, this paper proposes a method for image instance segmentation in which the long-tailed instance segmentation dataset is used for experiments.First, the dataset is processed by the data augmentation method based on the target scale to expand the training samples and increase the target number of rare classes.Rare class data are resampled to solve the problem of the excessively small number of rare class data, thereby improving the robustness of the model in long-tailed datasets.Finally, the equalization loss function is integrated into the Mask Region-based Convolutional Neural Network(R-CNN) instance segmentation network to reduce the inhibition of the data features of the frequent class to the data features of the rare class.In an experimental verification on the Large Vocabulary Instance Segmentation (LVIS) dataset, the method proposed improves the detection accuracy by 4.9%, reaching 25.7%.In addition, APr, APc, and APf reach 16.2%, 26.1%, and 30.4%, respectively, which are significantly improved compared with the Baseline method.The results of ablation experiments performed using the proposed method show that it can solve the long-tailed-distribution problem.

Key words: long tail distribution, instance segmentation, data augmentation, loss function, deep learning

中图分类号: