作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2023, Vol. 49 ›› Issue (10): 202-211, 221. doi: 10.19678/j.issn.1000-3428.0065821

• 图形图像处理 • 上一篇    下一篇

基于渐进式训练的多判别器域适应目标检测

李惠森1,2, 侯进1,2,*, 党辉1,2, 周宇航1,2   

  1. 1. 西南交通大学 信息科学与技术学院 智能感知智慧运维实验室, 成都 611756
    2. 西南交通大学 综合交通大数据应用技术国家工程实验室, 成都 611756
  • 收稿日期:2022-09-22 出版日期:2023-10-15 发布日期:2023-01-03
  • 通讯作者: 侯进
  • 作者简介:

    李惠森(1998—),男,硕士研究生,主研方向为计算机视觉、域适应、深度学习

    党辉,硕士研究生

    周宇航,硕士研究生

  • 基金资助:
    国家重点研发计划(2020YFB1711902)

Domain Adaptive Multi-Discriminator Object Detection Based on Progressive Training

Huisen LI1,2, Jin HOU1,2,*, Hui DANG1,2, Yuhang ZHOU1,2   

  1. 1. Laboratory of Intelligent Perception and Smart Operation and Maintenance, School of Information Science and Technology, Southwest Jiaotong University, Chengdu 611756, China
    2. National Engineering Laboratory of Integrated Transportation Big Data Application Technology, Southwest Jiaotong University, Chengdu 611756, China
  • Received:2022-09-22 Online:2023-10-15 Published:2023-01-03
  • Contact: Jin HOU

摘要:

基于对抗训练的域适应目标检测的研究旨在不对新数据集进行额外标注的情况下, 将检测模型应用于不同的数据集。但现有算法存在目标检测和域对齐任务难以平衡的问题, 且一般的单判别器结构容易局限于数据的单个模式, 导致域对齐的质量下降。提出一种基于渐进式训练的多判别器域适应目标检测算法, 针对传统的单判别器结构对复杂结构数据进行域对齐时的局限性, 在实例级的域适应头中引入多判别器结构, 使其在学习域不变信息时考虑数据的多模结构, 实现质量更高、更全面的域对齐。同时, 为降低引入多判别器结构而增加的模型复杂度, 设计基于Dropout技术的多判别器结构, 对单个判别器参数进行重复利用, 并创新性地引入渐进式训练策略, 即随着训练的推进逐步增大域对齐任务的比重和难度, 动态平衡目标检测和域对齐任务的权重。实验结果表明, 所提算法在Cityscapes到Foggy Cityscapes的域适应场景下的平均检测精度为42.9%, 相比近几年该领域的新算法提高了至少0.5个百分点。

关键词: 目标检测, 域适应, 对抗训练, 多判别器, 渐进式训练策略

Abstract:

The research on domain adaptive object detection based on adversarial training aims the deployment of detection models for use with different data sets without labeling new data sets. However, existing algorithms have difficulty in balancing the tasks of object detection and domain alignment. The general single discriminator structure is limited to single mode of data, resulting in degradation of domain alignment quality. This paper proposes a multi-discriminator domain adaptive object detection algorithm based on progressive training. Considering the limitations of the traditional single-discriminator structure in the domain alignment of complex structural data, a multi-discriminator structure is introduced into the instance-level domain-adaptive head to force it to consider multiple modes of data while learning the domain invariant information, which contributes to achieving higher quality and more comprehensive domain alignment. Meanwhile, to reduce the increased model complexity, a multi-discriminator structure designed to reuse the single discriminator parameters is introduced based on dropout technology. In this paper, an innovative progressive training strategy is introduced, whereby the proportion and difficulty of domain alignment are gradually increased with the progress in training, to dynamically balance the weight of object detection and domain alignment tasks. The experimental results indicate that the mean average precision of the algorithm in domain adaptation from Cityscapes to Foggy Cityscapes was 42.9%, which is an improvement of at least 0.5 percentage points compared to algorithms of recent years.

Key words: object detection, domain adaptive, adversarial training, multi-discriminator, progressive training strategy