Author Login Editor-in-Chief Peer Review Editor Work Office Work

Computer Engineering ›› 2022, Vol. 48 ›› Issue (4): 89-98. doi: 10.19678/j.issn.1000-3428.0061188

• Artificial Intelligence and Pattern Recognition • Previous Articles     Next Articles

Research on Optimization of Convolution Pretraining Model Based on Causal Intervention and Invariance

HU Xuan1,2, XING Kai1,2, LI Yaming1,2, WANG Zhiyong2,3, DENG Hongwu1,2   

  1. 1. School of Computer Science and Technology, University of Science and Technology of China, Hefei 230027, China;
    2. Suzhou Research Institute for Advanced Research, University of Science and Technology of China, Suzhou, Jiangsu 215123, China;
    3. School of Cyberspace Security, University of Science and Technology of China, Hefei 230027, China
  • Received:2021-03-18 Revised:2021-05-22 Published:2022-04-14

基于因果干预与不变性的卷积预训练模型优化研究

胡璇1,2, 邢凯1,2, 李亚鸣1,2, 王志勇2,3, 邓洪武1,2   

  1. 1. 中国科学技术大学 计算机科学与技术学院, 合肥 230027;
    2. 中国科学技术大学 苏州高等研究院, 江苏 苏州 215123;
    3. 中国科学技术大学 网络空间安全学院, 合肥 230027
  • 作者简介:胡璇(1995—),女,硕士研究生,主研方向为深度学习;邢凯,副教授、博士;李亚鸣、王志勇、邓洪武,硕士研究生。
  • 基金资助:
    国家自然科学基金(61332004)。

Abstract: The deep learning model based on Convolutional Neural Network(CNN) has been widely used in image recognition and classification.However, the model still has some shortcomings in the overall grasp of global features, the effective extraction of feature invariance at concept level, and determining the clear causal relationship between variables.This makes deep models less flexible, adaptable and generalizable.Based on causal intervention and invariance, this study proposes a directional pruning and network structure optimization method based on a CNN deep model.The optimization method performs invariant-based intervention modulation on the model input, then analyzes the output distribution of the pretrained network convolutional substructure according to the generated modulation picture sequence, and filters and directionally prunes the noise-sensitive substructure.On this basis, the objective function based on inter-class differentiation and the inter-layer connection of the network are constructed with the help of the Capital Asset Pricing Model(CAPM) used in the field of economics.The network topology that can increase the inter-class differentiation under a single classification task is generated, and the stable characteristics of the concept level are optimized layer by layer.The experimental results on the ImageNet-2012 dataset show that the optimized deep model improves the classification accuracy of the ResNet50 baseline pre-training model by about 5 percentage points, and greatly reduces the size of the training set.

Key words: image recognition and classification, Convolutional Neural Network(CNN), causal intervention, invariance, Capital Asset Pricing Model(CAPM)

摘要: 基于卷积神经网络(CNN)的深度模型在图像识别与分类领域应用广泛,但在全局特征控制、概念层次特征不变性提取和变量之间的因果关系确定方面仍存在不足,使得深度模型缺乏灵活性、适应性及泛化性。基于因果干预和不变性,提出一种基于CNN深度模型的定向修剪和网络结构优化方法。通过对模型输入进行基于不变性的干预调制,根据生成的调制图片序列分析预训练网络卷积子结构的输出分布,筛选和定向修剪噪声敏感子结构。构建基于类间区分度的目标函数,借助经济学领域中的资本资产定价模型构建网络的层间连接,生成在单分类任务下能增大类间区分度的网络拓扑结构,逐层优化构建概念层次的稳定特征。在ImageNet-2012数据集上的实验结果表明,优化后的深度模型相比于ResNet50基线预训练模型的分类准确率约提升了5个百分点,并大幅降低了训练集规模。

关键词: 图像识别与分类, 卷积神经网络, 因果干预, 不变性, 资本资产定价模型

CLC Number: