作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程

• •    

一种新的基于基底-门控机制的激活函数——自适应参数化Softplus-Sigmoid

  • 发布日期:2026-03-27

A Novel Activation Function Regarding Substrate-Gate Mechanism--Adaptive Parameterized Softplus-Sigmoid Function

  • Published:2026-03-27

摘要: 近年来深度学习在计算机视觉等研究领域取得越来越多的成果,其中,激活函数对于增强深度神经网络的非线性拟合能力具有重要的影响。但随着研究的深入,现有的激活函数,如ReLU和SiLU等,暴露出越来越多的问题,比如存在梯度消失/死亡现象,对负值区域不具有自适应调节性等。论文针对常见目标检测识别任务中显著性特征的去留问题,提出了一种新的激活函数--自适应参数化Softplus-Sigmoid函数 (Adaptive Parametric Softplus-Sigmoid,APSS),旨在从复杂背景中精准地提取和学习目标的多尺度融合特征。该激活函数基于生物神经科学中的基底-门控组合机制。其中,基底项确保基础特征的可学习性与梯度稳定性,门控项则通过动态调节负值区域的响应强度,实现无效特征的抑制,通过两者的有机结合,实现网络模型保留和抑制特征能力的平衡。为了验证该激活函数的优势,论文在SoccerNet、UA-DETRAC和BEEF24等三组实验数据集上,与几种典型的目标检测识别网络原型进行了对比实验。研究结果表明,论文提出的APSS激活函数显著优于原始网络模型中的激活函数,具有更好的目标特征提取和拟合能力。

Abstract: In recent years, deep learning has achieved increasing success across various research fields such as computer vision, in which activation functions play an important role in enhancing the nonlinear fitting capability of deep neural networks. However, existing activation functions such as ReLU, SiLU, etc., have revealed more and more issues as research progresses, such as the problems of gradient vanishing/dead and the lack of adaptive regulation capability in the negative region, etc. This paper proposes a new activation function—Adaptive Parametric Softplus-Sigmoid (APSS)—for the salient feature preservation and dropping in common object detection and recognition tasks. It aims to extract and learn the multi-scale collaborative features from complex backgrounds. This activation function is based on the base-gate combination mechanism in biological neuroscience. The base unit ensures the learnability of basic features and gradient stability. The gate unit achieves the suppression of invalid features by dynamically adjusting the response intensity in the negative value region. The combination of two units can promote the network model's balance of retaining or suppressing features. To verify the advantages of this activation function, this paper conducts comparative experiments with several typical object detection and recognition network prototypes on three experimental datasets: SoccerNet, UA-DETRAC, and BEEF24. The research results show that the proposed APSS activation function is significantly superior to the activation functions in the original network models. It has better target feature extraction and fitting capabilities.