Author Login Chief Editor Login Reviewer Login Editor Login Remote Office

Computer Engineering ›› 2026, Vol. 52 ›› Issue (4): 264-275. doi: 10.19678/j.issn.1000-3428.0252879

• Computer Vision and Image Processing • Previous Articles     Next Articles

3D Small Object Detection Algorithm Based on Dynamic Feature Enhancement

LI Luyang1,2,3, YAN Jinlong1,2,3, FANG Zeru1,2,3, JIN Qiqi1,2,3, XUE Hongxin1,2,3,*()   

  1. 1. School of Computer Science and Technology, North University of China, Taiyuan 030051, Shanxi, China
    2. Shanxi Key Laboratory of Machine Vision & Virtual Reality, Taiyuan 030051, Shanxi, China
    3. Shanxi Vision Information Processing and Intelligent Robot Engineering Research Center, Taiyuan 030051, Shanxi, China
  • Received:2025-08-11 Revised:2025-10-07 Online:2026-04-15 Published:2025-12-12
  • Contact: XUE Hongxin

基于动态特征增强的三维小目标检测算法

李潞洋1,2,3, 闫锦龙1,2,3, 方泽儒1,2,3, 金旗旗1,2,3, 薛红新1,2,3,*()   

  1. 1. 中北大学计算机科学与技术学院, 山西 太原 030051
    2. 机器视觉与虚拟现实山西省重点实验室, 山西 太原 030051
    3. 山西省视觉信息处理及智能机器人工程研究中心, 山西 太原 030051
  • 通讯作者: 薛红新
  • 作者简介:

    李潞洋(CCF专业会员), 男, 讲师, 主研方向为计算机视觉

    闫锦龙, 硕士研究生

    方泽儒, 硕士研究生

    金旗旗, 硕士研究生

    薛红新(通信作者), 副教授

  • 基金资助:
    国家自然科学基金(62272426); 山西省基础研究计划项目(202303021212372); 机器视觉与虚拟现实山西省重点实验室研究基金(447-110103)

Abstract:

In 3D object detection from point clouds, the inherent sparsity of Light Detection And Ranging (LiDAR) data poses pronounced challenges for small objects. A small number of effective points lead to weak structural cues and blurry boundaries; limited contextual awareness hinders spatial reasoning and semantic completion, causing localization bias; and the difficulty of precise spatial localization, weak channel expressiveness, and background dominance constrain accuracy. To mitigate the impact of the aforementioned issues on detection accuracy, a dynamic-aware 3D detector is proposed that integrates dynamic feature extraction with feature-enhancement mapping, targeting two critical stages of small-object detection: feature extraction and candidate generation. Specifically, a Dynamic Point Feature Prediction Network (DPFPN) that adaptively predicts and supplements sampling points to strengthen structural perception of small objects is introduced. Subsequently, a Feature Enhancement Mapping Network (FEMN) is built that deeply fuses the original features with those produced by the dynamic module to yield context-rich 2D feature maps, thereby compensating for contextual deficiency and improving localization. Finally, a Point Cloud Feature Enhancement Network (PCFEN) module is designed to sharpen focus on key small-object regions along both channel and spatial dimensions. Experiments on the nuScenes dataset demonstrate that the proposed approach performs better than mainstream detectors. Relative to the CenterPoint baseline, the mean Average Precision (mAP) increases from 56.1% to 59.4% and the Nuscenes Detection Score (NDS) rises from 64.4 to 67.4.

Key words: Light Detection And Ranging (LiDAR), dynamic feature extraction, feature enhancement, multi-scale features, object detection

摘要:

在点云三维目标检测任务中, 点云数据的稀疏性在客观上对小目标检测构成显著挑战。小目标自身有效点数稀少导致结构信息缺失与边界模糊, 而上下文感知能力不足阻碍模型有效利用周围环境信息进行空间推理与语义补全, 进而引发定位偏差, 以及其固有的空间定位困难、通道表达弱和特征易被背景淹没等问题, 共同制约了检测性能的提升。为缓解上述问题对检测精度造成的影响, 提出一种融合动态特征提取与特征增强映射的动态感知三维检测算法。该算法聚焦特征提取与候选框生成两大关键阶段对小目标检测进行优化。首先, 引入动态点特征预测网络(DPFPN), 通过自适应预测补充采样点以强化对小目标的结构感知能力; 其次, 构建特征增强映射网络(FEMN), 对原始特征及动态预测网络生成的特征进行深度融合, 输出富含上下文信息的二维特征图, 以有效弥补上下文缺失并提升小目标的定位精度; 最后, 设计点云特征增强网络(PCFEN), 在通道与空间双维度提升网络对小目标关键区域的聚焦能力。在nuScenes数据集上的实验结果表明, 所提算法性能优于当前主流目标检测算法, 与基准模型CenterPoint相比, 均值平均精度(mAP)由56.1%提升至59.4%, Nuscenes检测分数(NDS)由64.4提升至67.4。

关键词: 激光雷达, 动态特征提取, 特征增强, 多尺度特征, 目标检测