改进的 SSD-ResNet 算法

孟婧，江平，王凯，蒋鑫宇

（合肥工业大学数学学院，安徽合肥230601）

摘要

单次多边界框检测器（single shot multibox detector, SSD）算法因其性能优良已被应用于许多场景中，但该算法对小目标物体的检测精度偏低，主要原因是高层的语义信息没有被充分利用。为解决该问题，文章将其基础网络替换为残差网络（residual network, ResNet），同时融合深浅层的特征信息来增强浅层特征图的语义信息，此外还引入注意力模块，保留更多的目标特征信息，抑制无关信息，进一步提升对小目标物体的检测效果。在 PASCAL VOC2007 数据集上进行实验测试，平均精度均值为 80.2%，优于其他 SSD 改进算法。由于增加了特征融合和注意力模块，所提算法检测速度有所下降，但相比于 SSD 改进算法，检测速度仍有明显的优势。

关键词

目标检测；单次多边界框检测器(SSD)；残差网络(ResNet)；特征融合；注意力机制

中图分类号：TP391.413

文献标志码：A

文章编号：1003-5060（2023）03-0326-07

An improved SSD-ResNet algorithm

MENG Jing, JIANG Ping, WANG Kai, JIANG Xinyu

(School of Mathematics, Hefei University of Technology, Hefei 230601, China)

Abstract

As an algorithm with better detection accuracy and speed, single shot multibox detector (SSD) has made great progress in many aspects. However, it cannot achieve a good detection effect for small objects because it does not make full use of high-level semantic information. Aiming at this problem, VGG is replaced with residual network (ResNet) as the backbone network, the feature fusion method is used to enhance the semantic information of the shallow feature map, at the same time, the attention module is introduced, which retains more object feature information and suppresses irrelevant information. Through the above methods, the detection effect of small objects can be enhanced. By experimenting on the PASCAL VOC2007 datasets, the validity of the proposed algorithm is proven, the mean average precision value of the algorithm is 80.2%, which is better than those of other improved SSD algorithms. Although the addition of feature fusion and attention modules to the algorithm can cause a decrease in the detection speed, it is still better than deconvolutional single shot detector (DSSD) algorithm.

Keywords

object detection; single shot multibox detector (SSD); residual network (ResNet); feature fusion; attention mechanism

收稿日期：2021-03-26

修回日期：2021-04-16

基金项目：国家自然科学基金资助项目（12172115）