变电站环境下多目标多姿态行人语义分割检测

Semantic Segmentation-based Multi-target and Multi-posture Pedestrian Detection in Substation Environment

扫码查看

原文链接

维普
万方数据

中文摘要：通过跳跃连接方式构建 U-Net++深度学习模型,增加二进制交叉标记和骰子系数的组合作为损失函数,然后改进激活函数,并将 Res2Net多尺度骨干架构作为特征提取网络,以获得更强的多尺度特征提取能力,进而对变电站环境下多目标多姿态行人进行语义分割检测实验.实验结果表明,对于变电站内行人目标极小、行人密集、行人多姿态非直立、行人多尺度并存等不同情况,所提算法都可准确检测并分割出行人目标,F1-Score指标可达 0.978,检测效率可达 0.035 s/张,优于其他两种经典方法.

外文摘要：Through the jump connection method,the U-Net++deep learning model is constructed.The combination of binary cross-marking and dice coefficients is added as the loss function,then the activation function is improved,and the Res2Net multi-scale backbone architecture is used as the feature extraction network to obtain stronger multi-scale feature extraction ability.Then the semantic segmentation-based detection experiment is carried out on multi-target and multi-posture pedestrians in substation environment.The experimental results show that the pedestrian targets can be detected and segmented accurately under the conditions of very small pedestrian targets,dense pedestrians,non-upright pedestrian gestures and the coexistence of pedestrian multi-scales,with F1-Score index reaching 0.978 and detection efficiency of 0.035 s per image,which are superior to the two selected classical methods.

外文关键词：

substationsemantic segmentation-based pedestrian detectionmulti-target and multi-postureU-Net++deep learning model

作者：

王迪、胡耀蓉、冯钰玮、余容、赵健

展开 >

作者单位：

中国电建集团贵州电力设计研究院有限公司,贵州贵阳 550000

关键词：

变电站行人语义分割检测多目标多姿态 U-Net++深度学习模型

出版年：

2024

DOI：