电工技术2024,Issue(21) :62-64.DOI:10.19768/j.cnki.dgjs.2024.21.015

变电站环境下多目标多姿态行人语义分割检测

Semantic Segmentation-based Multi-target and Multi-posture Pedestrian Detection in Substation Environment

王迪 胡耀蓉 冯钰玮 余容 赵健
电工技术2024,Issue(21) :62-64.DOI:10.19768/j.cnki.dgjs.2024.21.015

变电站环境下多目标多姿态行人语义分割检测

Semantic Segmentation-based Multi-target and Multi-posture Pedestrian Detection in Substation Environment

王迪 1胡耀蓉 1冯钰玮 1余容 1赵健1
扫码查看

作者信息

  • 1. 中国电建集团贵州电力设计研究院有限公司,贵州 贵阳 550000
  • 折叠

摘要

通过跳跃连接方式构建 U-Net++深度学习模型,增加二进制交叉标记和骰子系数的组合作为损失函数,然后改进激活函数,并将 Res2Net多尺度骨干架构作为特征提取网络,以获得更强的多尺度特征提取能力,进而对变电站环境下多目标多姿态行人进行语义分割检测实验.实验结果表明,对于变电站内行人目标极小、行人密集、行人多姿态非直立、行人多尺度并存等不同情况,所提算法都可准确检测并分割出行人目标,F1-Score指标可达 0.978,检测效率可达 0.035 s/张,优于其他两种经典方法.

Abstract

Through the jump connection method,the U-Net++deep learning model is constructed.The combination of binary cross-marking and dice coefficients is added as the loss function,then the activation function is improved,and the Res2Net multi-scale backbone architecture is used as the feature extraction network to obtain stronger multi-scale feature extraction ability.Then the semantic segmentation-based detection experiment is carried out on multi-target and multi-posture pedestrians in substation environment.The experimental results show that the pedestrian targets can be detected and segmented accurately under the conditions of very small pedestrian targets,dense pedestrians,non-upright pedestrian gestures and the coexistence of pedestrian multi-scales,with F1-Score index reaching 0.978 and detection efficiency of 0.035 s per image,which are superior to the two selected classical methods.

关键词

变电站/行人语义分割检测/多目标多姿态/U-Net++深度学习模型

Key words

substation/semantic segmentation-based pedestrian detection/multi-target and multi-posture/U-Net++deep learning model

引用本文复制引用

出版年

2024
电工技术
重庆西南信息有限公司(原科技部西南信息中心)

电工技术

影响因子:0.177
ISSN:1002-1388
段落导航相关论文