首页|残差混合注意力结合骨骼图卷积多人姿态识别

残差混合注意力结合骨骼图卷积多人姿态识别

扫码查看
多人姿态识别研究起步晚,成熟度低,复杂性高,因此网络深度也随之加深,梯度消失问题也随之加剧,网络性能也随之衰减,由此造成识别精度差,识别效率低等共性问题。为解决这些问题,本文提出了一种残差混合注意力结合骨骼图卷积多人姿态识别模型。通过自顶向下的研究路径,运用预处理干预方式对多人体图像进行检测并对单人体坐标定位及框选标定,生成骨骼关键点架构图,借助残差块对网络结构进行改进以抑制梯度弥散,加载混合注意力机制对模型赋能增效。在MPII及MSCOCO2017 两个数据集上对本文提出的模型进行了验证,结果显示该模型对多人姿态识别效果较好,在两个数据集上分布稳定,差异微小。同时,将本文模型与对本领域各类重要文献中记载模型综合能力进行了比较,结果表明在各项精细指标上本模型都有一定程度提升,稳定性较好,分布较为均匀。本文提出的多人姿态识别模型在跨数据集基础上表现出较好的识别效果和效率,为多人姿态识别的研究增添了动力。
Skelton-based Graph Convolution with Residual Combined with Mixed Attention Mechanism for Multi-Person Posture Recognition
The research of multi-person attitude recognition started lately,with low maturity and high complexity,so the network depth is also deepened,the problem of gradient vanishing is also intensified,and the network performance is also attenuated,resulting in the common problems of poor recognition accuracy and low recognition efficiency.To solve these problems,this paper proposes a model of skelton-based graph convolution with residual combined with mixed attention mechanism for multi-person posture recognition.Through the top-down research path,the pre-processing intervention was used to detect multi-body images and select the single body coordinate frame,and the bone key point architecture map was generated.With the residual block,the network structure was improved to suppress the gradient dispersion,and the mixed attention mechanism was loaded to enable and enhance the model.The proposed model is validated on two datasets,MPII and MSCOCO2017,and has stable distribution on the two datasets with small differences.At the same time,the model in this paper is compared with the comprehensive ability of the model recorded in various important literature in this field.In various fine indicators,the model has been improved to a certain extent,with good stability and uniform distribution.The multi-person pose recognition model proposed in this paper reflects the good recognition effect and efficiency based on the cross-data sets,and adds impetus to the study of multi-person gesture recognition.

multi-person posture recognitionresidualmixed attention mechanismskeletal key point diagramgraph convolution

陈斌、樊飞燕、陆天易

展开 >

南京师范大学信息化建设管理处,江苏 南京 210023

多人姿态识别 残差 混合注意力机制 骨骼关键点图 图卷积

2024

南京师大学报(自然科学版)
南京师范大学

南京师大学报(自然科学版)

CSTPCD北大核心
影响因子:0.427
ISSN:1001-4616
年,卷(期):2024.47(4)