一种具有尺度不变性的人体姿态估计算法

A Human Pose Estimation Algorithm with Scale Invariant

孙瑞阳 ¹杨慧馨 ¹赵蓝飞²

扫码查看

作者信息

1. 哈尔滨体育学院民族传统体育学院,哈尔滨 150008
2. 哈尔滨理工大学测控技术与仪器黑龙江省高校重点实验室,哈尔滨 150080
折叠

摘要

针对现有的人体姿态估计算法无法准确检测小尺寸、大尺寸人体关键点和精确度较低的问题,提出一种具有尺度不变性的卷积神经网络用于估计人体姿态.首先设计图像缩放网络将输入图像缩放到标准尺寸.该网络能够抑制由插值引起的特征丢失.其次引入非局部卷积,增加网络的感受野.再次为多分辨率特征融合引入分辨率注意力机制,提高网络的尺度不变性.最后设计优化网络,抑制由采样引起的量化误差.在COCO数据集进行实验的结果表明,所提算法的平均精度均值达到79.2%高于其他算法,因此所提算法的尺度不变性和准确度优于现有人体姿态估计算法.

Abstract

Due to the existing issues in current human pose estimation algorithms,which struggle with accurately detecting small and large-sized human keypoints as well as having lower precision,this paper proposes a scale invariant convolution neural network to estimate human pose.First,resizing network is constructed for resizing the input image to the standard resolution.This network would reduce feature loss caused by interpolation.Second,the receptive field of network is increased by introducing non-local convolution.Thirdly,resolution attention mechanism is introduced into multi-resolution feature fusion,leading to enhance invariance in scales.Finally,optimized network is designed to reduce quantisation error caused by sampling.Experimental results conducted on the COCO dataset indicate that the proposed algorithm achieves an average accuracy of 79.2%,which is higher than other algorithms.Therefore,the proposed algorithm exhibits better scale invariance and accuracy than existing human pose estimation algorithms.

关键词

人体姿态估计/卷积神经网络/尺度不变性/人体关键点检测/非局部卷积/量化误差

Key words

human pose estimation/convolutional neural network/scale invariance/human keypoints detection/non-local convolution/quantisation error

引用本文复制引用

出版年

2024

哈尔滨理工大学学报

哈尔滨理工大学

哈尔滨理工大学学报

CSTPCD北大核心

影响因子：0.508

ISSN：1007-2683

段落导航