首页|多尺度特征融合的移动端单目深度估计研究

多尺度特征融合的移动端单目深度估计研究

扫码查看
目前基于深度学习的深度估计模型参数量大,难以适应移动端设备.针对此问题,提出一种可以部署在移动端的多尺度特征融合轻量级深度估计方法.首先,以MobileNetV2为主干,提取出4个尺度的特征.然后,通过构建编码器到解码器的跳跃连接路径,将4个尺度的特征进行融合,充分利用融合低层的位置信息和高层的语义信息.最后,融合后的特征通过卷积层得出高精度的深度图像.在NYU Depth Dataset V2数据集上进行了训练和测试,结果表明,该模型的参数量在仅有1.6×106的情况下,评估指标δ1高达0.812,在移动端的麒麟980 CPU上推理一幅图像仅需要0.094 s,具有实际应用价值.
Mobile monocular depth estimation based on multi-scale feature fusion
The current depth estimation model based on depth learning has a large number of param-eters,which is difficult to adapt to mobile devices.To address this issue,a lightweight depth estimation method with multi-scale feature fusion that can be deployed on mobile devices is proposed.Firstly,Mo-bileNetV2 is used as the backbone to extract features of four scales.Then,by constructing skip connec-tion paths from the encoder to the decoder,the features of the four scales are fused,fully utilizing the combined positional information from lower layers and semantic information from higher layers.Final-ly,the fused features are processed through convolutional layers to produce high-precision depth images.After training and testing on NYU Depth Dataset V2,the experimental results show that the proposed model achieves advanced performance with an evaluation index of δ1 up to 0.812 while only having 1.6×106 parameters numbers.Additionally,it only takes 0.094 seconds to infer a single image on the Kirin 980 CPU of a mobile device,demonstrating its practical application value.

deep learningdepth estimationmulti-scale featurelightweight networkmobile terminal model

陈磊、梁正友、孙宇、蔡俊民

展开 >

广西大学计算机与电子信息学院,广西南宁 530004

广西多媒体通信与网络技术重点实验室,广西南宁 530004

深度学习 深度估计 多尺度特征 轻量级网络 移动端模型

2024

计算机工程与科学
国防科学技术大学计算机学院

计算机工程与科学

CSTPCD北大核心
影响因子:0.787
ISSN:1007-130X
年,卷(期):2024.46(9)