多尺度特征融合的移动端单目深度估计研究

扫码查看

原文链接

万方数据
维普

中文摘要：目前基于深度学习的深度估计模型参数量大,难以适应移动端设备.针对此问题,提出一种可以部署在移动端的多尺度特征融合轻量级深度估计方法.首先,以MobileNetV2为主干,提取出4个尺度的特征.然后,通过构建编码器到解码器的跳跃连接路径,将4个尺度的特征进行融合,充分利用融合低层的位置信息和高层的语义信息.最后,融合后的特征通过卷积层得出高精度的深度图像.在NYU Depth Dataset V2数据集上进行了训练和测试,结果表明,该模型的参数量在仅有1.6×106的情况下,评估指标δ1高达0.812,在移动端的麒麟980 CPU上推理一幅图像仅需要0.094 s,具有实际应用价值.

外文标题：Mobile monocular depth estimation based on multi-scale feature fusion

外文摘要：The current depth estimation model based on depth learning has a large number of param-eters,which is difficult to adapt to mobile devices.To address this issue,a lightweight depth estimation method with multi-scale feature fusion that can be deployed on mobile devices is proposed.Firstly,Mo-bileNetV2 is used as the backbone to extract features of four scales.Then,by constructing skip connec-tion paths from the encoder to the decoder,the features of the four scales are fused,fully utilizing the combined positional information from lower layers and semantic information from higher layers.Final-ly,the fused features are processed through convolutional layers to produce high-precision depth images.After training and testing on NYU Depth Dataset V2,the experimental results show that the proposed model achieves advanced performance with an evaluation index of δ1 up to 0.812 while only having 1.6×106 parameters numbers.Additionally,it only takes 0.094 seconds to infer a single image on the Kirin 980 CPU of a mobile device,demonstrating its practical application value.

外文关键词：

deep learningdepth estimationmulti-scale featurelightweight networkmobile terminal model

作者：

陈磊、梁正友、孙宇、蔡俊民

展开 >

作者单位：

广西大学计算机与电子信息学院,广西南宁 530004

广西多媒体通信与网络技术重点实验室,广西南宁 530004

关键词：

深度学习深度估计多尺度特征轻量级网络移动端模型

出版年：

2024

DOI：

10.3969/j.issn.1007-130X.2024.09.011

计算机工程与科学

国防科学技术大学计算机学院

计算机工程与科学

CSTPCD北大核心

影响因子：0.787

ISSN：1007-130X

年,卷(期)：2024.46(9)