融合注意力机制和多层动态形变卷积的多视图立体视觉重建方法

扫码查看

原文链接

万方数据
维普

中文摘要：针对现有多视图立体视觉(Multi-View Stereo,MVS)技术提取弱纹理区域和非郎伯体曲面特征信息不充分及重建效果不理想问题,提出一种融合注意力机制和多层动态形变卷积的AM-DC-PatchmatchNet方法.构建一种融合坐标注意力的特征提取网络,能更准确地捕捉重建对象的边缘形状和纹理特征,同时融合一种基于动态形变卷积的自适应感受野模块,根据不同尺度的特征自适应调整感受野的大小和形状,获得兼具全局和细节的特征表示.在DTU数据集上的测试结果表明,所提方法相较于主流MVS方法,点云重建整体性指标提高2.8％,并且在航空影像数据集上验证了模型的泛化能力.

外文标题：Multi-view Stereo Vision Reconstruction Network with Fusion Attention Mechanism and Multi-layer Dynamic Deformable Convolution

外文摘要：The existing multi-view stereo vision technology is not enough to extract the feature information of weak texture region and non-Lambert surface,and its reconstruction effect is not ideal.An AMDC-PatchmatchNet method with fusion attention mechanism and multi-layer dynamic deformable convolution is proposed for the problems above.In this method,a feature extraction network integrating the coordinate attention is constructed,which can capture the edge shape and texture features of reconstructed objects more accurately.At the same time,an adaptive receptive field module based on dynamic deformable convolution is integrated in the feature extraction network,and the size and shape of receptive field can be adjusted adaptively according to different scales of features to obtain both global and detailed feature representation.The generalization ability of the AMDC-PatchmatchNet method is verified on the aerial image data sets.The test results on DTU data sets show that the overall index of point cloud reconstruction of the proposed method is improved by 2.8％compared with those of mainstream MVS methods.

外文关键词：

multi-view stereo visionattention mechanismdynamic deformable convolutiondeep learn-ing

作者：

孙凯、张成、詹天、苏迪

展开 >

作者单位：

北京理工大学宇航学院飞行器动力学与控制教育部重点实验室,北京 100081

杭州极弱磁场国家重大科技基础设施研究院,浙江杭州 310051

关键词：

多视图立体视觉注意力机制动态形变卷积深度学习

出版年：

2024

DOI：

10.12382/bgxb.2023.0740

兵工学报

中国兵工学会

兵工学报

CSTPCD北大核心

影响因子：0.735

ISSN：1000-1093

年,卷(期)：2024.45(10)