增量式尺度估计下的相机位置解算

扫码查看

原文链接

万方数据
维普

中文摘要：目的全局式从运动恢复结构(structure from motion,SfM)通过运动平均一次性恢复所有相机的绝对位姿,效率相对较高.运动平均中的平移平均主要负责解算相机在世界坐标系下的绝对位置,其求解过程因尺度歧义性、估计敏感性和求解不确定性的影响而较为困难.本文提出了一种基于增量尺度估计的平移平均方法,在消除尺度歧义性的同时提升了求解鲁棒性与准确性.方法本文将平移平均问题解耦为3个子问题:1)局部绝对尺度的增量式估计;2)全局绝对尺度的增量式估计;3)基于L1优化的尺度已知的绝对位置估计.结果在1DSfM数据集上进行对比实验,基线解算精度明显提升,解算相机百分比的均值达到96％.当引入两种不同的绝对旋转进行计算时,其绝对位置中值误差仅略差于 BATA(bilinear angle-based translation averaging)与 CReTA(correspondence reweighted translation averaging),排名第3,均值误差改善更为明显,分别排名第1和第2.相较于原始方法,本文方法在相机解算数量与位置解算精度上均有较大提升.结论本文方法综合了尺度分离思想与增量式参数估计思想,既消除了尺度歧义性,又保证了鲁棒性与高效性,求解所得的相机绝对位置稳定可靠.

外文标题：Incremental scale estimation-based camera location recovery

外文摘要：Objective The structure from motion(SfM)technique serves as the fundamental step in the sparse reconstruc-tion process,finding extensive applications in remote sensing mapping,indoor modeling,augmented reality,and ancient architecture preservation.SfM technology retrieves camera poses from images,encompassing two main categories:incre-mental and global approaches.The global SfM,in contrast to the iterative nature of incremental SfM,simultaneously esti-mates the absolute poses of all cameras through motion averaging,resulting in relatively high efficiency.However,it still encounters challenges regarding robustness and accuracy.Rotation averaging and translation averaging constitute crucial components within the motion averaging.Compared with rotation averaging,translation averaging is more difficult due to the following three reasons:1)Only relative translation directions could be recovered by essential matrix estimation and decomposition,i.e.,the produced relative translations are scale ambiguous.2)Only cameras in the same parallel rigid component could their absolute locations be uniquely determined by translation averaging,while for rotation averaging,the requirement simply degenerates to the connected component.3)Compared with relative rotation,the estimation accuracy of relative translation is more vulnerable to the feature point mismatches and more likely to be outlier contaminated.In tra-ditional approaches,the translation averaging method based on scale separation(L1SE-L1TA)calculates the relative base-line length between cameras before estimating the absolute locations,eliminates the scale ambiguity,and the solving range is no longer constrained by the camera triplet,but its robustness and accuracy still need to be improved.Incremental trans-lation averaging(ITA)introduces the idea of incremental parameter estimation into the translation averaging process for the first time,which has good robustness and high accuracy.However,its solving process depends on camera triplets and may suffer from degeneracy during collinear camera motion.To solve the above problems,this study proposes a translation aver-aging method based on incremental scale estimation(ISE-L1TA),which eliminates the scale ambiguity and enhances the method's robustness and result accuracy.Method Incremental SfM has been proven to be highly accurate and robust,mak-ing it a preferred choice for many applications.It has shown to be particularly effective in handling large datasets and over-coming the challenges posed by complex real-world scenarios.Recognizing its potential,researchers have sought to trans-fer the incremental parameter estimation ideology to other related tasks,such as incremental rotation averaging(IRA)and ITA.In particular,IRA is designed to estimate the camera absolute rotations incrementally and efficiently.Meanwhile,ITA is performed for the camera absolute locations,enabling it to handle outliers effectively and avoid error propagation.Overall,the adoption of incremental parameter estimation ideology for motion averaging tasks demonstrates the versatility and effectiveness of this approach.With its ability to handle complex datasets and overcome a range of challenges,the incremental parameter estimation ideology holds great promise for future research in the field of 3D reconstruction and beyond.In this study,ISE-L1TA is proposed by incorporating the scale separation strategy and incremental parameter esti-mation ideology.Specifically,the translation averaging problem is decomposed into three sub-ones and sequentially solved:1)incremental estimation of local absolute scale,2)incremental estimation of global absolute scale,3)scale-aware absolute location estimation based on L,optimization.The input of our proposed method is the pairwise scale invari-ant feature transform point matches,and its output is the absolute camera locations.First,the relative motion between cam-eras is obtained by estimating and decomposing the essential matrix.Next,the two-view triangulation is performed to calcu-late the relative depths in the local coordinate system.On the basis of depth ratios,incremental estimations are conducted for the local and global absolute scales.Subsequently,the relative baseline length between cameras is computed,and rota-tion averaging is performed for absolute rotation estimation,enabling the final scale-aware absolute location estimation.Result We performed experimental tests to evaluate the selection of scale distance function and scale distance threshold.The experimental results confirmed that the normalized perfect square deviation function effectively eliminates the impact of scaling effects.Furthermore,the incremental scale estimation method shows good robustness and insensitivity to scale dis-tance threshold and achieves remarkably higher baseline accuracy compared with L1SE.The experiments were conducted on the 1DSfM dataset.In comparison with various state-of-the-art methods including bilinear angle-based translation aver-aging(BATA),correspondence reweighted translation averaging(CReTA),ITA,and L1SE-L1TA,our proposed method exhibited the following performance:1)In terms of the number of cameras solved,the average percentage of successfully solved cameras using the proposed method is 96％.2)The median error of absolute location estimation is slightly worse than that of BATA and CReTA and ranks third overall under different absolute rotations.3)In terms of the mean error in absolute location estimation,the proposed method has remarkable advantages,ranking first and second respectively.Com-pared with the original L1SE-L1TA,the method in this study has a great improvement in the number of cameras solved and the accuracy of locations estimated.Conclusion The proposed method combines the concept of scale separation with incre-mental parameter estimation.By integrating these two ideas,our method effectively eliminates scale ambiguity while ensur-ing the effectiveness of outlier rejection and maintaining a concise solving process.As a result,the obtained absolute cam-era locations are stable and reliable.

外文关键词：

global structure from motiontranslation averagingscale separationbaseline length computationincremen-tal parameter estimation

作者：

李梦晗、高翔、解则晓、申抒含

展开 >

作者单位：

中国海洋大学工程学院,青岛 266100

中国科学院自动化研究所,北京 100190

关键词：

全局式从运动恢复结构平移平均尺度分离基线长度求解增量式参数估计

基金：

国家自然科学基金项目国家自然科学基金项目国家自然科学基金项目

项目编号：

62373349U22B205542076192

出版年：

2024

DOI：

10.11834/jig.230745

中国图象图形学报

中国科学院遥感应用研究所,中国图象图形学学会 ,北京应用物理与计算数学研究所

中国图象图形学报

CSTPCD北大核心

影响因子：1.111

ISSN：1006-8961

年,卷(期)：2024.29(10)