首页|基于反向残差注意力的光流估计

基于反向残差注意力的光流估计

扫码查看
光流估计是视频理解和分析的一项基本任务.现有的许多方法直接将遮挡作为异常点剔除,从而提高模型计算光流的能力,但这也容易引起图像灰度不连续,导致光流估计失败.此外,物体高速运动造成的大位移问题一直是光流估计的难点.为了解决上述问题,本文提出一种用于光流估计的基于反向残差注意力的生成对抗学习框架(FlowTran-GAN,FTGAN).该框架通过设计一个反向残差注意力模块增强特征的空间信息,提高像素之间的匹配程度;并且利用基于U-Net的鉴别器来约束生成器,减少光流估计的误差和不连续性,提高模型的泛化能力.通过在KITTI-2015数据集和MPI-Sintel数据集上进行的实验,实验结果表明本文所提出FTGAN的有效性和优越性.
Optical Flow Estimation Based on Inverse Residual Attention
Optical flow estimation is a basic task of video understanding and analysis.Many existing methods directly take occlu-sion as the outer point and eliminate it,so as to improve the ability of the model to calculate the optical flow,but it is also easy to cause the image gray discontinuity,leading to the failure of optical flow estimation.In addition,the problem of large displace-ment caused by high speed motion of objects has always been a difficulty in optical flow estimation.In order to solve the above problems,this paper proposes a generative adversarial learning framework based on reverse residual attention(FlowTranGAN,FTGAN)for optical flow estimation.The proposed framework enhances the spatial information of features by designing a reverse residual attention module to improve the matching degree between pixels.Besides,we use a discriminator based on U-Net to con-strain the generator to reduce the error and discontinuity of optical flow estimation,and improve the generalization ability of the model.Experiment results on the KITTI-2015 dataset and MPI-Sintel dataset demonstrate the effectiveness and superiority of the proposed FTGAN.

optical flow estimationreverse residual attentiongenerative adversarial learningsupervised learning

梁建业、陈俊洪、方桂标、吴兴财、刘文印

展开 >

广东工业大学计算机学院,广东 广州 510006

贵州大学省部共建公共大数据国家重点实验室,贵州 贵阳 550025

光流估计 反向残差注意力 生成对抗学习 有监督学习

国家自然科学基金资助项目国家自然科学基金资助项目国家自然科学基金资助项目广东省引进创新科研团队计划项目广东省基础与应用基础研究基金资助项目广东省科技创新战略专项资金资助项目

9174810762076073619020772014ZT05G1572020A1515010616pdjh2020a0173

2024

计算机与现代化
江西省计算机学会 江西省计算技术研究所

计算机与现代化

CSTPCD
影响因子:0.472
ISSN:1006-2475
年,卷(期):2024.(2)
  • 27