Composite-Mask GAN based on refined optical flow and disparity map for SLAM Visual Odometry

Composite-Mask GAN based on refined optical flow and disparity map for SLAM Visual Odometry
DOI:
                        
                    
CSTR:
                        [cstr]
                    
Author:
                        JI YuehuiJI Yuehui
Tianjin University of Technology
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
JIANG JingweiJIANG Jingwei
Tianjin University of Technology
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
LIU JunjieLIU Junjie
Tianjin University of Technology
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
Song YuSong Yu
Tianjin University of Technology
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
GAO QiangGAO Qiang
Tianjin University of Technology
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site

                    
Affiliation:Tianjin University of Technology
Clc Number:
Fund Project:Program of Graduate Education and Teaching Reform in Tianjin University of Technology YBXM2204, ZDXM2202；the National Natural Science Foundation of China under Grant (No.62203331) and the National Natural Science Foundation of China under Grant (No.62103299)

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

Although deep learning methods have been widely applied to slam visual odometry over the past decade with impres-sive improvements, their accuracy remains limited in complex dynamic environments. In this paper, we use a compo-site mask-based generative adversarial network to predict camera motion and binocular depth maps. Specifically, a perceptual generator is first designed to obtain the corresponding parallax map and optical flow from between two neighboring frames. Then, an iterative pose improvement strategy is proposed to improve the accuracy of pose estima-tion. Finally, a composite mask is embedded in the discriminator to sense structural deformations in the synthetic vir-tual image, thus encouraging the generator to learn additional structural level information to improve the accuracy of pose estimation. Detailed quantitative and qualitative evaluations on the KITTI dataset show that the proposed framework outperforms existing conventional, supervised learning and unsupervised depth VO methods, providing better results in both pose estimation and depth estimation.

Key words:visual odometer; generate adversarial networks; composite mask; iterative optimization strategy

Get Citation

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:August 11,2023
Revised:October 13,2023
Adopted:November 07,2023
Online:
Published:

Home

About us

Authors

Editors

News

Contents

Contact us

Get Citation

Share

Article Metrics

History

Article QR Code