Abstract (EN):
A dense and accurate disparity map is relevant for a large number of applications, ranging from autonomous driving to robotic grasping. Recent developments in machine learning techniques enable us to bypass sensor limitations, such as low resolution, by using deep regression models to complete otherwise sparse representations of the 3D space. This article proposes two main approaches that use a single RGB image and sparse depth information gathered from a variety of sensors/techniques (stereo, LiDAR and Light Stripe Ranging (LSR)): a Convolutional Neural Network (CNN) and a cascade architecture, that aims to improve the results of the first. Ablation studies were conducted to infer the impact of these depth cues on the performance of each model. The models trained with LiDAR sparse information are the most reliable, achieving an average Root Mean Squared Error (RMSE) of 11.8 cm on our own Inhouse dataset; while the LSR proved to be too sparse of an input to compute accurate predictions on its own. © Springer Nature Switzerland AG 2020.
Language:
English
Type (Professor's evaluation):
Scientific