Design a NN able to take sparse data as input, which is difficult in general
Performing an adaptation of an existing network, designed to perform detection on a dense input
Notes
Actually this has been performed with a pre-processing step to convert a the Point Cloud which is a Sparse Data Structure into an Image which is a Dense Data Structure
This could possibly add some extra computational cost and be sensitive to the non trainable pre-processing related hyperparams like Grid Cell Size
Prior on the space with a certain regular geometry
Each cell is 8cm square
Detected Points get projected on the grid according to sensor extrinsic calibration and this defines a Lidar point to cell association
According to this association it is possible to compute cell specific statistics defining the values for the 3 channels: average height, average intensity and density as number of points in the cell
Overview
Complex-YOLO: Real-time 3D Object Detection on Point Clouds
Arxiv: https://arxiv.org/abs/1803.06199