Figure 8: Accuracy of yaw angle prediction of 3 classes.
pedestrians have significantly precision improvement
on object detection with the help of Normal-map.
5 CONCLUSION
In this work we propose a 3D object detection method
that combines Normal-map (the surface normal esti-
mated from point cloud) with other hand-crafted im-
ages. The proposed method makes the input informa-
tion have more enhanced spatial shape information.
The object detection results show competitive perfor-
mance on KITTI benchmarks. This method has better
accuracy in object detection than conventional meth-
ods, and is less affected by sparse point clouds. In
addition, it brings better yaw angle prediction. It also
has excellent anti-interference ability for object sur-
faces with unreliable reflection intensity data. Our
method has the potential to be used for the virtual-
world dataset, enables further research in autonomous
driving. In the future, we would like to use a modern
normal estimation technique in our pipeline for the
accuracy of Normal-map go further. We also plan to
improve it to detect more classes of objects and faster.
REFERENCES
Beltr
´
an, J., Guindel, C., Moreno, F. M., Cruzado, D.,
Garc
´
ıa, F., and Escalera, A. D. L. (2018). Birdnet: A
3d object detection framework from lidar information.
In 2018 21st International Conference on Intelligent
Transportation Systems (ITSC), pages 3517–3523.
Bochkovskiy, A., Wang, C.-Y., and Liao, H.-Y. M. (2020).
Yolov4: Optimal speed and accuracy of object detec-
tion. arXiv preprint arXiv:2004.10934.
Chen, X., Ma, H., Wan, J., Li, B., and Xia, T. (2017). Multi-
view 3d object detection network for autonomous
driving. In The IEEE Conference on Computer Vision
and Pattern Recognition (CVPR).
Geiger, A., Lenz, P., and Urtasun, R. (2012). Are we ready
for autonomous driving? the kitti vision benchmark
suite. In Conference on Computer Vision and Pattern
Recognition (CVPR).
Girshick, R. (2015). Fast r-cnn. In Proceedings of the IEEE
international conference on computer vision, pages
1440–1448.
Girshick, R., Donahue, J., Darrell, T., and Malik, J. (2014).
Rich feature hierarchies for accurate object detection
and semantic segmentation. In The IEEE Conference
on Computer Vision and Pattern Recognition (CVPR).
Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S.,
Fu, C.-Y., and Berg, A. C. (2016). Ssd: Single shot
multibox detector. In European conference on com-
puter vision, pages 21–37. Springer.
Merry, B., Gain, J., and Marais, P. (2013). Accelerating kd-
tree searches for all k-nearest neighbours. Technical
report, University of Cape Town.
Ran, L., Ximin, Z., Yiyuan, Z., Wanggen, W., and Lib-
ing, L. (2013). Normal estimation algorithm for point
cloud using kd-tree. IET International Conference on
Smart and Sustainable City 2013 (ICSSC 2013).
Redmon, J., Divvala, S., Girshick, R., and Farhadi, A.
(2016). You only look once: Unified, real-time ob-
ject detection. 2016 IEEE Conference on Computer
Vision and Pattern Recognition (CVPR).
Redmon, J. and Farhadi, A. (2017). Yolo9000: Better,
faster, stronger. In The IEEE Conference on Computer
Vision and Pattern Recognition (CVPR).
Redmon, J. and Farhadi, A. (2018). Yolov3: An incremental
improvement. arXiv preprint arXiv:1804.02767.
Ren, S., He, K., Girshick, R., and Sun, J. (2015). Faster
r-cnn: Towards real-time object detection with region
proposal networks. In Advances in neural information
processing systems, pages 91–99.
Shi, S., Wang, X., and Li, H. (2019). Pointrcnn: 3d object
proposal generation and detection from point cloud.
In Proceedings of the IEEE Conference on Computer
Vision and Pattern Recognition, pages 770–779.
Simon, M., Amende, K., Kraus, A., Honer, J., Samann,
T., Kaulbersch, H., Milz, S., and Michael Gross, H.
(2019). Complexer-yolo: Real-time 3d object detec-
tion and tracking on semantic point clouds. In Pro-
ceedings of the IEEE Conference on Computer Vision
and Pattern Recognition Workshops, pages 0–0.
Simony, M., Milzy, S., Amendey, K., and Gross, H.-M.
(2018). Complex-yolo: An euler-region-proposal for
real-time 3d object detection on point clouds. In Pro-
ceedings of the European Conference on Computer Vi-
sion (ECCV), pages 0–0.
Wang, C. and Siddiqi, K. (2016). Differential geometry
boosts convolutional neural networks for object detec-
tion. In The IEEE Conference on Computer Vision and
Pattern Recognition (CVPR) Workshops.
Zhou, Q.-Y., Park, J., and Koltun, V. (2018). Open3d: A
modern library for 3d data processing. arXiv preprint
arXiv:1801.09847.
VISAPP 2021 - 16th International Conference on Computer Vision Theory and Applications
576