Table 6: PointNet++ tested with real world point clouds: To train the network, w e either used point or face normals obtained
from 90% of 3D models as in Table 5. For testing, we used real point clouds equipped with point normals. Results were
compared with classical threshold segmentation as in Table 5.
training variant
Wall North East South West Flat Roof
mean
point normals 65.2% 64.3% 52.0 % 70.7% 56.8% 55.7% 60.8%
face normals
53.3% 5 6.2% 57.0% 73.2% 57.1% 47.1% 57.3%
classical segmentation 52.1% 60.3% 59.5% 71.6% 59.4% 44.2% 57.9 %
Figure 11: Stable segmentation results on cloud subsets.
ACKNOWLEDGEMENT
This work was supported by a generous hardware
grant from NVIDIA.
REFERENCES
Ben-Shabat, Y., Lindenbaum, M., and Fischer, A.
(2017). 3D point cloud classification and segmenta-
tion using 3D modified fisher vector representation
for convolutional neural networks. arXiv preprint
arXiv:1711.08241.
Boulch, A., Le Saux, B., and Audebert, N. ( 2017). Un-
structured point cloud semantic labeling using deep
segmentation networks. In 3DOR.
Duchi, J., H azan, E., and Singer, Y. (2011). Adaptive
subgradient methods for online learning and stochas-
tic optimization. Journal of Machine Learning Rese-
arch, 12:12:2121–2159.
Goebbels, S. and Pohle-Fr¨ohlich, R. (2016). Roof recon-
struction from airborne laser scanning data based on
image processing methods. ISPRS Ann. Photogramm.
Remote Sens. and Spatial Inf. Sci., III-3:407–414.
Hu, X. and Yuan, Y. (2016). Deep-learning-based classifi-
cation for dtm extraction from als point cloud. Remote
sensing, 8(9):730.
Hua, B.-S., Tran, M.-K., and Yeung, S.-K. (2018). Point-
wise convolutional neural networks. In Proceedings of
the IEEE Conference on Computer Vision and Pattern
Recognition, pages 984–993.
Kingma, D. and A dam, J. B. (2015). A method for stochas-
tic optimization. In International Conference on Le-
arning Representations, pages 1–15, San Diego, CA.
Klokov, R. and Lempitsky, V. S. (2017). Escape from cells:
Deep kd-networks f or the recognition of 3D point
cloud models. 2017 IEEE International Conference
on Computer Vision (ICCV), pages 863–872.
Li, X., Chen, S., Hu, X., and Yang, J. (2018). Understan-
ding the disharmony between dropout and batch nor-
malization by variance shift. CoRR, abs/1801.05134.
Maturana, D. and Scherer, S. (2015). Voxnet: A 3D convo-
lutional neural network for real-time object recogni-
tion. In Intelligent Robots and Systems (IROS), 2015
IEEE/RSJ International Conference on, pages 922–
928. IEEE.
Minto, L., Zanuttigh, P., and Pagnutti, G. (2018). Deep
learning for 3d shape classification based on volume-
tric density and surface approximation clues. In VISI-
GRAPP (5: VISAPP), pages 317–324.
Qi, C. R., S u, H., Mo, K., and Guibas, L. J. (2017a). Point-
net: Deep learning on point sets for 3D classification
and segmentation. In Proceedings of the IEEE Con-
ference on Computer Vision and Pattern Recognition
(CVPR), pages 77–85.
Qi, C. R., Yi, L., Su, H., and Guibas, L. (2017b). Point-
net++: Deep hierarchical f eature learning on point
sets in a metric space. In Proceedings of the 31st
Conference on Neural Information Processing Sys-
tems (NIPS).
Rethage, D., Wald, J., Sturm, J., Navab, N., and Tombari, F.
(2018). Fully-convolutional point networks for large-
scale point clouds. arXiv preprint arXiv:1808.06840.
Ronneberger, O., Fi scher, P., and Brox, T. (2015). U-net:
Convolutional networks for biomedical image seg-
mentation. CoRR, abs/1505.04597.
Wang, P., Gan, Y., Shui, P., Yu, F., Zhang, Y., Chen, S., and
Sun, Z. (2018a). 3D shape segmentation via shape
fully convolutional networks. Computers & Graphics,
70:128–139.
Wang, P.-S., Liu, Y., Guo, Y.-X., Sun, C .-Y., and Tong, X.
(2017). O-CNN: Octree-based convolutional neural
networks for 3D shape analysis. ACM Transactions
on Graphics (TOG), 36(4):72:1–72:11.
Wang, R., Peethambaran, J., and Dong, C. (2018b). Li-
DAR point clouds to 3D urban models: A review.
IEEE J. Sel. Topics Appl. Earth Observ. Remote Sens.,
11(2):606–627.
Yang, Z., Jiang, W., Xu, B., Zhu, Q., Jiang, S., and Huang,
W. (2017). A convolutional neural network-based 3D
semantic labeling method for als point clouds. Remote
Sensing, 9(9):936.