coder network to compensate the missing texture in-
formation. To tackled the lack of large trainable
dataset in this field and to evaluate our HFPNet quan-
titatively, we presented a synthetic CSN dataset and
a real-world NightCampus dataset. We demonstrated
that HFPNet, which is trained on synthetic dataset,
yielding top performances on real-world scenes.
ACKNOWLEDGEMENTS
This work was supported in part by the NSFC under
Grants 62073215, 61873166, and 61673275.
REFERENCES
Adelson, E. H., Anderson, C. H., Bergen, J. R., Burt, P. J.,
and Ogden, J. M. (1984). Pyramid methods in image
processing. RCA engineer, 29(6):33–41.
Badrinarayanan, V., Kendall, A., and Cipolla, R. (2017).
Segnet: A deep convolutional encoder-decoder ar-
chitecture for image segmentation. IEEE transac-
tions on pattern analysis and machine intelligence,
39(12):2481–2495.
Burt, P. and Adelson, E. (1983). The laplacian pyramid as a
compact image code. IEEE Transactions on commu-
nications, 31(4):532–540.
Chen, L.-C., Papandreou, G., Kokkinos, I., Murphy, K., and
Yuille, A. L. (2017). Deeplab: Semantic image seg-
mentation with deep convolutional nets, atrous convo-
lution, and fully connected crfs. IEEE transactions on
pattern analysis and machine intelligence, 40(4):834–
848.
Cordts, M., Omran, M., Ramos, S., Rehfeld, T., Enzweiler,
M., Benenson, R., Franke, U., Roth, S., and Schiele,
B. (2016). The cityscapes dataset for semantic urban
scene understanding. In Proceedings of the IEEE con-
ference on computer vision and pattern recognition,
pages 3213–3223.
Ge, J., Luo, Y., and Tei, G. (2009). Real-time pedes-
trian detection and tracking at nighttime for driver-
assistance systems. IEEE Transactions on Intelligent
Transportation Systems, 10(2):p.283–298.
Ghiasi, G. and Fowlkes, C. C. (2016). Laplacian pyramid
reconstruction and refinement for semantic segmenta-
tion. In European Conference on Computer Vision,
pages 519–534. Springer.
Huang, Z., Wang, X., Huang, L., Huang, C., Wei, Y., and
Liu, W. (2018). Ccnet: Criss-cross attention for se-
mantic segmentation.
Krizhevsky, A., Sutskever, I., and Hinton, G. E. (2012). Im-
agenet classification with deep convolutional neural
networks. In Advances in neural information process-
ing systems, pages 1097–1105.
Lin, T. Y., Dollar, P., Girshick, R., He, K., Hariharan, B.,
and Belongie, S. (2017). Feature pyramid networks
for object detection. In 2017 IEEE Conference on
Computer Vision and Pattern Recognition (CVPR).
Long, J., Shelhamer, E., and Darrell, T. (2015). Fully con-
volutional networks for semantic segmentation. In
Proceedings of the IEEE conference on computer vi-
sion and pattern recognition, pages 3431–3440.
Mazzini, D. (2018). Guided upsampling network for
real-time semantic segmentation. arXiv preprint
arXiv:1807.07466.
Nekrasov, V., Shen, C., and Reid, I. (2018). Light-weight
refinenet for real-time semantic segmentation. arXiv
preprint arXiv:1810.03272.
Or
ˇ
si
´
c, M. and Segvic, S. (2020). Efficient semantic seg-
mentation with pyramidal fusion. Pattern Recogni-
tion, page 107611.
Sakaridis, C., Dai, D., and Van Gool, L. (2019). Guided cur-
riculum model adaptation and uncertainty-aware eval-
uation for semantic nighttime image segmentation.
Sun, L., Wang, K., Yang, K., and Xiang, K. (2019). See
clearer at night: Towards robust nighttime semantic
segmentation through day-night image conversion.
Wang, C. Research on Infrared Image Semantic Segmenta-
tion Technology Based on Deep Learning. PhD thesis,
University of Chinese Academy of Sciences (Shang-
hai Institute of Technical Physics, Chinese Academy
of Sciences).
Xu, F. and Fujimura, K. (2002). Pedestrian detection and
tracking with night vision. In Intelligent Vehicle Sym-
posium, 2002. IEEE.
Yu, C., Wang, J., Peng, C., Gao, C., Yu, G., and Sang,
N. (2018). Bisenet: Bilateral segmentation network
for real-time semantic segmentation. In Proceed-
ings of the European Conference on Computer Vision
(ECCV), pages 325–341.
Yu, F. and Koltun, V. (2015). Multi-scale context ag-
gregation by dilated convolutions. arXiv preprint
arXiv:1511.07122.
Zhao, H., Qi, X., Shen, X., Shi, J., and Jia, J. (2018).
Icnet for real-time semantic segmentation on high-
resolution images. In Proceedings of the European
Conference on Computer Vision (ECCV), pages 405–
420.
Zhao, H., Shi, J., Qi, X., Wang, X., and Jia, J. (2017).
Pyramid scene parsing network. In Proceedings of
the IEEE conference on computer vision and pattern
recognition, pages 2881–2890.
Zhiyi Liu, Shaoyuan Sun, Z. R. (2020). Infrared image se-
mantic segmentation of unmanned vehicles at night
based on improved deeplabv3+. Journal of Applied
Optics, 41(1).
VISAPP 2021 - 16th International Conference on Computer Vision Theory and Applications
328