case of “Grid-Grid” sampling, the DCNN is not that
high since it is trained with patches that do not cover
the whole spatial range of the original image. The best
global accuracy 99.1% is achieved for the “Grid-
Random” case for both patch sizes 128×128 and
224×224.
4 CONCLUSIONS
In this work a deep learning approach is presented
that tackles the problem of extracting buildings from
historical topographic maps. For this purpose, a
DCNN based on the U-Net architecture is trained in a
deep image-to-image regression mode. Experiments
on a historical topographic map demonstrate that the
proposed method efficiently extracts the buildings
from the map even when they are densely surrounded
or even overlapped by text or other geospatial
features. Evaluation under several sampling and patch
size scenarios gives promising results in terms of
building detection accuracy, especially when large
patch sizes are involved and when training the
network is based on randomly generated patches.
ACKNOWLEDGMENTS
Article processing charges are covered by the
University of West Attica.
REFERENCES
Dornaika, F., & Hammoudi, K. (2009, May). Extracting 3D
Polyhedral Building Models from Aerial Images Using
a Featureless and Direct Approach. In MVA (pp. 378-
381).
Fischer, A., Kolbe, T. H., Lang, F., Cremers, A. B.,
Förstner, W., Plümer, L., & Steinhage, V. (1998).
Extracting buildings from aerial images using
hierarchical aggregation in 2D and 3D. Computer
Vision and Image Understanding, 72(2), 185-203.
Hecht, R., Meinel, G., & Buchroithner, M. (2015).
Automatic identification of building types based on
topographic databases–a comparison of different data
sources. International Journal of Cartography, 1(1),
18-31.
Heitzler, M., & Hurni, L. (2020). Cartographic
reconstruction of building footprints from historical
maps: A study on the Swiss Siegfried map.
Transactions in GIS, 24(2), 442-461.
Kingma, D. P., & Ba, J. (2014). Adam: A method for
stochastic optimization. arXiv preprint
arXiv:1412.6980.
Laycock, S. D., Brown, P. G., Laycock, R. G., & Day, A.
M. (2011). Aligning archive maps and extracting
footprints for analysis of historic urban environments.
Computers & Graphics, 35(2), 242-249.
LeCun, Y., Bottou, L., Bengio, Y. & Haffner, P. (1998).
Gradient-based learning applied to document
recognition. Proceedings of the IEEE. 86(11): 2278 –
2324.
Li, B. H., Hou, B. C., Yu, W. T., Lu, X. B., & Yang, C. W.
(2017). Applications of artificial intelligence in
intelligent manufacturing: a review. Frontiers of
Information Technology & Electronic Engineering,
18(1), 86-96.
Peng, J., Zhang, D., & Liu, Y. (2005). An improved snake
model for building detection from urban aerial images.
Pattern Recognition Letters, 26(5), 587-595.
Ronneberger, O., Fischer, P., & Brox, T. (2015, October).
U-net: Convolutional networks for biomedical image
segmentation. In International Conference on Medical
image computing and computer-assisted intervention
(pp. 234-241). Springer, Cham.
Suzuki, S., & Chikatsu, H. (2003). Recreating the past city
model of historical town Kawagoe from antique map.
International Archives of Photogrammetry and Remote
Sensing, 34, 5.
Uhl, J. H., Leyk, S., Chiang, Y. Y., Duan, W., & Knoblock,
C. A. (2017, July). Extracting human settlement
footprint from historical topographic map series using
context-based machine learning. In 8th International
Conference of Pattern Recognition Systems (ICPRS
2017) (pp. 1-6).
Uhl, J. H., Leyk, S., Chiang, Y. Y., Duan, W., & Knoblock,
C. A. (2019). Automated extraction of human
settlement patterns from historical topographic map
series using weakly supervised convolutional neural
networks. IEEE Access, 8, 6978-6996.
Voulodimos, A., Doulamis, N., Doulamis, A., &
Protopapadakis, E. (2018). Deep learning for computer
vision: A brief review. Computational intelligence and
neuroscience, 2018.