ing human interaction in the inspection of structures
through automation. Although ideally its perfor-
mance could be improved by increasing the amount
of training data, the relatively high detection rate and
reduced number of false positives achieved are ex-
pected to guarantee a relevant speedup with respect
to human-only inspection. As a matter of fact, the
good segmentation results make it possible to deter-
mine the area of damaged components in many cases.
In an experiment, given the dimensions of a reference
component we calculated the damaged area, as shown
in Figure 10.
5 CONCLUSIONS
In this paper, we have proposed a system for auto-
mated detection of damages in power transmission
towers using drone images. We have successfully
overcome the limitations of the scarcity of training
data and the inherent ambiguity of part of it. These
challenges have been tackled using an instance seg-
mentation network and diverse techniques to optimize
the training suited to the particular conditions of the
available data. Furthermore, we have evaluated the
results obtained by the proposed methods, both quan-
titatively and qualitatively, and compared them to the
baseline. In doing so, we reached the conclusion that,
not only do these techniques improve the results, but
the resulting system shows a promising performance
that would make it suitable for our goal of automa-
tion. Thanks to our sophisticated augmentation, dam-
ages of different types are detected accurately even if
they are underrepresented or subjectively labeled in
the training data. We expect that the usage of this
system will help reduce the human input needed and
substantially speed up the whole process.
REFERENCES
Chao, P., Kao, C.-Y., Ruan, Y.-S., Huang, C.-H., and Lin,
Y.-L. (2019). Hardnet: A low memory traffic network.
In Proceedings of the IEEE/CVF International Con-
ference on Computer Vision, pages 3552–3561.
Dutta, A. and Zisserman, A. (2019). The VIA annotation
software for images, audio and video. In Proceedings
of the 27th ACM International Conference on Multi-
media, MM ’19, New York, NY, USA. ACM.
Girshick, R. (2015). Fast r-cnn. In Proceedings of the IEEE
international conference on computer vision, pages
1440–1448.
Gopalakrishnan, K., Gholami, H., Vidyadharan, A., Choud-
hary, A., and Agrawal, A. (2018). Crack damage
detection in unmanned aerial vehicle images of civil
infrastructure using pre-trained deep learning model.
Int. J. Traffic Transp. Eng, 8(1):1–14.
Lee, Y. and Park, J. (2020). Centermask: Real-time anchor-
free instance segmentation. In Proceedings of the
IEEE/CVF conference on computer vision and pattern
recognition, pages 13906–13915.
Lin, T.-Y., Doll
´
ar, P., Girshick, R., He, K., Hariharan, B.,
and Belongie, S. (2017). Feature pyramid networks
for object detection. In Proceedings of the IEEE con-
ference on computer vision and pattern recognition,
pages 2117–2125.
Lin, T.-Y., Maire, M., Belongie, S., Hays, J., Perona, P.,
Ramanan, D., Doll
´
ar, P., and Zitnick, C. L. (2014).
Microsoft coco: Common objects in context. In Euro-
pean conference on computer vision, pages 740–755.
Springer.
Morgenthal, G. and Hallermann, N. (2014). Quality assess-
ment of unmanned aerial vehicle (uav) based visual
inspection of structures. Advances in Structural Engi-
neering, 17(3):289–302.
P
´
erez, P., Gangnet, M., and Blake, A. (2003). Poisson im-
age editing. ACM Trans. Graph., 22(3):313–318.
Ronneberger, O., Fischer, P., and Brox, T. (2015). U-net:
Convolutional networks for biomedical image seg-
mentation. In International Conference on Medical
image computing and computer-assisted intervention,
pages 234–241. Springer.
Shah, M. P. (2017). Semantic segmenta-
tion architectures implemented in pytorch.
https://github.com/meetshah1995/pytorch-semseg.
Shihavuddin, A., Chen, X., Fedorov, V., Nymark Chris-
tensen, A., Andre Brogaard Riis, N., Branner, K.,
Bjorholm Dahl, A., and Reinhold Paulsen, R. (2019).
Wind turbine surface damage detection by deep
learning aided drone inspection analysis. Energies,
12(4):676.
Sony, S., Laventure, S., and Sadhu, A. (2019). A literature
review of next-generation smart sensing technology in
structural health monitoring. Structural Control and
Health Monitoring, 26(3):e2321.
Tian, Z., Shen, C., Chen, H., and He, T. (2019). Fcos:
Fully convolutional one-stage object detection. In
Proceedings of the IEEE/CVF international confer-
ence on computer vision, pages 9627–9636.
Varghese, A., Gubbi, J., Sharma, H., and Balamuralidhar,
P. (2017). Power infrastructure monitoring and dam-
age detection using drone captured images. In 2017
International Joint Conference on Neural Networks
(IJCNN), pages 1681–1687. IEEE.
Zhao, H., Shi, J., Qi, X., Wang, X., and Jia, J. (2017).
Pyramid scene parsing network. In Proceedings of
the IEEE conference on computer vision and pattern
recognition, pages 2881–2890.
Automated Damage Inspection of Power Transmission Towers from UAV Images
389