measurement of vessels over a large-distance trajec-
tory. Despite the obtained high performance of the
system, the current process of law enforcement still
requires the intervention of a human operator. How-
ever, the performance of our automated system is
approaching the level of directly supporting law en-
forcement.
ACKNOWLEDGEMENTS
We want to thank the Dutch Ministry of Infrastruc-
ture and Water Management for supporting the exper-
iments and providing access to the test location. Parts
of this work were funded by the European ITEA PS-
Crimson project on surveillance.
REFERENCES
Ahmed, E., Jones, M., and Marks, T. K. (2015). An
improved deep learning architecture for person re-
identification. In CVPR, pages 3908–3916.
Bibby, C. and Reid, I. (2005). Visual tracking at sea.
In Proceedings of the 2005 IEEE Int. Conference on
Robotics and Automation, pages 1841–1846. IEEE.
Bloisi, D. and Iocchi, L. (2009). Argos—a video surveil-
lance system for boat traffic monitoring in venice. In-
ternational Journal of Pattern Recognition and Artifi-
cial Intelligence, 23(07):1477–1502.
Chen, H., Lagadec, B., and Bremond, F. (2019). Partition
and reunion: A two-branch neural network for vehicle
re-identification. In CVPR Workshops.
Everingham, M., Van Gool, L., Williams, C. K. I., Winn,
J., and Zisserman, A. (2012). The PASCAL Visual
Object Classes Challenge 2012 (VOC2012) Results.
Girshick, R. (2015). Fast R-CNN. In Proceedings of the
International Conference on Computer Vision.
Groot, H., Bondarau, E., et al. (2019). Improving person re-
identification performance by customized dataset and
person detection. In IS&T International Symposium
on Electronic Imaging 2019, Image Processing: Al-
gorithms and Systems XVII.
He, K., Gkioxari, G., Doll
´
ar, P., and Girshick, R. (2017).
Mask R-CNN. In Proceedings of the International
Conference on Computer Vision.
Hermans*, A., Beyer*, L., and Leibe, B. (2017). In Defense
of the Triplet Loss for Person Re-Identification. arXiv
preprint arXiv:1703.07737.
Karanam, S., Gou, M., Wu, Z., Rates-Borras, A., Camps,
O., and Radke, R. J. (2019). A systematic evaluation
and benchmark for person re-identification: Features,
metrics, and datasets. IEEE Transactions on Pattern
Analysis and Machine Intelligence, 41(3):523–536.
Lin, T.-Y., Maire, M., Belongie, S., Hays, J., Perona, P.,
Ramanan, D., Doll
´
ar, P., and Zitnick, C. L. (2014).
Microsoft coco: Common objects in context. In Euro-
pean Conference on Computer Vision, pages 740–755.
Springer.
Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S.,
Fu, C.-Y., and Berg, A. C. (2016). Ssd: Single shot
multibox detector. In ECCV, pages 21–37. Springer.
Qiao, D., Liu, G., Zhang, J., Zhang, Q., Wu, G., and
Dong, F. (2019). M3c: Multimodel-and-multicue-
based tracking by detection of surrounding vessels in
maritime environment for usv. Electronics, 8(7):723.
Redmon, J., Divvala, S., Girshick, R., and Farhadi, A.
(2016). You only look once: Unified, real-time ob-
ject detection. In IEEE CVPR, pages 779–788.
Redmon, J. and Farhadi, A. (2017). Yolo9000: Better,
faster, stronger. In IEEE CVPR, pages 6517–6525.
Ren, S., He, K., Girshick, R., and Sun, J. (2015). Faster R-
CNN: Towards real-time object detection with region
proposal networks. In NIPS.
Russakovsky, O., Deng, J., Su, H., Krause, J., Satheesh,
S., Ma, S., Huang, Z., Karpathy, A., Khosla, A.,
Bernstein, M., Berg, A. C., and Fei-Fei, L. (2015).
ImageNet Large Scale Visual Recognition Challenge.
International Journal of Computer Vision (IJCV),
115(3):211–252.
Shi, J. and Tomasi, C. (1994). Good features to track. In
1994 Proceedings of IEEE Conference on Computer
Vision and Pattern Recognition, pages 593–600.
Wang, G., Yuan, Y., Chen, X., Li, J., and Zhou, X. (2018).
Learning discriminative features with multiple granu-
larities for person re-identification. In 2018 ACM Mul-
timedia Conference on Multimedia Conference, pages
274–282. ACM.
Zheng, L., Shen, L., Tian, L., Wang, S., Wang, J., and Tian,
Q. (2015). Scalable person re-identification: A bench-
mark. In Proceedings of the IEEE International Con-
ference on Computer Vision, pages 1116–1124.
Zheng, Z., Zheng, L., and Yang, Y. (2017). Unlabeled
samples generated by gan improve the person re-
identification baseline in vitro. In Proceedings of the
IEEE International Conference on Computer Vision,
pages 3754–3762.
Zwemer, M. H., Wijnhoven, R. G., and de With, P. H. N.
(2018). Ship detection in harbour surveillance based
on large-scale data and cnns. In Proceedings of the
13th International Joint Conference on Computer Vi-
sion, Imaging and Computer Graphics Theory and
Applications - Volume 5: VISAPP,, Funchal, Madeira,
Portugal. INSTICC, INSTICC.
Vessel-speed Enforcement System by Multi-camera Detection and Re-identification
277