
Future work involves studying the behavior of
YOLOv5 using different partial decodings, not only
using a 30% of the PNG size but exploring a larger
range. Also, even though YOLOv5 is mature and
widely spread, testing more recent YOLO versions
could provide additional insight into the advantages
of performing fish detection using partially decoded
data.
Finally, we would like to emphasize that we have
made all the source code used in this paper publicly
available. The PHIE is available at https://github.com/
aburguera/PHCENCODER. A YOLOv5 wrapper as
well as different tools to compute different metrics
and perform grid search on different parameters is
available at https://github.com/aburguera/ODUTILS.
ACKNOWLEDGEMENTS
This work is partially supported by
Grant PID2020-115332RB-C33 funded by
MCIN/AEI/10.13039/501100011033 by ”ERDF
A way of making Europe” and by Grant PLEC2021-
007525/AEI/10.13039/501100011033 funded by the
Agencia Estatal de Investigacion, under NextGenera-
tion EU/PRTR
REFERENCES
Anwar, S. and Li, C. (2020). Diving deeper into underwa-
ter image enhancement: A survey. Signal Processing:
Image Communication, 89:115978.
Bonin-Font, F. and Burguera, A. (2020). Towards multi-
robot visual graph-SLAM for autonomous marine ve-
hicles. Journal of Marine Science and Engineering,
8(6).
Burguera, A. (2023). Hierarchical color encoding for
progressive image transmission in underwater envi-
ronments. IEEE Robotics and Automation Letters,
8(5):2970–2975.
Burguera, A. and Bonin-Font, F. (2022). Progressive hierar-
chical encoding for image transmission in underwater
environments. In OCEANS 2022 - Hampton Roads,
pages 1–9.
Burguera, A., Bonin-Font, F., Lisani, J. L., Petro, A. B.,
and Oliver, G. (2016). Towards automatic visual sea
grass detection in underwater areas of ecological in-
terest. In 2016 IEEE 21st International Conference
on Emerging Technologies and Factory Automation
(ETFA), pages 1–4.
Ditria, E. M., Connolly, R. M., Jinks, E. L., and Lopez-
Marcano, S. (2021). Annotated video footage for au-
tomated identification and counting of fish in uncon-
strained seagrass habitats. Frontiers in Marine Sci-
ence, 8.
Do, H., Hong, S., and Kim, J. (2020). Robust Loop Closure
Method for Multi-Robot Map Fusion by Integration of
Consistency and Data Similarity. IEEE Robotics and
Automation Letters, 5(4):5701–5708.
Feng, W., Hu, C., Wang, Y., Zhang, J., and Yan, H. (2019).
A novel hierarchical coding progressive transmission
method for wmsn wildlife images. Sensors, 19(4).
Guo, J.-M. (2010). High efficiency ordered dither block
truncation coding with dither array lut and its scal-
able coding application. Digital Signal Processing,
20(1):97–110.
He, K., Gkioxari, G., Doll
´
ar, P., and Girshick, R. (2017).
Mask R-CNN.
Jaafar, A. N., Ja’afar, H., Pasya, I., Abdullah, R., and Ya-
mada, Y. (2022). Overview of underwater commu-
nication technology. In Proceedings of the 12th Na-
tional Technical Seminar on Unmanned System Tech-
nology 2020, pages 93–104, Singapore. Springer Sin-
gapore.
K
¨
oser, K. and Frese, U. (2020). Challenges in Underwater
Visual Navigation and SLAM. In Intelligent Systems,
Control and Automation: Science and Engineering,
volume 96, pages 125–135.
Latif, Y., Cadena, C., and Neira, J. (2013). Robust loop
closing over time for pose graph SLAM. International
Journal of Robotics Research, 32(14):1611–1626.
Lin, T.-Y., Goyal, P., Girshick, R., He, K., and Doll
´
ar, P.
(2020). Focal loss for dense object detection. IEEE
Transactions on Pattern Analysis and Machine Intel-
ligence, 42(2):318–327.
Monika, R., Samiappan, D., and Kumar, R. (2021). Under-
water image compression using energy based adaptive
block compressive sensing for IoUT applications. The
Visual Computer, (37):1499–1515.
Redmon, J., Divvala, S., Girshick, R., and Farhadi, A.
(2016). You only look once: Unified, real-time ob-
ject detection. In 2016 IEEE Conference on Com-
puter Vision and Pattern Recognition (CVPR), pages
779–788, Los Alamitos, CA, USA. IEEE Computer
Society.
Rubino, E. M., Centelles, D., Sales, J., Marti, J. V., Marin,
R., Sanz, P. J., and Alvares, A. J. (2017). Progressive
image compression and transmission with region of
interest in underwater robotics. In OCEANS 2017 -
Aberdeen, pages 1–9.
Said, A. and Pearlman, W. (1996). A new, fast, and efficient
image codec based on set partitioning in hierarchical
trees. IEEE Transactions on Circuits and Systems for
Video Technology, 6(3):243–250.
Tan, M., Pang, R., and Le, Q. V. (2020). EfficientDet: Scal-
able and efficient object detection. In 2020 IEEE/CVF
Conference on Computer Vision and Pattern Recogni-
tion (CVPR), pages 10778–10787.
Ultralytics. YOLOv5.
https://github.com/ultralytics/yolov5. Accessed:
2023-11-14.
Viola, P. and Jones, M. (2001). Rapid object detection us-
ing a boosted cascade of simple features. In Proceed-
ings of the 2001 IEEE Computer Society Conference
on Computer Vision and Pattern Recognition. CVPR
2001, volume 1, pages I–I.
VISAPP 2024 - 19th International Conference on Computer Vision Theory and Applications
340