6 COMMENTS/DISCUSSION
This paper introduces indexed convolution and pool-
ing operators for images presenting pixels arranged
in non-Cartesian lattices. These operators have been
validated on standard images as well as on the special
case of hexagonal lattice images, exhibiting similar
performances as standard convolutions and therefore
showing that the indexed convolution works as ex-
pected. However, the indexed method is much more
general and can be applied to any grid of data, en-
abling unconventional image representation to be ad-
dressed without any pre-processing. This differs from
other approaches such as image re-sampling com-
bined with masked convolutions (Hoogeboom et al.,
2018) or oversampling to square lattice (Holch et al.,
2017) that actually require additional pre-processing.
Moreover, both methods increase the size of the trans-
formed image (adding useless pixels of padding value
for the resampled image to be rectangular and / or
multiplying the number of existing pixels) and are re-
stricted to regular grids. On the other hand, they make
use of out the box operators already available in cur-
rent deep learning frameworks.
The approach proposed in this paper is not lim-
ited to hexagonal lattice and only needs the index
matrices to be built prior the training and inference
processes, one for each convolution of different in-
put size. No additional pre-processing of the image
is then required to apply convolution and pooling ker-
nels. However, the current implementation in Python
shows a decrease in computing performances com-
pared to the convolution method implemented in Py-
torch. We have observed an increase of RAM usage of
factors varying between 1 and 3 and training times of
factors varying between 4 and 8 on GPU (depending
on the GPU model), of factor 1.5 on CPU (but slightly
faster than masked convolutions on CPU) depending
on the network used.
These drawbacks are actually related to the use of
un-optimized codes and work is carried out to fix this
by the use of optimized CUDA and C++ implementa-
tions.
As a future work, we will use the indexed oper-
ations for the analysis of hexagonal grid images of
CTA. We also plan to experiment with arbitrary ker-
nels, which are another benefit of the indexed opera-
tions, for the convolution (e.g. retina like kernel with
more density in the center, see the example in the
github repository).
ACKNOWLEDGEMENTS
This project has received funding from the European
Union’s Horizon 2020 research and innovation pro-
gram under grant agreement No 653477.
This work has been done thanks to the facilities of-
fered by the Université Savoie Mont Blanc MUST
computing center.
REFERENCES
Ambrosi, G., Awane, Y., Baba, H., et al. (2013). The
cherenkov telescope array large size telescope. arXiv
preprint arXiv:1307.4565.
Asharindavida, F., Hundewale, N., and Aljahdali, S. (2012).
Study on hexagonal grid in image processing. 45:282–
288.
Bolmont, J., Corona, P., Gauron, P., et al. (2014). The
camera of the fifth h.e.s.s. telescope. part i: System
description. Nuclear Instruments and Methods in
Physics Research Section A: Accelerators, Spectrom-
eters, Detectors and Associated Equipment, 761:46 –
57.
Cho, D., Lee, M., Kim, S., and Tai, Y.-W. (2013). Modeling
the calibration pipeline of the lytro camera for high
quality light-field image reconstruction. In Proceed-
ings of the IEEE International Conference on Com-
puter Vision, pages 3280–3287.
Glicenstein, J., Barcelo, M., Barrio, J., et al. (2013). The
NectarCAM camera project. ArXiv e-prints.
He, K., Zhang, X., Ren, S., and Sun, J. (2015). Deep
residual learning for image recognition. CoRR,
abs/1512.03385.
He, X. and Jia, W. (2005). Hexagonal structure for intelli-
gent vision. In Information and Communication Tech-
nologies, 2005. ICICT 2005. First International Con-
ference on, pages 52–64. IEEE.
Holch, T. L., Shilon, I., Büchele, M., et al. (2017). Probing
convolutional neural networks for event reconstruc-
tion in gamma-ray astronomy with cherenkov tele-
scopes. arXiv preprint arXiv:1711.06298.
Holschneider, M., Kronland-Martinet, R., Morlet, J., and
Tchamitchian, P. (1990). A real-time algorithm for
signal analysis with the help of the wavelet transform.
In Combes, J.-M., Grossmann, A., and Tchamitchian,
P., editors, Wavelets, pages 286–297, Berlin, Heidel-
berg. Springer Berlin Heidelberg.
Hoogeboom, E., Peters, J. W., Cohen, T. S., and Welling,
M. (2018). Hexaconv. In International Conference on
Learning Representations.
Katz, U. (2012). A neutrino telescope deep in the Mediter-
ranean Sea. CERN Cour., 52N6:31–33.
Krizhevsky, A., Sutskever, I., and Hinton, G. E. (2012). Im-
agenet classification with deep convolutional neural
networks. In Advances in neural information process-
ing systems, pages 1097–1105.
VISAPP 2019 - 14th International Conference on Computer Vision Theory and Applications
370