lution networks. IEEE Trans. on Pattern Analysis and
Machine Intelligence (PAMI), 35(8):1872–1886.
Cohen, T. S. and Welling, M. (2016). Group equivariant
convolutional networks. In Proc. of the International
Conf. on Machine learning (ICML).
Dai, J., Qi, H., Xiong, Y., Li, Y., Zhang, G., Hu, H., and
Wei, Y. (2017). Deformable convolutional networks.
Arxiv tech report.
Doersch, C., Gupta, A., and Efros, A. A. (2015). Unsu-
pervised visual representation learning by context pre-
diction. In Proc. of the IEEE International Conf. on
Computer Vision (ICCV).
Haeusser, P., Mordvintsev, A., and Cremers, D. (2017).
Learning by association - a versatile semi-supervised
training method for neural networks. In Proc. IEEE
Conf. on Computer Vision and Pattern Recognition
(CVPR).
Ioffe, S. and Szegedy, C. (2015). Batch normalization:
Accelerating deep network training by reducing inter-
nal covariate shift. In Proc. of the International Conf.
on Machine learning (ICML).
Jaderberg, M., Simonyan, K., Zisserman, A., and Kavuk-
cuoglu, K. (2015). Spatial transformer networks. In
Advances in Neural Information Processing Systems
(NIPS).
Kingma, D. P. and Ba, J. (2015). Adam: A method for
stochastic optimization. In Proc. of the International
Conf. on Learning Representations (ICLR).
Kivinen, J. J. and Williams, C. K. I. (2011). Transformation
equivariant boltzmann machines.
Krizhevsky, A. (2009). Learning multiple layers of featu-
res from tiny images. Master’s thesis, Department of
Computer Science, University of Toronto.
Krizhevsky, A., Sutskever, I., and Hinton, G. E. (2012).
Imagenet classification with deep convolutional neu-
ral networks. In Advances in Neural Information Pro-
cessing Systems (NIPS).
Laptev, D., Savinov, N., Buhmann, J. M., and Pollefeys, M.
(2016). TI-POOLING: transformation-invariant pool-
ing for feature learning in convolutional neural net-
works. In Proc. IEEE Conf. on Computer Vision and
Pattern Recognition (CVPR).
Larochelle, H., Erhan, D., Courville, A., Bergstra, J., and
Bengio, Y. (2007). An empirical evaluation of deep
architectures on problems with many factors of varia-
tion. In Proc. of the International Conf. on Machine
learning (ICML).
Lecun, Y., Bottou, L., Bengio, Y., and Haffner, P. (1998).
Gradient-based learning applied to document recogni-
tion. Proc. of the IEEE, 86(11):2278–2324.
Miyato, T., Maeda, S., Koyama, M., Nakae, K., and Ishii,
S. (2016). Distributional smoothing by virtual advers-
arial examples. In Proc. of the International Conf. on
Learning Representations (ICLR).
Noroozi, M. and Favaro, P. (2016). Unsupervised learning
of visual representations by solving jigsaw puzzles.
In Proc. of the European Conf. on Computer Vision
(ECCV).
Oyallon, E. and Mallat, S. (2015). Deep roto-translation
scattering for object classification. In Proc. IEEE
Conf. on Computer Vision and Pattern Recognition
(CVPR).
Rasmus, A., Valpola, H., Honkala, M., Berglund, M., and
Raiko, T. (2015). Semi-supervised learning with lad-
der networks. In Advances in Neural Information Pro-
cessing Systems (NIPS).
Sajjadi, M., Javanmardi, M., and Tasdizen, T. (2016). Regu-
larization with stochastic transformations and pertur-
bations for deep semi-supervised learning. In Advan-
ces in Neural Information Processing Systems (NIPS).
Sifre, L. and Mallat, S. (2013). Rotation, scaling and defor-
mation invariant scattering for texture discrimination.
In Proc. IEEE Conf. on Computer Vision and Pattern
Recognition (CVPR).
Simard, P. Y., Steinkraus, D., and Platt, J. C. (2003). Best
practices for convolutional neural networks applied to
visual document analysis.
Sohn, K. and Lee, H. (2012). Learning invariant represen-
tations with local transformations. In Proc. of the In-
ternational Conf. on Machine learning (ICML).
Springenberg, J. T., Dosovitskiy, A., Brox, T., and Ried-
miller, M. A. (2015). Striving for simplicity: The all
convolutional net. In International Conf. on Learning
Representations (ICLR) (workshop track).
Stallkamp, J., Schlipsing, M., Salmen, J., and Igel, C.
(2012). Man vs. computer: Benchmarking machine
learning algorithms for traffic sign recognition. Neu-
ral Networks, 32:323–332.
Worrall, D. E., Garbin, S. J., Turmukhambetov, D., and
Brostow, G. J. (2017). Harmonic networks: Deep
translation and rotation equivariance. In Proc. IEEE
Conf. on Computer Vision and Pattern Recognition
(CVPR).
Zheng, S., Song, Y., Leung, T., and Goodfellow, I. (2016).
Improving the robustness of deep neural networks via
stability training. In Proc. IEEE Conf. on Computer
Vision and Pattern Recognition (CVPR).
Zhou, Y., Ye, Q., Qiu, Q., and Jiao, J. (2017). Oriented
response networks. In Proc. IEEE Conf. on Computer
Vision and Pattern Recognition (CVPR).
VISAPP 2018 - International Conference on Computer Vision Theory and Applications
72