Figure 6: Output of binarization layer: the 1st and 3rd
columns are original images and the 2nd and 4th are out-
put images.
formed asynchronously to create a package including
augmented samples in order to reduce the processing
time. The effective samples are selected from that
package and used to update the parameters. The ex-
haustless sample generation and effective sample se-
lection methods are used in an efficient combination
in order to train the network using a huge amount of
samples, all the while keeping the sample quality. Af-
ter comparison, our proposed approach is not only
useful for the recognition task, but also for the seg-
mentation task. While this is still in its primitive state
for sample generation and selection, we will continue
our research by applying this proposed approach to
other datasets and more challenging recognition tasks.
REFERENCES
Boureau, Y., Bach, F., LeCun, Y., and j. Ponce (2010).
Learning mid-level features for recognition. In IEEE
Conference on Computer Vision and Pattern Recogni-
tion(CVPR2010).
Ciresan, D., Meier, U., and Schmidhuber, J. (2012). Multi-
column deep neural networks for image classification.
In IEEE Conference on Computer Vision and Pattern
Recognition(CVPR2012).
Delakisand, M. and Garcia, C. (2008). Text detection with
convolutional neural networks. In InternationalCon-
ference on Computer Vision Theory and Applications
(VISAPP 2008).
Duffer, S. and Garcia, C. (2007). An online backpropa-
gation algorithm with validation error-based adaptive
learning rate. In In International Conference on Artifi-
cial Neural Networks (ICANN), volume 1, pages 958–
962.
Goodfellow, I., Warde-Farley, D., Mirza, M., Courville, A.,
and Bengio, Y. (2013). Maxout networks. In arXiv
preprint arXiv:1302.4389.
Hinton, G. E., Srivastava, N., Krizhevsky, A., Sutskever, I.,
and Salakhutdinov, R. R. (2012). Improving neural
networks by preventing co-adaptation of feature de-
tectors. In arXiv preprint arXiv:1207.0580.
Hubel, D. and Wiesel, T. (1962). Receptive fields, binocu-
lar interaction and functional architecture in the cat’s
visual cortex. Journal of Physiology, 160:106–154.
Jain, V., Murray, J., Roth, F., and Turaga, S. (2007). Super-
vised learning of image restoration with convolutional
networks. In IEEE International Conference on Com-
puter Vision (ICCV2007).
Osadchy, M., LeCun, Y., and Mille, M. (2007). Synergistic
face detection and pose estimation with energy-based
models. In Journal of Machine Learning Research,
number 1197-1215, page 8.
Ouyang, W. and Wang, X. (2013a). Joint deep learning for
pedestrian detection. In IEEE International Confer-
ence on Computer Vision (ICCV2013).
Ouyang, W. and Wang, X. (2013b). Single-pedestrian de-
tection aided by multi-pedestrian detection. In IEEE
Conference on Computer Vision and Pattern Recogni-
tion(CVPR2013).
Rumelhart, D. E., Hinton, G. E., and Williams, R. J. (1986).
Learning internal representations by error propaga-
tion. Parallel Distributed Processing:Explorations in
the Microstructures of Cognition, 1:318–362.
Scherer, D., Muller, A., and S.Behnke (2010). Evaluation of
pooling operations in convolutional architectures for
object recognition. In International Conference on Ar-
tificial Neural Networks(ICANN2010).
Sermanet, P., Chintala, S., and LeCun, Y. (2012). Convolu-
tional neural networks applied to house numbers digit
classification. In International Conference on Pattern
Recognition (ICPR 2012).
Sermanet, P., Eigen, D., Zhang, X., Mathieu, M., Fergus,
R., and LeCun, Y. (2013). Overfeat: Integrated recog-
nition, localization and detection using convolutional
networks. In arXiv preprint arXiv:1312.6229.
Simard, P., Steinkraus, D., and Platt, J. (2003). Best prac-
tices for convolutional neural networks applied to vi-
sual document analysis. In In International Con-
ference on Document Analysis and Recognition, vol-
ume 2, pages 958–962.
Sutskever, A. K. I. and Hinton, G. (2012). magenet classi-
fication with deep convolutional neural networks. In
Advances in Neural Information Processing Systems
25(NIPS2012).
Vaillant, R., Monrocq, C., and LeCun, Y. (1994). Origi-
nal approach for the localisation of objects in images.
volume 4, pages 245–250.
Wan, L., Zeiler, M., Zhang, S., LeCun, Y., and Fergus, R.
(2013). Regularization of neural networks using drop-
connect. In In International Conference on Machine
Learning (ICML2013).
Y. LeCun ad L. Bottou, Y. B. and P.Haffner (1998).
Gradient-basedlearning applied to document recogni-
tion. In Proceedings of the IEEE, 86(11):2278–2324.
Zeiler, M. D. and Fergus, R. (2013). Visualizing and un-
derstanding convolutional networks. In arXiv preprint
arXiv:1311.2901.
ImprovingQualityofTrainingSamplesThroughExhaustlessGenerationandEffectiveSelectionforDeepConvolutional
NeuralNetworks
235