Figure 4: Confusion matrix regarding the model perfor-
mance on the real test set. The starting point of the graph is
11 since the minimum amount of pedestrians in the real test
set is 11.
should mention that Chan et.al experiment in (Chan
et al., 2008) was done by hand-crafting highly spe-
cialized features and exhaustive labeling. This results
approve the suitability of synthetic data as a surrogate
for the small real data when using DCNN.
5 CONCLUSIONS
In this paper we explore the benefits of synthetic data
generation for the application of deep convolutional
neural networks for a crowd counting problem with
small training set. We propose an algorithm for cre-
ating a highly realistic synthetic dataset of pedestri-
ans in a walkway to train the proposed DCNN with.
Moreover, we provide a system trained with synthetic
images capable of predicting the number of pedestri-
ans in an image to a satisfactory extent. The obtained
results suggest the incorporation of synthetic data as
a well-suited surrogate for the missing real along with
alleviating required exhaustive labeling.
There are still many open questions to be ad-
dressed such as, when and to what extent synthetic
images are applicable as a substitute to solve real
world problems. which is the best network architec-
ture for counting the crowd?
ACKNOWLEDGEMENTS
This work has been partially funded by the Spanish
MINECO Grants TIN2013-43478-P and TIN2012-
38187- C03. We gratefully acknowledge the support
of NVIDIA Corporation with the donation of a Tesla
K40 GPU used for this research.
REFERENCES
Cappelli, R., Erol, A., Maio, D., and Maltoni, D. (2000).
Synthetic fingerprint-image generation. In Pattern
Recognition, 2000. Proceedings. 15th International
Conference on. IEEE.
Chan, A. B., Liang, Z.-S. J., and Vasconcelos, N. (2008).
Privacy preserving crowd monitoring: Counting peo-
ple without people models or tracking. In Computer
Vision and Pattern Recognition, 2008. CVPR 2008.
IEEE Conference on. IEEE.
Chan, A. B., Morrow, M., and Vasconcelos, N. (2009).
Analysis of crowded scenes using holistic properties.
In Performance Evaluation of Tracking and Surveil-
lance workshop at CVPR.
Ciregan, D., Meier, U., and Schmidhuber, J. (2012). Multi-
column deep neural networks for image classification.
In Computer Vision and Pattern Recognition (CVPR),
2012 IEEE Conference on. IEEE.
Eggert, C., Winschel, A., and Lienhart, R. (2015). On the
benefit of synthetic data for company logo detection.
In Proceedings of the 23rd ACM international confer-
ence on Multimedia. ACM.
Griffin, G., Holub, A., and Perona, P. (2007). Caltech-256
object category dataset. California Institute of Tech-
nology.
Kong, D., Gray, D., and Tao, H. (2005). Counting pedes-
trians in crowds using viewpoint invariant training. In
BMVC. Citeseer.
Krizhevsky, A., Sutskever, I., and Hinton, G. E. (2012). Im-
agenet classification with deep convolutional neural
networks. In Advances in neural information process-
ing systems.
LeCun, Y. and Bengio, Y. (2005). Convolutional networks
for images, speech, and time series. In BMVC. Cite-
seer.
Leibe, B., Schindler, K., and Van Gool, L. (2007). Coupled
detection and trajectory estimation for multi-object
tracking. In 2007 IEEE 11th International Conference
on Computer Vision. IEEE.
Mahadevan, V., Li, W., Bhalodia, V., and Vasconcelos, N.
(2010). Anomaly detection in crowded scenes. In
CVPR.
Marana, A., Costa, L. d. F., Lotufo, R., and Velastin, S.
(1998). On the efficacy of texture analysis for crowd
monitoring. In Computer Graphics, Image Process-
ing, and Vision, 1998. Proceedings. SIBGRAPI’98. In-
ternational Symposium on. IEEE.
Phua, C., Lee, V., Smith, K., and Gayler, R. (2010). A
comprehensive survey of data mining-based fraud de-
tection research. In arXiv preprint arXiv:1009.6119.
Rabaud, V. and Belongie (2006). Counting crowded moving
objects. In 2006 IEEE Computer Society Conference
on Computer Vision and Pattern Recognition. IEEE.
Segu
´
ı, S., Pujol, O., and Vitria, J. (2015). Learning to count
with deep object features. In Proceedings of the IEEE
Conference on Computer Vision and Pattern Recogni-
tion Workshops.
ICPRAM 2017 - 6th International Conference on Pattern Recognition Applications and Methods
322