Experimental results showed that there are quantita-
tive, temporal, and density relationship between train-
ing data and performance, i.e., with more records for
training, the supervised learning techniques can better
adjust the learning bias, providing better predictions.
As strengths of this article, we highlight: i) we
used heterogeneous data sources, official and unoffi-
cial crime records, to predict crimes; ii) we provided
a complementarity analysis showing the feasibility of
using different data sources by combining them into
a single and bigger dataset; iii) we proposed a predic-
tive approach capable of predicting the tendency and
occurrence of different types of crimes in different ge-
ographic regions; iv) we evaluated four different ma-
chine learning techniques used by our crime predic-
tion approach; v) the authorities can use our approach
to plan crime preventive actions, such deciding where
to perform street lighting and police patrol.
Contrasting the predictive techniques presented
in this article and other recent crime prediction ap-
proaches reported in the literature, we observed that
there is still room for further improvements. There-
fore, for future work we plan to evaluate other learn-
ing techniques, such as latent factor models and neu-
ral networks for crime prediction. Also, we want to
exploit different geographic properties of crimes as
features within our predictive approach, and extend
the datasets to cover more types of crimes. Finally,
we also intent to use more sources of heterogeneous
data, specially crime records with judgment data.
ACKNOWLEDGEMENTS
The present work was carried out with the support
of the Coordenac¸
˜
ao de Aperfeic¸oamento de Pessoal
de N
´
ııvel Superior - Brazil (CAPES) - Financing
Code 001. The authors thank CNPq, FAPEMIG,
PUC Minas and SESP-MG (Secretaria de Estado de
Seguranc¸a P
´
ublica de Minas Gerais) for the partial
support in the execution of this work.
REFERENCES
Baculo, M. J. C., Marzan, C. S., de Dios Bulos, R., and
Ruiz, C. (2017). Geospatial-temporal analysis and
classification of criminal data in manila. In Interna-
tional Conference on Computational Intelligence and
Applications, ICCIA’17, pages 6–11. IEEE.
Belesiotis, A., Papadakis, G., and Skoutas, D. (2018). An-
alyzing and predicting spatial crime distribution using
crowdsourced and open data. ACM Transactions on
Spatial Algorithms and Systems, 3(4):12:1–12:31.
Bogomolov, A., Lepri, B., Staiano, J., Oliver, N., Pianesi,
F., and Pentland, A. (2014). Once upon a crime: To-
wards crime prediction from demographics and mo-
bile data. In International Conference on Multimodal
Interaction, ICMI’14, pages 427–434. ACM.
Breiman, L. (2001). Random forests. Machine Learning,
45(1):5–32.
B
¨
uhlmann, P. and Yu, B. (2010). Boosting. Wiley Interdis-
cipl. Reviews: Computational Statistics, 2(1):69–74.
Celisse, A. and Mary-Huard, T. (2018). Theoretical analy-
sis of cross-validation for estimating the risk of the k-
nearest neighbor classifier. Journal of Machine Learn-
ing Research, 18:1–54.
Chen, T. and Guestrin, C. (2016). Xgboost: A scalable
tree boosting system. In ACM SIGKDD International
Conference on Knowledge Discovery and Data Min-
ing, KDD’16, pages 785–794. ACM.
Constitution, B. (1988). Constitution of the Federative Re-
public of Brazil.
Duda, R. O., Hart, P. E., and Stork, D. G. (2001). Pattern
Classification. Wiley-Interscience, 2ed edition.
Friedman, J. H. (2001). Greedy function approximation: A
gradient boosting machine. The Annals of Statistics,
29(5):1189–1232.
Ho, T. K. (1995). Random decision forests. In International
Conference on Document Analysis and Recognition,
ICDAR’95, pages 278–282. IEEE.
Hsu, C.-W., Chang, C.-C., and Lin, C.-J. (2008). A practical
guide to support vector classication. Technical report,
National Taiwan University, Taipei, Taiwan.
Huang, C., Zhang, J., Zheng, Y., and Chawla, N. V. (2018).
Deepcrime: Attentive hierarchical recurrent networks
for crime prediction. In ACM International Confer-
ence on Information and Knowledge Management,
CIKM’18, pages 1423–1432. ACM.
Jain, R. (1991). The art of computer systems performance
analysis. Wiley-Interscience.
Kohavi, R. and John, G. H. (1997). Wrappers for feature
subset selection. Artificial Intelligence, 97(1):273–
324.
Lin, Y.-L., Yen, M.-F., and Yu, L.-C. (2018). Grid-based
crime prediction using geographical features. Inter-
national Journal of Geo-Information, 7(8).
Liu, T.-Y. (2009). Learning to rank for information re-
trieval. Foundations and Trends in Information Re-
trieval, 3(3):225–331.
Oberwittler, D., Hummelsheim, D., Hirtenlehner, H., and
Jackson, J. (2010). Social insecurities and fear of
crime: A cross-national study on the impact of wel-
fare state policies on crime-related anxieties. Euro-
pean Sociological Review, 27(3):327–345.
Pradhan, I., Potika, K., Eirinaki, M., and Potikas, P. (2019).
Exploratory data analysis and crime prediction for
smart cities. In International Database Applications
& Engineering Symposium, IDEAS’19, pages 4:1–
4:9. ACM.
Syarif, I., Prugel-Bennett, A., and Wills, G. (2016). Svm
parameter optimization using grid search and ge-
netic algorithm to improve classification performance.
Telecommunication Computing Electronics and Con-
trol, 14:1502–1509.
United Nations (2015). Universal declaration of human
rights.
Predicting Crime by Exploiting Supervised Learning on Heterogeneous Data
531