tecedent part. In this regards, we suggest the use of
class association rule mining wherein each transac-
tion is labelled with a class s. Let S be the set of
all class items corresponding to the attribute S, I be
the set of all items in database corresponding to the
attributes SPF and N, and S ∩ I=
/
0. A class associa-
tion rule is an implication in the form: a−→b, where
a ⊆ I and b ⊆ S. In CARs candidats are denoted as
ruleitems and are in the following form: (Condset,s).
condset represents a set of items (condset ⊆ I) and
s a class (s ⊆ S). Each ruleitem represents the rule
condset −→ s. The support count of a condset (SCC)
represents the number of transactions in the database
which contain the condset. The support count of the
generated rules (SCR) is the number of transactions of
the database containing the condset and having s as
a class. Like in the traditional association rule min-
ing, apriori in CARM generates all frequent ruleitems
whose support is above a minsup threshold, then, use
them to generate class association rules with a con-
fidence value (SCR / SCC) above the user-specified
minconf threshold. The distinct particularity in CAR
candidate generation function is that, the joining step
is done by joining condsets of ruleitems having the
same class.
As mentioned above this approach (CARM) aims
at only producing rules in a specific form which is
assumed to be adequate for our prediction task. Al-
though it may seem logical to proceed, simply, to a
post-selection of interesting rules, this solution is, in
practice, very difficult and sometimes impossible due
to the combinatorial explosion, in other words, the
huge number of rules that could be generated.
4 CONCLUSION
Association rule mining is a fundamental datamin-
ing task which has been basically presented as an ex-
ploratory tool rather than a predictive tool. In this
research we attempted at reviewing the most impor-
tant advances, in this datamining tool, ranging from
proposing scalable algorithms and efficient method-
ologies for mining frequent itemsets to handling a
diversity of data types and structures and extended
mining tasks. Our overall goal is to show how all
this progress has contributed in making AR a pow-
erful prediction tool. Indeed, we proposed an associ-
ation rule-based approach for predicting the evolution
of geographical areas as an attempt to show how the
progress, done so far, can, practically, be harnessed
for this example of prediction problem.
Our proposal consists in an apriori-based ap-
proach for mining rules predicting the evolevolution
of geographical areas. This approach proposes to ad-
dress issues related to handling spatial and temporal
relationships in the learning dataset, producing rules
involving rare patterns, and making sure to only gen-
erate rules in an adequate form for our prediction
problem.
REFERENCES
Aggarwal, C. C., Bhuiyan, M. A., and Hasan, M. A. (2014).
Frequent pattern mining algorithms: A survey. In Ag-
garwal, C. C. and Han, J., editors, Frequent Pattern
Mining, pages 19–64. Springer International Publish-
ing.
Agrawal, R., Imieli
´
nski, T., and Swami, A. (1993). Min-
ing association rules between sets of items in large
databases. In ACM SIGMOD Record, volume 22,
pages 207–216. ACM.
Chen, J., Miller, C., and Dagher, G. (2014). Product rec-
ommendation system for small online retailers using
association rules mining. In Innovative Design and
Manufacturing (ICIDM), Proceedings of the 2014 In-
ternational Conference on, pages 71–77.
Farzanyar, Z. and Kangavari, M. R. (2012). Efficient min-
ing of fuzzy association rules from the pre-processed
dataset. Computing and Informatics, 31(2):331–347.
Gharbi, A., de Runz, C., Faiz, S., and Akdag, H. (2014). An
association rules based approach to predict semantic
land use evolution in the french city of saint-denis. In-
ternational Journal of Data Warehousing and Mining,
10:1–17.
Ilayaraja, M. and Meyyappan, T. (2015). Efficient data
mining method to predict the risk of heart diseases
through frequent itemsets. Procedia Computer Sci-
ence, 70:586 – 592. Proceedings of the 4th Inter-
national Conference on Eco-friendly Computing and
Communication Systems.
Lin, J. and Li, X. (2015). Knowledge transfer for large-
scale urban growth modeling based on formal concept
analysis. Transactions in GIS, pages n/a–n/a.
Park, H. A., Kim, T., Li, M., Shon, H. S., Park, J. S., and
Ryu, K. H. (2015). Application of gap-constraints
given sequential frequent pattern mining for protein
function prediction. Osong Public Health and Re-
search Perspectives, 6(2):112 – 120.
Rudin, C., Letham, B., and Madigan, D. (2013). Learn-
ing theory analysis for association rules and sequen-
tial event prediction. Journal of Machine Learning
Research, 14:3441–3492.
Ryan, C. and Brown, K. (2013). Predicting occupant lo-
cations using association rule mining. In Bramer, M.
and Petridis, M., editors, Research and Development
in Intelligent Systems XXX, pages 63–77. Springer In-
ternational Publishing.
Thabtah, F., Cowling, P., and Peng, Y. (2004). Mmac: a
new multi-class, multi-label associative classification
approach. In Data Mining, 2004. ICDM ’04. Fourth
IEEE International Conference on, pages 217–224.
GISTAM 2016 - 2nd International Conference on Geographical Information Systems Theory, Applications and Management
206