Table 3: Number of inferences that each algorithm can
make in a second and percentage of time reduction obtained
to process the examples compared to Algorithm 1.
Data Alg1 Alg2 % Alg3 %
cens 32.5 34.9 6.76 9502.0 99.63
covt 251.3 456.7 49.97 17810.4 97.44
fars 69.8 71.0 1.68 24852.0 99.71
hepm 0.3* 0.4* 6.31 611.2 99.93
higg 2.3* 2.4* 2.48 991.2 99.76
kddc 6316.4 6396.1 1.25 67363.0 90.51
poke 7.0 21.7 67.80 242051.3 99.99
susy 242.6 365.8 33.68 25706.9 98.58
mean 865.3 918.6 20.62 48611.0 98.19
It can also be observed that while Algorithm 2
produces an improvement in time of 20.62% over the
Algorithm 1, the Algorithm 3 produces a reduction
of 98.19%, going from being able to make 865 infer-
ences per second to more than 48000. Algorithm 3, as
expected, presents its best results in those cases where
a high number of rules are used, such as “hepm”,
“higg” and “poke” databases. It also shows good
results when the number of continuous attributes is
small in relation to the total number of attributes.
These results show that the process of evaluation
of membership functions and pruning used in the pro-
posal makes it more efficient than the other two algo-
rithms.
In summary, the results presented show that the
proposed algorithm provides a substantial improve-
ment in the process of inference in problems where
the description of knowledge contains a high number
of rules and/or the number of continuous attributes
(discretized as fuzzy domains) is small in relation to
the total number of attributes.
5 CONCLUSIONS
The use of a large number of examples generates
problems in obtaining knowledge from these exam-
ples, but also in using them in a reasoning model.
Some of the algorithms proposed in the field of fuzzy
logic to deal with big data problems have the disad-
vantage of generating a very high number of rules.
The standard inference algorithm of the winning rule
used in fuzzy logic has problems when confronted
with knowledge bases with many rules.
The modified version of the algorithm that opti-
mizes the calculation through adaptation thresholds
is not sufficient to significantly improve the response
time.
We have analyzed the problem and have proposed
a model for reasoning that is more efficient than the
standard one in cases where there are a large number
of fuzzy rules. Thus, we have presented a calculation
of the inference algorithm that uses a different strat-
egy to obtain the winning rule. Although according
to its order of complexity is a worse approximation
than the original algorithm, in the experimental part
we show that it presents a significantly better behav-
ior applied to databases than the field of classifiers
based on fuzzy logic are being applied.
In future work, it seems interesting to combine
both algorithms using a parallel model to ensure a bet-
ter time response to the inference process.
ACKNOWLEDGEMENTS
This work has been partially funded by the Spanish
MEC Projects TIN2015-71618-R, DPI2015-69585-R
and co-financed by FEDER funds (European Union).
REFERENCES
Bache, K. and Lichman, M. (2013). Uci machine learning
repository.
Chi, Z., Yan, H., and Pham, T. (1996). Fuzzy algorithms:
with applications to image processing and pattern
recognition, volume 10. World Scientific.
Cord
´
on, O., del Jesus, M. J., and Herrera, F. (1999). A pro-
posal on reasoning methods in fuzzy rule-based clas-
sification systems. International Journal of Approxi-
mate Reasoning, 20:21–45.
Dean, J. and Ghemawat, S. (2008). Mapreduce: simpli-
fied data processing on large clusters. Commun ACM
51(1), 107-113.
Dean, J. and Ghemawat, S. (2010). Mapreduce: a flexible
data processing tool. Communications of the ACM,
53(1):72–77.
del R
´
ıo, S., L
´
opez, V., Ben
´
ıtez, J. M., and Herrera, F. (2015).
A mapreduce approach to address big data classifica-
tion problems based on the fusion of linguistic fuzzy
rules. International Journal of Computational Intelli-
gence Systems, 8(3):422–437.
Elkano, M., Galar, M., Sanz, J., and Bustince, H. (2018).
Chi-bd: A fuzzy rule-based classification system for
big data classification problems. Fuzzy Sets and Sys-
tems, 348(1):75–101.
G
´
amez, J. C., Garcia, D., Gonz
´
alez, A., and P
´
erez, R.
(2016). On the use of an incremental approach to
learn fuzzy classification rules for big data problems.
In 2016 IEEE International Conference on Fuzzy Sys-
tems, FUZZ-IEEE 2016, Vancouver, BC, Canada, July
24-29, 2016, pages 1413–1420.
Ishibuchi, H. and Yamamoto, T. (2005). Rule weight spec-
ification in fuzzy rule-based classification systems.
IEEE Transactions on Fuzzy Systems, 13(4):428–435.
IoTBDS 2019 - 4th International Conference on Internet of Things, Big Data and Security
260