neighborhood size between 3 and 10, and a default
Lambda parameter equals to 1.0. According to Table
5, C4.5 have presented the best accuracy rates
comparing to K-NN and Naïve Bayes that did not
exceed 97.56% and 93.18% respectively for training
sets and 92.73% and 89.79% respectively for test
sets. These classifiers have achieved good
performance but still lower comparing to the
performance of C4.5 algorithm.
Table 5: Comparison of accuracy rates obtained using
C4.5, K-NN and Naive Bayes classifiers.
Classification
techniques
Training sets Test sets
Min
(%)
Max
(%)
Min
(%)
Max
(%)
C4.5 96.01 100 91.04 100
K-NN 95.12 97.56 83.33 92.73
Naïve Bayes 85.25 93.18 83.82 89.79
By applying C4.5 decision tree algorithm in this
study, a promising and satisfying accuracy rates
were achieved. In fact, the deep analysis of the
initial data set enabled to identify the input attributes
for each decision tree. This procedure allowed to
simplify the model generation phase and produce
decision trees achieving low error rates which
contributes to the production of accurate and
efficient preliminary conclusions.
6 CONCLUSIONS
In this paper, a case study about the application of
C4.5 decision tree algorithm was conducted using a
data set extracted from the ANS unit of university
hospital Avicenne in Morocco. The objective of this
study was to produce a decision support system to
automate the analysis procedure of the ANS's test
results and make it easier for specialists. Thereby, as
a first step, C4.5 algorithm was used to generate a
set of classifiers that enable to generate the
preliminary conclusions needed to produce the
appropriate diagnosis. The classifiers were evaluated
and the results obtained achieved high accuracy rates
which were very promising. However, as a
limitation of this study, we may mention the small
size of the data set used. Thus, more validation tests
over bigger data sets should be conducted.
As mentioned in Section 3, The ANS unit is
specialized on conducting the ANS tests in order to
analyze the preliminary conclusions deducted from
the classifiers. These conclusions are analyzed by
the specialists to provide a global synthesis,
diagnosis of the patient’s state and prescribe the
appropriate treatment. In this study, we worked on
the first phase of the procedure and using the C4.5
algorithm, we were able to define a set of rules
helping to generate the preliminary conclusions. For
future work, a validation of the generated classifiers
by cardiologists on new patients needs to be carried
out. Besides, classification and association
techniques will be used to produce a complete
decision support system that provide a diagnosis for
patients and suggest the appropriate treatment.
REFERENCES
Aparna, R.. Bincy, G., Mathu, T., 2012. Survey on
common data mining classification Technique. In
International Journal of Wisdom Based Computing, 2.
Apté, C., Weiss, S., 1997. Data mining with decision trees
and decision rules. In Future Generation Computer
Systems.
Benarroch, E., 1993. The central autonomic network:
Functional organization, dysfunction and perspective.
In Mayo Clinic Proceedings.
Breiman, L., Friedman, J., Olshen, R. A., Stone, C. J.
(1984). Classification and regression trees, Chapman
and Hall/CRC, 1
st
edition.
Coghlan, H. C., 1996. Orthostatic intolerance: mitral valve
prolapse. In Primer on the Autonomic Nervous System,
D. Robertson, P. A. Low, and J. Polinsky, Eds.,
Academic Press, San Diego, Calif, USA.
Esfandiari, N., Babavalian, M. R., Moghadam, A. E.,
Tabar, V. 2014. Knowledge discovery in medicine:
Current issue and future trend. In Expert Systems with
Applications.
Familia, A., Shenb, W. M., Weberc, R., Simoudis, E.,
(1997). Data preprocessing and intelligent data
analysis. In Intelligent Data Analysis.
Grubb, B. P., Karas, B., 1999. Clinical Disorders of the
Autonomic Nervous System Associated With
Orthostatic Intolerance: An Overview of
Classification, Clinical Evaluation and Management.
In Pacing and Clinical Electrophysiology.
Han, J., Kamber, M. 2001. Data Mining, Concepts and
Techniques, Morgan Kaufmann publisher.
Han, J., Kamber, M., Pei, P., 2011. Data preprocessing. In
The Morgan Kaufmann Series in “Data Management
Systems”, Morgan Kaufmann Publishers.
Johansen, T. L., Kambskar, G., Mehlsen, J., 1997. Heart
rate variability in evaluation of the autonomic nervous
system. In Ugeskr Laeger.
Karaolis, M. A., Moutiris, J. A., Hadjipanayi, D., Pattichis,
C. S., 2010. Assessment of the Risk Factors of
Coronary Heart Events Based on Data Mining With
Decision Trees. In IEEE Transactions on Information
Technology in Biomedicine.
Kreibig, S. D., 2010. Autonomic nervous system activity
in emotion: A review. In Biological Psychology.
HeartDiseaseDiagnosisUsingC4.5Algorithms-ACaseStudy
403