Table 2: The obtained results for all considered classifiers when applied on Dataset B. The highest rate per figure of merit is
emboldened.
Classifier Precision Recall f1-score
k-NN 0.64 0.69 0.60
Random forest 0.62 0.73 0.66
SVM 0.61 0.69 0.60
ANN 0.59 0.61 0.60
CNN+MFCC 0.59 0.72 0.64
CNN+log-mel spectrogram 0.58 0.72 0.64
CNN+constant-Q transform 0.60 0.68 0.57
6 CONCLUSIONS
In this paper, a great variety of machine learning
methods has been extensively evaluated on heartbeat
sound classification, with the aim being the detec-
tion of abnormalities such as extrasystole, extra heart
sound and murmurs. To this end, several tempo-
ral and spectral audio features have been exploited.
Such an automatic framework aims at supporting de-
cisions made by healthcare professionals, as well as
early diagnosis, e.g. using a smartphone, so as to
quickly check for any existing heartbeat abnormali-
ties and contact an expert physician. It was shown
that the recognition rates reached by such audio pat-
tern recognition methods differ significantly between
dataset A (smartphone) and dataset B (stethoscope).
In the first datasets, the methods achieved quite good
results in distinguishing artifacts and murmurs, while
in the second the results were worse, especially for the
extrasystole class where no model was able to classify
correctly.
The results of the present experiments could be
primarily improved by expanding the datasets. More
specifically, it would be especially useful to have
available more heartbeat samples representing the ex-
tra heart sound classes, i.e. extrasystole and mur-
mur. Further improvements could be obtained by cor-
rectly extracting and labeling the S1 and S2 sounds
of the heartbeat, and use them as an additional input
feature for the different classifiers. From a machine
learning perspective, it would be interesting to exper-
iment with a) data augmentation methods, including
transfer learning (Ntalampiras and Potamitis, 2018),
b) modeling temporal properties of heartbeats, us-
ing e.g. temporal convolutional networks (Yan et al.,
2020), and c) employed one-shot learning techniques
(Lake et al., 2015) accommodating scarce data avail-
ability.
REFERENCES
Ari, S. and Saha, G. (2009). In search of an optimiza-
tion technique for artificial neural network to clas-
sify abnormal heart sounds. Applied Soft Computing,
9(1):330–340.
Bentley, P., Nordehn, G., Coimbra, M., and Mannor, S.
(2011). The PASCAL Classifying Heart Sounds Chal-
lenge 2011 (CHSC2011) Results. http://www.peterj
bentley.com/heartchallenge/index.html.
Brown, J. C. (1991). Calculation of a constant q spectral
transform. The Journal of the Acoustical Society of
America, 89(1):425–434.
Chao, A., Ng, S., and Wang, L. (2018). Listen to your heart:
Feature extraction and classification methods for heart
sounds.
Davey, P., Sprigings, D., El-Kadri, M., and Hart, G.
(2018a). Extrasystoles. Oxford University Press, Ox-
ford, UK.
Davey, P., Sprigings, D., Timperley, J., and Hothi, S.
(2018b). Murmur. Oxford University Press, Oxford,
UK.
Giannakopoulos, T. and Pikrakis, A. (2014). Chapter 4 -
audio features. In Giannakopoulos, T. and Pikrakis,
A., editors, Introduction to Audio Analysis, pages 59
– 103. Academic Press, Oxford.
Kruse, R., Borgelt, C., Braune, C., Mostaghim, S., Stein-
brecher, M., Klawonn, F., and Moewes, C. (2016).
Computational Intelligence: A Methodological Intro-
duction. Springer Publishing Company, Incorporated,
2nd edition.
Lake, B. M., Salakhutdinov, R., and Tenenbaum, J. B.
(2015). Human-level concept learning through proba-
bilistic program induction. Science, 350(6266):1332–
1338.
Lu, L., Jiang, H., and Zhang, H. (2001). A robust audio clas-
sification and segmentation method. In Proceedings of
the ninth ACM international conference on Multime-
dia, pages 203–211.
Lu, L., Zhang, H.-J., and Li, S. Z. (2003). Content-based au-
dio classification and segmentation by using support
vector machines. Multimedia systems, 8(6):482–492.
McFee, B., Raffel, C., Liang, D., Ellis, D., Mcvicar, M.,
Battenberg, E., and Nieto, O. (2015). librosa: Audio
and music signal analysis in python. pages 18–24.
Automatic Acoustic Diagnosis of Heartbeats
55