models for individual tasks. We observed a precision
of 94.2% and sensitivity of 92.4% for the 54 cough
events and a precision of 89.6% and sensitivity of
91.6% for breath event detection.
5 CONCLUSIONS
An extensive study on methods for using speech as a
modality for respiratory sensing and cough
monitoring is presented in this paper. These strategies
are essential for patients suffering from respiratory
conditions, especially in remote monitoring services.
Our results evaluated on joint datasets of 10 healthy
volunteers conclude that joint sensing of coughs and
respiratory parameters is possible by training deep
learning models on separate datasets specific to
respiratory sensing and cough detection. However,
evaluation of this strategy on speech recordings of
patients suffering from respiratory conditions is
essential and is the future scope of our work.
ACKNOWLEDGMENT
This work was partially supported by the Horizon
H2020 Marie Skłodowska-Curie Actions Initial
Training Network
European Training Network project under grant
agreement No. 766287 (TAPAS) and Data Science
Department, Philips Research, Eindhoven.
REFERENCES
Alireza A. Dibazar, S. Narayanan, and Theodore W.
Berger, “Feature analysis for automatic detection of
pathological speech,” in Proceedings of the Second
Joint 24th Annual Conference and the Annual Fall
Meeting of the Biomedical Engineering Society,
Engineering in Medicine and Biology. IEEE, 2002, vol.
1, pp. 182–183.
Marcin Włodarczak, Mattias Heldner, and Jens Edlund,
“Breathing in conversation: An unwritten history,” in
Proceedings of the 2nd European and the 5th Nordic
Symposium on Multimodal Communication: 2015,
number 110 in Linkoping Electronic Conference
Proceedings,¨ pp. 107–112.
Venkata Srikanth Nallanthighal, Aki Harm¨ a, and Helmer
Strik, “Deep¨ Sensing of Breathing Signal During
Conversational Speech,” in Proc. Interspeech 2019,
2019, pp. 4110–4114.
V. S. Nallanthighal, A. Härmä, and H. Strik, “Speech
breathing estimation using deep learning methods,” in
ICASSP 2020 - 2020 IEEE International Conference on
Acoustics, Speech and Signal Processing (ICASSP),
2020, pp. 1140–1144.
A. C. den Brinker, M. Coman, O. Ouweltjes, M. G. Crooks,
S. Thackray-Nocera, and A. H. Morice, “Performance
requirements for cough classifiers in real-world
applications,” in 2020 28th European Signal
Processing Conference (EUSIPCO). IEEE, 2021, pp.
96–100.
Justice Amoh and Kofi Odame, “Deep neural networks for
identifying cough sounds,” IEEE transactions on
biomedical circuits and systems, vol. 10, no. 5, pp.
1003–1011, 2016.
Yusuf A. Amrulloh, Udantha R. Abeyratne, Vinayak
Swarnkar, Rina Triasih, and Amalia Setyati,
“Automatic cough segmentation from non-contact
sound recordings in pediatric wards,” Biomedical
Signal Processing and Control, vol. 21, pp. 126–136,
2015.
Hui-Hui Wang, Jia-Ming Liu, Mingyu You, and Guo-
Zheng Li, “Audio signals encoding for cough
classification using convolutional neural networks: A
comparative study,” in 2015 IEEE International
Conference on Bioinformatics and Biomedicine
(BIBM). IEEE, 2015, pp. 442–445.
Sander Boelders, Venkata Srikanth Nallanthighal, Vlado
Menkovski, and Aki Harm¨ a,¨ “Detection of mild
dyspnea from pairs of speech recordings,” in ICASSP
2020-2020 IEEE International Conference on
Acoustics, Speech and Signal Processing (ICASSP).
IEEE, 2020, pp. 4102–4106.
Thomas J. Hixon, Jere Mead, and Michael D. Goldman,
“Dynamics of the chest wall during speech production:
Function of the thorax, rib cage, diaphragm, and
abdomen,” Journal of speech and hearing research,
vol. 19, no. 2, pp. 297–356, 1976.
K. Konno and J. Mead, “Measurement of the separate
volume changes of rib cage and abdomen during
breathing,” Journal of Applied Physiology, vol. 22, no.
3, pp. 407–422, 1967, PMID: 4225383.
G. Fairbanks, “The rainbow passage,” Voice and
articulation drillbook, vol. 2, 1960.
Adam Paszke, Sam Gross, Francisco Massa, Adam Lerer,
James Bradbury, Gregory Chanan, Trevor Killeen,
Zeming Lin, Natalia
Gimelshein, Luca Antiga, Alban Desmaison, Andreas
Kopf, Edward Yang, Zachary DeVito, Martin Raison,
Alykhan Tejani, Sasank Chilamkurthy, Benoit Steiner,
Lu Fang, Junjie Bai, and Soumith Chintala, “Pytorch:
An imperative style, high-performance deep learning
library,” in
Advances in Neural Information Processing
Systems 32, H. Wallach, H. Larochelle, A.
Beygelzimer, F. d'Alche-Buc, E. Fox, and R. Garnett,´
Eds., pp. 8024–8035. Curran Associates, Inc., 2019.
Jurgen Schmidhuber, “Deep learning in neural networks:
An overview,”¨ Neural Networks, vol. 61, pp. 85 – 117,
2015.
Diederik P. Kingma and Jimmy Ba, “Adam: A method for
stochastic optimization,” Computing Research
Repository (CoRR), vol. abs/1412.6980, 2015.