Singing Voice Detection Based on a Deeper Convolutional Neural Network
Wenming Gui, Zeyu Xia, Rubin Gong, Gui Wang, Bingxu Chen, Donghui Zhang
2022
Abstract
Singing voice detection is a fundamental task in music information retrieval, which benefits other tasks such as singing voice separation. We propose a new algorithm based on a deeper convolution neural network, fed with the logarithmic and mel-scaled spectrogram, to exact and integrate the features of the different layers of the network and to discriminate the singing voice finally. We demonstrate that this deeper network can produce good performances and be designed efficiently to some extent. The experiments are based on the public datasets: Jamendo, Mir1k, RWC pop, and their combined dataset. We also studied what depth of the network is suitable for this task. The experiments show that the optimal depth on the four public datasets is 152.
DownloadPaper Citation
in Harvard Style
Gui W., Xia Z., Gong R., Wang G., Chen B. and Zhang D. (2022). Singing Voice Detection Based on a Deeper Convolutional Neural Network. In Proceedings of the 3rd International Symposium on Automation, Information and Computing - Volume 1: ISAIC; ISBN 978-989-758-622-4, SciTePress, pages 336-341. DOI: 10.5220/0011924600003612
in Bibtex Style
@conference{isaic22,
author={Wenming Gui and Zeyu Xia and Rubin Gong and Gui Wang and Bingxu Chen and Donghui Zhang},
title={Singing Voice Detection Based on a Deeper Convolutional Neural Network},
booktitle={Proceedings of the 3rd International Symposium on Automation, Information and Computing - Volume 1: ISAIC},
year={2022},
pages={336-341},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0011924600003612},
isbn={978-989-758-622-4},
}
in EndNote Style
TY - CONF
JO - Proceedings of the 3rd International Symposium on Automation, Information and Computing - Volume 1: ISAIC
TI - Singing Voice Detection Based on a Deeper Convolutional Neural Network
SN - 978-989-758-622-4
AU - Gui W.
AU - Xia Z.
AU - Gong R.
AU - Wang G.
AU - Chen B.
AU - Zhang D.
PY - 2022
SP - 336
EP - 341
DO - 10.5220/0011924600003612
PB - SciTePress