Singing Voice Detection Based on a Deeper Convolutional Neural Network

Wenming Gui, Zeyu Xia, Rubin Gong, Gui Wang, Bingxu Chen, Donghui Zhang

2022

Abstract

Singing voice detection is a fundamental task in music information retrieval, which benefits other tasks such as singing voice separation. We propose a new algorithm based on a deeper convolution neural network, fed with the logarithmic and mel-scaled spectrogram, to exact and integrate the features of the different layers of the network and to discriminate the singing voice finally. We demonstrate that this deeper network can produce good performances and be designed efficiently to some extent. The experiments are based on the public datasets: Jamendo, Mir1k, RWC pop, and their combined dataset. We also studied what depth of the network is suitable for this task. The experiments show that the optimal depth on the four public datasets is 152.

Download


Paper Citation


in Harvard Style

Gui W., Xia Z., Gong R., Wang G., Chen B. and Zhang D. (2022). Singing Voice Detection Based on a Deeper Convolutional Neural Network. In Proceedings of the 3rd International Symposium on Automation, Information and Computing - Volume 1: ISAIC; ISBN 978-989-758-622-4, SciTePress, pages 336-341. DOI: 10.5220/0011924600003612


in Bibtex Style

@conference{isaic22,
author={Wenming Gui and Zeyu Xia and Rubin Gong and Gui Wang and Bingxu Chen and Donghui Zhang},
title={Singing Voice Detection Based on a Deeper Convolutional Neural Network},
booktitle={Proceedings of the 3rd International Symposium on Automation, Information and Computing - Volume 1: ISAIC},
year={2022},
pages={336-341},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0011924600003612},
isbn={978-989-758-622-4},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 3rd International Symposium on Automation, Information and Computing - Volume 1: ISAIC
TI - Singing Voice Detection Based on a Deeper Convolutional Neural Network
SN - 978-989-758-622-4
AU - Gui W.
AU - Xia Z.
AU - Gong R.
AU - Wang G.
AU - Chen B.
AU - Zhang D.
PY - 2022
SP - 336
EP - 341
DO - 10.5220/0011924600003612
PB - SciTePress