Semi-supervised Audio Source Separation based on the Iterative Estimation and Extraction of Note Events

Alejandro Delgado Castro; John E. Szymanski

doi:10.5220/0007828002730279

Semi-supervised Audio Source Separation based on the Iterative Estimation and Extraction of Note Events

Alejandro Delgado Castro, John E. Szymanski

2019

Abstract

In this paper, we present an iterative semi-automatic audio source separation process for single-channel polyphonic recordings, where the underlying sources are isolated by clustering a set of note events, which are considered to be single notes or groups of consecutive notes coming from the same source. In every iteration, an automatic process detects the pitch trajectory of the predominant note event in the mixture, and separates its spectral content from the mixed spectrogram. The predominant note event is then transformed back to the time-domain and subtracted from the input mixture. The process repeats using the residual as the new input mixture, until a predefined number of iterations is reached. When the iterative stage is complete, note events are clustered by the end-user to form individual sources. Evaluation is conducted on mixtures of real instruments and compared with a similar approach, revealing an improvement in separation quality.

Download

Paper Citation

in Harvard Style

Delgado Castro A. and Szymanski J. (2019). Semi-supervised Audio Source Separation based on the Iterative Estimation and Extraction of Note Events.In Proceedings of the 16th International Joint Conference on e-Business and Telecommunications - Volume 1: SIGMAP, ISBN 978-989-758-378-0, pages 273-279. DOI: 10.5220/0007828002730279

in Bibtex Style

@conference{sigmap19,
author={Alejandro Delgado Castro and John Szymanski},
title={Semi-supervised Audio Source Separation based on the Iterative Estimation and Extraction of Note Events},
booktitle={Proceedings of the 16th International Joint Conference on e-Business and Telecommunications - Volume 1: SIGMAP,},
year={2019},
pages={273-279},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0007828002730279},
isbn={978-989-758-378-0},
}

in EndNote Style

TY - CONF

JO - Proceedings of the 16th International Joint Conference on e-Business and Telecommunications - Volume 1: SIGMAP,
TI - Semi-supervised Audio Source Separation based on the Iterative Estimation and Extraction of Note Events
SN - 978-989-758-378-0
AU - Delgado Castro A.
AU - Szymanski J.
PY - 2019
SP - 273
EP - 279
DO - 10.5220/0007828002730279