A SIMILARITY MEASURE FOR MUSIC SIGNALS
Gonc¸alo Marques
Instituto Superior de Engenharia de Lisboa, Portugal
Thibault Langlois
Universidade de Lisboa, Faculdade de Ciˆencias, Departamento de Informatica, Portugal
Keywords:
Music Information Retrieval, Music Similarity Measure, Audio Signal Processing, Feature Extraction.
Abstract:
One of the goals in the field of Music Information Retrieval is to obtain a measure of similarity between two
musical recordings. Such a measure is at the core of automatic classification, query, and retrieval systems,
which have become a necessity due to the ever increasing availability and size of musical databases. This
paper proposes a method for calculating a similarity distance between two music signals. The method extracts
a set of features from the audio recordings, models the features, and determines the distance between models.
While further work is needed, preliminary results show that the proposed method has the potential to be used
as a similarity measure for musical signals.
1 INTRODUCTION
Nowadays there is an enormous amount of digital mu-
sic available on-line, and users can search through
vast databases to select their favorite albums, artists,
songs, and create their own databases or playlists.
Even at a personal level, one can create fairly large
music collections by transferring ones CDs to a com-
puter or an iPod. Nevertheless, with the rapidly in-
creasing amount of digital data it is necessary to have
some means of indexing,searching and retrieving mu-
sic contents. Theses tasks are aided by including
some information along with a song (metadata), typ-
ically annotated manually by an expert or by the user.
Nevertheless, metadata is not always provided or in
some cases is erroneous, and with the every increas-
ing number of new songs, the required manual work
becomes prohibitive.
Similarity is the core of classification and ranking
algorithms, thus, having an automatic way of mea-
suring similarities between two songs would be a
valuable tool in the field of Music Information Re-
trieval. Such a tool would have many applications
such as making database queries by user-provided ex-
amples (Spevak and Favreau, 2002; Heln and Virta-
nen, 2007), automatically organizing and classifying
digital audio collections (Neumayer et al., 2005), au-
tomatic playlist generation (Aucouturier and Pachet,
2002b; Logan and Salomon, 2001), providing per-
sonal musical recommendations, etc.
In order to measure the similarity between songs
it is necessary to characterize each song with a set
of features and to determine a distance between sets.
There is an extensive number of features that can be
used for this purpose, since the question of similarity
can be answered from multiple perspectives. For in-
stance, one could include features that are not directly
related to the audio signals, such as lyrical contents,
geographical origins, historical periods, artists infor-
mation, reviews, etc. This type of information is well
suited for Web-based methods, and a various works
exist on this subject - for example (Whitman and Ellis,
2004; Baumann et al., 2004; Pampalk et al., 2005a).
In this paper we are interested in deriving a mea-
sure of similarity solely based on the music signal,
without any additional information. There are several
features that can be extracted directly from the audio
signal, and there are many ways of using them to ob-
tain a similarity measure between songs. The most
common approach for obtaining the features is to di-
vide the signal into short overlapping frames (typi-
cally 10ms to 40ms long) and use each frame to ex-
tract time domain information such as the zero cross-
ing rate, or some spectral domain information such
as the fast Fourier transform (FFT), or the Mel fre-
quency cepstrum coefficients (MFCCs). These can be
directly used as features vectors, or one can be incor-
porated some additional statistics of each audio seg-
ment such as the spectral centroid, spectral flux, his-
tograms, etc. Once the features are extracted, there
308
Marques G. and Langlois T. (2008).
A SIMILARITY MEASURE FOR MUSIC SIGNALS.
In Proceedings of the Tenth International Conference on Enterprise Information Systems - AIDSS, pages 308-312
DOI: 10.5220/0001707903080312
Copyright
c
SciTePress