6 RESULTS
To compare the various similarity measures with
each other and with human judgments, the
dissimilarity between every pair of rhythms in the
data was first calculated, obtaining a distance matrix.
Then a statistical procedure called the Mantel test
was used for calculating the correlation coefficients
between pairs of these distance matrices. The results
of the Mantel tests for the 16-pulse and 8-pulse
rhythms are listed in Tables 3 and 4, respectively,
where the statistically significant results are shown
in boldface type, and the asterisk indicates results
that were obtained in previous studies. Space
limitations do not permit the duplication here of the
description of the human listening tests performed
(for this see Toussaint et al., 2011).
Table 3: Mantel test results for the 16-pulse rhythms.
Human
Judgment
Edit Distance
Statistical Features
r = -0.07
p = 0.47
r = -0.14
p = 0.28
Stat. Features and
nPVI
r = 0.02
p = 0.44
r = -0.09
p = 0.42
nPVI only
r = 0.24
p = 0.21
r = 0.07
p = 0.43
Normalized
Mallows Distance
r = 0.70
p = 0.02
r = 0.35
p = 0.2
Edit Distance*
r = 0.76
p = 0.02
−
Of all the experiments performed with the 16-pulse
rhythms, only the Mallows distance gave statistically
significant results, correlating highly with human
judgments (r = 0.70, p = 0.02). This is almost as
high as the previous result obtained with the edit
distance (r = 0.76, p = 0.02) calculated directly on
the rhythms themselves (Toussaint et al., 2011).
Note that in this corpus all the rhythms have the
same number of onsets, and therefore corpus
normalization is equivalent to pairwise
normalization.
By contrast with the 16-pulse rhythms, all the
experiments with the 8-pulse rhythms, yielded mild
but statistically significant correlations with human
judgments. The nPVI, a successful measure of
rhythm complexity (Toussaint, 2012), gave the
lowest correlation (r = 0.25, p = 0.04) when used in
isolation, and all the other models yielded
correlation coefficients ranging between 0.48 and
0.43. This represents a significant drop from the
previously obtained result with the edit distance (r =
0.59, p = 0.0002) when calculated directly on the
Table 4: Mantel test results for the 8-pulse rhythms.
Human
Judgment
Edit Distance
Statistical Features
r = 0.43
p = 0.006
r = 0.57
p = 0.003
Stat. Features and
nPVI
r = 0.46
p = 0.003
r = 0.57
p = 0.003
nPVI only
r = 0.25
p = 0.04
r = -0.05
p = 0.44
Corpus-Normalized
Gen. Mallows Dist.
r = 0.45
p = 0.003
r = 0.41
p = 0.03
Pairwise-Normalized
Gen. Mallows Dist.
r = 0.48
p = 0.001
r = 0.21
p = 0.1
Edit Distance*
r = 0.59
p = 0.0002
−
rhythm sequences (Toussaint et al., 2012). In this
corpus the number of onsets in the rhythms varies
considerably, and therefore the results with the
corpus and pairwise normalizations differ a little.
Surprisingly, the statistical features calculated from
the IOI histograms correlate quite highly with the
edit distance.
7 CONCLUSIONS
One of the main conclusions we can draw from this
study is that the statistical features calculated from
the inter-onset interval histograms, used by Gouyon
et al. (2004) in the context of music information
retrieval, are much better than the music-theoretical
structural features investigated previously by
Toussaint et al., 2012), for predicting human
judgments of rhythm similarity. The Mallows
distance computed from the IOI histograms gave the
best results obtained here, providing further
evidence to support the hypothesis that
transformation methods are superior to feature-based
methods as tools for predicting human judgments of
similarity.
REFERENCES
Duda, R. O., Hart, P. E., & Stork, D. G., 2000. Pattern
Classification, Wiley-Interscience, 2
nd
Edition.
Gouyon, F., Dixon, S., Pampalk, E., & Widmer, G., 2004.
Evaluating rhythmic descriptors for musical genre
classification. Proc. 25th Int. AES Conference.
Hahn, U., Chater, N., & L. B. Richardson, L. B., 2003.
Similarity as transformation. Cognition, 87, 1-32.
Levina, E. & Bickel, P. (2001). The earth mover’s distance
is the Mallows distance: Some insights from statistics.
Proceedings Eighth IEEE International Conference on
MeasuringMusicalRhythmSimilarity-StatisticalFeaturesversusTransformationMethods
597