
 
6 RESULTS 
To compare the various similarity measures with 
each other and with human judgments, the 
dissimilarity between every pair of rhythms in the 
data was first calculated, obtaining a distance matrix. 
Then a statistical procedure called the Mantel test 
was used for calculating the correlation coefficients 
between pairs of these distance matrices. The results 
of the Mantel tests for the 16-pulse and 8-pulse 
rhythms are listed in Tables 3 and 4, respectively, 
where the statistically significant results are shown 
in boldface type, and the asterisk indicates results 
that were obtained in previous studies. Space 
limitations do not permit the duplication here of the 
description of the human listening tests performed 
(for this see Toussaint et al., 2011).  
Table 3: Mantel test results for the 16-pulse rhythms. 
 
Human 
Judgment 
Edit Distance 
Statistical Features 
r = -0.07 
p = 0.47 
r = -0.14 
p = 0.28 
Stat. Features and 
nPVI 
r = 0.02 
p = 0.44 
r = -0.09 
p = 0.42 
nPVI only 
r = 0.24 
p = 0.21 
r = 0.07 
p = 0.43 
Normalized 
Mallows Distance 
r = 0.70 
p = 0.02 
r = 0.35 
p = 0.2 
Edit Distance* 
r = 0.76 
p = 0.02 
− 
Of all the experiments performed with the 16-pulse 
rhythms, only the Mallows distance gave statistically 
significant results, correlating highly with human 
judgments (r = 0.70, p = 0.02). This is almost as 
high as the previous result obtained with the edit 
distance (r = 0.76, p = 0.02) calculated directly on 
the rhythms themselves (Toussaint et al., 2011). 
Note that in this corpus all the rhythms have the 
same number of onsets, and therefore corpus 
normalization is equivalent to pairwise 
normalization. 
By contrast with the 16-pulse rhythms, all the 
experiments with the 8-pulse rhythms, yielded mild 
but statistically significant correlations with human 
judgments. The nPVI, a successful measure of 
rhythm complexity (Toussaint, 2012), gave the 
lowest correlation (r = 0.25, p = 0.04) when used in 
isolation, and all the other models yielded 
correlation coefficients ranging between 0.48 and 
0.43.  This represents a significant drop from the 
previously obtained result with the edit distance (r = 
0.59, p = 0.0002) when calculated directly on the 
Table 4: Mantel test results for the 8-pulse rhythms. 
 
Human 
Judgment 
Edit Distance 
Statistical Features 
r = 0.43 
p = 0.006 
r = 0.57 
p = 0.003 
Stat. Features and  
nPVI 
r = 0.46 
p = 0.003 
r = 0.57 
p = 0.003
nPVI only 
r = 0.25 
p = 0.04 
r = -0.05 
p = 0.44 
Corpus-Normalized 
Gen. Mallows Dist. 
r = 0.45 
p = 0.003 
r = 0.41 
p = 0.03
Pairwise-Normalized 
Gen. Mallows Dist. 
r = 0.48 
p = 0.001 
r = 0.21 
p = 0.1 
Edit Distance* 
r = 0.59 
p = 0.0002 
− 
rhythm sequences (Toussaint et al., 2012). In this 
corpus the number of onsets in the rhythms varies 
considerably, and therefore the results with the 
corpus and pairwise normalizations differ a little. 
Surprisingly, the statistical features calculated from 
the IOI histograms correlate quite highly with the 
edit distance. 
7 CONCLUSIONS 
One of the main conclusions we can draw from this 
study is that the statistical features calculated from 
the inter-onset interval histograms, used by Gouyon 
et al. (2004) in the context of music information 
retrieval, are much better than the music-theoretical 
structural features investigated previously by 
Toussaint et al., 2012), for predicting human 
judgments of rhythm similarity. The Mallows 
distance computed from the IOI histograms gave the 
best results obtained here, providing further 
evidence to support the hypothesis that 
transformation methods are superior to feature-based 
methods as tools for predicting human judgments of 
similarity.  
REFERENCES 
Duda, R. O., Hart, P. E., & Stork, D. G., 2000. Pattern 
Classification, Wiley-Interscience, 2
nd
 Edition. 
Gouyon, F., Dixon, S., Pampalk, E., & Widmer, G., 2004. 
Evaluating rhythmic descriptors for musical genre 
classification. Proc. 25th Int. AES Conference. 
Hahn, U., Chater, N., & L. B. Richardson, L. B., 2003. 
Similarity as transformation. Cognition, 87, 1-32. 
Levina, E. & Bickel, P. (2001). The earth mover’s distance 
is the Mallows distance: Some insights from statistics. 
Proceedings Eighth IEEE International Conference on
MeasuringMusicalRhythmSimilarity-StatisticalFeaturesversusTransformationMethods
597