GA-BASED APPROACH TO PITCH RECOGNITION OF MUSICAL
CONSONANCE
Masanori Natsui, Shunichi Kubo and Yoshiaki Tadokoro
Dept. of Information and Computer Sciences, Toyohashi University of Technology
1-1 Hibarigaoka, Tempaku-cho, Toyohashi-shi, Aichi 441-8580, Japan
Keywords:
Pitch recognition, Genetic algorithm, Musical consonance, Template matching.
Abstract:
This paper presents a novel method for the pitch recognition of the musical consonance (i.e., unison or octave)
using genetic algorithm (GA). GA is a kind of optimization techniques based on natural selection and genetics.
In our method, the pitch recognition is performed by the following two-step procedure: (i) search space
reduction using the comb filter estimation, and (ii) evolutionary parameter estimation of tone parameters such
as notes and volumes by minimizing error between a target waveform and a synthesized waveform using sound
templates with estimated parameters. The potential capability of the system is demonstrated through the pitch
estimation of randomly-generated consonances. Experimental results show that the system can successfully
estimate chords with more than 84% success rate for two-note consonances, and more than 71% success rate
for three-note consonances.
1 INTRODUCTION
Automatic music transcription is important for many
applications including music archival, music retrieval,
supports of music composition/arrangement, and also
significant problems in machine perception(Sterian
and Wakefield, 2000; Pollastri, 2002; Roads, 1985;
Roads, 1996; Piszczalski and Galler, 1977). The
study of automatic musical transcription can be clas-
sified into some categories, and that of the pitch detec-
tion is the most important task and many studies have
been done. Most of old studies are for monophony,
and based on the spectrum analysis using the fast
Fourier transform (FFT). On the other hand, the novel
technologies such as neural network, fuzzy logic, and
hidden Marcov model have also been proposed in the
recent studies(Klapuri, 2003).
For the pitch estimation of polyphonic sounds, we
have proposed a unique method based on comb filters
(H(z) = 1 − z
−N
)(Tadokoro and Yamaguchi, 2001;
Tadokoro et al., 2002; Tadokoro et al., 2003). The
comb filter can eliminate a fundamental frequency
and its harmonic components of a sound by simple
subtraction. So far, we have presented that cascade
or parallel connections of the comb filters enable the
polyphonic pitch estimation and can be effective for
the realization of the automatic music transcription
system.
A difficult problem in the polyphonic pitch estima-
tion is that some frequency components of one note
may be overlapped with harmonics of other notes.
In fact, composers often use chords containing notes
that have a simple ratio between their fundamental
frequencies, such as 1:1 (perfect unison), 2:1 (per-
fect octave), or 3:2 (perfect fifth), since these codes
called consonances typically produce sounds which
are pleasing to the human ear. If one note having a
fundamental frequency of f Hz and another note hav-
ing that of 2f Hz are produced at the same time, then
every harmonic of the upper note will be overlapped
to the even harmonics of the lower note (Fig. 1). To
infer the presence of the upper note, we have to use
some other information which is obtained by a tech-
nique except traditional methods such as spectrum
analysis.
From this viewpoint, we propose a unique method
of the pitch estimation based on genetic algorithm
(GA). GA is an optimization algorithm based on a
model of evolution in life. In this paper, we demon-
strate the possibility of the GA-based pitch estimation
method through the experimental pitch estimation of
musical consonance. The key ideas presented here
are: (i) time-domain template matching based on GA,
and (ii) search space reduction using the pitch estima-
47
Natsui M., Kubo S. and Tadokoro Y. (2006).
GA-BASED APPROACH TO PITCH RECOGNITION OF MUSICAL CONSONANCE.
In Proceedings of the Third International Conference on Informatics in Control, Automation and Robotics, pages 47-52
DOI: 10.5220/0001209400470052
Copyright
c
SciTePress