A PSYCHOACOUSTICALLY MOTIVATED SOUND ONSET DETECTION ALGORITHM FOR POLYPHONIC AUDIO

Balaji Thoshkahna, K. R. Ramakrishnan

Abstract

We propose an algorithm for sound onset detection applying principles of psychoacoustics. A popular model of loudness perception in human auditory system is used to compute a novelty function that allows for a more robust detection of onsets. The psychoacoustics paradigm also allows us to define thresholds for the novelty function that are both physically and perceptually meaningful and hence easy to manipulate according to the application. The algorithm performs well with an overall accuracy of detection of 86% for monophonic audio and 82% for polyphonic audio.

References

  1. A.Klapuri (1999). Sound onset detection by applying psychoacoustic knowledge. IEEE Conference on Audio,Speech and Signal Processing (ICASSP).
  2. B.C.J.Moore, B.Glasberg, and T.Baer (1997). A model for the prediction of thresholds,loudness and partial loudness. Journal of Audio Engineering Society(JAES),Vol.45,No.4.
  3. B.Moore and B.Glasberg (1983). Suggested formulae for calculating auditory-filter bandwidths and excitation patterns. Journal of the Acoustical Society of America(JASA),Vol.74,No.3.
  4. C.Duxbury, J.P.Bello, M.Davies, and M.Sandler (2003). Complex domain onset detection for musical signals. International Conference on Digital Audio Effects(DAFx).
  5. D.J.Hermes (1990). Vowel onset detection. Journal of the Acoustical Society of America(JASA),Vol.87,No.2.
  6. J.P.Bello, C.Duxbury, M.Davies, and M.Sandler (2004). On the use of phase and energy for musical onset detection in the complex domain. IEEE Signal Processing Letters,Vol.11,No.6.
  7. J.P.Bello, L.Daudet, S.Abdallah, C.Duxbury, M.Davies, and M.B.Sandler (2005). A tutorial on onset detection in music signals. IEEE Transactions on Speech and Audio Processing, Vol.13(No.5).
  8. J.P.Bello and M.Sandler (2003). Phase based note onset detection for music signals. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics(WASPAA).
  9. J.Timoney, T.Lysaght, M.Schoenwiesner, and L.McManus (2004). Implementing loudness models in matlab. International Conference on Digital Audio Effects(DAFx).
  10. Lee, W.-C. and Kuo, C. (2006). Improved linear prediction technique for musical onset detection. Intelligent Information Hiding and Multimedia Signal Processing(IIH-MSP).
  11. M.Gainza, E.Coyle, and B.Lawlor (2005). Onset detection using comb filters. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics(WASPAA).
  12. N.Collins (2005). A comparison of sound onset detection algorithms with emphasis on psychoacoustically motivated detection functions. Proceedings of Audio Engineering Society Convention.
  13. S.Dixon (2006). Onset detection revisited. International Conference on Digital Audio Effects(DAFx).
  14. Thoshkahna, B. and K.R.Ramakrishnan (2008). A psychoacoustics based sound onset detection algorithm for polyphonic audio. International Conference on Signal Processing(ICSP).
  15. W.Wang, Y.Luo, J.A.Chambers, and S.Sanei (2006). Nonnegative matrix factorization for note onset detection of audio signals. IEEE International Workshop on Machine Learning for Signal Processing(WMLSP).
  16. Zhou, R. and J.D.Reiss (2007). Music onset detection combining energy based and pitch based approaches. Music Information Retrieval Evaluation eXchange(MIREX).
Download


Paper Citation


in Harvard Style

Thoshkahna B. and R. Ramakrishnan K. (2009). A PSYCHOACOUSTICALLY MOTIVATED SOUND ONSET DETECTION ALGORITHM FOR POLYPHONIC AUDIO . In Proceedings of the International Conference on Signal Processing and Multimedia Applications - Volume 1: SIGMAP, (ICETE 2009) ISBN 978-989-674-007-8, pages 94-99. DOI: 10.5220/0002238400940099


in Bibtex Style

@conference{sigmap09,
author={Balaji Thoshkahna and K. R. Ramakrishnan},
title={A PSYCHOACOUSTICALLY MOTIVATED SOUND ONSET DETECTION ALGORITHM FOR POLYPHONIC AUDIO},
booktitle={Proceedings of the International Conference on Signal Processing and Multimedia Applications - Volume 1: SIGMAP, (ICETE 2009)},
year={2009},
pages={94-99},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002238400940099},
isbn={978-989-674-007-8},
}


in EndNote Style

TY - CONF
JO - Proceedings of the International Conference on Signal Processing and Multimedia Applications - Volume 1: SIGMAP, (ICETE 2009)
TI - A PSYCHOACOUSTICALLY MOTIVATED SOUND ONSET DETECTION ALGORITHM FOR POLYPHONIC AUDIO
SN - 978-989-674-007-8
AU - Thoshkahna B.
AU - R. Ramakrishnan K.
PY - 2009
SP - 94
EP - 99
DO - 10.5220/0002238400940099