Large Scale Similar Song Retrieval using Beat-aligned Chroma Patch Codebook with Location Verification

Yijuan Lu, Joseph E. Cabrera

Abstract

With the popularity of song search applications on Internet and mobile phone, large scale similar song search has been attracting more and more With the popularity of song search applications on Internet and mobile phone, large scale similar song search has been attracting more and more attention in recent years. Similar songs are created by altering the volume levels, timing, amplification, or layering other songs on top of an original song. Given the large scale of songs uploaded on the Internet, it is demanding but challenging to identify these similar songs in a timely manner. Recently, some state-of-the-art large scale music retrieval approaches represent songs with a bag of audio words by quantizing local features, such as beat-chroma patches, solely in the feature space. However, feature quantization reduces the discriminative power of local features, which causes many false audio words matches. In addition, the location clues among audio words in a song is usually ignored or exploited for full location verification, which is computationally expensive. In this paper, we focus on similar song retrieval, and propose to utilize beat-aligned chroma patches for large scale similar song retrieval and apply location coding scheme to encode the location relationships among beat-aligned chroma patches in a song. Our approach is both efficient and effective to discover true matches of beat chroma patches between songs with low computational cost. Experiments in similar songs search on a large song database reveal the promising results of our approach.

References

  1. Arthur, D. and Vassilvitskii, S., 2006. How slow is the kmeans method? In Proceedings of the twenty-second annual symposium on Computational geometry, SCG 7806, pages 144-153, New York, NY, USA.
  2. Bertin-Mahieux,T., Grindlay, G., Weiss, R., and Ellis, D., 2011. Evaluating music sequence models through missing data. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP).
  3. Bertin-Mahieux,T., Weiss, R., and Ellis, D., 2010, Clustering beat-chroma patterns in a large music database. 11th International Conference on Music Information Retrieval (ISMIR).
  4. Casey, M., and Slaney, M., 2007. Fast recognition of remixed music audio. In Proceedings of ICASSP.
  5. Ellis, D. and Poliner, G., 2007. Identifying “cover songs” with chroma features and dynamic programming beat tracking. Proceedings of ICASSP.
  6. Maddage, N. C., Xu, C., Kankanhalli, M. S., and Shao, X., 2004. Content-based music structure analysis with applications to music semantics understanding. ACM Multimedia, pages 112-119, New York, NY, USA.
  7. Marszalek, M., Schmid,C., Harzallah,H., and Van De Weijer, J., 2007. Learning object representations for visual object class recognition. Visual Recognition Challenge workshop, in conjunction with ICCV.
  8. Nister, D. and Stewenius, H., 2006. Scalable recognition with a vocabulary tree. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). New York, pages 2161-2168.
  9. Seyerlehner, K., Widmer, G., and Knees, P., 2008. Frame level audio similarity-a codebook approach. Proceddings of the 11th International Conference on Digital Audio Effects.
  10. Van Gemert, J. C., Veenman, C. J., Smeulders, A. W. M., and Geusebroek, J.M., 2010. Visual word ambiguity. IEEE Transactions on Pattern Analysis and Machine Intelligence, 32(7):1271-1283.
  11. Vedaldi, A. and Fulkerson, B., 2010. Vlfeat: an open and portable library of computer vision algorithms. In Proceedings of the International Conference on Multimedia, pages 1469-1472, New York. USA.
  12. Wu, Z., Ke, Q., Isard, M., and Sun, J., 2009. Bundling features for large scale partial-duplicate web image search. In Proceedings of CVPR.
  13. Zhou, W., Lu, Y., Li, H., Song, Y., Tian, Q., 2010. Spatial coding for large scale partial-duplicate web image search. ACM Multimedia, Florence, Italy.
Download


Paper Citation


in Harvard Style

Lu Y. and E. Cabrera J. (2012). Large Scale Similar Song Retrieval using Beat-aligned Chroma Patch Codebook with Location Verification . In Proceedings of the International Conference on Signal Processing and Multimedia Applications and Wireless Information Networks and Systems - Volume 1: SIGMAP, (ICETE 2012) ISBN 978-989-8565-25-9, pages 208-214. DOI: 10.5220/0004129802080214


in Bibtex Style

@conference{sigmap12,
author={Yijuan Lu and Joseph E. Cabrera},
title={Large Scale Similar Song Retrieval using Beat-aligned Chroma Patch Codebook with Location Verification},
booktitle={Proceedings of the International Conference on Signal Processing and Multimedia Applications and Wireless Information Networks and Systems - Volume 1: SIGMAP, (ICETE 2012)},
year={2012},
pages={208-214},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0004129802080214},
isbn={978-989-8565-25-9},
}


in EndNote Style

TY - CONF
JO - Proceedings of the International Conference on Signal Processing and Multimedia Applications and Wireless Information Networks and Systems - Volume 1: SIGMAP, (ICETE 2012)
TI - Large Scale Similar Song Retrieval using Beat-aligned Chroma Patch Codebook with Location Verification
SN - 978-989-8565-25-9
AU - Lu Y.
AU - E. Cabrera J.
PY - 2012
SP - 208
EP - 214
DO - 10.5220/0004129802080214