AN APPROACH TO THE SEMANTIC MODELING OF AUDIO DATABASES

Mustafa Sert, Buyurman Baykal

Abstract

The modeling of multimedia databases for multimedia information systems is a complicated task. The designer has to model the structure and the dynamic behavior of multimedia objects, as well as the interactions between them. In this paper, we present a data model for audio database applications in the context of MPEG-7. The model is based on the object-oriented paradigm and as well as low-level and high-level signal features, which are standardized within the MPEG-7 framework, thus enabling interoperability of data resources. The model consists of two parts: a structural model, which provides a structural view of raw audio data, and an interpretation model, which allows semantic labels to be associated with audio data. We make use of an object-oriented approach to capture the audio events and objects in our model. Compared to similar models, particular attention is paid to integration issues of the model with commercial database management systems. Temporal relations between audio objects and events are also considered in this study.

References

  1. A. Ghias, J. Logan, e. a. (1995). Query-by-hummingmusical information retrieval in an audio database. In ACM Multimedia Conference. Proc. ACM.
  2. A. Hampapur, e. a. (1997). Virage video engine. SPIE, 3022.
  3. A. Woudstra, e. a. (1998). Modeling and retrieving audiovisual information. LNCS, 1508.
  4. Adam T. Lindsay, e. a. (2000). Representation and linking mechanism for audio in mpeg-7. Signal Processing: Image Communication, 16:193-209.
  5. Allen, J. (1983). Maintaining knowledge about temporal intervals. Communications of ACM, 26(11):832-843.
  6. E. Wold, T. Blum, e. a. (1996). Content-basedclassification, search, and retrieval of audio. IEEE Multimedia, pages 27-36.
  7. Foote, J. (1997). Content-based retrieval of music and audio. In Proceedings of SPIE'97.
  8. G. Amato, e. a. (1998). An approach to a content-based retrieval of multimedia data. Multimedia Tools and Applications, 7(1/2):5-36.
  9. Ghafoor, A. (1994). Multimedia database course notes. In ACM Multimedia Conference.
  10. Grosky, W. (1997). Managing multimedia information in database systems. Communications of the ACM, 40(12):73-80.
  11. Gudivada, V. and Raghavan, V. (1995). Content-based image retrieval systems: Guest editors' introduction. IEEE Computer, pages 18-22.
  12. John R. Smith, A. B. B. (2000). Conceptual modeling of audio-visual content. In IEEE International Conference on Multimedia and Expo (II), pages 915-. IEEE Press.
  13. J.Z. Li, M.T. Ozsu, e. a. (1997). MOQL: A Multimedia Object Query Language. In The 3rd International Workshop on Multimedia Information Systems.
  14. L. Lu, H.J. Zhang, e. a. (2003). Content-based audio classification and segmentation by using support vector machines. Multimedia Systems, 8:482-492.
  15. M. Flinker, e. a. (1995). Query by image and video content: The qbic system. IEEE Computer, 28:23-32.
  16. MPEG-7 (1999). Mpeg-7 requirements document v.8, iso/iec jtc1/sc29/wg11/n2727. Technical report, Seoul Meeting.
  17. MPEG-7 (2001). Multimedia content description interface - part4: Audio, iso/iec jtc1/sc29n. Technical report, MPEG-7.
  18. Oracle (2000). User's guide and reference: Oracle intermedia audio, image, and video. Technical report, Oracle.
  19. P. Salembier, e. a. (1999). Video ds. Proposal P185, P186, MPEG-7 Lancaster Meeting.
  20. Petkovic, M. and Jonker, W. (2000). An overview of data models and query languages for content-based video retrieval. In International Conference on Advances in Infrastructure for Electronic Business, Science, and Education on the Internet.
  21. R. Weiss, e. a. (1994). Content-based access to algebraic video. In Proc. of Int. Conf. on Multimedia Computing and Systems, pages 140-151. IEEE Press.
  22. Sert, M. and Baykal, B. (2003). A web model for querying, storing, and processing multimedia content. In IKS'03, International Conference on Information and Knowledge Sharing. ACTA Press.
Download


Paper Citation


in Harvard Style

Sert M. and Baykal B. (2004). AN APPROACH TO THE SEMANTIC MODELING OF AUDIO DATABASES . In Proceedings of the First International Conference on E-Business and Telecommunication Networks - Volume 3: ICETE, ISBN 972-8865-15-5, pages 385-390. DOI: 10.5220/0001401503850390


in Bibtex Style

@conference{icete04,
author={Mustafa Sert and Buyurman Baykal},
title={AN APPROACH TO THE SEMANTIC MODELING OF AUDIO DATABASES},
booktitle={Proceedings of the First International Conference on E-Business and Telecommunication Networks - Volume 3: ICETE,},
year={2004},
pages={385-390},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001401503850390},
isbn={972-8865-15-5},
}


in EndNote Style

TY - CONF
JO - Proceedings of the First International Conference on E-Business and Telecommunication Networks - Volume 3: ICETE,
TI - AN APPROACH TO THE SEMANTIC MODELING OF AUDIO DATABASES
SN - 972-8865-15-5
AU - Sert M.
AU - Baykal B.
PY - 2004
SP - 385
EP - 390
DO - 10.5220/0001401503850390