CONTEXT AWARENESS USING ENVIRONMENTAL SOUND CUES AND COMMONSENSE KNOWLEDGE

Mostafa Al Masum Shaikh, Keikichi Hirose, Helmut Prendinger

2009

Abstract

Detecting or inferring human activity (e.g., an outdoor activity) by analyzing sensor data is often inaccurate, insufficient, difficult, and expensive. Therefore, this paper explains an approach to infer human activity and location considering the environmental sound cues and commonsense knowledge of everyday objects usage. Our system uses mel-frequency cepstral coefficients (MFCC) and their derivatives as features, and continuous density hidden Markov models (HMM) as acoustic models. Our work differs from others in three key ways. First, we utilize both indoor and outdoor environmental sound cues which are annotated according to the objects pertaining to the sound samples to build the idea regarding sounds and the objects which produce that particular sound. Second, use of portable microphone instead of having a fixed setup of an array of microphones to capture environmental sound we can also infer outdoor environments like being on the road, in a train station, etc., which previous research was limited to perform. Thirdly, our model is easy to incorporate new set of activities for further needs by adding more appropriately annotated sound clips and re-training of the HMM based recognizer. A perceptual test is made to study the human accuracy in the task and to obtain a baseline for the assessment of the performance of the system. Though the direct comparison of the system’s performance to human performance is somewhat worse but the preliminary results are encouraging with the accuracy rate for outdoor and indoor sound categories for activities being above 67% and 61% respectively.

References

  1. Kam, A. H., Zhang, J., Liu, N., and Shue, L., 2005. Bathroom Activity Monitoring Based on Sound. In PERVASIVE'05, 3rd International Conf. on Pervasive Computing. Germany, LNCS 3468/2005, pp. 47-61.
  2. Temko, A., Nadeu, C., 2005. Classification of meetingroom acoustic events with Support Vector Machines and Confusion-based Clustering. In ICASSP'05, pp. 505-508.
  3. Wang, D., and Brown, G., 2006. Computational Auditory Scene Analysis: Principles, Algorithms and Applications. Wiley-IEEE
  4. Okuno, H.G., Ogata, T., Komatani, K., and Nakadai, K., 2004. Computational Auditory Scene Analysis and Its Application to Robot Audition. In International Conference on Informatics Research for Development of Knowledge Society Infrastructure, pp., 73-80
  5. Eronen, A., Tuomi, J., Klapuri, A., Fagerlund, S., Sorsa, T., Lorho, G., and Huopaniemi, J., 2003. Audio-based Context Awareness-Acoustic Modeling and Perceptual Evaluation. In ICASSP 7803, Int'l Conference on Acoustics, Speech, and Signal Processing, pp. 529-532
  6. Hatzivassiloglou, V. and McKeown, K. R., 1997.
  7. Predicting the Semantic Orientation of Adjectives. In 35th annual meeting on ACL, pp.174-181
Download


Paper Citation


in Harvard Style

Al Masum Shaikh M., Hirose K. and Prendinger H. (2009). CONTEXT AWARENESS USING ENVIRONMENTAL SOUND CUES AND COMMONSENSE KNOWLEDGE . In Proceedings of the International Conference on Signal Processing and Multimedia Applications - Volume 1: SIGMAP, (ICETE 2009) ISBN 978-989-674-007-8, pages 193-196. DOI: 10.5220/0002230901930196


in Bibtex Style

@conference{sigmap09,
author={Mostafa Al Masum Shaikh and Keikichi Hirose and Helmut Prendinger},
title={CONTEXT AWARENESS USING ENVIRONMENTAL SOUND CUES AND COMMONSENSE KNOWLEDGE},
booktitle={Proceedings of the International Conference on Signal Processing and Multimedia Applications - Volume 1: SIGMAP, (ICETE 2009)},
year={2009},
pages={193-196},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002230901930196},
isbn={978-989-674-007-8},
}


in EndNote Style

TY - CONF
JO - Proceedings of the International Conference on Signal Processing and Multimedia Applications - Volume 1: SIGMAP, (ICETE 2009)
TI - CONTEXT AWARENESS USING ENVIRONMENTAL SOUND CUES AND COMMONSENSE KNOWLEDGE
SN - 978-989-674-007-8
AU - Al Masum Shaikh M.
AU - Hirose K.
AU - Prendinger H.
PY - 2009
SP - 193
EP - 196
DO - 10.5220/0002230901930196