Finally, two Multimedia Content Analysis (MCA)
modules extract high-level semantics such as the
Scene Boundary Detector (Nesvadba, 2004-1),
which segment the content items into meaningful
semantic scenes. In parallel, a Commercial Block
Detector (Dimitrova, 2002) indexes commercial
instances.
The central component, indicated by
PNX,
performs both low-level feature extraction and
MPEG2 encoding. Moreover, it generates a time-
code on video frame basis, which is fed to all
content analysis components. They use the time-
code to provide a time stamp for all generated
features. This way all subsequent processing is able
to display and store the extracted features in a
synchronized way.
Each component transmits the features it has
extracted over a network channel to another
component. Since each component generates unique
metadata features, each pair of communicating
components must
know the type of the data being
transmitted. To this purpose over 12 metadata data
types were defined, see Figure 8, enabling each
component to correctly interpret any metadata
received.
All stream processing components/ tasks stem
from many different sources/development groups
within Philips. By encapsulating them by a thin shell
– implementing the required streaming and control
interfaces – they were easily integrated into one
content analysis system.
4 CONCLUSIONS
To enable fast evaluation of content analysis
systems, to come to sensible solutions that are easy
to use, PC based prototyping is a must. This is
extremely important to be able to understand the
feasibility of future
Consumer-Electronics (CE)
storage products that heavily depend on advanced
content analysis features.
To assure that PC based system solutions are
created that can be mapped onto more resource
constrained systems with a different architecture,
standardized interfaces are used for control and
streaming (Philips, 2005) (Nieuwland, 2002) (Kock,
2000), by using interconnection technology that is
designed for the platform at hand, and by optimizing
the feature-implementation for each underlying
HW/SW platform. Experiences with the large-scale
prototyping activities we have carried out
(Nesvadba, 2003) for the assessment of future
content-analysis systems, show that a PC based
prototyping approach enables the integration of
many different media processing features in a short
time and that it allows for accurate analysis of the
resource (CPU/ memory) requirements of such
components.
REFERENCES
Haan, G. de 2000. Video processing for multimedia
systems. In ISBN: 90-9014015-8, Eindhoven.
Hollemans G., 2005. MediaBrowser: In
http://www.research.philips.com/technologies/misc/ho
melab/downloads/homelab_365.pdf
Dimitrova, N., 2002. Real time commercial detection
using MPEG features. In 9th International Conference
on Information Processing and Management of
Uncertainty in Knowledge-Based Systems, IPMU 2002.
Kock, E. de, 2000. YAPI: application modeling for signal
processing systems. In Proc. 37th DAC.
Lange, F. de., 2001, The Philips-OpenTV Product Family
Architecture for interactive Set-Top boxes. In 4
th
product family engineering, Springer Verlag, ISBN 3-
540-43659-6, pages 187-206.
Ma, Q., 2001. Virtual TV Channel Filtering, Merging
and Presenting Internet Broadcasting Channels. In
SIGNotes Information Processing Society of Japan.
Maxtor, 2004. “Big Drives”, Maxtor Technologies. In
http://www.maxtor.com/_files/maxtor/en_us/documenta
tion/white_papers/big_drives_white_papers.pdf .
Nesvadba, J., 2004-1. Low-level cross-media statistical
approach for semantic partitioning of audio-visual
content in a home multimedia environment. In Proc.
IEEE IWSSIP’04.
Nesvadba, J., 2004. Face Related Features in consumer
Electronic environments. In IEEE SMC, Den Haag,
The Netherlands.
Nesvadba, J., 2005. Local real-time multimedia analysis.
http://www.research.philips.com/technologies/storage/
cassandra
McKinney, M., 2003. Features for audio and music
classification. In 4th International Symposium on
Music Information and Retrieval. Baltimore, Maryland
Nieuwland, A., 2002. a Heterogeneous Multi-processor
Architecture and Scalable and Flexible Protocol for the
design of Embedded Signal Processing Systems. In
Journal of Design Automation for Embedded Systems,
Kluwer Academic Publishers, vol. 7, no. 3.
MultimediaN, 2005. Multimedia analysis, database
technology, and human computer interaction. In
http://homepages.cwi.nl/~mk/multimedianonline/
Ommering, R., 2002. Building product populations with
software components. In ISCE 2002, pages 255-265.
Philips, 2005. UHAPI, a new Application Programming
Interface for the CE Industry. In http://www.uhapi.org .
UPnP, 2005. The Universal Plug and Play Forum. In
http://www.upnp.org .
RAPID PROTOTYPING OF MULTIMEDIA ANALYSIS SYSTEMS: A Networked hardware/software solution
109