6 CONCLUSIONS AND FUTURE
WORKS
In this paper we proposed a new method for video
object recognition based on video object models.
The results of video object recognition, in terms of
accuracy are very encouraging (83%). We created a
video dataset of 25 video object, it consists of 360
degree-views of the objects. From the video dataset
an image dataset is also constructed by sampling the
video frames. It contains 900 views of the 25
objects. Our method for object modeling gives, as
result, a compact and complete representation of the
objects, it achieves almost 76% data compression of
the models. With regard to object recognition
method, one of the possible improvement is to refine
the selection of the frames for the query in the
objects models database. Given a video, the camera
motion could be estimated and the frame samples
extracted according to motion, for example trying to
get a frame every fixed angular displacing. Best
results should be reached using a sampling rate that
approximate the rate used in the dataset creation. If
the video is long enough to have a high number of
selected frames, the same modeling process could be
used in the query to increase time performance of
the recognition, preserving the accuracy taking only
the most relevant views.
Figure 11: Two examples of results: a false negative (the
dancer) and a true negative (unknown object).
REFERENCES
Li, Z. N., Zaiane, O. R., Tauber, Z., 1999. Illumination
Invariance and Object Model Content-Based Image
and Video Retrieval. In Journal of Visual
Communication and Image Representation, vol 10, pp
219-224.
Z. Li and B. Yan., 1996 Recognition Kernel for content-
based search. In Proc. IEEE Conf. on Systems, Man,
and Cybernetics, pages 472-477.
Day, Y. F., Dagtas, S., Iino, M., Khokhar, A., Ghafoor, A.,
1995. Object-oriented conceptual modeling of video
data. In Proceedings of the Eleventh International
Conference on Data Engineering.
Chen, L., Ozsu, M. T., 2002. Modeling of video objects in
a video databases. In Proceedings of IEEE
International Conference on Multimedia and Expo.
Sivic, J., Zisserman, A., 2006. Video Google: Efficient
visual search of videos. In Toward Category-Level
Object Recognition, pp. 127-144, Springer.
Vedaldi, A., Fulkerson, B., 2010. VLFeat: An open and
portable library of computer vision algorithms. In
Proceedings of the International Conference on
Multimedia.
Kavitha, G., Chandra, M. D., Shanmugan, J., 2007. Video
Object Extraction Using Model Matching Technique:
A Novel Approach. In 14th IWSSIP, 2007 and 6th
EURASIP Conference focused on Speech and Image
Processing, Multimedia Communications and
Services, pp. 118-121.
Mundy, Joseph L. 2006. Object recognition in the
geometric era: A retrospective. Toward category-level
object recognition. pp.3-28.
Lowe, D.G., 2004. Distinctive Image Features from Scale-
Invariant Keypoints, In International Journal of
Computer Vision n. 60 vol.2 pp. 91-110, Springer.
Turk, M., Pentland, A., 1991. Eigenfaces for recognition.
In Journal of cognitive neuroscience vol.3, n.1, pp. 71-
86, MIT press.
Zhao, L. W., Luo, S. W., Liao, L. Z., 2004. 3D object
recognition and pose estimation using kernel PCA. In
Proceedings of 2004 International Conference on
Machine Learning and Cybernetics.
Wang, X. Z., Zhang, S. F., Li, J., 2007. View-based 3D
object recognition using wavelet multiscale singular-
value decomposition and support vector machine. In
ICWAPR.
Pontil, M., Verri, A., 1998. Support vector machines for
3D object recognition. In IEEE Transactions on
Pattern Analysis and Machine Intelligence, vol.20 n.6,
pp. 637-646.
Murase, H., Nayar, S. K., 1995. Visual learning and
recognition of 3-D objects from appearance. In
International journal of computer vision, vol.14 n.1,
pp. 5-24. Springer.
Lowe, D. G., 1999. Object recognition from local scale-
invariant features. In . The proceedings of the seventh
IEEE international conference on Computer vision.
Chang, P., Krumm, J., 1999. Object recognition with color
cooccurrence histograms. In IEEE Computer Society
Conference on Computer Vision and Pattern
Recognition.
Wu, Y. J., Wang, X. M., Shang, F. H., 2011. Study on 3D
Object Recognition Based on KPCA-SVM. In
International Conference on Information and
Intelligent Computing, vol.18 pp. 55-60. IACSIT
Press, Singapore.
Fischler, Martin A and Bolles, Robert C.,1981. Random
sample consensus: a paradigm for model fitting with
applications to image analysis and automated
cartography In Communications of the ACM, vol. 24,
num.6, pp. 381–395.
VideoObjectRecognitionandModelingbySIFTMatchingOptimization
669