annotation techniques. The proposed technique show
a good improvement although must be said that a
straight comparison is impossible due to the
different adopted features and the number of classes.
Table 3: Comparision of proposed technique with state of
art annotation techniques.
TM
CMR
M
ME MBRM
Propo
sed
Tech.
Prec 0.06 0.10 0.09 0.24 0.36
Recall 0.04 0.09 0.12 0.25 0.36
6 CONCLUSION AND FUTURE
WORKS
Image annotation needs to exploit information from
different orthogonal features to capture the visual
elements carrying a symbolic meaning matched with
the text labels.
The shown techniques use information from
different features and merge together visual
information represented in term of scores related to
different labels. Different information fusion
techniques have been compared showing that, for
this application, the weighted sum of g-scores
produces better results than other fusion techniques.
The information fusion produced putting a HL
LDF to summarize the results of the first stage
LDFs, allows an improvement in performance when
the characterization of input images, through g-units
scores, is adherent to their content. Decision trees
have a reduced utility in this case mainly due to the
reduced generalization capability.
Further investigations will be focused on the
training of the images in terms of more specific
classes or sub-classes that despite a reduced number
of samples for each category are more specific as
content. The application of more complex models
instead of LDF can also allow capturing the positive
and negative classes in a more flexible way and
allow a better performance for fusion algorithms.
ACKNOLEDGEMENTS
Authors would like to thank Kobus Barnard and
Shen Gao for their help with the images data set and
Rulequest company for the evaluation version of the
See5/C5.0 software for decision trees building.
REFERENCES
Barnard K., Duygulu P., Forsyth D., de Freitas N., Blei D.,
Jordan. M., 2003, “Matching words and pictures”,
Journal of Machine Learning Research, Vol.3, pp
1107-1135.
Bellegarda J.-R., 2000, “Exploiting latent semantic
information in statistical language modelling”, Proc.
of the IEEE, Vol. 88, No. 8, pp 1279-1296.
Blei D., Jordan M.-I., 2003, “Modeling annotated data”,
ACM SIGIR.
Carbonetto P., de Freitas N., Barnard K., 2004, “A
statistical model for general contextual object
recognition”, Proc. of ECCV.
Duygulu P., de Freitas N., Barnard K., Forsyth D., 2002,
“Object recognition as machine translation: Learning a
lexicon for a fixed vocabulary”, Proc. of ECCV.
Feng S.-L., Manmatha R., Lavrenko V., 2004, “Multiple
Bernoulli relevance models for image and video
annotation,” , Proc of CVPR’04.
Gao S., Wang D.-H., Lee C.-H., 2006, “Automatic Image
Annotation through Multi-Topic Text Categorization”,
Proc. of ICASSP.
Gao S., Wu W., Lee C.-H., Chua T.-S. , 2004, “A MFoM
learning approach to robust multiclass multi-label text
categorization”, Proc. of ICML.
He X.-M., Zemel R. S., Carreira-Perpiñán M. A., 2004,
“Multiscale conditional random fields for automatic
image annotation”, Proc. of CVPR
Jeon J., Manmatha R., 2004., “Using maximum entropy
for automatic image annotation”, Proc of ICVR.
Jeon J., Manmatha R., 2003, “Automatic image annotation
and retrieval using cross-media relevance models”,
ACM SIGIR.
Landgrebe T.C.W., Paclik P., Duin R.P.W., Bradley A.P.,
2006, "Precision-recall operating characteristic (P-
ROC) curves in imprecise environments", Proc. of the
18th Int. Conf. on Pattern Recognition
Linde Y., Buzo A., Gray R., 1980. “An Algorithm for
Vector Quantizer Design”. IEEE Transaction on
Communications, vol. 28 (1), pp 84–94.
Mitchell T.M., 1997, Machine Learning, McGrawHill
Mori Y, Takahashi H., Oka R., 1999, Image-to-word
transformation based on dividing and vector
quantizing images with words, In Proc of MISRM'99
Quinlan J.R., 2006, Data Mining Tools See5 and C5.0,
from Rule Quest web site: www.rulequest.com/ see5-
info.html
Salton G., 1971, The SMART Retrieval System, Prentice-
Hall, Englewood Cliffs, NJ
Sebastiani F., 2002, “Machine Learning in Automated
Text Categorization”, ACM Computer Surveys, Vol.
34, No. 1, pp 1-47.
Wang D.-H, Gao S., Tian Q., Sung W.-K, 2005,
“Discriminative fusion approach for automatic image
annotation”, Proc. of IEEE 7
th
Workshop on
Multimedia Signal Processing
INFORMATION FUSION TECHNIQUES FOR AUTOMATIC IMAGE ANNOTATION
67