generally have sufficed.
From these experiments we have also observed
that there is little diversity in the 10 coefficients in
the recovered solutions along the main diagonal in-
dicating a stability in the coefficients across different
objects that is consistent with the prior training data.
Also, we have detected a difference in the optimisa-
tion behaviour of the two algorithms, DE and sim-
plex, and how much earlier the latter can reach the
global minimum. DE is much slower, but it has the
advantage that it can avoid locally optimum solutions,
which the simplex sometimes cannot.
Finally, in order to obtain a more specific and
complete idea of the characteristics of the posterior
surface, we have used a Markov-Chain Monte Carlo
(MCMC) (Gelman et al., 1995) approach in order to
generate a sample of the distribution and further anal-
yse it. We chose a single experiment (matching to a
frontal view of object 1 at 0
0
) and generated a set of
10000 samples of the posterior (3) from areas of high
probability using a single Markov Chain. We then ran
a k-means clustering algorithm (Bishop, 1995) which
recovered 3 main clusters in close proximity and all
near the global optimum. This indicates that, for this
example, the distribution is approximately unimodal
though perhaps with some subsidiary, nearby peaks
caused by noise effects. The main point is that there
is no significant local optimum elsewhere nearby in
the distribution.
A final examination of the kurtosis and skew-
ness of the sample has shown that the distributions
of the samples of all coefficients, except b
1
, are quite
strongly skewed, reflecting strong influence of the
likelihood near the optimum posterior, a property that
is highly desirable. This is due to the shape of the
likelihood function since the priors are symmetric.
The values for the kurtosis are small for some coeffi-
cients whose posteriors are therefore almost Gaussian
near the optimum, whilst other coefficients strongly
affected by the priors are leptokurtic.
4 CONCLUSIONS
Our approach to view-based object recognition in-
volves synthesising intensity images using a linear
combination of views and comparing the sythesised
images to the target, scene image. We incorporate
prior probabilistic information on the LCV parame-
ters by means of a Bayesian model. Matching and
recognition experiments carried out on data from the
COIL-20 public database have shown that our method
works well for pose variations where the target view
lies between the basis views. The experiments further
show the beneficial effects of the prior distributions in
“regularising” the optimisation. In particular, priors
could be chosen that produced a good basin of attrac-
tion surrounding the desired optimum without unduly
biasing the solution.
Nevertheless, additional work is required. In order
to avoid the overcompleteness of the LCV equations,
we would like to reformulate the LCV equations (1)
by using the affine tri-focal tensor and introducing the
appropriate constraints in the LCV mapping process.
In addition, in this paper we have only addressed ex-
trinsic viewpoint variations, but it should also be pos-
sible to include intrinsic, shape variations using the
approach described by (Dias and Buxton, 2005).
REFERENCES
Bishop, C. M. (1995). Neural Networks for Pattern Recog-
nition. Oxford University Press.
Dias, M. B. and Buxton, B. F. (2005). Implicit, view invari-
ant, linear flexible shape modelling. Pattern Recogni-
tion Letters, 26(4):433–447.
Gelman, A., Carlin, J. B., Stern, H. S., and Rubin, D. B.
(1995). Bayesian Data Analysis. Chapman and Hall,
London, 2nd edition.
Koufakis, I. and Buxton, B. F. (1998). Very low bit-rate
face video compression using linear combination of
2dfaceviews and principal components analysis. Im-
age and Vision Computing, 17:1031–1051.
Nelder, J. A. and Mead, R. (1965). A simplex method
for function minimization. Computer Journal, 7:308–
313.
Nene, S. A., Nayar, S. K., and Murase, H. (1996). Columbia
Object Image Library (COIL-20). Technical Re-
port CUCS-006-96, Department of computer science,
Columbia University, New York, N.Y. 10027.
Shashua, A. (1995). Algebraic functions for recognition.
IEEE Transactions on Pattern Analysis and Machine
Intelligence, 17(8):779–789.
Shewchuk, J. R. (2002). Delaunay refinement algorithms
for triangular mesh generation. Computational Ge-
ometry: Theory and Applications, 22:21–74.
Storn, R. and Price, K. V. (1997). Differential evolution - a
simple and efficient heuristic for global optimization
overcontinuous spaces. Journal of Global Optimiza-
tion, 11(4):341–359.
Ullman, S. and Basri, R. (1991). Recognition by linear com-
binations of models. IEEE Transactions on Pattern
Analysis and Machine Intelligence, 13(10):992–1006.
Zografos, V. and Buxton, B. F. (2007). Pose-invariant
3d object recognition using linear combination of 2d
views and evolutionary optimisation. ICCTA, pages
645–649.
VISAPP 2008 - International Conference on Computer Vision Theory and Applications
298