
ON PROJECTION MATRIX IDENTIFICATION FOR CAMERA
CALIBRATION
Michał Tomaszewski and Władysław Skarbek
Institute of Radioelectronics, Warsaw University of Technology, Nowowiejska 15/19, 00-665 Warszawa, Poland
Keywords:
Projection matrix, homographic matrix, camera calibration, intrinsic parameters, Housholder symmetry.
Abstract:
The projection matrix identification problem is considered with application to calibration of intrinsic camera
parameters. Physical and orthogonal intrinsic camera models in context of 2D and 3D data are discussed. A
novel nonlinear goal function is proposed for homographic calibration method having the fast convergence of
Levenberg-Marquardt optimization procedure. Three models (linear, quadratic, and rational) and four opti-
mization procedures for their identification were compared wrt their time complexity, the projection accuracy,
and the intrinsic parameters accuracy. The analysis has been performed for both, the raw and the calibrated
pixel data, too. The recommended technique with the best performance in all used quality measures is the
Housholder QR decomposition for the linear least square method of the linear form of projection equations.
1 INTRODUCTION
Camera calibration is the fundamental generic prob-
lem in computer vision (Y. Ma, 2004). In case of
pinhole camera model, the problem usually refers to
estimation of camera intrinsic parameters K and to
camera pose R and camera locationC estimation with
respect to a selected coordinates frame. Both kinds
of parameters define, modulo constant factor, a pro-
jection matrix M which is the algebraic model in ho-
mogenous coordinates of the imaging geometry for
the given view of 3D or 2D scene:
p ≡ MP, p =
x
y
1
, P =
X
Y
Z
1
M = [M
3
,m
4
], M
3
∈ R
3×3
, m
4
∈ R
3
(1)
where projective relation ≡ makes equivalent all
points collocated in the same projective line. In al-
gebraic notation it means that for any P there exists
a scaling factor λ(P) for which the equation is true:
p = λ(P)MP.
The matrix K ∈ R
3×3
of intrinsic parameters de-
scribes the transformation from scene to camera pixel
coordinates on the projection plane. Since the choice
of coordinate axis for the camera is not unique the K
is not unique, too. However, the following decompo-
sition formula always holds:
M ≡ KR
−1
[I
3
,−C] (2)
where the pose matrix R = [r
x
,r
y
,r
z
] consists of the
camera frame axis defined by unit length vectors with
coordinates wrt the scene frame I
3
= [e
1
,e
2
,e
3
], and
C is the origin of the camera frame.
The matrix equivalence used in (2) is the equality
modulo constant factor: M
1
≡ M
2
if and only there
exists λ 6
= 0 such that M
2
= λM
1
.
Since any rotation in projection plane can be mod-
elled by the matrix R, the intrinsic matrix K is the
upper triangular matrix with positive elements on the
diagonal. In principle there are two approaches to
make the matrix K unique. In the most popular case,
the requirement of orthogonality R
t
R = I
3
makes by
QR decomposition of M
−1
3
, the unique selection of K
(O. Faugeras, 2001). We call this case of calibration
as orthogonal calibration and identify the fivefree pa-
rameters for the inverse matrix:
K
−1
o
=
k
1
k
2
k
3
0 k
4
k
5
0 0 1
(3)
92
Tomaszewski M. and Skarbek W. (2007).
ON PROJECTION MATRIX IDENTIFICATION FOR CAMERA CALIBRATION.
In Proceedings of the Second International Conference on Computer Vision Theory and Applications, pages 92-97
DOI: 10.5220/0002068000920097
Copyright
c
SciTePress