(a) Coordinate systems (b) Sample images (c) Cube faces (d) VRML view
Figure 1: Cube based background representation. (a) Coordinate systems of the cube, {X,Y,Z} and the pan-tilt camera,
{X
t
,Y
t
,Z
t
} with zero tilt and non-zero pan. (b) A number of the images captured to build a mosaic. (c) Left, front and right
cube mosaicked-faces. (d) VRML view of the cube model showing one of the acquired images.
to encompass radiometric calibration to estimate the
radiometric response and vignetting functions of a
camera. Grossberg and Nayar introduced a camera
response model based on a large database of response
functions obtained from well controlled illumination
and color pattern setups (Grossberg and Nayar, 2003).
Kim and Pollefeys proposed estimating the vignetting
and radiometric response functions for high dynamic
range mosaics, from a set of images with different ex-
posures values (Kim and Pollefeys, 2008). Lin et al.
proposed to estimate the radiometric response func-
tion from images without changes in the exposure, us-
ing histograms of the edges regions (Lin and Zhang,
2005). However, vignetting is not considered in the
estimation of the radiometry. Zheng et al. also pro-
posed the correction of vignetting from a single im-
age, but requiring large piecewise flat regions in the
image, which is highly dependent on the scene con-
tents (Zheng et al., 2009).
Alternatively, Wonpil Yu proposed to correct vi-
gnetting based on a white pattern (Yu, 2004). The
white image, decreasing in brightness towards the
borders due to a vignetting distortion function, was
approximated with a 2D hypercosine function. This
calibration methodology is however cumbersome due
to the requirement of having to use very large patterns
when the cameras to calibrate are far away, e.g. out-
doors at a second level floor, as it is usual with surveil-
lance cameras.
In this work we propose therefore using the geo-
metric calibration procedures adapted to pan-tilt cam-
eras, and propose exploring the pan and tilt degrees
of freedom instead of requiring large constant color
areas in the scenarios, or color calibrating patterns.
This paper is organized as follows: Section 2 de-
scribes the geometrical model and the background
representation for pan-tilt cameras, Section 3 dis-
cusses and proposes methodologies to correct the ef-
fect of vignetting on the background variance and
event detection, Section 4 shows experiments test-
ing the proposed methodologies, and finally Section
5 summarizes the work and draws some conclusions.
2 PANORAMIC SCENE
REPRESENTATION
The background scene of a pan-tilt camera can be rep-
resented in various ways, such as a plane, a cylin-
der, a sphere or a cube. In particular we select the
cube based representation as it can handle a complete
spherical field-of-view (FOV), 360
o
× 360
o
, which is
not possible in the planar or cylindric mosaics, and
maps perspective images to/from the background us-
ing just homographies (as compared to using spheri-
cal mappings). See Fig. 1.
Building the cube based representation is a two
steps process: (i) obtaining a back-projection for each
image point and (ii) projecting the back-projection to
the right face of the cube. If one knows the intrin-
sic parameters matrix, K and the orientation R of the
camera, then each image point, m can be easily back-
projected to a 3D world point [x y z]
T
= (KR)
−1
m .
Projecting the world point to the right face of the cube
involves determining the face, namely front, back,
left, right, top or bottom (see Fig. 1a), and then com-
puting the 2D coordinates within that face. The cube
face where to project a world point is determined di-
rectly by inspecting the point coordinates. Defining
v = max(|x|,|y|, |z|), one has that [x y z]
T
is imaged in
the right, left, bottom, top, front or back face of the
cube if v ≡ x,−x, y,−y,z or −z, respectively.
Having identified the cube faces for mapping the
image points, the mapping process consists simply in
projecting the back-projections of the image points
using a projection matrix P
WF
= K
F
[R
WF
0
3×1
],
where K
F
is an intrinsic parameters matrix charac-
terizing the resolution (size) of the cube faces, and
R
WF
are rotation matrices defining optical axis or-
thogonal to the cube faces. The rotation matrices R
WF
VIGNETTING CORRECTION FOR PAN-TILT SURVEILLANCE CAMERAS
639