camera) similar to our system. Due to the use of back-
ground subtraction as the object tracking algorithm,
the active camera must stop its motion for image cap-
ture. Therefore, moving target images captured by
this system may appear blurred or out of focus. More-
over, the tracking performance of this system is a few
dozen degrees per second.
System (d) was constructed using binocular cam-
eras similar to our system. The advantage of this sys-
tem is that it seldom fails in tracking a target that is
at a different depth when compared to its surrounding
objects. This is a fast-driven system; however, it is
very expensive because it uses very complex special
hardware.
In (e), the system was constructed using binocu-
lar camera head fixed on the robot arm, and In (f), it
presented the effect of the introduction the dynamic
control (like feed-forward control) into closed-loop
tracking system. In these manuscripts, the methods
of object tracking are not described clearly.
As is evident from the details stated above, the re-
lated works require complex, expensive, and special
hardware; in addition, their operation is seldom sta-
ble in a real environment. Moreover, their systems
obtain target object images regardless of their quality,
because the architecture of these systems is based on
the concept of only tracking the object.
1.2 Our Approach
In our system, we use two computers and video cam-
eras that are available in the market to obtain the ob-
ject image, track the target object, and control the ac-
tive cameras at 30 fps. In our system, the camera
is controlled such that it moves at the same angular
speed and direction as the target object. Therefore, we
can obtain an image with a blurred background and
clear target at the image center (Fig.1). Additionally,
we use binocular active cameras to track the object.
We can then estimate the 3D position and velocity of
the object. The 3D information can be used for ad-
justing the focus and zoom of the camera. Therefore,
we can obtain target images that are clearer than those
Figure 1: Obtained by conventional method (left) and pro-
posed method (right).
obtained by using only a single active camera. The re-
quirements for our system are as follows:
(1) binocular active cameras to focus their optic axis
on a point in 3D space
(2) target to appear at the center of the images
This is because condition (1) helps us to avoid
the contradiction between detection with two cam-
eras. Epipolar geometry can work stably only if there
is no contradiction between the two cameras.
Condition (2) is required because the view angle
will become narrow when zooming in and the target
will easily escape from the image. In such cases, the
object tracking may completely fail, and thus, there
will be no means to control the cameras.
In fact, it is difficult to estimate the absolute cor-
rect epipolar line because of the errors in object track-
ing or estimation of camera directions. The system
becomes unstable if it is controlled according to an in-
correct epipolar line. Further, the system may become
unstable and lose its smoothness if the time response
deteriorates under the influence of excessive interac-
tions between the two control loops, or if the actions
of the cameras are overly restricted. If the tracking
system’s action is not smooth, the target in the im-
age will appear blurred and the accuracy of the object
tracking will deteriorate. Therefore, the control of the
active cameras will become increasingly unstable. As
previously described, to focus the optic axis of two
cameras on one 3D point, we must solve the follow-
ing problems.
(A) The information sent from the other camera may
be incorrect or of low accuracy if a tracking failure
occurs.
(B) Excessive constraint from the other camera will
make the tracking system’s action unstable.
The conventional related studies on active vision
tracking systems do not mention the methods for con-
trolling the binocular active camera with an emphasis
on the quality of the captured images by solving prob-
lems (A) and (B).
Therefore, in this paper, we have proposed a new
method for solving these problems and constructing
a high-speed-tracking active camera system, that can
continuously obtain high quality images. Our system
can automatically control the direction, zoom, and
focus of the two cameras to focus on a point in 3D
space.
To solve problem (A), we introduce the concept
of reliability into the K-means tracker and propose a
method to constrain the camera action by using this
reliability, which is based on the calculation of the
distance from the K-means clusters.
CLEAR IMAGE CAPTURE - Active Cameras System for Tracking a High-speed Moving Object
95