(MRF) framework and the optimization is performed
through graph cuts. The silhouettes obtained for all
viewpoints are used to build the VH of the object.
The paper is organized as follows: in section 2
an overview of the related work is given. Section 3
introduces research hypotheses and the notation used
in the paper. In sections 4-6, the details of the esti-
mation of background and object likelihoods as well
as the segmentation framework are presented. Sec-
tion 7 presents the experiments and discusses the re-
sults. The last section provides some conclusions on
the proposed approach and identifies directions for fu-
ture work.
2 RELATED WORK
SFS was first introduced by Baumgart (Baumgart,
1974), this concept suggests to fuse silhouettes of an
object in 3D to obtain the VH. Since the object’s sil-
houette is the key element for VH construction, the
following review concentrates on silhouette extrac-
tion approaches.
The obvious and easy way to implement tech-
niques for silhouette extraction is chroma key-
ing (Smith and Blinn, 1996). This approach is based
on the knowledge of the scene background. An object
is imaged against a uniform or known background,
then the silhouette is extracted by thresholding the
background color or by background subtraction. Due
to implementation simplicity, this technique was used
in many SFS works (Matusik et al., 2002; Jagers et al.,
2008). Even though this method provides fairly good
results, there are some drawbacks. Firstly, it implies
preliminary scene background manipulations for each
camera viewpoint, which limits possible camera po-
sitions on a hemisphere since the background has to
be visible from all viewpoints. Secondly, the case
when part of the object has the same color as the back-
ground may lead to incorrect segmentation.
Chroma keying was extended in other works
where instead of a static background, an active back-
ground system was used (Zongker et al., 1999; Ma-
tusik et al., 2002). As an active background, a con-
trolled display was installed around an object. A
scene was captured with and without an object with
different background patterns for a fixed viewpoint.
Even though such an approach allows the extraction
of the alpha matte of the silhouette of an object made
from material with complex optical properties such
as glass, the hardware requirement seriously compli-
cates the acquisition process and limits the method’s
application area. The major drawback is the inability
to move the camera with respect to the background
screens, since images with and without an object have
to be aligned at the pixel level.
Another group of algorithms with explicit back-
ground modeling is based on background subtraction.
A good review can be found in (Piccardi, 2004; Radke
et al., 2005; Parks and Fels, 2008). A background
subtraction technique is based on the construction of
a background model of a scene at first, followed by
the classification of pixels that do not fit this model
as foreground pixels. The major drawback of these
methods is the requirement of an explicit estimation
of the background. This requirement imposes that an
update of the background model needs to be done ev-
ery time the position of the camera is changed which
can be difficult for non uniform backgrounds.
An original segmentation technique that is worth
mentioning was presented in (Sun et al., 2007). The
idea is to use two images: with flash and without flash.
It is assumed that the appearance of a foreground ob-
ject in a ”flash image” is different from that of a ”with-
out flash image”. However, the background remains
similar in both images. The main requirement in this
method is that the background has to be sufficiently
far from the object so that it is not affected by the
camera flash. Unfortunately, this condition is not met
in our experimental environment.
A more universal way to segment images is to rely
on user initialization (Boykov and Jolly, 2001). Here,
user input is used to obtain initial information about
object and background properties. This information
is used to construct a graph and the object segmen-
tation problem is considered as a graph energy min-
imization. A graph cuts is applied to find the global
minimum. In the approach presented in this paper, an
energy minimization via graph cuts is also performed
to obtain optimal segmentation. However, our goal is
to find the initial information required to construct an
MRF automatically.
Single image segmentation by graph cut was fur-
ther extended to automatic silhouette extraction in
multiple views in (Campbell et al., 2007; Lee et al.,
2007). Although these methods may work well, the
usage of explicit object and background color mod-
eling limits the type of objects that can be recon-
structed. Another drawback related to color model-
ing is when the same color belongs to the object and
background model. In this case, the result may lead
to over- or under-estimation of the silhouette. In our
work, we avoid explicit color modeling of an object
and background in order to overcome these limita-
tions.
ShapefromSilhouetteinSpace,TimeandLightDomains
369