2 RELATED WORK AND
CONTRIBUTION
Robust Planar Reconstruction. A key feature in
our work is our robust planar reconstruction method
using color segmentation. Most segmentation-based
algorithms rely on initial matched pairs in each seg-
ment, as discussed earlier. However in a scene con-
taining T-junctions or large parallax motions, the ini-
tial matched pairs may contain numerous outliers,
which can cause incorrect plane fits.
Our approach is more robust and overcomes many
potential problems such as T-junctions due to a wide-
baseline, since we find an optimal plane by directly
remapping each segment, using the homographies
from the candidate plane, onto all visible target im-
ages. One related approach is Patch-based Multi-
View Stereo (PMVS2) (Furukawa and Ponce, 2009)
that uses small patches (e.g., 5 × 5) on which to fit
planes. PMVS2 also relies on initial matched pairs
(e.g., Harris corner with LoG operators) so the algo-
rithm may lead to incorrect patches. When PMVS2
fails to expand a patch’s size, holes or gaps may also
occur.
In addition, most segmentation-based stereo algo-
rithms use a disparity map between a stereo-rectified
image pair. This is not practical in multi-view cases
where we find a plane that satisfies multiple images in
which the segment is visible. Moreover, in multi-view
cases with widely different camera positions and di-
rections, camera poses cannot be manipulated to pre-
vent large distortions after stereo-rectification.
Another related approach is Iterative Plane Fit-
ting (Habbecke and Kobbelt, 2006) by computing a
plane that approximates part of the scene. However,
this method requires manual plane initialization. In
addition, since it relies on intensity differences be-
tween reference and comparison images, it may not
be sufficient to find the correct plane in wide-baseline
stereo. Our method, on the other hand, improves
planar reconstruction by exploiting several matching
constraints. The Iterative Plane Fitting method does
not detect occlusion, whereas our method simultane-
ously detects occluded planar segments in the recon-
struction process.
Plane Sweeping is another approach to reconstruct
planar scenes. Gallup et al. (Gallup et al., 2007)
proposed a real-time plane sweeping stereo method
for outdoor scenes. However, their method requires
initial sparse correspondences and uses intensity dif-
ferences only, which has limitations in wide-baseline
stereo, as mentioned earlier.
In addition, our method use a simple segment
graph to filter out plane outliers, that is, any segment
that has a large 3D discontinuity between adjacent
plane segments is filtered out. Some papers such as
(Taguchi et al., 2008) use similar techniques that op-
timize segment planes. However, their optimization
again uses a simple energy function based on a dis-
parity map between a stereo image pair, which is not
applicable to multi-view cases. We believe it is more
practical and reliable to directly look at planes in 3D
instead of a disparity map in an image domain.
Hybrid Reconstruction with Segmentation. We
also seek an integration of two heterogeneous meth-
ods for aerial and outdoor urban scene reconstruction.
Our algorithm uses color segmentation to obtain seg-
mented regions to be categorized by potential 3D ge-
ometry. We define a simple planarity criterion by seg-
ment color and size so that each segment, treated as
either planar or non-planar, is reconstructed as a plane
or a set of points, respectively.
GPU-friendliness. For a CPU implementation, our
planar reconstruction could potentially be slower than
other matching approaches when a segment is too
large, since for every pixel in the segment, bilinear in-
terpolation at its reprojected target position is required
for every target view. Likewise, the descriptor com-
parison for point-to-point matches can be very expen-
sive when a scene demands many per-pixel matches.
Our algorithm is portable to GPU systems, where it
produces matches in a fraction of the time of a CPU
version, using the texture mapping hardware in the
GPU to do the bilinear interpolation for the remap-
ping.
3 RECONSTRUCTION
OVERVIEW
Our reconstruction process begins with a color seg-
mentation, in particular, mean-shift color segmenta-
tion (Comaniciu and Meer, 2002). Once segmenta-
tion is completed, any tiny segment inside a larger
segment that has a similar color is merged with
it. Sparse feature-point matching using SURF (Bay
et al., 2008), followed by triangulation, is additionally
performed to determine an approximate 3D bounding
box of the scene. This bounding box is later used to
limit the extent of plane candidates in the planar re-
construction and to filter outliers in the point recon-
struction.
During the reconstruction process, each segment
is classified as either planar or non-planar. How-
ever, it is extremely difficult to automatically recog-
VISAPP 2012 - International Conference on Computer Vision Theory and Applications
256