0 50 100 150 200
confidence in translation
frame 0 50 100 150 200
confidence in translation
frame 0 50 100 150 200
confidence in translation
frame 0 50 100 150 200
confidence in translation
frame 0 50 100 150 200
confidence in translation
frame 0 50 100 150 200
confidence in translation
frame 0 50 100 150 200
confidence in translation
frame
singular
eigenenval
global
GMDL
Figure 4: Confidence of different criteria in a camera trans-
lation. White background indicates static camera, yellow
background a pure rotation and green background a general
motion including translation.
homography does not suffice to identify planes. First
we enforced that the purely geometric homographies
represent physical scene planes, then the case of a
global homography resulting from zero camera trans-
lation was analyzed. Finally, the overall effectiveness
of plane detection was shown in experiments.
Defining coplanarity only via the geometric trans-
fer function of a homography, it is not possible to de-
cide, whether a plane is only geometrically present or
corresponding to a physical scene plane. The key idea
was to use points in a closed image area for the def-
inition of planar patches, as the contiguous 3D plane
surfaces have to be mapped to contiguous 2D areas.
Finally, planes can not be detected in every sit-
uation. If there was no camera translation between
two frames and the optical centers are identical, no
information on coplanarity can be gained. Demand-
ing validity of the detected homographies for frames
with non-zero camera translation allows to handle this
degeneracy. An automatic classification of the camera
motion allows the detection of coplanar feature points
also in handheld image sequences.
REFERENCES
Baker, S., Szeliski, R., and Anandan, P. (1998). A layered
approach to stereo reconstruction. In Proc. Computer
Vision and Pattern Recognition, pages 434–441, Santa
Barbara, CA.
Bartoli, A. and Sturm, P. (2003). Constrained structure and
motion from multiple uncalibrated views of a piece-
wise planar scene. International Journal of Computer
Vision, 52(1):45–64.
Fraundorfer, F., Schindler, K., and Bischof, H. (2006).
Piecewise planar scene reconstruction from sparse
correspondences. Image and Vision Computing,
24(4):395–406.
Gorges, N., Hanheide, M., Christmas, W., Bauckhage, C.,
Sagerer, G., and Kittler, J. (2004). Mosaics from arbi-
trary stereo video sequences. In Proc. 26th DAGM
Symposium, pages 342–349, Heidelberg, Germany.
Springer-Verlag.
K¨ahler, O. and Denzler, J. (2006). Detection of planar
patches in handheld image sequences. In Proceedings
Photogrammetric Computer Vision 2006, volume 36
of International Archives of the Photogrammetry, Re-
mote Sensing and Spatial Information Sciences, pages
37–42.
Kanatani, K. (2004). Uncertainty modeling and model
selection for geometric inference. IEEE Transac-
tions on Pattern Analysis and Machine Intelligence,
26(10):1307–1319.
Lourakis, M., Argyros, A. A., and Orphanoudakis, S. C.
(2002). Detecting planes in an uncalibrated image
pair. In Proc. British Machine Vision Conference
(BMVC2002), pages 587–596.
Odone, F., Fusiello, A., and Trucco, E. (2002). Layered
representation of a video shot with mosaicing. Pattern
Analysis and Applications, 5(3):296–305.
Rother, C. (2003). Linear multi-view reconstruction of
points, lines, planes and cameras using a reference
plane. In Proceedings ICCV 2003, pages 1210–1217,
Nice, France.
Shi, J. and Tomasi, C. (1994). Good features to track.
In IEEE Conference on Computer Vision and Pattern
Recognition CVPR, pages 593–600.
Torr, P. H., Fitzgibbon, A., and Zisserman, A. (1999). The
problem of degeneracy in structure and motion recov-
ery from uncalibrated image sequences. International
Journal of Computer Vision, 32(1):27–45.
Torr, P. H. S. (1997). An assessment of information criteria
for motion model selection. In IEEE Conference on
Computer Vision and Pattern Recognition, pages 47–
53.
0
10
20
30
40
50
60
70
80
90
100
0 10 20 30 40 50 60 70 80 90 100
false detections
detected translations
GAIC
GBIC1
GBIC2
GMDL
0.12
1.17
1.17
singular
eigenval
global
Figure 5: ROC-curve for different methods of detecting
camera translation. An optimal method had 100% of de-
tected translations with 0% of false detections, which is sit-
uated in the lower right corner.