vertical direction, the displacement is computed by
matching salient features and RANSAC is used to
remove outliers.
The optimization framework is tested on a PC
with two Xeon CPUs (2.00 GHz and 1.99 GHz) and
1.50GB ram. The global optimization of the result in
Figure 17(a) (with 980 482 × 429 input frames) takes
around 12 minutes and the result in Figure 17(b) (with
1200 395 × 227 input frames) takes around 8 minutes
2
. The local adjustment of these two results both takes
around 4 minutes.
6 CONCLUDING REMARKS
This paper presents a framework for producing multi-
perspective panoramas of street scenes. Our approach
uses an estimation of 3D scene structure to eliminate
the sampling error caused by the depth parallax. Then
an automatic optimization is performed to create the
panorama with minimal aspect ratio distortions. Af-
ter that, a further local adjustment step is applied to
remove artifacts caused by inverse perspectives. In
principle, our approach is restricted to straight cam-
era trajectories and approximately fronto-parallel pic-
ture surfaces. For non-straight camera trajectories, we
assume they are piece-wise linear. However, for tra-
jectories with abrupt direction changes, although our
rendering system can handle this situation, the result
of our global optimization is not theoretically accu-
rate, as the aspect ratio distortion in this case is not
yet clear.
REFERENCES
Agarwala, A., Agrawala, M., Cohen, M., Salesin, D., and
Szeliski, R. (2006). Photographing long scenes with
multi-viewpoint panoramas. ACM Transactions on
Graphics, 25(3):853 – 861.
Boykov, Y. and Kolmogorov, V. (2004). An experimental
comparison of min-cut/max-flow algorithms for en-
ergy minimization in vision. IEEE Transactions on
Pattern Analysis and Machine Intelligence, 26:1124–
1137.
Chen, S. and Williams, L. (1993). View interpolation
for image synthesis. Computer Graphics, 27(Annual
Conference Series):279–288.
Davis, J. (1998). Mosaics of scenes with moving objects.
In Proceedings of the IEEE Conference on Computer
Vision and Pattern Recognition, pages 354–360.
2
To eliminate the redundant computation of matching
points, a dense disparity map for each consecutive frame
pair is pre-computed before optimization.
Goesele, M., Curless, B., and Seitz, S. (2006). Multi-view
stereo revisited. In Proceedings of IEEE Conference
on Computer Vision and Pattern Recognition, pages
2402–2409.
Hartley, R. and Zisserman, A. (2004). Multiple view geom-
etry in computer vision. Cambridge University Press,
2 edition.
Kumar, R., Anandan, P., Irani, M., Bergen, J., and Hanna,
K. (1995). Representation of scenes from collections
of images. In Proceedings of IEEE Workshop on Rep-
resentation of Visual Scenes, pages 10–17.
Roman, A., Garg, G., and Levoy, M. (2004). Interactive
design of multi-perspective images for visualizing ur-
ban landscapes. In Proceedings of IEEE Visualization,
pages 537–544.
Roman, A. and Lensch, H. (2006). Automatic multiper-
spective images. In Proceedings of 17th Eurographics
Symposium on Rendering, pages 161–171.
Shum, H. and Szeliski, R. (2000). Construction of
panoramic image mosaics with global and local align-
ment. International Journal of Computer Vision,
36(2):101–130.
Szeliski, R. and Shum, H. (1997). Creating full view
panoramic image mosaics and environment maps. In
Proceedings of SIGGRAPH 97, Computer Graphics
Proceedings, Annual Conference Series, volume 31,
pages 251–258.
Wexler, Y. and Simakov, D. (2005). Space-time scene man-
ifolds. In Proceedings of the International Conference
on Computer Vision, volume 1, pages 858 – 863.
Yu, J. and McMillan, L. (2004). General linear cameras.
In Proceedings of European Conference on Computer
Vision, pages 14–27.
Zheng, J. (2003). Digital route panoramas. IEEE Multime-
dia, 10(3):57– 67.
Zhu, Z., Riseman, E., and Hanson, A. (2001). Parallel-
perspective stereo mosaics. In Proceedings of the
International Conference on Computer Vision, vol-
ume 1, pages 345–352.
Zomet, A., Feldman, D., Peleg, S., and Weinshall, D.
(2003). Mosaicing new views: The crossed-slits pro-
jection. IEEE Transactions on Pattern Analysis and
Machine Intelligence, 25(6):741– 754.
GRAPP 2009 - International Conference on Computer Graphics Theory and Applications
198