8 CONCLUSIONS
We are entering an exciting period in which capture,
processing, and display are increasingly scalable in
the number of video streams supported. In support of
this view, we capture 24 uncompressed VGA
streams in real-time on a single PC and leverage
multiprocessors for video processing. Coupling
these developments with recent commercial
hardware advances brings forth end-to-end
scalability that enables new applications of camera
arrays and other multi-video sources. We selected a
mosaic camera as our first application in this space.
We presented several innovations in this paper.
First, we described how to use a digital projector as
a calibration instrument for a mosaic camera. We
also presented an adaptive technique for reducing
the number of calibration images using the solution
constraints accumulated thus far. We described how
the mosaic camera is built using our camera array,
and how our dataflow architecture combined with
chip multiprocessors and graphics cards present new
abilities and options for mosaic processing. Finally,
we showed initial results for 18 cameras with small
error and real-time performance.
REFERENCES
Authors. References withheld to preserve anonymity
during review process.
Arvind, D. Culler, R. Iannucci, V. Kathail, K. Pingali, R.
Thomas, 1984. The tagged token dataflow
architecture. Technical report, MIT Laboratory for
Computer Science.
Buck, J., S. Ha, E. Lee, and D. Messerschmitt, 1994.
Ptolemy: A framework for simulating and prototyping
heterogeneous systems. In International Journal of
Computer Simulation, April 1994.
Canny, J., 1986. A computational approach to edge
detection. In IEEE Trans. on Pattern Analysis and
Machine Intelligence, vol. 8, pp. 679-698.
Gibbs, W. Wayt, 2004. A split at the core. In Scientific
American, Nov. 2004.
Gortler, S.J., R. Grzeszczuk, R. Szeliski, M.F. Cohen,
1996. The Lumigraph, In Proc. ACM SIGGRAPH,
New Orleans, USA.
GPGPU. General-purpose computation using graphics
hardware. http://www.gpgpu.org.
Hartley, R., A. Zisserman, 2000. Multiple view geometry
in computer vision, Cambridge University Press.
Kanade T., P.J. Narayanan, P.W. Rander, 1995.
Virtualized reality: Concepts and early results. In
Proc. IEEE Workshop on Representation of Visual
Scenes.
Kongetira, P., K. Aingaran, K Olukotun, 2005. Niagara: A
32-way multithreaded SPARC processor. In IEEE
Micro, vol. 25, no. 2, pp. 21-29.
Lee, E., T. Parks, 1995. Dataflow Process Networks. In
Proceedings of the IEEE, May 1995.
Levoy, M., B. Chen, V. Vaish, M. Horowitz, L.
McDowall, M. Bolas, 2004. Synthetic aperture
confocal imaging. In ACM Trans on Graphics
(SIGGRAPH 2004).
Levoy, M., P. Hanrahan, 1996. Light field rendering. In
Proc. ACM SIGGRAPH, New Orleans, USA.
Lohse, M., M. Repplinger, P. Slusallek, 2002. An open
middleware architecture for network-integrated
multimedia. In Protocols and Systems for Interactive
Distributed Multimedia Systems, Proceedings of
IDMS/PROMS'2002 Joint International Workshops on
Interactive Distributed Multimedia Systems /
Protocols for Multimedia Systems, Coimbra, Portugal,
November 26-29, 2002.
Mayer-Patel, K., Rowe, L., 1997. Design and performance
of the Berkeley Continuous Media Toolkit. In
Multimedia Computing and Networking, Proc. SPIE
3020, pp. 194-206.
McDougall, R., 2005. Extreme software scaling. In ACM
Queue, vol. 3, no.7, pp. 36-46.
Microsoft Corp., DirectShow. http://msdn.microsoft.com.
Peleg, S., J. Herman, 1997. Panoramic mosaicing with
VideoBrush. In DARPA Image Understanding
Workshop, May 1997, pp.261-264.
Rasure, J., C. Williams, 1991. An integrated visual
language and software development environment. In
Journal of Visual Languages and Computing, vol. 2,
pp. 217-246.
Robinson, D., P. Milanfar, 2003. Statistical performance
and analysis of super-resolution image reconstruction.
In
Proceedings of Intl. Conf. on Image Processing.
Sawhney, H.S., S. Hsu, R. Kumar, 1998. Robust video
mosaicing through topology inference and local to
global alignment. In Proc. of the 5th European
Conference on Computer Vision, vol. II, 1998, pp.
103-119.
Shum, H.-Y., R. Szeliski, 2000. Construction of
panoramic mosaics with global and local alignment. In
International Journal of Computer Vision, February
2000, vol. 36, no.2, pp. 101-130.
Sun Corp., Java Media Framework.
http://java.sun.com/products/java-media/jmf.
Wilburn, B., N. Joshi, V. Vaish, M. Levoy, M. Horowitz,
2004. High speed video using a dense camera array. In
Proc. Computer Vision Pattern Recognition.
Wilburn, B., M. Smulski, H-H. Kelin Lee, M. Horowitz,
2002. The light field video camera. In Proc. Media
Processors, SPIE Electronic Imaging, vol. 4674, 29-
36.
Yang, R., M. Pollefeys, S. Li, 2004. Improved real-time
stereo on commodity graphics hardware. In
Proceedings of the 2004 IEEE Computer Society
Conference on Computer Vision and Pattern
Recognition Workshops (CVPRW’04).
ACHIEVING HIGH-RESOLUTION VIDEO USING SCALABLE CAPTURE, PROCESSING, AND DISPLAY
169