ACHIEVING HIGH-RESOLUTION VIDEO USING SCALABLE CAPTURE, PROCESSING, AND DISPLAY
Donald Tanguay, H. Harlyn Baker, Dan Gelb
2006
Abstract
New video applications are becoming possible with the advent of several enabling technologies: multicamera capture, increased PC bus bandwidth, multicore processors, and advanced graphics cards. We present a commercially-available multicamera system and a software architecture that, coupled with industry trends, create a situation in which video capture, processing, and display are all increasingly scalable in the number of video streams. Leveraging this end-to-end scalability, we introduce a novel method of generating high-resolution, panoramic video. While traditional point-based mosaicking requires significant image overlap, we gain significant advantage by calibrating using shared observations of lines to constrain the placement of images. Two non-overlapping cameras do not share any scene points; however, seeing different parts of the same line does constrain their spatial alignment. Using lines allows us to reduce overlap in the source images, thereby maximizing final mosaic resolution. We show results of synthesizing a 6 megapixel video camera from 18 smaller cameras, all on a single PC and at 30 Hz.
References
- Arvind, D. Culler, R. Iannucci, V. Kathail, K. Pingali, R. Thomas, 1984. The tagged token dataflow architecture. Technical report, MIT Laboratory for Computer Science.
- Buck, J., S. Ha, E. Lee, and D. Messerschmitt, 1994. Ptolemy: A framework for simulating and prototyping heterogeneous systems. In International Journal of Computer Simulation, April 1994.
- Canny, J., 1986. A computational approach to edge detection. In IEEE Trans. on Pattern Analysis and Machine Intelligence, vol. 8, pp. 679-698.
- Gibbs, W. Wayt, 2004. A split at the core. In Scientific American, Nov. 2004.
- Gortler, S.J., R. Grzeszczuk, R. Szeliski, M.F. Cohen, 1996. The Lumigraph, In Proc. ACM SIGGRAPH, New Orleans, USA.
- Hartley, R., A. Zisserman, 2000. Multiple view geometry in computer vision, Cambridge University Press.
- Kanade T., P.J. Narayanan, P.W. Rander, 1995. Virtualized reality: Concepts and early results. In Proc. IEEE Workshop on Representation of Visual Scenes.
- Kongetira, P., K. Aingaran, K Olukotun, 2005. Niagara: A 32-way multithreaded SPARC processor. In IEEE Micro, vol. 25, no. 2, pp. 21-29.
- Lee, E., T. Parks, 1995. Dataflow Process Networks. In Proceedings of the IEEE, May 1995.
- Levoy, M., B. Chen, V. Vaish, M. Horowitz, L. McDowall, M. Bolas, 2004. Synthetic aperture confocal imaging. In ACM Trans on Graphics (SIGGRAPH 2004).
- Levoy, M., P. Hanrahan, 1996. Light field rendering. In Proc. ACM SIGGRAPH, New Orleans, USA.
- Lohse, M., M. Repplinger, P. Slusallek, 2002. An open middleware architecture for network-integrated multimedia. In Protocols and Systems for Interactive Distributed Multimedia Systems, Proceedings of IDMS/PROMS'2002 Joint International Workshops on Interactive Distributed Multimedia Systems / Protocols for Multimedia Systems, Coimbra, Portugal, November 26-29, 2002.
- Mayer-Patel, K., Rowe, L., 1997. Design and performance of the Berkeley Continuous Media Toolkit. In Multimedia Computing and Networking, Proc. SPIE 3020, pp. 194-206.
- McDougall, R., 2005. Extreme software scaling. In ACM Queue, vol. 3, no.7, pp. 36-46.
- Peleg, S., J. Herman, 1997. Panoramic mosaicing with VideoBrush. In DARPA Image Understanding Workshop, May 1997, pp.261-264.
- Rasure, J., C. Williams, 1991. An integrated visual language and software development environment. In Journal of Visual Languages and Computing, vol. 2, pp. 217-246.
- Robinson, D., P. Milanfar, 2003. Statistical performance and analysis of super-resolution image reconstruction. In Proceedings of Intl. Conf. on Image Processing.
- Sawhney, H.S., S. Hsu, R. Kumar, 1998. Robust video mosaicing through topology inference and local to global alignment. In Proc. of the 5th European Conference on Computer Vision, vol. II, 1998, pp. 103-119.
- Shum, H.-Y., R. Szeliski, 2000. Construction of panoramic mosaics with global and local alignment. In International Journal of Computer Vision, February 2000, vol. 36, no.2, pp. 101-130.
- Wilburn, B., N. Joshi, V. Vaish, M. Levoy, M. Horowitz, 2004. High speed video using a dense camera array. In Proc. Computer Vision Pattern Recognition.
- Wilburn, B., M. Smulski, H-H. Kelin Lee, M. Horowitz, 2002. The light field video camera. In Proc. Media Processors, SPIE Electronic Imaging, vol. 4674, 29- 36.
- Yang, R., M. Pollefeys, S. Li, 2004. Improved real-time stereo on commodity graphics hardware. In Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops (CVPRW'04).
Paper Citation
in Harvard Style
Tanguay D., Harlyn Baker H. and Gelb D. (2006). ACHIEVING HIGH-RESOLUTION VIDEO USING SCALABLE CAPTURE, PROCESSING, AND DISPLAY . In Proceedings of the First International Conference on Computer Vision Theory and Applications - Volume 1: VISAPP, ISBN 972-8865-40-6, pages 162-169. DOI: 10.5220/0001374301620169
in Bibtex Style
@conference{visapp06,
author={Donald Tanguay and H. Harlyn Baker and Dan Gelb},
title={ACHIEVING HIGH-RESOLUTION VIDEO USING SCALABLE CAPTURE, PROCESSING, AND DISPLAY},
booktitle={Proceedings of the First International Conference on Computer Vision Theory and Applications - Volume 1: VISAPP,},
year={2006},
pages={162-169},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001374301620169},
isbn={972-8865-40-6},
}
in EndNote Style
TY - CONF
JO - Proceedings of the First International Conference on Computer Vision Theory and Applications - Volume 1: VISAPP,
TI - ACHIEVING HIGH-RESOLUTION VIDEO USING SCALABLE CAPTURE, PROCESSING, AND DISPLAY
SN - 972-8865-40-6
AU - Tanguay D.
AU - Harlyn Baker H.
AU - Gelb D.
PY - 2006
SP - 162
EP - 169
DO - 10.5220/0001374301620169