Integration of Tracked and Recognized Features for Locally and Globally Robust Structure from Motion
Chris Engels, Friedrich Fraundorfer, David Nistér
2008
Abstract
We present a novel approach to structure from motion that integrates wide baseline local features with tracked features to rapidly and robustly reconstruct scenes from image sequences. Rather than assume that we can create and maintain a consistent and drift-free reconstructed map over an arbitrarily long sequence, we instead create small, independent submaps generated over short periods of time and attempt to link the submaps together via recognized features. The tracked features provide accurate pose estimates frame to frame, while the recognizable local features stabilize the estimate over larger baselines and provide a context for linking submaps together. As each frame in the submap is inserted, we apply real-time bundle adjustment to maintain a high accuracy for the submaps. Recent advances in feature-based object recognition enable us to efficiently localize and link new submaps into a reconstructed map within a localization and mapping context. Because our recognition system can operate efficiently on many more features than previous systems, our approach easily scales to larger maps. We provide results that show that accurate structure and motion estimates can be produced from a handheld camera under shaky camera motion
References
- Lowe, D.: Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision 60 (2004) 91-110
- Nistér, D., Stewénius, H.: Scalable recognition with a vocabulary tree. In: Proc. IEEE Conference on Computer Vision and Pattern Recognition, New York City, New York. (2006)
- Sivic, J., Zisserman, A.: Video Google: A text retrieval approach to object matching in videos. In: Proc. 9th IEEE International Conference on Computer Vision, Nice, France. (2003) 1470-1477
- Matas, J., Chum, O., Urban, M., Pajdla, T.: Robust wide baseline stereo from maximally stable extremal regions. In: Proc. 13th British Machine Vision Conference, Cardiff, UK. (2002) 384-393
- Lowe, D.: Object recognition from local scale-invariant features. In: Proc. 7th International Conference on Computer Vision, Kerkyra, Greece. (1999) 1150-1157
- Kalman, R.: A new approach to linear filtering and prediction problems. Transactions of the ASME: Journal of Basic Engineering (1960) 35-45
- Triggs, B., McLauchlan, P., Hartley, R., Fitzgibbon, A.: Bundle adjustment: A modern synthesis. In: Vision Algorithms Workshop: Theory and Practice. (1999) 298-372
- Engels, C., Stewénius, H., Nistér, D.: Bundle adjustment rules. In: Photogrammetric Computer Vision. (2006)
- Bosse, M., Newman, P., Leonard, J., Teller, S.: An atlas framework for scalable mapping. In: IEEE International Conference on Robotics and Automation. (2003) 1234-1240
- Leonard, J.J., Newman, P.M.: Consistent, convergent, and constant-time slam. In: International Joint Conference on Artificial Intelligence. (2003) 1143-1150
- Davison, A., Reid, I., Molton, N., Stasse, O.: Monoslam: Real-time single camera slam. IEEE Transactions on Pattern Analysis and Machine Intelligence 29 (2007) 1052-1067
- Sim, R., Little, J.J.: Autonomous vision-based exploration and mapping using hybrid maps and Rao-Blackwellised particle filters. In: Proceedings of the IEEE/RSJ Conference on Intelligent Robots and Systems (IROS), Beijing, IEEE/RSJ, IEEE Press (2006) 2082-2089
- Montemerlo, M., Thrun, S., Koller, D., Wegbreit, B.: Fastslam: A factored solution to the simultaneous localization and mapping problem. In: Proc. of the AAAI National Conference on Artificial Intelligence. (2002) 593-598
- Nistér, D., Naroditsky, O., Bergen, J.: Visual odometry. In: Proc. IEEE Conference on Computer Vision and Pattern Recognition, Washington, DC. (2004) I: 652-659
- Harris, C., Stephens, M.: A combined corner and edge detector. In: Alvey Vision Conference. (1988)
- Tomasi, C., Kanade, T.: Detection and tracking of point features. Technical Report CMUCS-91-132, Carnegie Mellon University (1991)
- Mikolajczyk, K., Tuytelaars, T., Schmid, C., Zisserman, A., Matas, J., Schaffalitzky, F., Kadir, T., Van Gool, L.: A comparison of affine region detectors. International Journal of Computer Vision 65 (2005) 43-72
- Ni, K., Steedly, D., Dellaert, F.: Out-of-core bundle adjustment for large-scale 3d reconstruction. In: Proc. 11th IEEE International Conference on Computer Vision, Rio de Jeneiro, Brazil, IEEE (2007)
Paper Citation
in Harvard Style
Engels C., Fraundorfer F. and Nistér D. (2008). Integration of Tracked and Recognized Features for Locally and Globally Robust Structure from Motion . In VISAPP-Robotic Perception - Volume 1: VISAPP-RoboPerc, (VISIGRAPP 2008) ISBN 978-989-8111-23-4, pages 13-22. DOI: 10.5220/0002341800130022
in Bibtex Style
@conference{visapp-roboperc08,
author={Chris Engels and Friedrich Fraundorfer and David Nistér},
title={Integration of Tracked and Recognized Features for Locally and Globally Robust Structure from Motion},
booktitle={VISAPP-Robotic Perception - Volume 1: VISAPP-RoboPerc, (VISIGRAPP 2008)},
year={2008},
pages={13-22},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002341800130022},
isbn={978-989-8111-23-4},
}
in EndNote Style
TY - CONF
JO - VISAPP-Robotic Perception - Volume 1: VISAPP-RoboPerc, (VISIGRAPP 2008)
TI - Integration of Tracked and Recognized Features for Locally and Globally Robust Structure from Motion
SN - 978-989-8111-23-4
AU - Engels C.
AU - Fraundorfer F.
AU - Nistér D.
PY - 2008
SP - 13
EP - 22
DO - 10.5220/0002341800130022