functionalities. Hence, it is more convenient to base
further perceptual processes on a more general rep-
resentation of the visual signal. The harmonic repre-
sentation discussed in this paper is a reasonable rep-
resentation of early vision process since it allows for
an efficient and complete representation of (spatially
and temporally) localized structures. It is character-
ized by: (1) compactness (i.e., minimal uncertainty
of the band-pass channel); (2) coverage of the fre-
quency domain; (3) robust correspondence between
the harmonic descriptors and the perceptual ‘sub-
stances’ in the various modalities (edge, motion and
stereo). Through a systematic analysis we investi-
gated the advantages of anisotropic
vs
isotropic filter-
ing approaches for a complete harmonic description
of the visual signal. We observed that it is prefer-
able to construct a multichannel, multiorientation rep-
resentation, thus avoiding an “early condensation” of
basic features. The harmonic content is then com-
bined in the phase-orientation space at the final stage,
only, to come up with the ultimate perceptual deci-
sions. An analysis of possible advantages of the ag-
gregation of the information in the monogenic im-
age in mid- and high-level perceptual tasks (e.g., im-
age classification) would require further investigation,
and it is deferred to a future work.
ACKNOWLEDGEMENTS
This work results from a cross-collaborative effort
within the EU Project IST-FET-16276-2 “DrivSco”.
REFERENCES
Adelson, E., Anderson, C., Bergen, J., Burt, P., and Ogden,
J. (1984). Pyramid methods in image processing. RCA
Engineer, 29(6):33–41.
Barron, J., Fleet, D., and Beauchemin, S. (1994). Perfor-
mance of optical flow techniques. Int. J. of Comp.
Vision, 12:43–77.
Bergen, J., Anandan, P., Hanna, K., and Hingorani, R.
(1992). Hierarchical model-based motion estimation.
In Proc. ECCV’92, pages 237–252.
Daugman, J. (1985). Uncertainty relation for resolution in
space, spatial frequency, and orientation optimized by
two-dimensional visual cortical filters. J. Opt. Soc.
Amer. A, A/2:1160–1169.
Diaz, J., Ros, E., Pelayo, F., Ortigosa, E., and Mota, S.
(2006). FPGA based real-time optical-flow system.
IEEE Trans. Circuits and Systems for Video Technol-
ogy, 16(2):274–279.
Felsberg, M. and Sommer, G. (2001). The monogenic sig-
nal. IEEE Trans. Signal Processing, 48:3136–3144.
Fleet, D. and Jepson, A. (1993). Stability of phase in-
formation. IEEE Trans. Pattern Anal. Mach. Intell.,
15(12):1253–1268.
Fleet, D., Jepson, A., and Jenkin, M. (1991). Phase-based
disparity measurement. CVGIP: Image Understand-
ing, 53(2):198–210.
Fleet, D. J. and Jepson, A. D. (1990). Computation of com-
ponent image velocity from local phase information.
Int. J. of Comp. Vision, 1:77–104.
Freeman, W. and Adelson, E. (1991). The design and use
of steerable filters. IEEE Trans. Pattern Anal. Mach.
Intell., 13:891–906.
Gautama, T. and Van Hulle, M. (2002). A phase-based ap-
proach to the estimation of the optical flow field us-
ing spatial filtering. IEEE Trans. Neural Networks,
13(5):1127–1136.
Harris, J. and Watamaniuk, S. N. (1995). Speed discrimina-
tion of motion-in-depth using binocular cues. Vision
Research, 35(7):885–896.
Kehtarnavaz, N. and Gamadia, M. (2005). Real-Time Im-
age and Video Processing: From Research to Reality.
Morgan & Claypool Publishers.
Koenderink, J. and van Doorn, A. (1987). Representation
of local geometry in the visual system. Biol. Cybern.,
55:367–375.
Kovesi, P. (1999). Image features from phase congruency.
Videre, MIT Press, 1(3):1–26.
Kr
¨
uger, N. and Felsberg, M. (2003). A continuous formula-
tion of intrinsic dimension. In Proc. British Machine
Vision Conference, Norwich, 9-11 September 2003.
Kr
¨
uger, N. and Felsberg, M. (2004). An explicit and com-
pact coding of geometric and structural information
applied to stereo matching. Pattern Recognition Let-
ters, 25(8):849–863.
Marr, D. (1982). Vision. New York: Freeman.
Nestares, O., Navarro, R., Portilla, J., and Tabernero, A.
(1998). Efficient spatial-domain implementation of a
multiscale image representation based on Gabor func-
tions. J. of Electronic Imaging, 7(1):166–173.
Owens, R. (1994). Feature-free images. Pattern Recogni-
tion Letters, 15:35–44.
Pauwels, K. and Van Hulle, M. (2006). Optic flow from
unstable sequences containing unconstrained scenes
through local velocity constancy maximization. In
Proc. British Machine Vision Conference, Edinburgh,
4-7 September 2006.
Sabatini, S., Solari, F., Cavalleri, P., and Bisio, G. (2003).
Phase-based binocular perception of motion in depth:
Cortical-like operators and analog VLSI architectures.
EURASIP J. on Applied Signal Proc., 7:690–702.
Sanger, T. (1988). Stereo disparity computation using Ga-
bor filters. Biol. Cybern., 59:405–418.
Scharstein, D. and Szeliski, R. (2002). A taxonomy and
evaluation of dense two-frame stereo correspondence
algorithms. Int. J. of Comp. Vision, 47(1–3):7–42.
Solari, F., Sabatini, S., and Bisio, G. (2001). Fast technique
for phase-based disparity estimation with no explicit
calculation of phase. Elect. Letters, 37:1382–1383.