The case shown on the right in Figure 4 has a
pixel on pyramid level 1 as center; the average of the
two adjacent pixels on level 0 may be taken for the
second diagonal test; however, with less effort better
results are obtained by using pixels on the next
higher pyramid level.
6 COMPARISON OF EFFORT
NEEDED AND EXPERIMENTAL
RESULTS
Beside the quality of results delivered (consistency,
completeness, localization accuracy), the effort to
obtain them is a criterion for general acceptance of a
method. The new method has been investigated
extensively with special test images and with single
real-world images from video (-fields and full
images). Qualitative results look promising so that a
real-time implementation for video-rate is underway.
With respect to computing effort needed, Table
1 shows a comparison with other proven methods;
the corner evaluation is done for every second pixel
(m
r
in total) in every second row (n
c
in total),
yielding m
r
·n
c
·0.25 locations. This is considered a
fair comparison to the pyramid concept in the new
method. The pyramid stages are a byproduct needing
just storage and only little additional computation
(see Eq.(2) with a single pixel as mask element).
With the size reduction by one quarter for each
stage, three stages need (1 + 0.25 + 0.0625) = 1.3125
times the operations per pixel. Assuming 15%
additional effort for removing corner candidates
stemming from nearly diagonal edges from the first
test (~ 5% of image locations and 3 times the basic
effort) requires another factor of 1.15 for the total
operations needed (~ 1.51 times the operations per
pixel, see square brackets in last column).
Table 1: Comparison of mathematical operations needed
with several corner extraction methods.
Since reusing intermediate results has been
taken into account computing the effort needed,
applying the known methods to every pixel location
requires only about doubling the numbers given. For
achieving the same localization accuracy, the new
method would have to start from twice the image
resolution, and the numbers in the last column have
to be multiplied by four. This would cut the ratio
between former and the last column in half, leaving
still some advantage to be expected for the new
method. However, since real-time visual perception
runs at 25 (33⅓) Hz with smoothing by recursive
estimation, this increased effort may not be
necessary.
Of course, these numbers can only yield a
rough estimate of computing times required by full
algorithms, since hardware capabilities and
programming proficiency also play an important role
for the results finally achieved. Future has to show
actual results.
Figure 5 shows results with two diagonal tests
but without consistent pixel centering on the
different pyramid levels. From the figure and many
other examples investigated it has been concluded
that the effort for precise superposition of mask
centers may be desirable for smooth tracking in real
time; an approach with two images on each higher
pyramid level, one shifted by (1, 1) relative to the
other (see Fig. 2), is under study and looks
promising.
Note that the approach requires no computation
of gradients at all. Just the sums of two pixels on the
diagonal have to be computed in the framework of
the fit of an intensity plane with least sum of errors
squared. The difference not only yields the residues
ε of the planar fit, but its square is also directly
proportional to the trace of the structural matrix, i.e.
the sum of the eigenvalues (Eq.(12)).
The use of two rotated planar fits allows
lowering the threshold traceN for achieving
detection of fainter real corners. If the threshold is
set too high, candidates resulting from real corners
but with low differences in intensity are lost. The
second rotated planar fit eliminates, or at least
strongly reduces, the number of candidates
stemming from noise-corrupted edges.
For real-time applications it is not so important
to detect all corners (also unreliable ones) but to
obtain sufficiently many consistent candidates for
tracking at high image rates. Edges are picked up by
separate operators anyway. Figure 5 shows that
some corner candidates are obtained on single
pyramid levels only, while others are detected on
two or even three levels; of course, these latter ones
are those best suited for tracking.
n
c
·m
r
·0.25
[·1.51]
n
c
·m
r
·0.25
n
c
·m
r
·0.25
n
c
·m
r
·0.25
number of
locations
1 [1.5]1519 361compare
1 [1.5]00 0 13mult./div.
5 [7.5]2228 5327add/subtr.
Dickmanns
2008
~ 4
FAST
2004
3.5
SUSAN
1997
2.5 3.5
Harris,
1988
~ 2.5
method,
year
radius r
16 pixel
37 pixel
2r = 2 · 3.5
2r = 2 · 3.5
1
2
3
4
−
6
87
5
1
7
5
3
refe-
rence
pixel
y
z
CORNER DETECTION WITH MINIMAL EFFORT ON MULTIPLE SCALES
319