2 LOGO EXTRACTION
The first step in our logo recognition algorithm is
logo extraction. The logo extraction itself can be
further divided into five steps, as illustrated in
Figure 2. The first three steps, i.e., contrast stretch,
automatic gamma correction, and bilateral filtering,
pre-process the input image to improve the quality
of the subsequent steps in the logo extraction. Next,
a dynamic region growing process searches the logo
in the neighborhood of the click position. As soon as
the region growing finishes, the algorithm makes an
image crop, i.e., the result of our logo extraction.
Figure 2: Logo extraction by region growing.
Automatic gamma correction
Since logo extraction works on all kinds of images,
gamma correction needs to calculate a different
gamma value for each individual image. To
automatically generate an appropriate value we
created an automatic gamma correction. This
correction starts with an RGB to HSL color space
conversion. Using the histogram of the HSL
lightness component, our algorithm computes the
mean and standard deviation of the lightness
component. Based on these values the algorithm
computes the gamma value using (Eq. 1).
Bilateral filtering
Noise makes logo extraction more difficult. Many
solutions, i.e., filters, exist in literature to remove
image noise. However the majority of these filters
have the undesirable side-effect of blurring the
edges. For region growing these edges are very
important and must be easily distinguishable.
Bilateral filtering prevents averaging across
edges, while still averaging within smooth regions. It
is a non-linear filtering technique introduced by
(Tomasi and Manduchi, 1998). It extends the
concept of Gaussian smoothing by weighting the
Gaussian filter coefficients with their corresponding
relative pixel intensities, i.e., combining gray levels
or colors based on their geometric closeness and
their photometric similarity.
Dynamic region growing
Since the proposed pre-processing operates on the
global image, and, standard region growing grows
unbounded, the computational cost can become very
high and the retrieved image crop is sometimes too
big to find a unique match. Dynamic region growing
solves these problems since it investigates only a
small part of the image, i.e., a 100 by 100 pixel area
centered at the click position. If this small image
crop contains enough information to retrieve a
unique match, the dynamic region growing finishes.
Otherwise the pixel area is extended and the logo
extraction restarts.
3 FEATURE EXTRACTION
Many algorithms exist to recognize logo-like
objects, e.g., global features, shape description- &
matching techniques, and local features (invariant to
transformations and variations). Based on our results
and on studies comparing recognition techniques
(Veltkamp, 2001), local features seem to perform
best for solving real-life image matching problems.
Local feature-based image matching is usually
done in two steps. The first step is the feature
detection, i.e., keypoint or interest point detection.
The second step involves computing descriptors for
each detected interest point. These descriptors are
then used for matching keypoints of the input image
with keypoints in the logo database.
During the last decade, a lot of different
detectors and descriptors have been proposed in
literature (Mikolajczyk et al., 2004). For logo
recognition the descriptor should be distinctive and
at the same time robust to changes in viewing
conditions. The feature should also be resilient to
changes in illumination, image noise, uniform
scaling, rotation, and minor changes in viewing
direction. The descriptor should also minimize the
probability of mismatch and finally, it should also be
(1)
VISAPP 2009 - International Conference on Computer Vision Theory and Applications
374