image understanding; and (ii) the sensory gap between the object in the world and the
information in a (computational) description derived from a recording of that scene [1].
Basically, all systems use the assumption of equivalence of an image and its rep-
resentation in the feature space. These systems often use measurement systems, such
as the easily understandable Euclidean vector space model for measuring distances be-
tween a query image (represented by its features) and possible results, representing
all images as feature vectors in an n-dimensional vector space. Nevertheless, metrics
have been shown to not correspond well to the human visual perception. Several other
distance measures do exist for the vector space model such as the city-block distance,
the Mahalanobis distance or a simple histogram intersection. Still, the use of high-
dimensional feature spaces has shown to cause problems. Also, caution should be taken
when choosing the distance measure in order to retrieve meaningful results. These prob-
lems with a similarity definition in high-dimensional feature spaces is also known as the
“curse of dimensionality”, and has also been discussed in the domain of medical imag-
ing [2].
Beyer et. al. proved in [3] that the increasing in the number of features (and con-
sequently in the dimensionality of the data) leads to losing the significance of each
feature value. Thus, to avoid decreasing the discrimination accuracy, it is important to
keep the number of features as low as possible, establishing a trade-off between the
discrimination power and the feature vector size.
Aimed at overriding the problems of the semantic gap and the “curse of dimension-
ality”, this paper shows a simple but powerful feature extractor based on multiresolution
wavelet transforms, which uses the approximation subspace to compose the feature vec-
tor to represent the image. The results of applying our method achieves 90% regarding
the precision in the retrieval of medical images that asks up to 65% of the image set.
2 Background - Wavelets
Our proposed technique works on image subspaces generated by applying wavelet
transforms through the multiresolution method. Wavelets are mathematical functions
that separate the signal in different components of frequency, and then examine each
component with a combined resolution with its scale.
It is interesting to compare the wavelet transform to the Fourier transform. While the
Fourier transform analyzes a signal according to the frequency, the wavelet transform
analyzes it according to the scale. Thus, the wavelets can remove statistical redundancy
among pixels, providing a more compact representation of the image information. It is
believed that image indexing generated over the wavelet transformed domain are more
efficient than those designed over the spatial domain. This is due to the fact that the
transformed coefficients have better defined distributions than image pixels. Besides,
the wavelets have a multiresolution property that make it easier to extract the image
features from transformed coefficients [4].
The central element of a multiresolution analysis is a function φ(t), called the scal-
ing function, whose role is to represent a signal at different scales. The translations of
the scaling function constitute the “building blocks” of the representation of a signal at