implementing complex applications for content
based query (Chigrik, 2007, Kratochvil, 2005,
Oracle, 2005).
This paper presents a DBMS based on the
relational model, which is less expensive. The
DBMS is platform independent and can easily
manage medium sized image collections and
alphanumerical information from the medical
domain. It has a visual interface for building content
based retrieval using color and texture
characteristics and can be easily used by any person
working in this area, even if he does not have
advanced knowledge in using the computer.
2 DATA ORGANIZATION IN
THE DATABASE
MANAGEMENT SYSTEM
In this section it is described how the information is
organized in the DBMS.
For each database a new folder is created with
the same name as the name provided by the user.
This folder is in the Database folder which is also
created in the application main folder when the
software is installed. Each table in the database is
represented by a file with “.tbl” extension stored in
corresponding database folder. The file has two
components:
− A header – is created in the design phase
− Data area – is updated when executing
traditional operations of Insert, Update and
Delete
The header structure is made of:
- the number of records in table header (in the
header there will be a record for each column in
table, a record for primary key, and a record for each
external key defined in the table).
- the size of each record from the header (a
header record has information about a column of the
table: name, type, length – in case of char strings; it
can also store information about primary or foreign
key/keys).
- the header records.
The DBMS has three types of data: int, char
(fixed length strings) and image.
For the Image data type, in the data area the
following attributes are stored:
− Image type (bmp, jpg or gif);
− Image height and width
− Number of bytes needed to store the image
− The image in binary;
− 166 integer values, representing the color
histogram.
− 12 integer values, representing texture vector.
A series of methods frequently used in the
medical domain are also implemented for the Image
data type: rotating, zooming, pseudo-colors, the
similarity distance between two images, a thumbnail
representation, etc. The Image data type is in
compliance with the SQL/MM standard (SQLMM,
2001).
There are described below the methods used for
extracting color and texture information from an
image and the reason why they where chosen.
The color is the visual feature that is
immediately perceived on an image. The color space
used for representing color information in an image
has a great importance in content-based image
query, so this direction of research was intensely
studied (Del Bimbo, 2001).
There is no color system that it is universally
used, because the notion of color can be modeled
and interpreted in different ways (Gevers, 2004).
There were created several color spaces, for
different purposes (Gevers, 2004). The color
systems were studied taking into consideration
different criteria imposed by content-based visual
query (Gevers and Smeulders, 1999): the
independence of the imaging device; perceptual
uniformity; linear transformation; intuitive for user;
robust against imaging conditions (invariant to a
change in viewing direction, invariant to a change in
object geometry, invariant to a change in direction
and intensity of the illumination and invariant to a
change in the spectral power distribution of the
illumination).
It was proved that the HSV color system has the
following properties (Gevers, 2004): it is close to the
human perception of colors; it is intuitive; it is
invariant to illumination intensity and camera
direction. The studies made on nature and medical
images have shown that in the case of the HSV,
RGB, l1l2l3 and CieLuv color systems, the HSV
color space produces the best results in content
based retrieval (Gevers and Smeulders, 1999,
Gevers, 2004, Smith, 1997, Stanescu et al, 2006).
The operation of color system quantization is
needed in order to reduce the number of colors used
in content-based visual query. The quantization of
the HSV color space to 166 colors, solution
proposed by J.R. Smith, is the idea used in this
MMDBMS (Smith, 1997), having as result the color
histogram which is memorized together with the
image in the data area of the file.
Together with color, texture is a powerful
characteristic of an image, existent in nature and
medical images, where a disease can be indicated by
changes in the color and texture of a tissue. A series
of methods have been studied to extract texture
features (Del Bimbo, 2001), but there is not a certain
SIGMAP 2007 - International Conference on Signal Processing and Multimedia Applications
372