7. Image analysis problem decomposition and synthesis.
8. Problems 1-7 for dynamical images with a complex background and with
considering the ways of an image acquisition, formation and representation.
In solving these subproblems an automated/interactive IA system is faced with
the specific subproblems as follows: a) extract meaningful 2-D grouping of intensity-
location-time values via identification of groups of image entities – pixels by means
of similarity of intensity value or by similarity discontinuity or similarity of change or
constancy over time; typical groupings are edges, regions, and flow vectors; b) infer
3-D surfaces, volumes, boundaries, shadows, occlusion, depth, color, motion using
the groupings of pixels and their characteristics; c) group information into unique
physical entities; d) transform image-centered representations into world-centered
representations; e) label entities depending on system goals and world model; f) infer
relations among entities; g) construct a consistent internal description.
Image understanding is considered as an emulation of human visual capabilities.
In particular, as the derivation of high-level (abstract) information from an image or
series of images and more specifically as the derivation of knowledge on 3-D world
from 2-D images and the construction of the description of 3-D scene represented as
2-D image (s). The result of an image understanding is a symbolic description of the
image in terms of its elements, relations between them and the image properties. The
description should ensure the decision making in a real 3-D environment (recognition
of 3-D objects, automated navigation, etc.). The image understanding process is
implemented by combining the results of image processing, analysis and recognition
with the knowledge on a scene.
In IA is used a wide spectrum of mathematical techniques from algebra,
geometry, discrete mathematics, mathematical logics, probability theory,
mathematical statistics, calculus, as well as the techniques of mathematical theory of
pattern recognition, digital signal processing, and physics (in particular, optics).
The transition to practical, reliable and efficient automation of image-mining is
directly dependent on introducing and developing of theoretical means for IAE.
The natural way to overcome the above mentioned contradiction between
“pictorial nature” of an image and the “formal” (symbolic) foundations of IAE is to
introduce pattern recognition oriented image models and necessary means and
techniques for reduction of an image to a recognizable form without loss of an image
specificity. The careful study of the challenge revealed the opportunity to solve it via
a theory establishing reasonable ties between an image nature, IAE applications,
pattern recognition philosophy, image representations and models, IAE transforms,
and corresponding information technologies.
In a whole the success of IAE depends mainly on the success of an image
reduction to a recognizable form, which could be accepted by appropriate image
analysis/recognition algorithms. It appeared that an image reduction to a recognizable
form is a crucial issue for IA applications, in particular, for qualified decision making
on the base of image mining. The main tasks and problems of an image reduction to a
recognizable form are:
1. Formal description of images: 1) study and construction of image models; 2) study
and construction of image representations by multiple models.
2. Description of image classes reducible to a recognizable form: 1) introduction of
new mathematical settings of an image recognition problem; 2) establishing and study
55