Image Representation: From Raw Data to Models

I. Gurevich, Vera Yashina

Dorodnicyn Computing Center, Russian Academy of Sciences

Vavilov str. 40, Moscow, 119333 Russian Federation

Abstract. A space of image representations used in Descriptive Theory for

Image Analysis is considered. The main types of image representations are

introduced in accordance with image and its reducing to a recognizable form.

1 Introduction

This paper describes research results related to the Descriptive Theory for Image

Analysis (DTIA), which was proposed and substantiated by I.B. Gurevich and is

being developed by members of his scientific school [1, 2]. The main contribution of

this paper is that it describes a system of concepts characterizing initial data (images)

in recognition problems and uniquely defines a hierarchical system of relations

introduced on classes of concepts. This system underlies the formal definition of

methods for synthesizing image models and descriptive image models intended for

image recognition problems.

At present, the crucial points in the development of image analysis and recognition

include the understanding of the nature of initial data (images); methods for image

representation and description that provide the construction of image models for

image recognition problems; a mathematical language for a unified description of

image models and their transformations ensuring the construction of image models

and the solution of image recognition problems; and models for solving image

recognition problems in the form of standard algorithmic schemes that, in the general

case, ensure transitions from the original image to its model and from the latter to a

desired solution.

DTIA specifies a unified conceptual structure for the development and implemen-

tation of such models and mathematical language. The main goal of DTIA is to

represent various methods, operations, and representations used in image analysis and

recognition in a structured and standardized form. DTIA provides the conceptual and

mathematical foundation for Image Mining. The axiomatics and formal structures of

DTIA provide methods and tools for representing and describing images for their

This work was partially supported by the Russian Foundation for Basic Research Grant № 08-01-00469,

by the project “Algorithmic schemes of descriptive image analysis” of the Program of Basic Research

“Algebraic and Combinatorial Techniques of Mathematical Cybernetics” of the Department of

Mathematical Sciences of the RAS and by the project of the Program of the Presidium of the Russian

Academy of Sciences “Fundamental Problems of Computer Science and Information Technologies” (the

project no. 2.14)

Gurevich I. and Yashina V. (2009).

Image Representation: From Raw Data to Models.

In Proceedings of the 2nd International Workshop on Image Mining Theory and Applications, pages 20-29

DOI: 10.5220/0001962900200029

 SciTePress

subsequent analysis and estimation.

Overall, the following results were obtained by analyzing and adapting descriptive

image models (DIMs) for automated image analysis and recognition: (1) a system of

concepts was introduced that characterizes initial data (images) in image recognition

problems; (2) a system of concepts was introduced that characterizes and defines

DIMs for image recognition; (3) a hierarchical scheme was constructed that represents

and unifies the conceptual apparatus, definitions, mathematical objects, and transfor-

mations defining the construction of an image model in image recognition based on

DTIA; (4) a scheme for constructing image models was proposed; and (5) schemes

for constructing four classes of DIMs were introduced.

2 Images as Initial Data in Image Recognition Problems

To develop automated image recognition methods, we need techniques for effective

image formalization that reflects the image semantics and the information carried by

the internal image structure and the structure of external connections in the actual

world part (scene) reproduced by the image. No systematic mathematical methods for

image formalization and analysis are available at present. The overwhelming majority

of image-related methods are heuristic, and their merits are determined by how

effectively the pictorial nature of images is overcome by non-pictorial tools.

An image is an object with a complex information structure reproducing

information on the original scene using the brightness of discrete image elements

(pixels); configurations of image fragments and sets of pixels; and spatial and logical

relations between configurations, sets of pixels, and individual pixels. In contrast to

other ways of data representation, images are highly informative, illustrative,

structured, and naturally perceived by humans. An image is a mixture of original

(raw, "actual") data, their representations, and deformations arising in the formation

and transformation of digital images. Representations reflect the information and

physical nature of objects, events, and processes represented by images, while

deformations are caused by the technical characteristics of tools used to record, form,

and transform images in the construction of hierarchical representations. Thus, while

developing methods for a formal description of images, we have to take into account

not only the brightness of image pixels but also additional explicit and implicit

information associated with images.

A natural hypothesis underlying the formalization of image descriptions and its

conceptual apparatus is that an original image is specified not only by a set of its

digital realizations but also by contextual and semantic information associated with

the methods used for image recording and formation or with some specific aspects of

the image.

To construct a formalized description of an image, transformations admissible for

the given type of images have to be applied to the entire information available on the

image. Thus, it is a necessity to study (i) the types of information carried by the image

(the space of initial data) and (ii) the transformations to be applied to the original

image to reduce it to a form acceptable for recognition algorithms (the space of trans-

formations).

The descriptions of procedures for serial and/or parallel applications of transfor-

mations form the space of transformations to initial data from the space of initial data

constitute a set of schemes for constructing formal descriptions of images (the space

of image representations).

To ensure that recognition algorithms can be applied to the resulting formal image

descriptions, the schemes constructed (image representations) have to be imple-

mented; i.e., image models have to be constructed by reducing the original image

(with allowance for the entire information on it) to a form acceptable for recognition

algorithms. The space of image representations is intermediate between the space of

initial data and the space of image models.

Thus, the construction of image models involves the synthesis and application of

objects from the set of initial data (i.e., images), the set of image transformations for

Reducing Images to a Recognizable Form (RIRF), the set of image representations

(i.e., schemes for constructing formal image descriptions), and the set of image

models.

DTIA deals with three classes of admissible image transformations: procedural, pa-

rametric, and generating transformations (see Definitions 2-4 below). These classes of

transformations generate three classes of image representations and three classes of

image models, respectively.

The following concepts are used to characterize images in DTIA: initial data

(image as a whole with its legend), transformations of initial data, representations of

initial data (by a representation, we mean a formal scheme for describing an image

and the objects it involves), and models of initial data (by an image model, we mean

an image description acceptable for recognition algorithms).

Additional objects are introduced to define the types of representations through

which an original image goes in the course of image model construction and to estab-

lish the relations between these types. These additional objects include generating

rules, structuring elements, semantic and contextual information on an image, digital

image realizations, classes of image representations, realizations of image representa-

tions, classes of image models, and a correct image model.

The following relations between the objects have been revealed by analyzing the

basic concepts related to image description construction: 1) there are deterministic

(obvious) relations between the initial data (image) and: a) the transformations

applied to it; b) methods for obtaining its digital realizations; c) the results of applying

transformations to digital image realizations; 2) there are relations inherent in DTIA:

a) between the classes of image transformations and the classes of admissible image

representations; b) between the classes of admissible image representations and the

classes of image descriptions in a form acceptable for recognition algorithms (classes

of image models); 3) special relations were revealed: a) between some classes of

image models; b) between some class of image models and initial data; c) between the

results of applying transformations to digital image realizations and the classes of

image models.

The study of these relations has led to the construction of a hierarchy of DTIA con-

cepts. According to the hierarchical scheme shown in Fig. 1, the concepts introduced

can be structured so that they can be used to develop algorithmic schemes for image

analysis and recognition and to describe images with the help of DTIA. Based on the

hierarchy, several axioms of DTIA were formulated in [2].

The scheme in Fig.1 reflects several levels of relations between DTIA concepts.

I. An open arrow denotes the relation whereby object 1 is associated with objects 2,

3, …: the original image I is associated with the following three sets: 1) the set of

transformations

{}O



(the set of structuring elements

{}S



is auxiliary for

{}O



), 2) the

set of initial data

{}



; 3) the set of image models

{}



II. The application of

{}O



{}



generates a set

{}



of correct image models

(this assertion is proved in Theorem 1).

III. A solid heavy arrow denotes the relation "object 1 consists of object 2, object

3…" 1) The set of transformations

{}O



consists of three subsets: procedural

transformations

{}



, parametric transformations

{}



, and generating transforma-

tions

{}



. Moreover,

{}O



(as well as its subsets

{}



{}



{}



) is specified

together with the set

{}S



of structuring elements, which can be applied to the image

together with transformations. The rules for applying generating transformations from

{}



to initial data from

{}



are described by a set of generating rules

{}R



. Thus, a

subset of

{}



is associated with a fixed subset of

{}R



. 2) The set of image models

{}



consists of four subsets: procedural image models

{}



, parametric image

models

{}



, generating image models

{}



, and image I-model

{}



. 3) The

initial data

{}



contain contextual and semantic information

{}B

on the image and

the set of realizations

{}

′′

∈



of I that represent the given object or scene.

IV. A dashed line denotes the relation "an object generates another object”. 1) The

three classes

{}



{}



{}



of transformations generate the following three

classes

Fig. 1. Hierarchy of DTIA concepts.

of image representations: procedural representations

{()}

ℜ



, parametric

representations

{()}

ℜ



, and generating representations

{()}

ℜ



. 2) It can be

proved that any image T-model

{}

MM∈



generates an image realization

{}

′

∈



(see Proposition 1).

V. The dashed circle in the scheme stresses that the image realizations

{'}



are re-

lated to the semantic and context information

{}B

. This relation means that the

various types of initial data are used together in solving problems.

3 Descriptive Image Models

In DTIA it is assumed that an image is described by a set of initial data

{}I



. The

composition of this set is determined below.

Lemma 1. The set

{}



of initial data consists of two subsets

{}

′



and

{}B



: (1) the

set of realizations

{}

′′

∈



of I representing the given object or scene such that

{( , ( ))}

Ixfx

∈

′

is the set of points

lying in the domain

of the image

realization and the set of values

()

at each point of

; and (2) semantic and con-

textual information

{}B



on the image.

Definition 1. An I-model of an image is any element

′

of a set

{}

′



of image

realizations.

Consider a set of transformations

{}O



introduced over data given in the form of

images. Below we define the basic classes of image transformations (procedural, pa-

rametric, and generating) and introduce the related concepts of a structuring element,

a generating rule, and a correct generating transformation.

Definition 2. A Procedural Transformation

{}

OO∈



of arity r over a set of

images

1...

{}

is an operation such that its application to

1...

{}

transforms it into

another set of images, into an image, or into image fragments.

Accordingly, a procedural transformation

{}

OO∈



of arity r over a set of image

I-models

1...

{}

′

is an operation such that its application to

1...

{}

′

transforms it into

another set of image I-models, into an image I-model, or into a set of I-models of

image fragments. The operands of this operation can be I-models of a single original

image or I-models of several different original images.

Definition 3. A Parametric Transformation

{}

OO∈



over an image I is an

operation such that its application to I transforms it into a numerical characteristic p

that correlates with the properties of geometric objects, brightness characteristics, or

configurations formed by regular repetitions of the geometric objects and brightness

characteristics of the original image.

To calculate a numerical characteristic p of I, we can use both the set of image

realizations and semantic or contextual information on the image.

Definition 4. A Generating Transformation

{}

OO∈



over an image I is an

operation generating a particular representation of I that reflects some specific

properties of I.

For the definition of a representation, see Definition 8.

Examples of such transformations are functions describing curves, the conjunction

function, the disjunction function, and image coding functions.

Definition 5. A Generating Rule

is a rule for constructing an image model that

determines a strict sequence of generating transformations applied to the image in

order to construct its model.

For the definition of an image model, see Definition 9.

Definition 6. A Generating Transformation

is correct for a given image if and

only if there are generating rules according to which

ensures the construction of a

generating image model.

Note that a generating model of an image (image G-model) is constructed using a

realization of a generating image representation (G-representation). For the

definitions of a generating representation and a realization of a generating

representation, see Definitions 17 and 18.

Definition 7. A Structuring Element

{}SS∈



is a two-dimensional spatial object

whose convolution with an image yields a partition of the image into a system of

fragments suitable for local analysis. A structuring element is specified by parameters

defining its form and numerical and geometric characteristics.

Definition 8. An Image Representation

()I

ℜ

is a formal scheme for obtaining a

standardized formal description of the surfaces, point configurations, and shapes

forming the image and the relations between them.

Definition 9. An Image Model

()

is a formal image description generated by a

realization of an image representation

()I

ℜ

Definition 10. A realization of an image representation is the application of the

representation to realizations of the original image with particular parameter values

specified for the transformations involved in the representation.

{}O



{}

′



{}B



{}S



{}{}SI

′

∗



{()}

ℜ



{}



{}I

′



Fig. 2. Construction of an image model.

The construction of an image model with the help of the objects and concepts

introduced above is shown schematically in Fig. 2.

Definition 11. A Correct Representation of an Image I is an element of the set of

image representations constructed from contextual and semantic information

{}B



applying transformations

{}O



and structuring elements

{}S



, where the sets

{}B



{}O



, and

{}S



are associated with I and the set

{}S



may be empty.

Definition 12. A Correct Image Model is an element of a set of image models

generated by implementing correct image representations on the set of initial data

{}



Theorem 1. Any element m of the set

{}



generated by applying transformations

from

{}O



to the set of initial data

{}



is a correct image model.

Corollary. Given an image I, the set of correct models of I is closed under trans-

formations from

{}O



as applied to the set of initial data

{}



with the use of struc-

turing elements from the set

{}S



associated with I.

The proof of this corollary is based on Definitions 11 and 12.

All transformations applied to an image model or an image are introduced to

achieve one of the following goals: the construction of a new model; RFSR; or the

construction of an aggregate model estimate, i.e., the transition from the space of ini-

tial data to a space of estimates on which classifying decision making procedures are

implemented in image recognition.

Schemes 1 and 2 illustrate the relations between image representations and image

models.

{()} {,}:{} { }

OS I Mℜ= ⇒



(1)

12 1

{()}() { , ,..., , }():{}{} { }

Ip OO OSpI I M M

′′

ℜ= ∈⇒∈



(2)

Definition 13. A T-representation

(, )

ημ

ℜ

of an image I is a formal scheme for

deriving a standardized formal description of I. This scheme is constructed from con-

textual and semantic information

{} {}BB⊂



by applying procedural transformations

{()}{}

⊂



and structuring elements

{( )} {}SS

⊂



(where

are parameters

of the procedural transformations and the structuring elements, respectively).

The set of all correct T-representations is denoted by

{(,)}

ℜ



Scheme 3 illustrates Definition 13.

{

}

{}

{},{ } {( )},{ ( )} ( , )

SO S O

μη ημ

⎯⎯⎯→⎯⎯→ℜ



(3)

Definition 14. A Realization of a T-representation

(, )

ημ

ℜ

of an image I is the

application of

(, )

ημ

ℜ

to realizations

{} {}

′

⊂



of I, where (

ημμ

)

are parameter values chosen for the transformations involved in

(, )

ημ

ℜ

. Scheme 4

illustrates Definition 14.

00 0 0

(, ){}{(), ( )}{'}

SO I

μημ

′

ℜ∗= ∗

(4)

Proposition 1. Any T-representation

(, ) { (, )}

μημ

ℜ∈ℜ



of an image generates a

set of image T-models

{(,}

ημ

by setting parameter values

ηη

and

μμ

for the procedural transformations and structuring elements.

The set of all correct image Т-models is denoted by

{}



Proposition 2. Any image T-model

{}

MM∈



generates an image realization I',

i.e., an image I-model

{}{'}

IM I

′

≡∈ ≡



Definition 15. A P-representation

(, )

ημ

ℜ

of an image I is a formal scheme for

deriving a standardized formal description of I. This scheme is constructed from the

contextual and semantic information

{} {}BB⊂



by applying parametric transforma-

tions

{()}{}

⊂



and structuring elements

{( )} {}SS

⊂



(where

are

parameters of the parametric transformations and the structuring elements, respec-

tively).

The set of all correct P-representations is denoted by

{(,)}

ℜ



. Scheme 5

illustrates Definition 15.

{}

{},{ } {( )},{ ( )} ( , )

{}

SO S O

ηημ

⎯⎯⎯→⎯⎯→ℜ



(5)

Definition 16: A realization of a P-representation

(, )

ημ

ℜ

of an image I is the

application of

(, )

ημ

ℜ

to realizations

{} {}

′

⊂



of I, where (

ημμ

)

are parameter values chosen for the transformations involved in

(, )

ημ

ℜ

Scheme 6 illustrates Definition 16.

00 0 0

(, ){}{(), ()}{'}

ISO I

ημ η μ

′

ℜ∗= ∗

(6)

Proposition 3. Any P-representation

(, ) { (, )}

μημ

ℜ∈ℜ



generates a set of image

P-models

{(,}

ημ

by setting parameter values

ηη

and

μμ

for the

parametric transformations and structuring elements, respectively.

The set of all correct P-models of an image is denoted by

{}



Definition 17. A G-representation

(,, )

ημ

ℜ

of an image I is a formal scheme for

deriving a standardized formal description of I. This scheme is constructed from the

contextual and semantic information

{} {}BB⊂



according to generating rules

{()}R



that completely define the sequence of generating transformations

{()}{}

⊂



and structuring elements

{( )} {}SS

⊂



applied to I (here,

ημ

are parameters of the generating rules, generating transformations, and structuring

elements, respectively).

The set of all correct G-representations is denoted by

{(,,)}

ημ

ℜ



. Scheme 7

illustrates Definition 17.

{}

{},{},{} {()},{()},{()} (,,)

{}

RSO R S O

GPG

μη λημ

⎯⎯⎯→⎯⎯→ℜ





(7)

Definition 18. A realization of a G-representation

(,, )

ημ

ℜ

of an image I is the

application of

00 0

(,, )

ημ

ℜ

to realizations

{} {}

′

⊂



of I, where (

00 0

ληη μ μ

===

) are parameter values chosen for the transformations in-

volved in

(,, )

ημ

ℜ

Scheme 8 illustrates Definition 18.

(,, ){}{(),(), ( )}{'}

00 0 0 0 0

RSO I

λη μ λ η μ

′

ℜ∗= ∗

(8)

Proposition 4. Any G-representation

(,, ) { (,,)}

ημ λημ

ℜ∈ℜ



generates a set of

image G-models

00 0

{(,,}

ημ

by setting parameter values

ηη

, and

μμ

for the generating rules, generating transformations, and structuring elements,

respectively.

The set of all correct image G-models is denoted by

{}



The interrelations between the concepts discussed in Section 2 are shown in Fig. 1.

4 Conclusions

At present, automated extraction of information from images is a major strategic goal

of fundamental research in image analysis, recognition, and understanding.

DTIA was proposed and is being developed as a conceptual and logical foundation

for image analysis and recognition. It includes a collection of methods for image

analysis and recognition, methods for RIRF, a system of concepts for image analysis

and recognition, classes of DIMs, various settings in image analysis and recognition,

and a base image recognition model.

The above description of DTIA concepts and the formal apparatus introduced for

descriptive image models provide the prerequisites for the formulation of the basic

DTIA axioms. Below are examples of such axioms.

Axiom 1. Any image I can be uniquely associated with a collection of sets (

{}



{}O



{}



), where

{}



is the set of initial data,

{}O



is the set of transformations applied

{}



, and

{}



is the set of results produced by applying

{}O



{}



Scheme (9) illustrate Axiom 1:

{}:( { }) { }

OI I M≅⇒



(9)

Axiom 2. The set of transformations

{}O



is specified by a set

{}S



of structuring

elements, by a set

{}R



of generating rules, and by three subsets of transformations

{}



{}



, and

{}



, namely, (1) procedural transformations

{}



, (2)

parametric

transformations

{}



, and (3) generating transformations

{}



To be continued…

References

1. I.B. Gurevich: The Descriptive Framework for an Image Recognition Problem. Proceedings

of the 6th Scandinavian Conference on Image Analysis, Vol.1, Pattern Recognition Society

of Finland (1989) – 220 – 227

2. I.B. Gurevich, V.V. Yashina: Descriptive Approach to Image Analysis: Image Models.

Pattern Recognition and Image Analysis: Advances in Mathematical Theory and Applica-

tions, Vol.18, No.4. MAIK "Nauka/Interperiodica"/Pleiades Publishing, Inc. (2008) 518-541