Health care and social inference systems:

An unauthorized inference control based on fuzzy logic

Souhila Kaci

, Abdeslam Ali-Laouar

, and Fr

eric Cuppens

Centre de Recherche en Informatique de Lens (C.R.I.L.–C.N.R.S.)

Rue de l’Universit

e SP 16 62307 Lens France

Institut de Recherche en Informatique de Toulouse (I.R.I.T-C.N.R.S)

118 route de Narbonne 62077 Toulouse France

Abstract. In this paper, we address the problem of unauthorized inference of

conﬁdential information in the ﬁeld of health care and social information systems.

More precisely, we will focus on the problem of inference control of conﬁdential

information from statistical databases which contain information about patients

and propopse a method based on fuzzy logic to avoid unauthorized inference.

Information provided using our approach remains relevant because it is without

loss of quality.

1 Introduction

The security of information systems is a very important problem which has been mainly

addressed in military applications. This led to security policies which are applicable

only in environments which accept a rigid bluk-heading of information and services

handling this information. Indeed, these models cannot be used in other domains which

also require security policies like for example the health care domain where it is impor-

tant to guarantee the conﬁdentiality, integrity and availability of pieces of information

contained in medical ﬁles of patients. The conﬁdentiality consists in expressing who

has the right to reach which information about which, when, and possibly under which

conditions. The integrity is the property which ensures that information is modiﬁed only

by the users authorized under the conditions normally envisaged. Lastly, the availabil-

ity is the aptitude of an information system for being able to be employed by the users

competent under the conditions of accesses and use normally envisaged.

In this paper, we particularly address the problem of security of information systems

in the ﬁeld of health care and social. Let us note that in spite of the development of

security policies in this context [6, 7], it is always possible for an external attacker and,

especially, for an internal user badly disposed, to try to circumvent the mechanisms of

access control to the resources in order to attack the conﬁdentiality, the integrity or the

availability of information.

To prevent the infringements against the intimacy of the patients, the medical databases

must protect not only conﬁdential information, but also information not explicitly con-

ﬁdential which can be employed to obtain conﬁdential information. This paper treats

Kaci S., Ali-Laouar A. and Cuppens F. (2004).

Health care and social inference systems: An unauthorized inference control based on fuzzy logic.

In Proceedings of the 2nd International Workshop on Security in Information Systems, pages 217-226

DOI: 10.5220/0002674902170226

 SciTePress

detection and the limitation of the situations for which there is a risk of illegal infer-

ence (called also illegitimate inference). This problem is called unauthorized inference

problem. It can also be simply deﬁned in the following way. Suppose that a user is au-

thorized to access to some information. The crucial question now is: can this user use

this information to deduce a conﬁdential information for which she would not have the

right of access? A possible solution to this problem is to refuse to answer when this

may allow to deduce conﬁdential information however this solution is not interesting

because it does not respect the availability condition. Another possible solution is the

use of false answers for users having a restricted access to the information system. In-

deed this method allows to protect conﬁdential information by providing false but not

very signiﬁcant answers. The problem of this method is that the user to whom one pro-

vides false answers can make bad decisions. It is also difﬁcult to provide a coherent set

of false answers. The solution that we propose in this paper does not consist to provide

a false answer to the user but a ”vague” information formalized in fuzzy logic [8, 4].

Section 2 describes the problem of illegitimate information from databases contain-

ing information about the patients. We also describe a well-known method to attack

such databases. In section 3, we ﬁrst present the general principle of our approach. We

then give some necessary background on fuzzy logic on which our approach is based.

Lastly, section 4 gives a detailed description of our approach.

2 Illegitimate inference in statistical databases

The main difference between a statistical database (SDB for short) and a traditional one

relates to the interrogation interface more limited in the SDB. The queries on a SDB are

limited to operations like counting (COUNT), sum (SUM), the average (AVG) and other

statistical calculus, which are carried out on subsets of data. Although these operations

seem to be without consequence, it should be made sure that signiﬁcant information on

the individuals are not revealed. This problem becomes particularly difﬁcult if we ac-

cept the possibility that a sequence of general queries, each one by itself does not allow

to deduce conﬁdential information, can be employed to deduce signiﬁcant information.

Let us now give an example to illustrate the difﬁcult nature of the inference problem

in the statistical databases. We consider a database, given in Table 1, which contains

Table 1. Example of a statistical database.

Name Sex Age Department salary

Jean M 27 Mathematics 2.000

Thomas M 43 computer science 3.000

Name Sex Age Department salary

Isabelle F 27 Mathematics 2.600

Justine F 31 computer science 3.200

information concerning the employees. Let us suppose that the policy of the company

imposes that the salary of the employees is a conﬁdential information which should

not be revealed. To achieve this goal, the database does not return an answer to a query

like: how much is the salary of the employee whose name is Isabelle? since the answer

218

is conﬁdential. Similarly, the base does not answer any query when, for example, the

average is calculated on the basis of a simple record, i.e. a query concerning only one

individual. Consequently, it refuses to answer for example the query: how much is the

average salary of the women employees who work for the computer science depart-

ment? because the average here is calculated from only one record.

A query on a SDB R consists to compute a subset of R using a characteristic formula

C, which is a logical formula built from the values of the attributes of R by using the

logical operators ∧ (and), ∨ (or), and ¬ (not). For example, the subset of records rep-

resenting the women employees who work for the computer science department, can be

represented by the following characteristic formula:

C = (sex=F) ∧ (department=computer science).

The set of records which satisfy the characteristic formula C, denoted by X

, is called

the result of the query. Applying the formula C on the relation R given in Table 1, we

get: COUNT (C) = 1, AV G(Age, C) = 31 and SUM(Salary, C) = 3200.

Generally, a statistical query taken separately does not allow to deduce conﬁdential

information. For this reason, a user with good intentions should be able to form any

interesting characteristic formula, and to carry out any statistical measurement on the

resulting set of the records. However, it is possible that a user forms statistical queries

which can be employed to deduce speciﬁc values of a ﬁeld of the database, which is not

acceptable if the values represent conﬁdential information. In this case, we say that the

database has been compromised.

A characteristic formula used in order to compromise a database is called a tracker

[2, 3]. This formula is chosen so that it gives as a result a set X

whose size is equal

to 1. Denning et col. [2] have shown that for any real database, a tracker can always be

found.

In the next section, we propose a new strategy to prevent attacks based on trackers.

3 Our approach

In the everyday life and particularly in the medical ﬁeld, medical analyses are gener-

ally expressed by linguistic descriptions (Example: Temperature of the body is raised,

normal, etc). This is especially used for the non-specialists in the medical ﬁeld. In this

paper, we take as a starting point this method to deal with the illegitimate inference

problem in statistical databases. More precisely, we replace the results of the statistical

queries (quantitative answers) by linguistic descriptions (qualitative answers) in order

to limit the risk of illegitimate inference.

For this, our idea consists in replacing the numerical answers (e.g. numbers of patients

= 10) by linguistic descriptions (e.g. medium) formalized in fuzzy logic framework.

Intuitively, each numerical answer is associated to a given class then a qualitative an-

swer is associated to each class. Thus, the formalization of our approach requires two

steps: classiﬁcation and fuzziﬁcation. Let us recall these two concepts:

– Classiﬁcation is the procedure which consists in decomposing the scale of the used

numerical values into non-empty classes so that each numerical value belongs to

one and only one class.

Let I be a set of elements. We say that Q(I) is a partition of I if there exists a set

219

, q

, · · · , q

} satisfying the following conditions:

i=1,···,k

= I with q

6= ∅ and q

∩ q

= ∅ for i 6= j.

To be relevant, a partition should be made up of deﬁnitely individualized classes.

Among existing classiﬁcation methods, we recall one method, that we will use later,

based on the aggregation around the centers using a ﬁxed number of classes. The

principle of this method is to determine a partition of I composed of k classes, the

number k being ﬁxed a priori by the user of the method. k centers c

, · · · , c

are

chosen which are either arbitrarily points in the space of the variables, or elements

of the set I.

Each element of the set I is associated to one and only one class whose center is

one of the k centers c

, · · · , c

according to the following assignment rule:

i belongs to the class q

of center c

iff ||i − c

|| = min

l=1,···,k

||i − c

||.

After the classiﬁcation step, we have to associate an appropriate linguistic variable

to each class. For example, if the numerical scale corresponds to the temperature

then the linguistic variable which corresponds to the interval [20, 25] may be tepid.

This can be formalized in fuzzy logic [8].

– Fuzziﬁcation: A principal characteristic of the human reasoning is that it is based

on vague or incomplete data. Thus, to determine if a temperature is hot or cold

is easy for any individual without necessarily knowing its exact value. Fuzzy logic

has the aim of studying the representation of vague knowledge and the approximate

reasoning. A principal characteristic of fuzzy logic is that an object may belong to

a set and at the same time to its complement. Thus, a temperature of 22 may at the

same time be hot and not hot.

A linguistic variable is a triple (X, V, F

), where X is a variable (age, temper-

ature, etc) deﬁned on a set of reference V (the set of integers, reals, etc). F

, A

, · · ·} is a ﬁnite or inﬁnite set of subsets of V used to characterize X (old,

young, hot, cold, etc). Each fuzzy subset represents a linguistic description.

The variable may belong to one or more subsets of this element of reference. For

example, the temperature T = 28 may belong to the subset ”pleasant” but may also

belong partly to the subset ”hot”.

The membership relation between a variable and a subset is called membership

function. In other terms, we speak about the membership degree of a variable x to

a subset F , denoted by µ

(x).

A fuzzy set F of universe Ω (a fuzzy subset of Ω) is deﬁned by a membership func-

tion µ

which associates to each element x of Ω a value in the interval [0, 1].

: Ω → [0, 1]

x 7→ µ

(x)

(x) represents the membership degree of x to the set F . By deﬁnition, if µ

(x) =

0 then x does not belong to F and more µ

(x) approaches 1, more the value x be-

longs to F . If µ

(x) = 1 then x belongs completely to F .

A fuzzy subset is said to be convex if and only if:

∀x, y; x > y, ∀z ∈ [x, y], µ

(z) ≥ min(µ

(x), µ

(y)).

Generally, we express numerical quantities by vague linguistic descriptions such as

”approximately 100”. The results of fuzzy measurements or an error analysis are

220

modelled by fuzzy sets called fuzzy quantities. A fuzzy quantity Q is a fuzzy set in

the universe R of real numbers. It is supposed to be normalized.

A fuzzy interval N is a convex fuzzy quantity. It is a generalization of a real interval

whose extremities are fuzzy in order to model concepts such as ”approximately”,

”roughly”, etc.

– Representation of a L-R fuzzy interval A fuzzy interval of type LR has a mem-

bership function built from a quadruplet A = (m

, m

, a, b), where m

, m

, a and

b are strictly positive real numbers, and of two functions L and R from R

into the

interval [0, 1] semi-continuous, non-increasing and satisfying the conditions:

– L(0) = R(0) = 1,

– L(1) = 0 or ∀x ∈ R

, L(x) > 0 and lim

x→+∞

L(x) = 0,

– R(1) = 0 or ∀x ∈ R

, R(x) > 0 and lim

x→+∞

R(x) = 0.

The membership function is deﬁned as follows:

(x) =











−x

) if m

− a ≤ x ≤ m

1 if m

< x < m

x−m

) if m

≤ x ≤ m

+ b

0 if x < m

− a or x > m

+ b

When m

= m

= m, the fuzzy interval P = (m, m, a, b)

is called a fuzzy

number, denoted by P = (m, a, b)

and whose membership function is deﬁned

as follows:

(x) = L(

m−x

)if x < m, µ

(x) = 1if x = m and µ

(x) = R(

x−m

)if x > m.

Let P

= (p

, α

, β

)

and P

= (p

, α

, β

)

be two LR-fuzzy numbers.

Then the addition ⊕, the substraction ª and multiplication ⊗ are deﬁned by [4]:

⊕ P

= (p

+ p

, α

+ α

, β

+ β

)

ª P

= (p

− p

, α

+ α

, β

+ β

)

⊗ P

= (p

∗ p

, p

∗ α

+ p

∗ α

, p

∗ β

+ p

∗ β

)

Contrary to the addition and subtraction, the multiplication P

⊗ P

is not of

type LR. An approximate value of type LR is given when P

and P

have a

support included in R

, α

and β

are small w.r.t. p

and, α

and β

are small

w.r.t. of p

To apply a linguistic representation to a quantitative variable, the principle consists

in breaking up all possible values of the given quantitative variable into subsets (a

set of classes of values), so that the borders of the classes are not clearly given. This

treatment allows to transform a numerical input into a fuzzy subset. The decompo-

sition should not be arbitrary but founded on criteria, such as the homogeneity of

the classes, the uniform partition of the universe, the subsets are totally ordered.

These subsets are also called ”linguistic variables”.

The subsets are characterized by their associated membership functions; we asso-

ciate a membership function to each subset. Their positions and overlappings can

be chosen arbitrarily provided that the following conditions are veriﬁed: their form

should be convex, the subsets (often in the form of trapezoid) should be partially

overlapped so that there are no unspeciﬁed ranges and lastly to avoid to imbricating

more than two subsets.

221

µµ

−

Fig. 1. Representation of the temperature in fuzzy logic.

Example 1. Let us consider the temperature input T = 31. According to the mem-

bership function given in Figure 1, we obtain the following values:

(very cold temperatures) = µ

(cold temperatutres) = 0, µ

(pleasant temperatures) =

.6, µ

(hot temperatures) = .35 and µ

(very hot temperatures) = 0.

Now, it seems important to answer some questions : How many classes is it nec-

essary to represent each quantitative variable? Which are the best linguistic val-

ues for each class? For the ﬁrst question, more the number of linguistic values is

high, more the partitioning quality is good. It is necessary however that the rate:

Card(Ω)/Number of Partitions is not equal to 1, otherwise this simply means that

there is no fuzziﬁcation.

For the second question, we compute the membership degree of each element x to

all the subsets F

of the universe Ω. Let µ

(x) be the membership degree of x to

. We say that x ∈ F

only if ∀F ∈ Ω, µ

(x) ≤ µ

(x).

4 Detailed description of our approach

The principle of our method consists, in a ﬁrst step, to decompose the set of values of

the conﬁdential attributes into subclasses of values. Each subclass contains values ac-

cording to a given criterion. In this paper, we will use the classiﬁcation method based

on a ﬁxed number of classes.

After the classiﬁcation into subclasses the fuzziﬁcation comes. We transform each class

into a fuzzy quantity i.e., a fuzzy number with a membership function. Then, we as-

sociate a linguistic variable to each number (small, large, etc). Next, for each answer

provided by the database management system, we compute the membership degree of

this answer to each fuzzy subset (linguistic variables). The answer of our system is the

linguistic variable which has the highest membership degree.

Let us note that the simplest version of a statistical query SQL is written as follows:

SELECT f( <attributes>) FROM <relations> WHERE <conditions>,

where f is a statistical function such as Sum, Avg, Count, etc.

In this paper, we focus on queries which compute statistical quantities, i.e. queries

which deduce information on aggregation such as sum, average, max and min.

Let us consider the example of relation R (patient, H/F, age, sickness insurance com-

pany, leucocyte rate) given in the Table 2 (borrowed from [5]).

The number of patients is 10 and the normal leucocyte rate in mm3 of blood is 4500.

In this example, we suppose that the leucocyte rate is a conﬁdential attribute. To control

the illegitimate inference on this attribute, we will transform the answers to the queries

222

Table 2. Example of a database.

Patient M/F Age Sick. ins. Leucocyte

Dufour M 45 MAAF 4000

Dulac F 35 MMA 7000

Dulon M 55 MGEN 3500

Dumas M 40 Rempart 3800

Dumont M 38 MMA 7500

Patient M/F Age Sick. ins. Leucocyte

Dupont M 30 MMA 6000

Dupr F 32 IPECA 7200

Dupuis F 50 MGEN 6800

Durand F 25 LMDE 3000

Duval M 45 IPECA 5500

concerning this attribute by giving qualitative answers.

We proceed in the same way for the answers to the queries which compute the number

of patients who verify a given condition. For this, we fuzzify the number of patients and

the leucocyte rate.

Let us start with the number of patients and decompose this variable as follows: A ﬁrst

We now transform each class into a fuzzy number A

(m, a, b) where m is the center of

the class, a and b represent the degrees of inaccuracy.

For each number, we associate a linguistic variable (see also Figure 2-a):

– The ﬁrst class is fuzziﬁed by the fuzzy number ”small” = (2, 2, 2)

– The second class is fuzziﬁed by the fuzzy number ”medium” = (5, 2, 2)

– The third class is fuzziﬁed by the fuzzy number ”great” = (8, 2, 2)

µµ

1000

2000

3000

4000

5000

6000

7000

cce

ood

Fig. 2. (a) Fuzziﬁcation of the number of patients. (b) Fuzziﬁcation of the leucocyte rate.

We now classify the leucocyte rate for a patient as follows:

– 1

– 3

We now propose the following fuzziﬁcation (see also Figure 2-b):

– The ﬁrst class is fuzziﬁed by the fuzzy number ”weak” = (2000, 1000, 1000)

– The second class is fuzziﬁed by the fuzzy number ”acceptable” = (3500, 1000, 1000)

– The third class is fuzziﬁed by the fuzzy number ”good” = (5000, 1000, 1000)

– The fourth class is fuzziﬁed by the fuzzy number ”high” = (6000, 1000, 1000)

Let us now suppose that a user is authorized to carry out statistical queries and she

wants to discover the leucocyte rate of ”Dulon”. Let us also suppose that this user knows

moreover that ”Dulon” has the MGEN as a sickness insurance company. Consider now

the following queries:

1) SELECT Count(Patient) FROM R WHERE M/F=’M’ AND Sick. ins. =’MGEN’

223

Result = 1 (R

)

Let us compute the membership degrees µ

) of the result (R

) w.r.t. each

fuzzy subset. We get: µ

small

) = 1, µ

medium

) = 0 and µ

great

) = 0.

So the answer provided after fuzziﬁcation is ”small” since it corresponds to the

highest membership degree.

2) SELECT AVG(Leucocyte) FROM R WHERE M/F=’M’ AND Sick. ins. = ’MGEN’

Result = 3500 (R

)

We compute the membership degrees µ

) of the result (R

) w.r.t. each fuzzy

subset: µ

weak

) = µ

good

) = µ

high

) = 0 and µ

acceptable

) = 1.

Then the answer is ”acceptable”.

Note that the deduction of conﬁdential information when handling numerical an-

swers is very easy. It is clear that from (R

) and (R

), the user may directly deduce

that the leucocyte rate of ”Dulon” is equal to 3500.

The case of the qualitative answers is less simple: from (R

), the user knows that

the size of the set to which ”Dulon” belongs is ”small”, and from (R

), she deduces

that their average of the leucocyte rate (the set ”small”) is ”acceptable”.

Let us now see what may the user deduce from these two information. For this, we

know that the average is deﬁned by the equation

x =

. It is clear that when

n is equal to 1, the average is equal to x

. To see the impact of the fuzziﬁcation on

the reasoning of the user, we will analyze the use of the fuzziﬁcation step by step:

– Let us suppose that the number of patients is not fuzziﬁed whereas the leucocyte

rate is. The answer given to the user in this case is then: the number of patients

is equal to 1 (as an answer to the query (R

)) and their average leucocyte rate

is ”acceptable”, which allows to deduce that the leucocyte rate of Dulon is

”acceptable”. However, the fuzziﬁcation of the number of patients makes that

the answer provided to the user (also as an answer to the query (R

)) is ”small”,

which does not allow to know precisely how many patients correspond to this

answer ”small”.

– Let us now suppose that the user knows moreover that the maximum size of the

fuzzy quantity ”small” is equal for example to two. However even if the user

has this information, she deduces nothing as we will show on the following

example: It is known that an ”acceptable” leucocyte rate lies between 3000

and 4500. Let the size of ”small” be equal to 2. From a leuycocyte average

of two patients x

and x

equal to ”acceptable”, we may have the following

possibilities for x

and x

– x

= 2500 ≡ ”weak”

, x

= 4000 ≡ ”acceptable”

– x

= 2500 ≡ ”weak”, x

= 5000 ≡ ”good”

– x

= 1500 ≡ ”weak”, x

= 6500 ≡ ”high”

– x

= 3500 ≡ ”acceptable”, x

= 5000 ≡ ”good”

– x

= 3500 ≡ ”acceptable”, x

= 4000 ≡ ”acceptable”.

From these results, one can say that from a leucocyte average equal to ”accept-

able” computed for two patients, one concludes nothing on the leucocyte rate

of one of the two patients.

The equivalence means here that the number (e.g. 2500) corresponds to the given class (e.g.

”weak”) after fuzziﬁcation.

224

Note that to have an average rate ”acceptable”, we have ﬁve possibilities for

the leucocyte rate for each of the two patients. In only one case, the rate of

the two patients is ”acceptable”. In the other cases, it varies between ”Weak”,

”acceptable”, ”Good” and ”High”. So we have four cases with x

or x

equal

to ”acceptable” and six cases different from ”acceptable”.

Then we may say that it is totally possible that the leucocyte rate of ”Dulon”

is equal to ”acceptable”, but it is also totally possible that it is different from

”acceptable”. Indeed, we are in a situation of total ignorance.

Let us note that in the real case, the database may contain thousands of patients and

the fuzzy quantity ”small” may reach several hundreds of patients. Consequently,

the possibilities of inference are even weaker when the cardinality which corre-

sponds to the fuzzy quantity is larger. The user deduces nothing on the leucocyte

rate of ”Dulon” when all the possible cases are considered.

3) SELECT Count(Patient) FROM R. Then, Result = 10 (R

)

The answer after fuzziﬁcation is ”great” (R

)

The user tries thereafter to know the number of patients different from ”Dulon”. For

this, she gives the following query:

4) SELECT Count(Patient) FROM R WHERE NOT (M/F=’M’ AND Sick. ins.=’MGEN’);

Result = 9 (R

)

The answer after fuzziﬁcation is ”great” (R

)

From these two answers, the user may construct the following reasoning: The dif-

ference between (R

) and (R

) (10-9=1) corresponds to the number of male pa-

tients who have the MGEN as a sickness insurance company (i.e., the number of

patients having the same properties as ”Dulon”).

With a similar reasoning, she concludes that the difference between (R

) and (R

)

is equal to

| ”great” ª ”great” |= | (8, 2, 2)

ª (8, 2, 2)

| =| (8, 2, 2)

⊕

(−8, 2, 2)

| =| (0, 4, 4)

| which is equivalent to (0, 0, 4)

after removing

the negative part, since there is no negative leucocyte rate.

So we have (R

) ª (R

) ∼ ”small”. Indeed, we get the same result as for (R

)

after fuzziﬁcation.

To know the average of the leucocyte rate for all the patients, the user gives the

following query:

5) SELECT AVG(Leucocyte) FROM R. Then, Result = 5430 (R

)

The answer after fuzziﬁcation is ”good” (R

)

To compute the average of the leucocyte rate of all the patients different from ”Du-

lon”, the user gives the following query:

6) SELECT AVG(Leucocyte) FROM R WHERE NOT (M/F=’M’ AND Sick. ins.=’MGEN’),

Result = 5644 (R

)

The answer after fuzziﬁcation is ”high” (R

)

In the case of numerical answers, to know the leucocyte rate of ”Dulon”, the user

computes the following value: 10 ∗ 5430 − 9 ∗ 5644 = 3500.

With a similar reasoning, in the case of qualitative answers, she may try to proceed

Since the values are not known a priori but supposed to be positive, the subtraction is translated

into fuzzy logic by the absolute value.

225

in the following way. The leucocyte rate of ”Dulon” is equal to:

| ((R

) ⊗ (R

)) ª ((R

) ⊗ (R

)) | ∼| (−8000, 38000, 38000)

From the obtained number, the user deduces nothing because the leucocyte rate is

never negative. Even if she can deduce some information (if the fuzziﬁcation is

changed), the situation is similar to the ﬁrst case since the user does not know the

exact number of patients. Let us also note that we lost the precision on the com-

putation of the leucocyte rate because of the multiplication which we carried out

(recall that in the case of the multiplication, the computation is only approximate).

We have shown on this example that the user may use different ways to deduce con-

ﬁdential information however the use of qualitative answers makes difﬁcult the imple-

mentation of attacks by trackers because after fuzziﬁcation, it is difﬁcult to identify the

individual concerned by the conﬁdential information. Indeed, required information is

not distinguished after fuzziﬁcation.

5 Conclusion

We have proposed a ﬁrst attempt to limit the risk of inference of conﬁdential information

from a database using fuzzy logic. It is difﬁcult to afﬁrm here that we eliminate any

risk of illegal inference. The goal is nevertheless to continue to answer the queries as

well as possible using non-conﬁdential information. So our aim is to limit at least as

possible the restrictions of legitimate access on databases while ensuring that the risk

of unauthorized inference remains below an acceptable threshold.

An immediate prospect for this work would be to implement our approach and to

validate it on great databases. We showed in this paper that our approach particularly

enables us to control the attacks by trackers. We expect to see how this approach could

be used to control other types of attacks like linear systems [2, 1]. Lastly, it would be

interesting to see to what extent our approach is sensitive to the classiﬁcation method

used, i.e. to see if the use of other classiﬁcation methods give sensitively different re-

sults.

References

1. F. Cuppens. A logical analysis of authorized and prohibited information ﬂows. In IEEE Sym-

posium on Research in Security and Privacy, 1993.

2. D. Denning, P. Denning and Schwartz. The tracker: A Threat to Statistical Database Security.

ACM Transactions on Database Systems, 4(1): 76-96, 1979.

3. D. Denning and J. Schlorer. A Fast Algorithm for Calculating a Tracker in Statistical Databse.

ACM Transactions on Database Systems, 5(1), 1980.

4. D. Dubois and H. Prade. La logique ﬂoue. In Quaderni, 50-73, 1995.

5. A.A. El Kalam. MP6, Sous-projet 3: Politiques de scurit pour les SICSS. Informations protger

et menaces. Rapport technique.

6. R. Sandhu, E. Coyne, H. Feinstein and C. Youman. Role-based access control models. IEEE

Computer, 29:38-47, 1996.

7. S. Solms. The management of computer security proﬁles using a role-oriented approach. Com-

puter and Security, 13(8), 673-680, 1994.

8. L. Zadeh. Fuzzy sets as a basis for a theory of possibility. Fuzzy Sets and Systems, 1, 3-28,

1978.

226