A DISTORTION FREE WATERMARK FRAMEWORK
FOR RELATIONAL DATABASES
Sukriti Bhattacharya and Agostino Cortesi
Dipartimento di Informatica, Universita Ca’ Foscari di Venezia, Via Torino 155, 30170 Venezia, Italy
Keywords:
Database watermarking, HMAC, Abstract interpretation.
Abstract:
In this paper we introduce a distortion free invisible watermarking technique for relational databases. The
main idea is to build the watermark after partitioning tuples with actual attribute values. Then, we build hash
functions on top of this grouping and get a watermark as a permutation of tuples in the original table. As the
ordering of tuples does not affect the original database, this technique is distortion free. Our contribution can
be seen as an application to relational databases of software watermarking ideas developed within the Abstract
Interpretation framework.
1 INTRODUCTION
A watermark can be considered to be some kind of
information that is embedded into underlying data
for tamper detection, localization, ownership proof,
and/or traitor tracing purposes (Agrawal et al., 2003).
Watermarking techniques apply to various types of
host content. Here, we concentrate on relational
databases. Rights protection for such data is crucial in
scenarios where data are sensitive, valuable and nev-
ertheless they need to be outsourced. A good exam-
ple is data mining application, where data are sold in
pieces to parties specialized in mining it, e.g. sales
patterns database, oil drilling data, financial data.
Other scenarios involvefor example online B2B inter-
actions, e.g., airline reservation and scheduling por-
tals, in which data are made available for direct, in-
teractive use. Given the nature of most of the data,
it is hard to associate rights of the originator over it
(Lafaye, 2007). Watermarking can be used to solve
these issues. Unlike encryption and hash description,
typical watermarking techniques modify ordinal data
as a modulation of the watermark information and
inevitably cause permanent distortion to the original
data and therefore cant not meet the integrity require-
ment of the data as required in some applications.
The first well-known database watermarking
scheme for relational databases was proposed by
Agrawal and Kiernan (Agrawalet al., 2003) for water-
marking numerical values. The fundamental assump-
tion is that the watermarked database can tolerate a
small amount of errors. Since any bit change to a
categorical value may render the value meaningless,
Agrawal and Kiernan’s scheme cannot be directly ap-
plied to watermarking categorical data. To solve this
problem, Sion (Sion, 2004) proposed to watermark a
categorical attribute by changing some of its values to
other values of the attribute (e.g., ’red’ is changed to
’green’) if such change is tolerable in certain applica-
tions.
All of the work cited so far, assume that minor dis-
tortions caused to some attribute data can be tolerated
to some specified precision grade. However some ap-
plications in which relational data are involved can-
not tolerate any permanent distortions and data’s in-
tegrity needs to be authenticated. To meet this re-
quirement, we further strengthen this approach and
propose a distortion free watermarking algorithm for
relational databases based on reordering tuples. The
robustness of the proposed watermarking obviously
depends on the size of the individual groups so it is
specifically designed for large databases. The result-
ing watermark is robust against various forms of ma-
licious attacks and updates to the data in the table.
Database watermarking consists of two basic pro-
cesses: watermark insertion and watermark detection
(Agrawal et al., 2003), as illustrated in Figure 1. For
watermark insertion, a key is used to embed water-
mark information into an original database so as to
produce the watermarked database for publication or
distribution. Given appropriate key and watermark
information, a watermark detection process can be
applied to any suspicious database so as to deter-
mine whether or not a legitimate watermark can be
229
Bhattacharya S. and Cortesi A. (2009).
A DISTORTION FREE WATERMARK FRAMEWORK FOR RELATIONAL DATABASES.
In Proceedings of the 4th International Conference on Software and Data Technologies, pages 229-234
DOI: 10.5220/0002256402290234
Copyright
c
SciTePress