lows to determine the location of the transform coef-
ficients and the amount that they can variate without
being noticeable in the spatial domain.
In the watermark embedding scheme in
(Podilchuk and Zeng, 1998), the watermark is
modulated by the JND, and the coefficients are
marked whenever they are greater than the JND
threshold, i.e.
b
X
w
(u,v) =
b
X(u,v) + J(u, v)w(ℓ)
b
X(u,v) > J(u,v)
b
X(u,v) othewise
(1)
where
b
X(u, v) and
b
X
w
(u,v) are the DWT coefficients
of the original image and the watermarked image re-
spectively, J(u, v) is the JND matrix at the u, v fre-
quency in the DWT domain, and w(ℓ) is a zero mean,
unit variance, normally distributed random sequence.
In this way, the watermark weighted by the JND
thresholds has lower power than the maximum power
that can be inserted without causing noticeable distor-
tions in the image.
The JND thresholds are computed based on the
perceptual model of the Human Visual System (HVS)
introduced in (Watson et al., 1997). This model takes
into account frequency sensitivity, local luminance
and contrast masking effects to determine an image-
dependent quantization matrix, which provides the
maximum possible quantization error in the DWT co-
efficients which is not perceptible by the HVS.
In the watermark detection scheme the JND are
calculated using the original image, then, the DWT
coefficients of the original image are subtracted from
the ones of the image suspected to be watermarked,
and this difference is divided by the JND in order to
obtain the received watermark. The correlation be-
tween the extracted watermark and the original one
is then performed and the maximum value is deter-
mined, i.e.
w
e
(ℓ) =
b
X
w
(u,v) −
b
X(u, v)
J(u, v)
if
b
X(u, v) > J(u, v) (2)
r
w,w
e
=
w
e
(ℓ) ∗w(−ℓ)
E
w
e
.E
w
(3)
where E
w
e
and E
w
are the energies of the extracted
watermark sequence, w
e
(ℓ), and the original water-
mark sequence, w(ℓ), respectively.
The following modification to the IADWT inser-
tion scheme in (1) can be introduced
b
X
w
(u,v) =
b
X(u,v) + J(u, v)w(ℓ)
b
X(u,v) > J(u,v) > T
b
X(u,v) othewise
(4)
This modified insertion scheme will be hereafter
denoted as IADWT
T
. The rationale for the con-
strain J(u, v) > T is that when the JND thresholds
are too small, the magnitude of the marking term in
(4) becomes negligible. The introduction of the lower
bound T has then the advantage of reducing the wa-
termark length, improving in this way the fidelity and
also the robustness, as will be illustrated in section 5.
The detection scheme in (2) has to be modified
to take into account the modification in the insertion
scheme, as follows
w
e
(ℓ) =
b
X
w
(u,v) −
b
X(u, v)
J(u, v)
if
b
X(u, v) > J(u,v) > T
(5)
3 FIDELITY EVALUATION
In the evaluation of image watermarking methods it
may be of interest to judge the fidelity of the wa-
termarked image, that is the similarity between the
images before and after the watermark insertion. To
avoid the dependence on human judgement in the fi-
delity evaluation, it would be desirable to objectively
quantify the fidelity of watermarked images based on
a metric that takes into account the characteristics of
the HVS.
Image fidelity metrics appeared in the context of
imaging applications to quantify the distortion in im-
ages produced by image processing algorithms such
as compression, halftoning, printing, etc. Different
metrics have been proposed in the literature to mea-
sure image distortion (Winkler, 2005), (Zhang et al.,
2004)). Among them, the ones based on the charac-
teristics of the HVS have proved to deliver the best re-
sults, since they take into account the different sensi-
tivity of the human eye for color discrimination, con-
trast masking and texture masking.
A metric widely used to measure image fidelity
is the S-CIELAB metric (Zhang, 1996) (based on
CIE94 (CIE: International Commission on Illumina-
tion, 1995)) that specifies how to transform physi-
cal image measurements into perceptual differences
(∆E
94
) and incorporates the different spatial sensitiv-
ities of the three opponent color channels. In (Zhang
and Wandell, 1998) the authors test how well the S-
CIELAB metric predicts image fidelity for a set of
color images by comparison with the widely used root
mean square error (RMSE) computed in un-calibrated
RGB values.
Since the S-CIELAB metric takes into account the
perceptual characteristics of the HVS, such as color
discrimination, different spatial sensitivity, etc., this
metric represents a natural choice for the quantifica-
tion, in an objective way, of the fidelity of the water-
marked image. To the best of the authors’ knowledge,
SIGMAP 2007 - International Conference on Signal Processing and Multimedia Applications
394