Non-local Huber Regularization for Image Denoising

A Hybrid Approach of Two Non-local Regularizations

Suil Son, Deokyoung Kang and Suk I. Yoo

School of Computer Science and Engineering, Seoul National University, Seoul, Republic of Korea

Keywords:

Denoising, Non-local Means, Total Variation Regularization, Non-local Total Variation Regularization,

Non-local H

Regularization.

Abstract:

Non-local Huber regularization is proposed for image denoising. This method improves the non-local total

variation regularization and the non-local H

regularization approaches. The non-local total variation regular-

ization preserves edges better than the non-local H

regularization; however, it leaves a little noise. In contrast,

the non-local H

regularization eliminates noise better than the non-local total variation regularization; how-

ever, it blurs edges. To take both advantages of the two methods, the proposed method applies the non-local

total variation to large non-local intensity differences and applies the non-local H

regularization to small

non-local intensity differences. A boundary value to determine whether the intensity difference comes from

edges or noise is also suggested. The experimental results of the proposed method is compared to the result

from the non-local total variation regularization and to the result from the non-local H

regularization; The

effect of the boundary value is illustrated as PSNR changes with respect to the various values of the boundary

values.

1 INTRODUCTION

Denoising has been consistently studied in the im-

age processing ﬁeld. As a method to enhancing the

quality of images, it is widely applied to many image

processing applications’ pre-processing step. Since a

good image quality enables the better result of the ap-

plications, the denoising is still active research area.

Smoothing is a primitive denoising method (Lin-

denbaum et al., 1994). A noisy image usually con-

tains large intensity differences between neighbour-

ing pixels; The large differences can be eliminated by

the Gaussian convolution. Therefore, eliminating the

noise of an image, the convolution can reduce the de-

gree of the intensity differences between neighbours.

The convolution however blurs the edge of images.

Though an anisotropic ﬁlter can reduce this blurring

effect (Perona and Malik, 1990), such a simple ﬁl-

tering approach still has a difﬁculty in restoing detail

textures of complex images.

Deﬁning denoising as optimization of a signal,

approaches reconstructing the signal have been pro-

posed. Under this formulation, the Tikhonov regu-

larization (Tikhonov and Arsenin, 1977) and Wiener

ﬁlter(Yaroslavsky, 1985) method have been solutions

to the signal reconstruction problem. Rudin, Osher,

and Fatemi also proposed the total variation mini-

mization (Rudin et al., 1992) having good properties

such as preserving edges while eliminating noises.

This method has been widley extended in many re-

searches (Chambolle, 2004; Osher et al., 2003; Osher

et al., 2005; Marquina, 2009).

Transforming an image into another domain and

analyzing it have been also suggested. Fourier trans-

form, discrete cosine transform, and the wavelet

transform are the methods (Yaroslavsky, 1996;

Donoho and Johnstone, 1995; Donoho, 1995; Tem-

izel and Vlachos, 2005). Applied by these meth-

ods, various components in an image can be separated

in the transformed domain. Based on this property,

the noise component was selectively eliminated, and

the denoised images were reconstructed by an inverse

transform from the remained components. The trans-

form approaches however have distorted images re-

sulting in some artifacts.

Statistical approaches have been also applied to

denoising problems. Assuming that a distribution of

noise obeys a zero mean Gaussian distribution, noise

can be successfully eliminated by taking a statistical

mean image if there are several images for a single

scene. Even though given just a single image, the sta-

tistical approach can be still applicable using small

554

Son S., Kang D. and I. Yoo S. (2013).

Non-local Huber Regularization for Image Denoising - A Hybrid Approach of Two Non-local Regularizations.

In Proceedings of the 2nd International Conference on Pattern Recognition Applications and Methods, pages 554-559

DOI: 10.5220/0004263705540559

 SciTePress

image patches located at several parts of the image.

Buades et al. have suggested a non-local means al-

gorithm calculating similarity weighted intensity for

each pixel in an image(Buades et al., 2005). To de-

termine an intensity of each pixel in a denoised im-

age, several image patches are selected, weighted by

a similarity between the patch whose center is the

target pixel, and then averaged. Since this method

preserves a detail texture of a patterned image, many

approaches to extend it have been proposed(Buades

et al., 2006; Gilboa and Osher, 2008; Lee et al., 2011).

The non-local means algorithm has been extended

to combine various regularizations in the optimization

ﬁled (Gilboa et al., 2006; Peyre et al., 2008; Lou et al.,

2010). In these works, the non-local total variation

and the non-local H

norm are deﬁned to link the tra-

ditional optimization problem and the non-local ap-

proach. The non-local total variation regularization

particularly showed the good restoration of detailed

texture while preserving edges.

In this paper, we propose a more improvement in

non-local regularization. The proposed non-local Hu-

ber regularization is the improvement of the non-local

total variation regularization and the non-local H

regularization. Though the non-local total variation

regularization preserves edges, it sometimes leaves

small noise. To completely eliminate this small noise,

the non-local H

regularization is better than the non-

local total variation regularization. However, the non-

local H

regularization sometime blurs edges com-

pared to the non-local total variation regularization.

Both of the noise and the edge cause intensity differ-

ences between pixels. If the distinction of the cause

between the noise and the edge is possible, the noise

can be eliminated by the non-local H

regularization

and the edge can be preserved by the total variation

regularization. Given a boundary value by which the

distinction is possible, the proposed non-local Huber

regularization applies a linear penalty, the non-local

total variation regularization, for larger values than

the boundary and a quadratic penalty, the non-local

regularization, for smaller values. A guideline to

selecting the boundary value between the linear and

the quadratic regularization is also suggested.

The remainder of this paper is organized as fol-

lows: The related denoising approeaches are intro-

duced in Section 2, and our non-local Huber regular-

ization approach is explained in Section 3. Specif-

ically, the non-local Huber regularization is formu-

lated in Section 3.1, and the method to the bound-

ary selection is suggested in Section 3.2. The ex-

perimental results in Section 4 demonstrate that our

approach improves existing non-local denoising ap-

proaches. Finally, we conclude in Section 5.

2 NON-LOCAL

REGULARIZATIONS

Given Ω ⊂ R

, let u ∈ Ω → R be an original image,

and let o ∈ Ω → R be an observed image for the image

u as follows:

o = K u + n (1)

where K is a linear operator representing image trans-

formation, and the n is a white Gaussian noise. Then,

the denoising is a process obtaining the u from the o.

In the regularized optimization formulation, the

denoised image can be obtained from following equa-

tion

u = arg min

||o − K u||

+ λJ(u), (2)

where ||·|| is the l

norm, J is a regularization applied

to the u, and λ > 0 is a Lagrange multiplier.

Under the formulation of the equation (2), J is

called as several names depending on its conﬁgura-

tion. Among the conﬁgurations, the total variation is

deﬁned as

(u) =

Ω

|∇u(x)|dx (3)

(Rudin et al., 1992).

Non-local means algorithm deﬁnes a new inten-

sity of pixel x as a weighted average of intensities

from non-local pixels, using a function w

(Buades

et al., 2005):

(x) =

C(x)

Ω

(x,y)o(y)dy, (4)

where the C is a normalization function, and the w

a similarity function deﬁned as

(x,y) = exp



− d

(o(x),o(y))



(5)

. The function d

computes a distance between two

image patches like

(o(x),o(y)) =

Ω

(t)|o(x +t)−u(y+t)|

dt (6)

where the G

is a Gaussian function whose standard

deviation is a. C in the equation (4) is deﬁned as

C(x) =

Ω

(x,y)dy (7)

Using the equation (4), a non-local mean image

can be obtained by applying the non-local regulariza-

tion,

NLM

(u) =

Ω

(u(x) − NL

(x))

dx (8)

, into the equation (2) (Buades et al., 2005).

Non-local total variation was suggested to com-

bine the non-local means approach and the total vari-

ation approach. In this approach, the non-local gra-

dient ∇

u : Ω → Ω × Ω and the non-local divergence

Non-localHuberRegularizationforImageDenoising-AHybridApproachofTwoNon-localRegularizations

555

div

−→

v : Ω × Ω → Ω are deﬁned (Gilboa and Osher,

2008):

(∇

u)(x,y) := (u(y) −u(x))

w(x,y), x, y ∈ Ω (9)

(div

−→

v )(x) :=

Ω

(v(x,y) − v(y,x))

w(x,y)dy

(10)

Using these non-local operator, non-local regulariza-

tions are deﬁned (Lou et al., 2010; Gilboa and Osher,

2008) as

NL/TV

(u) =

Ω

|∇

u(x)|dx

Ω

(u(x) − u(y))

w(x,y)dydx

(11)

NL/H

(u) =

Ω

|∇

u(x)|

Ω

(u(x) − u(y))

w(x,y)dydx

(12)

where the equation (11) is called as a non-local to-

tal variation regularization, and the equation (12) is

called as a non-local H

regularization. These regu-

larizations are a non-local linear regularization and a

non-local quadratic regularization respectively.

3 THE PROPOSED METHOD

3.1 Non-local Huber Regularization

The Huber function is a combination of two functions:

HU B

(x) =

M(2|x| −M), |x| > M

, |x| ≤ M

(13)

This function computes a linear equation for x larger

than a boundary value M and a quadratic equation for

smaller values.

Similarly, the non-local Huber regularization can

be deﬁned using the non-local linear and quadratic

regularizations in (11) and (12). We deﬁne the non-

local Huber regularization as

HU B

(u) =

Ω

|∇

u(x)|

T (x)



|∇

u(x)|



(1−T (x))

(14)

where the T is a threshold function given as

T (x) =

1, |∇

u(x)| > B(x)

0, |∇

u(x)| ≤ B(x)

(15)

Where the value of the function B(x) determines

whether the non-local intensity difference at pixel x

is caused by noise or edge. When the difference is

larger than the value of the B(x), its cause is deter-

mined as the edge, and smaller than or equal to the

value of the B(x), it is determined as the noise. From

this process, the difference from edges is applied to

the linear penalty, the non-local total variation regu-

larization, and the difference from noise is applied to

the quadratic penalty, the non-local H

regularization.

3.2 Non-local Huber Boundary

Selection

Assuming that intensity differences caused by edge

are larger than the difference by noise, the value of

B(x) should be smaller than the difference by edge

and be larger than the difference by noise. This con-

ﬁguration enables that the linear penalty is applied to

the edge and the quadratic penalty is applied to the

noisy area.

If the noise level is known, B(x) can be deﬁned by

the noise level for each pixel x.

B(x) = N

(x) (16)

However, the noise level of an image is not known in

most cases. For simplicity, consider an image I hav-

ing a uniform intensity for all pixels, and the image

obtained from the image I by adding zero mean white

Gaussian noise. Then, the noise level of the image

is proportional to the standard deviation of intensities

of the image.

(x) ∝ σ

(17)

In addition, the noise level of each pixel may be

different from each other. In this case, it has a follow-

ing relationship:

(x) ∝ 1 −

Ω

(x,y)dy

×W

(18)

The second term in the equation (18) represents the

average similarity of a pixel x between other pixels.

Because of the zero mean white Gaussian noise as-

sumption, pixels with small noise are more than ones

with large noise, and they have larger similarities to

other pixels. Therefore, if a pixel has a large average

similarity to other pixels, it has a small noise level. On

the other hand, if a pixel has a small average similar-

ity, it has a large noise level. Because the maximum

value of the w

(x,y) is one, the maximum value of the

Ω

(x,y)dy is equal to the area of the image. There-

fore, the second term is smaller than or equal to one,

and the whole term is proportional to the noise level

of a pixel x.

Furthermore, consider the noise level in the case

where the w

(x,y) has its maximum value. In this

case, the pixel x has no difference in intensity to all

of other pixels. This means that its noise level is zero.

ICPRAM2013-InternationalConferenceonPatternRecognitionApplicationsandMethods

556

On the other hand, in the case where the w

(x,y) has

its minimum value, the average similarity is zero, and

the noise level of a pixel can be obtained only by the

entire image’s noise level. Therefore, combining the

equation (17) and the equation (18), the noise level

for each pixel x is computed as

(x) = σ



1 −

Ω

(x,y)dy

×W



(19)

Most of images do not have a uniform intensity.

The standard deviation of intensities for an image o

having non-uniform intensity is represented by the

sum of the each standard deviation caused by the orig-

inal intensity changes and noise.

= σ

original

+ σ

noise

(20)

noise

= σ

− σ

original

= η · σ

where η =

− σ

original

(21)

The σ

original

is zero for a uniform intensity image; it

is larger than zero for a non-uniform intensity image;

and it is smaller than or equal to the σ

for any kind

of image. Therefore, the η is 0 ≤ η ≤ 1. This value

is close to one when the original image has more uni-

formity in its intensity and is close to zero when the

image has large intensity differences. Reﬂecting these

properties, we suggest a guide-line to the value of

B(x):

B(x) = N

(x) = σ

noise



1 −

Ω

(x,y)dy

×W



= η · σ



1 −

Ω

(x,y)dy

×W



(22)

4 EXPERIMENTAL RESULTS

In this section, the results of denoising for four classes

of images added with a zero mean white Gaussian

noise are illustrated. The image patch size is 3 × 3,

and the non-local similarity is computed in the 21 ×

21 window range to reduce the computation time.

The results of denoising for each image are shown

in ﬁgure 1 to ﬁgure 4. In each ﬁgure, the result of the

proposed non-local Huber regularization is compared

to the results of the non-local total variation regular-

ization and to the results of the non-local H

regular-

ization.

Figure 1(a) is an original airplane image of 128 ×

128, and the ﬁgure 1(b) is the noisy image obtained

from the ﬁgure 1(a) adding Gaussian noise of stan-

dard deviation 20. Figure 1(c), 1(d), and 1(e) are de-

noised images obtained from the non-local H

regu-

larization, the non-local Huber regularization, and the

Table 1: PSNRs of the denoised images from Figure 1, 2, 3,

and 4.

images NLH

NLHUB NLTV

airplane 22.80 23.52 19.93

barba 20.76 24.08 22.92

boat 21.82 21.31 20.29

lenna 21.10 22.03 20.70

non-local total variation regularization, respectively.

Similarly, ﬁgure 2(a), 3(a), and 4(a) are original im-

ages, and their noised images using the same Gaus-

sian noise to previous one are shown in ﬁgure 2(b),

3(b), and 4(b). Figure 2(c), 3(c), and 4(c) are their

denoised images obtraind from the non-local H

reg-

ularization, Figure 2(d), 3(d), and 4(d) are from the

non-local Huber regularization, and Figure 2(e), 3(e),

and 4(e) are from the non-local total variation regular-

ization. All of the denoising above have used η = 0.1.

As shown in the results, the non-local H

regu-

larization results in blurry denoised images, and the

non-local total variation regularization gives clearer

results than the result of the non-local H

regulariza-

tion. Compared to the results from other approaches,

the proposed non-local Huber regularization gives a

similar result to the one of the non-local total variation

regularization. Particularly, for ﬁgure 1, the non-local

Huber regularization gives clearer result than the one

from the non-local total variation regularization.

The peak signal to noise ratio (PSNR) results of

the ﬁgure 1, 2, 3, and 4 are shown in the table 1. As

shown in the table, except the boat image, the PSNRs

from the non-local Huber regularization show the

highest values compared to other approaches. Though

the non-local H

regularization shows the highest

PSNR for the boat image, The non-local Huber reg-

ularization is better at visualizing the denoised image

than the non-local H

regularization.

Using various values for η, the PSNR changes

of the images from the non-local Huber regulariza-

tion are shown in the ﬁgure 5. Figure 5(a) shows the

denoised results for the noisy images obtained from

the images in the ﬁgure 1(a), 2(a), 3(a), and 4(a) by

adding a zero mean white Gaussian noise whose stan-

dard deviation is 10, and the ﬁgure 5(b) shows the de-

noised results for the same images with noise whose

standard deviation is 20. As shown in the ﬁgure 5, the

PSNR with η ≤ 0.1 is larger than the one with η = 0,

and the highest PSNR results around η = 0.1. Since

the non-local Huber regularization with η = 0 is equal

to the non-local total variation regularization, the non-

local Huber regularization shows better PSNR than

non-local total variation regularization for η ≤ 0.1.

Non-localHuberRegularizationforImageDenoising-AHybridApproachofTwoNon-localRegularizations

557

(a) (b) (c) (d) (e)

Figure 1: Airplane images with 128 × 128 size. 1(a) is an original image. 1(b) is an image with noise of standard deviation 20

from the original image. 1(c) is a result from the non-local H

regularization. 1(d) is from the non-local Huber regularization.

1(e) is from the non-local total variation regularization.

(a) (b) (c) (d) (e)

Figure 2: Barba images with 128 × 128 2(a) is an original image. 2(b) is an image with noise of standard deviation of 20 from

the original image. 2(c) is a result from non-local H

regularization. 2(d) is from non-local Huber regularization. 2(e) is from

non-local total variation regularization.

(a) (b) (c) (d) (e)

Figure 3: Boat images with 128 × 128 3(a) is an original image. 3(b) is an image with noise of standard deviation of 20 from

the original image. 3(c) is a result from the non-local H

regularization. 3(d) is from the non-local Huber regularization. 3(e)

is from the non-local total variation regularization.

(a) (b) (c) (d) (e)

Figure 4: Lenna images with 128 × 128 4(a) is an original image. 4(b) is an image with noise of standard deviation of 20 from

the original image. 4(c) is from the non-local H

regularization. 4(d) is from the non-local Huber regularization. 4(e) is from

the non-local total variation regularization.

5 CONCLUSIONS

In this paper, we have proposed the non-local Huber

regularization enhancing the non-local total variation

regularization for image denoising. This method ap-

plied the non-local total variation and the non-local

regularization depending on the degree of the non-

ICPRAM2013-InternationalConferenceonPatternRecognitionApplicationsandMethods

558

(a) (b)

Figure 5: PSNR plot of denoised images for various η values. 5(a) is results from images with noise of standard deviation of

10 from the original images. 5(b) is results from images with noise of standard deivation of 20 from the original images.

local intensity differences based on a boundary value.

To properly apply this method, selection of the bound-

ary value was also suggested. Using the proposed

method, higher PSNR results and clearer denoised

images were obtained compared to the results from

the non-local total variation regularization and to the

results from the non-local H

regularization.

REFERENCES

Buades, A., Coll, B., and Morel, J.-M. (2005). A non-local

algorithm for image denoising. IEEE Preceedings of

Computer Vision and Pattern Recognition.

Buades, A., Coll, B., and Morel, J. M. (2006). Image en-

hancement by non-local reverse heat equation. Tech-

nical Report.

Chambolle, A. (2004). An algorithm for total variation min-

imization and applications. Journal of Mathematical

Imaging and Vision, pages 89–97.

Donoho, D. L. (1995). De-noising by soft-thresholding.

IEEE Transaction on Information Theory, 41:612–

627.

Donoho, D. L. and Johnstone, I. M. (1995). Adapting to un-

known smoothness via wavelet shrinkage. Journal of

the American Statistical Association, 90:1200–1224.

Gilboa, G., Darbon, J., Osher, S., and Chan, T. (2006).

Nonlocal convex functionals for image regularization.

UCLA CAM Report.

Gilboa, G. and Osher, S. (2008). Nonlocal operators with

applications to image processing. Multiscale Model

and Simulation, 7:1005–1028.

Lee, C., Lee, C., and Kim, C.-S. (2011). Mmse nonlocal

means denoising algorithm for poisson noise removal.

IEEE International Conference on Image Processing.

Lindenbaum, M., Fischer, M., and Bruckstein, A. M.

(1994). On gabor contribution to image enhancement.

Pattern Recognition, pages 1–8.

Lou, Y., Zhang, X., Osher, S., and Bertozzi, A. (2010). Im-

age recovery via nonlocal operators. Journal of Scien-

tiﬁc Computing, pages 185–197.

Marquina, A. (2009). Nonlinear inverse scale space meth-

ods for total variation blind deconvolution. Society of

Industrial Applied Mathematics, 2:64–83.

Osher, S., Burger, M., Goldfarb, D., Xu, J., and Yin, W.

(2005). An iterative regularization method for total

variation-based image restoration. Multiscale Model-

ing and Simulation, 4:460–489.

Osher, S., Sole, A., and Vese, L. (2003). Image decomposi-

tion and restoration using total variation minimization

and the h

−1

norm. Society for Industial and Applied

Mathematics, 1:349–370.

Perona, P. and Malik, J. (1990). Scale space and edge de-

tection using anisotropic diffusion. IEEE Transaction

on Pattern Analysis and Machine Intelligence, pages

629–639.

Peyre, G., Bougleux, S., and Cohen, L. (2008). Non-local

regularization of inverse problems. European Confer-

ence on Computer Vision, pages 57–68.

Rudin, L. I., Osher, S., and Fatemi, E. (1992). Nonlinear to-

tal variation based noise removal algorithms. Physica,

pages 259–268.

Temizel, A. and Vlachos, T. (2005). Wavelet domain image

resolution enhancement using cycle-spinning. IEEE

Electronics Letters, 41:119–121.

Tikhonov, A. and Arsenin, V. (1977). Solution of ill-posed

problems. Wiely, New York.

Yaroslavsky, L. (1985). Digital picture processing - an in-

troduction. Springer Verlag.

Yaroslavsky, L. (1996). Local adaptive image restoration

and enhancement with the use of dft and dct in a run-

ning window. Proceedings of Wavelet Applications in

Signal and Image Processing, pages 1–13.

Non-localHuberRegularizationforImageDenoising-AHybridApproachofTwoNon-localRegularizations

559