A Multi-features Fusion of Multi-temporal Hyperspectral Images

using a Cooperative GDD/SVM Method

Selim Hemissi

and Imed Riadh Farah

ITI, Telecom Bretagne, Brest, France

RIADII, ENSI, Manouba, Tunisia

Keywords:

Hyperspectral Data, Feature Fusion, Hyperion, Remote Sensing, SVM, Generalized Dirichlet Distribution,

Generative/Discriminative Model.

Abstract:

Considering the emergence of hyperspectral sensors, feature fusion has been more and more important for

images classiﬁcation, indexing and retrieval. In this paper, a cooperative fusion method GDD/SVM (General-

ized Dirichlet Distribution/Support Vector Machines), which involves heterogeneous features, is proposed for

multi-temporal hyperspectral images classiﬁcation. It differentiates, from most of the previous approaches, by

incorporating the potentials of generative models into a discriminative classiﬁer. Therefore, the multi-features,

including the 3D spectral features and textural features, can be integrated with an efﬁcient way into a uniﬁed

robust framework. The experimental results on a series of Hyperion images conﬁrm the improved performance

and show that this cooperative fusion approach has consistence over different testing datasets.

1 INTRODUCTION

Presently, the considerable archive produced by satel-

lite remote-sensing sensors is becoming an increas-

ingly valuable source of information. This leads to a

better interpretation of land-cover and land-use evolu-

tion by analyzing the spectral response of the different

earth’s surface elements. Hyperspectral signatures af-

ford a compact recording of reﬂectance values over a

large domain of the electromagnetic spectrum. They

allow practitioners to map, quantify and qualify effec-

tively the spatio-temporal variations of land surface

(Heinz et al., 2010). The goal of fusion techniques is

to extract complementary information from different

sources to allow for a more informed decision than

one could gain from any of the sources alone. In the-

ory, data fusion provides signiﬁcant advantages over

single source of features. In addition to the statisti-

cal advantage, the use of multiple kinds of features

may increase the possibility of a target of interest be-

ing observed and characterized resulting in a reduced

error rate. In contrast, fusion may not always result in

an improved decision over simply selecting the most

appropriate source for the task because accurate data

may be fused with very inaccurate data (Nakariyakul

and Casasent, 2004).

2 PROBLEM STATEMENT

The semantic categorization of remote-sensing im-

ages requires analysis of many features of the images

such as texture, spectral proﬁles, etc. Current feature

fusion approaches commonly concatenate different

features. It gives, generally good results and several

approaches have been proposed using this schema.

However, most of them have various conditional con-

straints, such as noise and imperfection, which might

retain the use of such systems under degraded perfor-

mance. However, how to fuse heterogeneous features

in a ﬂexible way is still an open research question.

Similarly, in the area of Supervised Machine

Learning (SML), diversity with respect to the er-

rors committed by component classiﬁers has received

much attention (Bishop, 2006). Generative and dis-

criminative approaches are two distinct schools of

probabilistic machine learning. It has shown that

discriminative approaches such as SVM (Cristianini

and Shawe-Taylor, 2000) outperform model based ap-

proaches due to their ﬂexibility in decision bound-

aries estimation. Conversely, since that discrimina-

tive methods are concerned with boundaries, all the

classes need to be estimated conjointly (Ulusoy and

Bishop, 2006). Complementary, one of the interest-

ing characteristics, that generative models have over

discriminative ones, is that they are learnt indepen-

681

Hemissi S. and Riadh Farah I..

A Multi-features Fusion of Multi-temporal Hyperspectral Images using a Cooperative GDD/SVM Method.

DOI: 10.5220/0004377406810685

In Proceedings of the 2nd International Conference on Pattern Recognition Applications and Methods (PRG-2013), pages 681-685

ISBN: 978-989-8565-41-9

 2013 SCITEPRESS (Science and Technology Publications, Lda.)

dently for each class. Moreover, following their mod-

eling power, generative models are able to deal with

missing data. An ideal fusion method should combine

these two approaches in order to improve the classiﬁ-

cation accuracy (LeBlanc and Safﬁotti, 2007).

3 COOPERATIVE SVM/GDD

METHOD FOR

HYPERSPECTRAL IMAGES

CLASSIFICATION

3.1 Overview of the proposed Fusion

Schema

In this paper, we propose a new technique in remote-

sensing images classiﬁcation by fusing heterogeneous

representations. The proposed approach involve sev-

eral steps including preprocessing; features extrac-

tion; features fusion; matching and classiﬁcation

stages. The block diagram of the proposed technique

is shown in Fig. 1. In our previous work (Farah et al.,

2010), we proposed a novel 3D model which design

the spectral signature as a three dimensional function

which are the time, reﬂectance, and wavelength band

(equation 1). For each pixel, we generated a surface

(3D Mesh) which generalizes the usual signature by

adding a time dimension. We call this new representa-

tion the multi-temporal spectral signature. Interested

readers can refer to (Farah et al., 2010).

Figure 1: General workﬂow of the proposed approach.

3.2 Images Pre-processing and Features

Extraction

In this study multi-temporal hyperspectral images

constitutes the source data. Spectral and textural fea-

tures are the foundational data for this kind of images.

The 3D spectral features are extracted from the rel-

ative mesh of a given pixel (multi-temporal spectral

signature) while the textural ones are derived directly

from images. Mainly, two features vectors are gener-

ated for each pixel as follows:

Heat Kernel Signature (HKS). The HKS is a sig-

nature computed only from the intrinsic geometry of

an object. Suppose (m,g) is a compelte Rieman-

nian manifold, g is the Riemannian metric. δ is the

Laplace-Beltrami operator. The eigenvalues {λ

} and

eigenfunctions {φ

} of δ are δφ

= λ

, where φ

normalized to be orthonormal in L

(M). The Laplace

spectrum is given by 0 = λ

< λ

≤ λ

≤ . . . ,λ

→ ∞.

4 is the Laplace-Beltrami operator. As a local shape

descriptor, Sun et al. (Sun et al., 2009) deﬁned the

heat kernel signature (HKS) by :

h(x,t) = K

x,t

(x,x) =

∞

∑

i=0

−λ

(x) (1)

where λ

,λ

,··· ≥ 0 are eigenvalues and φ

,φ

,...

are the corresponding eigenfuctions on the Laplace-

Beltrami operator, satisfying δ

= λ

. Let’s de-

note this vector by Y .

Spatio-temporal Gabor Filters. Texture is one of

the important characteristics used in identifying ob-

jects or regions of interest. It contains important infor-

mation about the structural arrangement of surfaces.

Fusing texture with 3D spectral information is con-

ducive to the interpretation of remote seeing image

(Wang and Chua, 2005). We use a method for dy-

namic texture modeling based on spatio-temporal Ga-

bor ﬁlters. Brieﬂy, the sequence of images is con-

volved with a bank of spatiotemporal Gabor ﬁlters

and a feature vector is constructed with the energy of

the responses as components. Let’s denote this vector

by Y

3.2.1 Multi-features Fusion based on a

Cooperative GDD/SVM Classiﬁer

In this section, we present an approach that combines

an SVM classiﬁer (Burges, 1998) with a generatively

trained GDD model and proﬁts, accordingly, from the

advantages of both techniques. The key idea here is to

concatenate the extracted features into one vector and

to project it in a new space. First, a straightforward

feature combination approach is used to concatenate

feature vectors (Y and Y

) to a single feature vector

X = (X

,...,X

idim

). The dim size may differ from

one pixel to another making the fusion and classiﬁ-

cation a challenging tasks. To overcome this limit,

we use the Generalized Dirichelet Distribution (GDD)

model (Bouguila and Ziou, 2010) to map each fea-

ture vector into its Fisher score. Therefore, the Fisher

kernel function from the GDD is used to replace the

Gaussian kernel in the classical SVM.

Let (X

,...,X

) denote a collection of N multi-

temporal hyperspectral pixels. Each data X

is as-

sumed to have dim size, X = (X

,...,X

idim

). Each

ICPRAM2013-InternationalConferenceonPatternRecognitionApplicationsandMethods

682

data X

is assumed to be drawn from the following

ﬁnite mixture model :

p(X

/θ) =

∑

j=1

p(X

/ j,θ

)P( j) (2)

where M is the number of components, the P( j),

(0 < P( j) < 1 and

∑

dim

j=1

P( j) = 1) are the mixing

proportions and p(X / j,θ

) is the Probability Density

Function PDF. θ is the set of parameters to be esti-

mated : θ = (α

,...,α

,P(1),...,P(M)).

If the random vector X = (X

,...,X

idim

) follows

a Dirichelet distribution, the joint density function is

given by :

X = (X

,...,X

idim

) =

τ(

)

∏

dim+1

i=1

τ(α

)

dim+1

∏

i=1

−1

(3)

Since that each feature vector X may has an arbitrary

dimension, the proposed method deﬁnes the fusion

as a projection from one feature vector space (spec-

tral bands) to another with a ﬁxed dimentionnality.

Accordingly, the feature-level fusion is done by pro-

jecting the vector X combining into one vector in the

Fisher space. Thus, the generative model will have

its impact on the ﬁnal classiﬁcation result through the

projection of the extracted features in this new space.

SVM classiﬁer is used to classify the fused fea-

tures and the multi-temporal dataset of images. Given

the generative model obtained by GDD with parame-

ters θ, we compute for each sample X the Fisher score

= 5

logP(x|θ) (the gradient of the log likehood

of x for model θ). The Fisher kernel operates in the

gradient space of the generative mode and provides a

natural similarity measure between data samples. For

each sample, this score is a vector of ﬁxed dimention-

ality. Using this score, the Fisher Information matrix

is deﬁned as I = E





. After Fisher score

normalization, we compute the Fisher kernel function

on the basis of the Euclidean distance between the

scores of the new sample and the training samples :

K(X,X

) = U

−1

(4)

In the second stage, suppose our training set S

consists of labels input vectors (X

),i = 1, . . . ,m

where X

∈ R

and z

∈ {±1}. Given a kernel ma-

trix and a set of labels z

for each sample, the SVM

proceeds to learn a classiﬁer of the form,

z(x) = sign(

∑

)K(X

,X)) (5)

where the coefﬁcients α

are determined by solv-

ing a constrained quadratic program which aims to

maximize the margin between classes. In our exper-

iments we used the LIBSVM package. Our research

deals with multi-class problem. The One-Vs-One ap-

proach is adopted to extend the proposed approach to

multi-temporal hyperspectral classiﬁcation.

4 EXPERIMENTAL RESULTS

The images set used in this experiment were jointly

collected from the Tunisian Institute of Remote-

Sensing (CNT) and the USGS library through the

Glovis Viewer (Clark et al., 2007). Some earlier re-

sults and ground truth maps produced by the CNT

were also used to perform the analysis of the selected

test sites and for validation purposes.

The studied area is being within the line between

the northwest tip of Djerba island on the southeast and

Ras Yonga on the northwest. The centroid for the

study area is at 33

◦

N 10

◦

E. It is char-

acterized by typical Mediterranean climate with max-

imum temperatures reaching, in the period between

June and August (48

◦

), whereas the coldest tempera-

tures are measured between December and February.

Due to the sea proximity, the climate of the study area

slightly differs from the typical arid or semi-arid ar-

eas. The rainfall is very irregular and ranges between

150 − 240mm with an average of 30 rainy days per

year (September/October). The region has been cho-

sen not only due to the great interest from govern-

mental and non-governmental organizations, but also

because of the coexistence of several oasis such as

Mareth and Teboulbou including various types of veg-

etations that change over time. The vegetation has a

cover of 40% to 60%, comprising predominantly an-

nual plants which develop from the autumn rains and

persist until the end of the following spring. The veg-

etation cover is marked by the predominant species,

Palm, Lythracea (Henn

e) and Carex. In this set of

experiments, two time series are available, and thus,

the season spectral variability can be well mapped

through this set of images. An external digital eleva-

tion model and a reference land-cover map provided

by the Tunisian Institute of Remote-Sensing (CNT)

were also available for results assessment. Consider-

ing the differences in multi-temporal images acqui-

sition, we ﬁrst perform a pre-processing step. Im-

ages were geometrically corrected and geo-coded to

the Universal Transverse Mercator (UTM) coordinate

system based on a topographic map of the study area.

45 regularly distributed ground control points (GCPs)

were used for this purpose. Then, Hyperion images

were converted to reﬂectance and co-registered and

re-sampled to 30 × 30 m with the nearest neighbor

algorithm. The registration was performed at a sub-

pixel level, obtaining a rootmean-squared error of

about 0.65 pixels. After co-registration, all images

were radiometrically corrected to surface reﬂectance

by Atmosphereic CORrection Now (ACRON) soft-

ware, which is based on the MODTRAN-4 radiative

transfer code. In the following experiments, we se-

AMulti-featuresFusionofMulti-temporalHyperspectralImagesusingaCooperativeGDD/SVMMethod

683

Table 1: Sequence of real images (red/green/blue (rgb) composition, bands [6,19,33]) and their corresponding true classiﬁca-

tion maps (ﬁrst time serie: 2009/2010).

03/06/2009 23/09/2009 27/12/2009 09/01/2010 30/04/2010

Real Hyperionimages

Thematic map True Map

Table 2: Confusion matrix of the proposed approach for real experience set.

Percentages Classiﬁcation Data

Reference Data Carex Lythracea Bare soil Palm Water Producer’s Accuracy Errors of Omission

Carex 243 17 6 5 0 88.47% 11.52%

Lythracea(henn

e) 29 514 18 7 11 87.35% 12.64%

Bare Soil 7 33 722 12 25 89.65% 10.66%

Water 2 5 13 279 8 89.96% 10.03%

Palm 0 9 12 19 308 87.01% 12.98%

User’s accuracy 84.63% 87.56% 93.65% 84.58% 85.11% OA=88.48%

Errors of commission 15.63% 12.45% 06.78% 15.41% 14.28%

Table 3: Evaluation of the Proposed Approach Aiganest Several Conventionnal Approaches.

Classiﬁer Overall Accuracy Kappa

Proposed Approach 88.48 0.73

Maximum-Likelihood Classiﬁer 81.46 0.69

Support Vector Machines (SVMs) 87.84 0.71

lected subset images from the whole temporal data

set of images containing 3300 pixels-per-image in ar-

eas with substantial changes. Pixels belonging to un-

known classes were not considered. Once the features

were extracted from the reconstructed images, their

potential use for image classiﬁcation is investigated

in the following steps. Tables 2 and 3 the obtained

results.

5 CONCLUSIONS AND FUTURE

WORKS

We have presented a novel fusion method in the con-

text of multi-temporal hyperspectral images, mixtures

of dirichlet and SVM classiﬁers. Accordingly, the

generalization capacity of generative models can con-

ICPRAM2013-InternationalConferenceonPatternRecognitionApplicationsandMethods

684

siderably be enhanced by training them discrimina-

tively. Our experiments show that the cooperative

generative-discriminative model can lead to a the-

matic maps with superior quality. There are two ob-

vious extensions to the work that has been covered in

this project. The ﬁrst is to improve the estimation ac-

curacy, and the second is to examine the possibility of

using other 3D feature vectors.

REFERENCES

Bishop, C. M. (2006). Pattern Recognition and Ma-

chine Learning (Information Science and Statistics).

Springer-Verlag New York, Inc., Secaucus, NJ, USA.

Bouguila, N. and Ziou, D. (2010). A dirichlet process mix-

ture of generalized dirichlet distributions for propor-

tional data modeling. IEEE Transactions on Neural

Networks, 21(1):107–122.

Burges, C. J. C. (1998). A tutorial on support vector ma-

chines for pattern recognition. Data Min. Knowl. Dis-

cov., 2(2):121–167.

Clark, R., Swayze, G., Wise, R., Livo, E., Hoefen, T.,

Kokaly, R., and Sutley, S. (2007). Usgs digital spec-

tral library splib06a: U.s. geological survey. In Digital

Data Series 2312007. NASA, USGS.

Cristianini, N. and Shawe-Taylor, J. (2000). An Introduction

to Support Vector Machines. Cambridge University

Press, Cambridge, UK.

Farah, I. R., Hemissi, S., Ettabaa, K. S., and Souleiman, B.

(2010). Multi-temporal Hyperspectral Images Unmix-

ing and Classiﬁcation Based on 3D Signature Model

and Matching. Piers Online, 6:480–484.

Heinz, D., Davidson, C., and Ben-David, A. (2010).

Temporal-spectral detection in long-wave ir hyper-

spectral imagery. IEEE Sensors Journal, 10(3):509

–517.

LeBlanc, K. and Safﬁotti, A. (2007). Cooperative infor-

mation fusion in a network robot system. In Proceed-

ings of the 1st international conference on Robot com-

munication and coordination, RoboComm ’07, pages

42:1–42:4, Piscataway, NJ, USA. IEEE Press.

Nakariyakul, S. and Casasent, D. (2004). Hyperspectral fea-

ture selection and fusion for detection of chicken skin

tumors. In Proc. SPIE, pages 128–139.

Sun, J., Ovsjanikov, M., and Guibas, L. (2009). A concise

and provably informative multi-scale signature based

on heat diffusion. In Proceedings of the Symposium

on Geometry Processing, SGP ’09, pages 1383–1392.

Eurographics Association.

Ulusoy, I. and Bishop, C. M. (2006). Comparison of gen-

erative and discriminative techniques for object detec-

tion and classiﬁcation. Toward CategoryLevel Object

Recognition, pages 173–195.

Wang, Y. and Chua, C. (2005). Face recognition from 2d

and 3d images using 3d gabor ﬁlters. Image and Vision

Computing, 23(11):1018–1028.

AMulti-featuresFusionofMulti-temporalHyperspectralImagesusingaCooperativeGDD/SVMMethod

685