its peers. To achieve this aim, it is important for this
organisation to better understand the markets in
which they are operating and have a personalised
local view of the retail stores within these markets.
Hence, store segmentation is proposed to cater to
these business requirements. Given the complexity of
data and market dynamics, it is imperative to apply
some sophisticated clustering techniques which
would address the limitations of traditional
techniques like K-mean and agglomerative
clustering.
This paper proposes the use of advance machine
learning techniques like Self Organizing Maps
(SOM), Gaussian Mixture Models (GMM), Fuzzy C-
means (FCM) for clustering offline stores of different
European markets. The results of these techniques are
also compared with results of legacy clustering
technique like hierarchical to prepare a comparative
analysis for each market.
This paper is structured as follows. Section 2
presents the related literature available in this domain.
Section 3 describes the different data sources,
variables and techniques used in the analysis. Section
4 presents the comparative results of the techniques
applied across different markets and the paper is
concluded in Section 5.
2 RELATED WORKS
Various algorithms have been proposed by
researchers relating to clustering applications for
retailers in the literature and results from clustering
have been presented.
Researchers have classified internet retail sites for
an e-commerce company. 35 observable internet
retail store’s attributes are used, and hierarchical
clustering technique is applied to classify store into
five distinct web catalog interface categories:
superstores, promotional stores, plain sales stores,
one-page stores, and product listings. The classified
online stores differ primarily on the three dimensions:
size, service offerings, and interface quality (Spiller
and Lohse, 2015).
Researchers analyze the data of a supermarket
chain which has 73 stores in Turkey. Data related to
stores such as store size, number of competitors
nearby, trade area demographics like distribution of
population by age, marital status are used for
conducting the segmentation. Hierarchical clustering
is applied, and effective target marketing strategy is
designed for each store segment (Bilgic, Kantardzic,
and Cakir, 2015).
Researchers have applied artificial neural
networks (ANNs) as an alternative means of
segmenting customers in retail space. Hopfield–
Kagmar (HK) clustering algorithm, an ANN
technique based on Hopfield networks, is compared
with K-means clustering algorithms. Purchase
behavior such as the total number of orders, days
since first purchase, the number of credit cards etc is
used for profiling the customers. The results indicate
that ANNs could be more useful to retailers for
segmentation because they provide more
homogeneous segmentation solution than K-means
clustering algorithms and are less sensitive to initial
starting conditions (Boone and Roehm, 2002).
Researchers have applied clustering techniques
namely K-means clustering, Mountain clustering, and
Subtractive clustering on the dataset for medical
diagnosis of heart disease. It is observed that K-means
overperformed in cases where many dimensions are
present. Mountain clustering is suitable only for
problems with two or three dimensions (Hammouda
and Karray, 2002).
Most of the papers have applied hard clustering
techniques like K-means and hierarchical. Most of
them have been used for customer segmentation
rather than for store segmentation. Even if there is
some research in the store segmentation space, it is
predominantly focused on online channel than the
traditional offline channel. To add further, the
attributes used for store clustering are mostly related
to firmographics, customer demographics or
competitor information. In this paper, store clustering
is performed for a retail organisation. Attributes
related to purchase pattern, transaction pattern,
customer behaviour, store dimensions are used for
clustering. Both hard clustering technique such as
hierarchical clustering and soft clustering techniques
such as Self Organizing Maps (SOM), Gaussian
Mixture Models (GMM), Fuzzy C-means (FCM) are
applied for clustering stores for four different
European markets. A comparative study on the results
derived from these different techniques for different
markets has been presented in this paper.
3 DATA AND METHODLOGY
The retailer considered is a UK based multinational
organization offering convenience retail services to
.consumers. The company operates through various
channels. Some of the stores are owned and operated
by the company itself, however, there are some which
are owned and operated by a franchise or a dealer.