2.2 Data warehouse multidimensional
model
There is a huge number of information related to
hazard analysis that are either stored or computed
from the data sources such as risk indicator, parcel
information: area, date of completion, building
material, presence or not of heating, population
information, company information, etc. A user may
like to view risk patterns on a map by building, by
parcel, by cadastral section, by elevation and by
different combinations of population and company
activities information, or even like to dynamically
drill-down or roll-up along any dimension to explore
desired patterns, such as high or low risky regions.
Spatial data can be analysed by taking into account
different aspects, for example, whether the type of
the predicates and the results is spatial or non-
spatial. Moreover, another classification criterion
can be used by taking into account the fact that
topological or non-topological relationships are or
not the main focus of analysis. In this paper, we use
the terminology usually used in multidimensional
modelling such as dimension type, fact relationship,
hierarchy, level.
2.2.1 Measures
Measures are attributes representing the specific
elements of analysis, such as Locative Value, area.
In general, they can be summed or averaged in order
to understand the particular aspects in consideration.
We distinguish two types of measures in a spatial
data warehouse (Han, 1997):
Numerical measure: a numerical measure is a
measure containing only numerical data. For
example, one measure could be total revenue of a
building, and a roll-up may give the total revenue by
parcel, by cadastral region, and so forth.
A simple example of numerical measure computing
is:
Select Sum(TotalLocativeValue)
From Built Prop
Group By ParcelId.
Numerical measures can be further classified into
distributive, algebraic, and holistic (Kimbal, 1996).
Spatial measure: A spatial measure is a measure
which contains a collection of pointers to spatial
objects. For example, during the generalization
procedure, the parcels with the same range of risk
indicator are grouped into the same cell, and the
measure so formed contains a collection of pointers
to those parcels.
2.2.2 Dimensions
Dimension is an object that includes attributes
allowing the user to explore the measures from
different perspectives of analysis. In the context of
spatial data warehouse, we distinguish three types of
dimensions according to whether or not it has spatial
references.
Non-Spatial dimension: A non-spatial dimension is a
dimension containing only non-spatial data. In our
case, risk indicator can be considered as a non-
spatial dimension. It contains non-spatial data
corresponding to risk value, whose generalization is
also non-spatial, such as low-risky, and high-risky.
Spatial-to-non-spatial dimension: A spatial-to-non-
spatial dimension is a dimension whose primitive
level data is spatial but whose generalization,
starting high level, becomes non-spatial. We will not
handle this type of dimension in our case.
Spatial Dimension: A spatial-to-spatial is a
dimension whose primitive level and all of its high
level generalized data are spatial. For example, in
our case, building, parcel, cadastral section, and
Commune are all spatial elements of the location
spatial dimension.
2.2.3 Star Model
The most known logical model used for the DW
design is called star scheme having his center
represented by a fact table surrounded with several
dimension tables forming star-like appearance. This
model is also called multidimensional model due to
the fact that several dimensions (multiple
dimensions) are used to analyze the measures. Since
the data warehouse is subject-oriented, and in
natural hazard framework, we focus our analysis on
vulnerability aspect; we will build our model with an
emphasis on vulnerability measures that can be
obtained or computed from data sources such as
total revenue, effective employees, and turnover
sales. Hence, the model (see figure 3 and 4) contains
one fact table (with vulnerability measures),
surrounded by (spatial and non spatial) dimensions
tables.
2.3 OLAP operations
With the above specified dimensions, OLAP
operations can be performed by stepping up and
down along any dimension shown in Figure 4, we
will use popular OLAP operations and analyze how
they are performed on a spatial data cube:
Slicing and dicing: each of which selects a portion of
the cube based on the constant(s) in one or a few
dimensions. For example, one may be interested
TOWARDS DATA WAREHOUSES FOR NATURAL HAZARDS
417