which randomly picks objects based on their
assigned weights.
For testing purposes, we generated a set of 10
query patterns for schema 1 each consisting of 3
queries and a set of 50 query patterns for schema 2
each consisting of 9 queries. The patterns were
generated by randomly choosing a node as start node
and then generating the sequence of queries from the
start node. Each new pattern denotes a change in the
context. Our approach is affected by the context
change, since its selection is based on views that are
best suited for the current context. Dynamat,
however, is not affected by the context change since
it does not exploit the user access patterns.
5.3 Results
Performance was measured under different space
constraints (i.e. view pool size expressed as a
percentage of the full data cube size). The DCSR per
view (in decreasing order of savings) for schema 1
and schema 2 (for space constraints of 5%, 10% and
20%) are shown in Figure 6 and Figure 7,
respectively. The CRC for schema 1 and schema 2
are shown in Figure 8 and Figure 9, respectively.
The global pre-fetching scheme clearly outperforms
the dynamat approach, especially when the available
space is low. As the available space increases, the
query performance (DCSR) of dynamat gradually
approaches to that of ours. Dynamat chooses views
for materialization as and when new queries are
asked. Our pre-fetching approach selects views for
materialization at the beginning of every context.
With more available space, more views can be
materialized, as a result of which the probability of
finding a matching view to answer a query is high.
Additionally, the global pre-fetching scheme uses
the access patterns information, which further
optimizes the selection of views in any given
context, as seen by the high DCSR values. On the
other hand, when the space constraints are high,
dynamat, which updates its selection at each stage,
requires replacing a lot of views. In the process, the
DCSR per view drops since more views have to be
answered from the base fact table. The global pre-
fetch, however, continues to perform better since it
selects views at the beginning of every context and
the selection is such that the queries in the given
context are likely to be answered from the
materialized view pool, instead of the base fact
table. Additionally, the global pre-fetch requires
fewer number of replacements since many of the
selections persist over different contexts, as a result
of which the reusability of these already materialized
views for answering queries from other contexts
increases.
It has been experimentally proved in (Kotidis,
2001) that dynamat outperforms the optimal static
view selection. The results above show that our
approach outperforms dynamat and thus, also the
optimal static view selection.
6 CONCLUSIONS
Pre-computation of views is an essential query
optimization strategy for decision support systems.
To meet the changing user needs, the views may be
fetched (or selected) on demand (on-demand
fetching) or they may be pre-fetched using some
prediction strategy. In this paper, we proposed a
global pre-fetching scheme that uses user access
pattern information to pre-fetch certain candidate
views that could be used for efficient query
processing within the specified user context. Our
approach optimizes the selection of views for
efficient drill-down analysis, which is the most
natural way of querying an OLAP system. Roll-up
analysis is not explicitly emphasized since such
queries can always be answered from the most
recently materialized views.
We compare our scheme against dynamat, a
dynamic view management system that uses on-
demand fetching and is already known to outperform
optimal static view collection. The DCSR results
show that the average cost savings of answering a
query using our proposed scheme clearly exceeds
the dynamat approach. The CRC results show that
our scheme is more robust than dynamat since it
requires relatively fewer number of view
replacements.
In future, we plan to test our approach by varying
the granularity of the materialized results and also
on large real-world data sets.
REFERENCES
Baralis, E., Paraboschi, S., Teniente, E., 1997.
Materialized View Selection in a Multidimensional
Database. In Proc of 23
rd
VLDB Conf., pp. 156-165.
Bauer, A., Lehner, W., 2003. On Solving the View
Selection Problem in Distributed Data Warehouse
Architectures, In Proc. of SSDBM Conf., pp. 43-51.
Harinarayan, V., Rajaraman, A., Ullman, J., 1996.
Implementing Data Cubes Efficiently. In ACM
SIGMOD Conference, pp. 205-216.
Gupta, H., 1997. Selection of Views to Materialize in a
Data Warehouse. In Proc. of Intl. Conf. on DB Theory,
pp. 98-112.
Gupta, H., Harinarayan, V., Rajaraman, A., 1997. Index
Selection for OLAP. In 13
th
Conf. on Data Engg, pp.
208-219.
ICEIS 2005 - DATABASES AND INFORMATION SYSTEMS INTEGRATION
66