relatively small (at least order of magnitude). The
number of node accesses in temporal index of
STAH-tree is presented in equation 9. This formula
may be derived from equations (5,7,8).
)(2
10sin
SELSELNANA
gleMVB
+=
(9)
8 EXPERIMENTS
We performed multiple which proved correctness of
all assumptions presented in the paper. The
experiments prove that all presented modifications
(AGG, BATCH, MAP) result in high acceleration.
Objects selectivity prediction was highly accurate
(about 5-10% and less). The selectivity for leaves
was correct with the same precision when
aggregation was performed only on the leaves level.
When aggregation was allowed on all aR-tree levels
– the error reached 30% for large values (when more
than 50% of space was covered by a query). For
smaller queries the error did not rise above 10%.
Detailed results description is not included as paper
page count is limited.
9 CONCLUSIONS AND FUTURE
WORK DIRECTIONS
STAH-tree is a complete system that allows to pose
various queries over spatio-temporal data. The
system performance is highly data parameter-
independent. Even for parameters that have large
impact on performance in standard solutions (for
example relational database).
STAH-tree is based on widely recognized and
valued solutions from spatial and temporal data
processing domain (R-tree, aR-tree, MVB-tree,…).
These ideas are combined and adapted is order to
work together and assure required features. The
original solution was extended with additional
accelerating techniques (BATCH, AGG, MAP). The
presented solution is more robust than the one using
the components separately. Disadvantage is higher
storage space consumption.
Selectivity prediction model for aggregate R-tree
was introduced. Its accuracy was proved by
experiments. The selectivity model along with node
accesses formula for MVB-tree (available in
literature) are merged in order to achieve full
performance model for STAH-tree (with respect to
node accesses).
Several main directions of future fork have been
recognized: (1) selectivity model for non uniform
data distribution; (2) extension of presented
selectivity model for aR-tree nodes with the
possibility of aggregating on higher aR-tree levels;
(3) selectivity model for different types of spatial
queries (KNN, Spatial Join etc.); (4) replace spatial
and temporal indexes with other techniques (quad
trees, OVB-trees, …). (5) use more sophisticated
way to aggregate spatial objects (not only point
aggregates) as it is done in (Zhang, 2002). Work on
some of these issues has already started.
REFERENCES
Manolopoulos, Y., Nanopoulos, A., Papadopoulos A. N.,
Theodoridis Y. 2006. R- Trees: Theory and
Applications. Springer
Beckerman N., Kriegel H.P., Schneider R., Seeger B.
1990. The R*-tree: An efficient and robust access
method for points and rectangles. Proc. SIGMOD
International Conference on Management of Data,
pages 322-331
Guttman A. 1984 R-trees: A dynamic index structure for
spatial searching Proc. SIGMOD International
Conference on Management of Data, pages 47-57
Kamel I. Faloutsos. 1994. Hilbert R-tree: An improved R-
tree using fractals. Proc. International Conference on
Very Large Databases, pages 500-509
Kamel I. Faloutsos. 1993. On packing R-trees. Proc.
International Conference on Information and
Knowledge Management, pages 490-499
Becker B., Gschwind S., Ohler T., Seeger B., Windmayer
O. 1996 An Asymptotically Optimal Multiversion B-
tree VLDB Journal, 5(4), pages 264-275
Tao Y., Papadias D., Zhang J. 2002 Efficient Cost Models
for Overlapping and Multi-Version Structures. TODS,
27(3), pages 299-342
Tao Y., Papadias D. 2004 Range Aggregate Processing in
Spatial Databases. IEEE Trans. Knowl. Data Eng.
16(12), pages 1555-1570
Theodoridis Y., Sellis T., 1996. A model for the Prediction
of R-tree Performance. Proc. Symp. Principles of
Database Systems
Comer D., 1979. The Ubiquitous B-tree. ACM Computing
Surveys, Vol. 11, No 2, pages 121-137
Zhang D., Tsotras V. J., Gunopulos D. Efficient
aggregation over Objects with Extent. PODS 2002,
pages 121-132
Zhang D., Markowetz A., Tsotras V. J., Gunopulos D.,
Seeger B. Efficient Computation of Temporal
Aggregates with Range Predicates. PODS 2001
Bercken J., Seeger B. Query Processing Techniques for
Multiversion Access Methods. VLDB 1996
ICEIS 2007 - International Conference on Enterprise Information Systems
118