memory consumption when the number of tuples
increases. Memory consumption of the hybrid
version H-Frag is always lower than Frag-Cubing
approach. When compared with Frag-Cubing, H-
Frag has similar performance in point queries, but
H-Frag approach outperforms Frag-Cubing in
inquire queries, producing answers 9 times faster
than Frag-Cubing approach. H-Frag is designed for
queries types proposed in qCube (Silva et al., 2013),
so H-Frag is also a range cube approach. In the
experiments, we had scenarios where Frag-Cubing
approach failed to index the data cube caused by
lack of main memory. The H-Frag hybrid memory
approach is, on average, 3 times slower than Frag-
Cubing in indexing a cube, which can be also
considered a promising result, since H-Frag uses
external memories to support huge data cubes. A
massive test with 60 dimensions and 10
9
tuples was
conducted to prove that H-Frag is robust and can be
used in extreme scenarios.
There are some improvements to H-Frag
approach. Among them, we can mention computing
and updating experiments for holistic measures,
which are extremely costly and important for
decision making. Top-k multidimensional queries is
part of our interest, since inverted index is also
useful for this type of problem.
ACKNOWLEDGEMENTS
This work was partially supported by ITA, UFOP,
FATEC-MC and by FAPESP under grant No.
2012/04260-4 provided to the authors.
REFERENCES
Brahmi, H., Hamrouni, T., Messaoud, R., and Yahia, S.
“A new concise and exact representation of data
cubes,” Advances in Knowledge Discovery and
Management, Studies in Computational Intelligence
(vol. 398), Springer, Berlin-Heidelberg, 2012, pp. 27–
48.
Codd, E. F. “Relational completeness of data base
sublanguages,” R. Rustin (ed.), Database Systems,
Prentice Hall and IBM Research Report (RJ 987), San
Jose, California, 1972, 65-98.
Gray, J., Chaudhuri, S., Bosworth, A., Layman, A.,
Reichart, D., Venkatrao, M., Pellow, F., and Pira-hesh,
H. “Data cube: a relational aggregation operator
generalizing group-by, cross-tab, and sub-totals,” Data
Mining and Knowledge Discovery (1), 1997, 29–53.
Han, J. Data Mining: Concepts and Techniques. Morgan
Kaufmann Publishers Inc., San Francisco, CA, USA,
2011.
Li, X., Han, J., and Gonzalez, H. “High-dimensional
OLAP: a minimal cubing approach,” Proceedings of
the International Conference on Very Large Data
Bases, 2004, pp. 528–539.
Lima, J. d. C. and Hirata, C. M. “Multidimensional cyclic
graph approach: representing a data cube without
common sub-graphs,” Information Sciences 181 (13),
July 2011, 2626–2655.
Ruggieri, S., Pedreschi, D., and Turini, F. “Dcube:
discrimination discovery in databases,” Proceedings of
ACM SIGMOD International Conference on
Management of Data, New York, NY, USA, 2010, pp.
1127–1130.
Silva, R. R., Lima, J. d. C., and Hirata, C. M. “qCube:
efficient integration of range query operators over a
high dimension data cube,” Journal of Information
and Data Management 4 (3), 2013, 469–482.
Sismanis, Y., Deligiannakis, A., Roussopoulos, N., and
Kotidis, Y. “Dwarf: shrinking the petacube,”
Proceedings of ACM SIGMOD International
Conference on Management of Data, New York, NY,
USA, 2002, pp. 464–475.
Xin, D., Shao, Z., Han, J., and Liu, H. “C-cubing: efficient
computation of closed cubes by aggregation-based
checking,” International Conference on Data
Engineering, Atlanta, Georgia, USA, 2006, pp. 4.
AHybridMemoryDataCubeApproachforHighDimensionRelations
149