Figure 3: Comparison of all algorithms for joining three
tables.
6 CONCLUSIONS
Including of top-k algorithms into a real relational
environment is not usual by that time. For exclusion
see, e.g., (Li, Chang, Ilyas, and Song, 2005)
extending relational algebra and query optimization.
Other examples are prototype implementations in
PostgreSQL (Khalefa et al., 2011), (Kini and
Naughton, 2007).
We have made comparison of several algorithms
implementing rank join operator. Our results
confirmed that huge processing time can be saved
using optimal algorithm and appropriate source-
choosing strategy. In simple join conditions it is
possible to boost the algorithms with use of a hash
table, which brings significant improvement,
especially for higher values of k.
We have developed the .NET library NRank,
which implements these algorithms and is ready to
be used in a real-world application. For a future
work, a rank-aware optimization framework would
be beneficial enabling to use data statistics stored in
a usual RDBMS. The library can be used practically
for processing arbitrary data.
ACKNOWLEDGEMENTS
This research has been partially supported by the
grant GACR No. P202/10/0761.
REFERENCES
Abid, A. and Tagliasacchi, M. (2011). Parallel Data
Access for Multiway Rank Joins. In ICWE 2011, S.
Auer, O. Diaz, and G.A. Papadopoulos (Eds.), LNCS
6757, 44–58.
Bruno, N., Chaudhuri, S., and Gravano, L. (2002). Top-k
Scoring Queries over Databases: Mapping strategies
and Performance Evaluation. ACM Transactions on
Database Systems, Vol. 27, No. 2, 153-187.
Fagin, R., Lotem, A. and Naor, M. (2003). Optimal
aggregation algorithms for middleware. Journal of
Computer and System Sciences 66, 614–656.
Finger, J. and Polyzotis, N. (2009). Robust and Efficient
Algorithms for Rank Join Evaluation. In ACM
SIGMOD International Conference on Management of
Data, Providence, Rhode Island, USA, June 29 - July
2, 415-428.
Horničák, E., Ondreička, M., Pokorný, J., Vojtáš, P.
(2011). Multi-user Searching of Top-k Objects with
Data on Remote Servers. In: ADBIS 2011 – Research
Communications, J. Eder, M. Bielikova, and A Min
Thoa (Eds.), Oesterreichische Comp. Gesellschaft,
Vienna, Austria, Sep. 20-30, 200-211.
Ilyas, I. F., Aref, W. G. and Elmagarmid, A. K. (2004).
Supporting Top-k Join Queries in Databases. The
VLDB Journal (2004) 13: 207–221.
Ilyas, I. F., Shah, R., Aref, W. G., Vitter, J. S. and
Elmagarmid, A. K. (2004). Rank-aware query
optimization. In ACM SIGMOD International
Conference on Management of Data, Paris, France,
June 13-18, 203-214.
Ilyas, I. F., Beskales, G. and Soliman, M. A. (2008). A
Survey of Top-k Query Processing Techniques in
Relational Database Systems. ACM Computing
Surveys, Vol. 40, No. 4, Article 11, 58 p.
Khalefa, M. E., Mokbel, M. F. and Levandoski, J. J.
(2011). PrefJoin: An Efficient Preference-aware Join
Operator. In 27th International Conference on Data
Engineering (ICDE), IEEE, 995-1006.
Kini, A. and Naughton, J. F. (2007). Database Support for
Weighted Match Joins. In: Proceedings of 19th
International Conference on Scientific and Statistical
Database Management (SSDBM 2007), IEEE, 10 p.
Li, Ch., Chang, K. Ch., Ilyas, I. F. and Song, S. (2005).
RankSQL: Query Algebra and Optimization for Top-k
Queries. In ACM SIGMOD International Conference
on Management of Data, Baltimore, Maryland, USA,
June 14-16, 131-142.
Mamoulis, N., Yiu, M. L., Cheng, K. H., Cheung, D. W.
(2007). Efficient Top-k Aggregation of Ranked Inputs.
ACM Transactions on Database Systems, Vol. 32,
August, Article 19.
AA nternational onference on ata echnoloies and Applications