Burdakov, A., Ermakov, E., Panichkina, A., Ploutenko, A.,
Grigorev, U., Ermakov, O., & Proletarskaya, V. (2019).
Bloom Filter Cascade Application to SQL Query
Implementation on Spark. In 2019 27th Euromicro
International Conference on Parallel, Distributed and
Network-Based Processing (PDP) (pp. 187-192). IEEE
Chi, Y., Moon, H. J. and Hacigümüş, H. (2011) iCBS:
incremental cost-based scheduling under piecewise
linear SLAs //Proceedings of the VLDB Endowment. –
2011. – Т. 4. – №. 9. – pp. 563-574.
Date, C. J., and Darwen, H. (1993). A Guide to the SQL
Standard (Vol. 3). Reading: Addison-wesley.
Dean, J. and Ghemawat, S. (2004) MapReduce: Simplified
data processing on large clusters. In Proceedings of the
Sixth Conference on Operating System Design and
Implementation (Berkeley, CA, 2004).
Ganapathi, A. et al. (2009) Predicting multiple metrics for
queries: Better decisions enabled by machine learning
//Data Engineering, 2009. ICDE'09. IEEE 25th
International Conference on. – IEEE, 2009. – pp. 592-
603.
Guirguis, S. et al. (2009) Adaptive scheduling of web
transactions //Data Engineering, 2009. ICDE'09. IEEE
25th International Conference on. – IEEE, 2009. – pp.
357-368.
Mishra, C. and Koudas, N. (2009) The design of a query
monitoring system //ACM Transactions on Database
Systems (TODS). – 2009. – Т. 34. – №. 1.
Leis, V. et al. (2015) How good are query optimizers,
really? //Proceedings of the VLDB Endowment. –
2015. – Т. 9. – №. 3. – pp. 204-215.
Mistrík, I., Bahsoon, R., Ali, N., Heisel, M., & Maxim, B.
(Eds.). (2017). Software Architecture for Big Data and
the Cloud. Morgan Kaufmann.
Odersky, M., Spoon, L., & Venners, B. (2008).
Programming in scala. Artima Inc.
Seber, G. A., and Lee, A. J. (2012). Linear regression
analysis (Vol. 329). John Wiley & Sons.
Tarkoma, S., Rothenberg, C. and Lagerspetz, E. (2012)
“Theory and practice of bloom filters for distributed
systems” IEEE Comms. Surveys and Tutorials, vol. 14,
no. 1, pp. 131–155, 2012.
Tozer, S., Brecht, T. and Aboulnaga, A. (2010) Q-Cop:
Avoiding bad query mixes to minimize client timeouts
under heavy loads //Data Engineering (ICDE), 2010
IEEE 26th International Conference on. – IEEE, 2010.
– pp. 397-408.
TPC org. (2019) “Documentation on TPC-H performance
tests”, tpc.org. [Online]. Available:
http://www.tpc.org/tpc_documents_current_versions/p
df/tpc-h_v2.17.2.pdf. [Accessed: Sept. 22, 2019]
Vavilapalli, V.K., et al. (2013) "Apache hadoop yarn: Yet
another resource negotiator." Proceedings of the 4th
annual Symposium on Cloud Computing. ACM, 2013,
p. 5
Wasserman, T. J. et al. (2004) Developing a
characterization of business intelligence workloads for
sizing new database systems //Proceedings of the 7th
ACM International Workshop on Data Warehousing
and OLAP. – ACM, 2004. – pp. 7-13.
Wu, W. et al. (2013) Predicting query execution time: Are
optimizer cost models really unusable? //Data
Engineering (ICDE), 2013 IEEE 29th International
Conference on. – IEEE, 2013. – pp. 1081-1092.
Xiong, P. et al. (2011) ActiveSLA: a profit-oriented
admission control framework for database-as-a-service
providers //Proceedings of the 2nd ACM Symposium
on Cloud Computing. – ACM, 2011. – P. 15.
Zukerman, M. (2019) Introduction to Queueing Theory and
Stochastic Teletrac Models. [Online]. Available:
http://www.ee.cityu.edu.hk/~zukerman/classnotes.pdf.
[Accessed: Sept. 22, 2019].