ing to panel three. In order to assess the user’s query
performance skill, we employed a checklist as edu-
cational strategy to get the participants’ feedback by
selecting the correct or best response from a provided
list (as shown in Figure 7
b
). The system prompts the
user to answer some questions, like why query has a
lower/upper cost? with several answers such as ”Is
the data for this query benefit from cached data” etc.
The interactions and feedback of users are saved in
the the event logs. This latter indicates users’ per-
formance based on collected data from users’ actions
during interaction with CF-CBO.
5 CONCLUSION
This paper presents a Conceptual and Methodologi-
cal Framework to explain Database Query Optimizer
to users dealing with query performance and to help
transferring the domain knowledge effectively to that
users. The work is motivated by the complexity of the
CBO entity compared to its cardinality estimates, plan
properties, database operations and its algorithms.
Our approach supports that MDE can be applied to a
real problem in advanced courses of database, in par-
ticular, the cost based optimization. The main contri-
butions of this work are as follows. First, (i) we built
a metamodel to explicit Database Query Optimizer
knowledge and to provide a domain vocabulary that
helps learning query plan, understanding and combin-
ing hints, and customizing query plan. Second, (ii) we
developed an MDE based process that offers different
bricks (operations, hints) that are required in CBO to
facilitate query plan optimization. Finally, (iii) we de-
veloped the tool support of the whole approach. Cur-
rently, we are evaluating how the CF-CBO can sig-
nificantly increase efficiency and effectiveness of the
students understanding in the query optimization do-
main. Nevertheless, this work opens several direc-
tions of further research. First, we are studying to
explore more intensively DBMS inside in order to de-
liver a more informative instruction on database query
optimization at a high level of abstraction. Second,
we project to develop a rule-based errors diagnosis
related to users selection of required hints to enhance
CBO learnability.
REFERENCES
Agarawal, S. and Others (2000). Automated selection
of materialized views and indexes for Sql databses.
In Proceedings of 26th International Conference on
VLDB, Cairo, Egypt, pages 191–207.
Al-Dmour, A. (2010). A cognitive apprenticeship based
approach to teaching relational database analysis and
design. Journal of Information & Computational Sci-
ence, 7(12):2495–2502.
Allenstein, B. and Others (2008). A query simulation sys-
tem to illustrate database query execution. In Pro-
ceedings of the 39th SIGCSE technical symposium on
Computer science education, pages 493–497.
Almeida, A. C. and Others (2019). An ontological perspec-
tive for database tuning heuristics. In International
Conference on ER, pages 240–254. Springer.
Bhagat, M. and Shukla, S. (2014). Tuning the tpc-c bench-
mark using oracle 10 g. IJEMR, 4(5):179–182.
Brown, D. P., Chaware, J., and Koppuravuri, M. (2009).
Index selection in a database system. US Patent
7,499,907.
Chaudhuri, S. and Narasayya, V. (1998). Autoadmin “what-
if” index analysis utility. ACM SIGMOD Record,
27(2):367–378.
Galaktionov, V. and Chernishev, G. (2019). Designing a
DBMS development course with automatic assign-
ment evaluation. In Fourth Conference on SEIM-2019
(full papers), volume 8, page 48.
Gallinucci, E. and Golfarelli, M. (2019). Sparktune: tun-
ing spark Sql through query cost modeling. In EDBT,
pages 546–549.
Jarke, M. and Koch, J. (1984). Query optimization in
database systems. ACM Computing surveys (CsUR),
16(2):111–152.
Keberle, N. and Utkin, I. V. (2012). Teaching conceptual
modeling in er: Chen worlds. In ICTERI, pages 222–
227.
Leventidis, A. and Other (2020). Queryvis: Logic-based di-
agrams help users understand complicated Sql queries
faster. In Proceedings of the 2020 ACM SIGMOD,
pages 2303–2318.
Liu, S., Bhowmick, S. S., Zhang, W., Wang, S., Huang,
W., and Joty, S. (2019). Neuron: Query execution
plan meets natural language processing for augment-
ing DB education. In Proceedings of the 2019 Inter-
national Conference on Management of Data, pages
1953–1956.
Manegold, S., Boncz, P., and Kersten, M. L. (2002).
Generic database cost models for hierarchical mem-
ory systems. In VLDB, pages 191–202.
Ouared, A., Ouhammou, Y., and Bellatreche, L. (2016).
Costdl: a cost models description language for per-
formance metrics in database. In 2016 21st Interna-
tional Conference on Engineering of Complex Com-
puter Systems (ICECCS), pages 187–190. IEEE.
Pavlo, A. et al. (2017). Self-driving database management
systems. In CIDR, volume 4, page 1.
Scherzinger, S. (2019). Have your Students build their own
Mini Hive in just Eight Weeks. In LWDA, pages 38–
41.
Sciore, E. (2007). Simpledb: a simple java-based multiuser
syst for teaching database internals. ACM SIGCSE
Bulletin, 39(1):561–565.
Siddiqui, T. and Other (2020). Cost models for big data
query processing: Learning, retrofitting, and our find-
ings. In Proceedings of the 2020 ACM SIGMOD,
pages 99–113.
MDI4SE 2021 - Special Session on Model-Driven Innovations for Software Engineering
536