In the extended chunking approach, we divided
the overall index selection problem into multiple
smaller sub-problems, which are solved individually.
The selected indexes of these sub-problems are then
put together and the best selection among these can-
didates will be determined in a final step. We showed
that, compared to the optimal solution of the basic
problem, this heuristic performs near-optimal and al-
lows to significantly reduce the overall solution time.
For the multi-index configuration extension, the
granularity of the possible options was changed from
the index level to the index configuration level, where
each configuration represents a combination of in-
dexes (e.g., a maximum of two). We showed that our
formulation is viable for standard solvers. The results
show that the execution time overhead is substantial
in small scenarios but decreases with an increasing
number of indexes.
The extension to stochastic workloads takes mul-
tiple workload scenarios into account. Such different
scenarios may be derived from historical data within
specific time spans. In this framework, different ob-
jectives were used to minimize: (1) the total workload
costs, (2) the worst-case workload costs, (3) a mean-
variance criterion, and (4) a weighted combination of
the first three objectives. Our results show that the tar-
geted effect to avoid bad and uneven performances is
achieved.
In the fourth extension with transition costs, we
addressed the additional challenge to create and re-
move indexes in the presence of an existing configu-
ration while balancing performance and minimal re-
quired reconfiguration costs. In our approach, we
used an extended penalty-based objective to endog-
enize creation and removal costs. We find that in-
volving transition costs makes it possible to iden-
tify minimal-invasive reconfigurations of index selec-
tions, which helps to manage them over time, e.g.,
under changing workloads.
Finally, our concepts, i.e., chunking, multi-index
configurations, stochastic workloads, and transition
costs, are designed such that they can be combined.
REFERENCES
Casey, R. G. (1972). Allocation of copies of a file in an
information network. In AFIPS, pages 617–625.
Chaudhuri, S. and Narasayya, V. (2020). Anytime Algo-
rithm of Database Tuning Advisor for Microsoft SQL
Server. https://www.microsoft.com/en-us/research/
publication/anytime-algorithm-of-database-tuning-\
advisor-for-microsoft-sql-server, visited 2020-06-04.
Chaudhuri, S. and Narasayya, V. R. (1997). An efficient
cost-driven index selection tool for Microsoft SQL
Server. In Proc. VLDB’97, pages 146–155.
Dash, D., Polyzotis, N., and Ailamaki, A. (2011). CoPhy:
A scalable, portable, and interactive index advisor for
large workloads. PVLDB, 4(6):362–372.
Finkelstein, S. J., Schkolnick, M., and Tiberio, P. (1988).
Physical database design for relational databases.
ACM Trans. Database Syst., 13(1):91–128.
Fourer, R., Gay, D., and Kernighan, B. (2003). AMPL: A
Modeling Language for Mathematical Programming.
Thomson/Brooks/Cole.
Kormilitsin, M., Chirkova, R., Fathi, Y., and Stallmann,
M. (2008). View and index selection for query-
performance improvement: Algorithms, heuristics
and complexity. In CIKM08: Proceedings of the 17th
ACM conference on Information and knowledge man-
agement, volume 2, pages 1329–1330.
Kossmann, J., Halfpap, S., Jankrift, M., and Schlosser, R.
(2020). Magic mirror in my hand, which is the best in
the land? An experimental evaluation of index selec-
tion algorithms. In PVLDB, volume 13, pages 2382–
2395.
Kossmann, J., Kastius, A., and Schlosser, R. (2022). Swirl:
Selection of workload-aware indexes using reinforce-
ment learning. In working paper.
Kossmann, J. and Schlosser, R. (2020). Self-driving
database systems: A conceptual approach. Distributed
and Parallel Databases, 38(4):795–817.
Papadomanolakis, S., Dash, D., and Ailamaki, A. (2007).
Efficient use of the query optimizer for automated
database design. In Proc. VLDB 2007, pages 1093–
1104.
Pavlo, A. et al. (2017). Self-driving database management
systems. In CIDR 2017.
Schlosser, R. and Halfpap, S. (2020). A decomposition
approach for risk-averse index selection. In SSDBM,
pages 16:1–16:4.
Schlosser, R., Kossmann, J., and Boissier, M. (2019). Effi-
cient scalable multi-attribute index selection using re-
cursive strategies. In ICDE, pages 1238–1249.
Schnaitter, K., Polyzotis, N., and Getoor, L. (2009). In-
dex interactions in physical design tuning: Modeling,
analysis, and applications. In Proc. VLDB’09, vol-
ume 2, pages 1234–1245.
Sharma, A., Schuhknecht, F. M., and Dittrich, J. (2018).
The case for automatic database administration using
deep reinforcement learning. CoRR, abs/1801.05643.
Valentin, G., Zuliani, M., Zilio, D. C., Lohman, G. M., and
Skelley, A. (2000). DB2 Advisor: An optimizer smart
enough to recommend its own indexes. In Proc. ICDE,
pages 101–110.
ICORES 2022 - 11th International Conference on Operations Research and Enterprise Systems
38