One limitation of our work is that we only looked
at Pbench data, available to us through Red Hat.
There might be other datasets out there that might be
a better fit for Linux configuration tuning, but these
are not easily available due to privacy concerns. If
a sufficiently diverse dataset becomes available, we
would like to attempt tuning with that data as future
work. Also, our analysis of the dataset filters features
by correlation and thus might ignore non-linear re-
lationships. These could be studied in the future as
well.
REFERENCES
Acher, M., Martin, H., Alves Pereira, J., Blouin, A., Ed-
dine Khelladi, D., and J
´
ez
´
equel, J.-M. (2019a). Learn-
ing From Thousands of Build Failures of Linux Kernel
Configurations. Technical report, Inria ; IRISA.
Acher, M., Martin, H., Pereira, J. A., Blouin, A., J
´
ez
´
equel,
J.-M., Khelladi, D. E., Lesoil, L., and Barais, O.
(2019b). Learning Very Large Configuration Spaces:
What Matters for Linux Kernel Sizes. Research re-
port, Inria Rennes - Bretagne Atlantique.
Agarwal, R. The 5 feature selection algorithms every data
scientist should know. https://towardsdatascience.c
om/the-5- feature-selection-algorithms- every-dat
a-scientist-need-to-know-3a6b566efd2. [Accessed
12-11-2021].
B.V, E. The heart of the free and open elastic stack. https:
//www.elastic.co/elasticsearch/. [Accessed 03-08-
2020].
B.V, E. Your window into the elastic stack. https://www.el
astic.co/kibana. [Accessed 05-01-2020].
Cao, Z., Kuenning, G., and Zadok, E. (2020). Carver: Find-
ing important parameters for storage system tuning. In
18th USENIX Conference on File and Storage Tech-
nologies (FAST 20), pages 43–57, Santa Clara, CA.
USENIX Association.
Cao, Z., Tarasov, V., Tiwari, S., and Zadok, E. (2018).
Towards better understanding of black-box Auto-
Tuning: A comparative analysis for storage sys-
tems. In 2018 USENIX Annual Technical Confer-
ence (USENIX ATC 18), pages 893–907, Boston, MA.
USENIX Association.
Chase, J., Gallatin, A., and Yocum, K. (2001). End system
optimizations for high-speed tcp. IEEE Communica-
tions Magazine, 39(4):68–74.
Chen, H., Jiang, G., Zhang, H., and Yoshihira, K.
(2009). Boosting the performance of computing sys-
tems through adaptive configuration tuning. In Pro-
ceedings of the 2009 ACM Symposium on Applied
Computing, SAC ’09, page 1045–1049, New York,
NY, USA. Association for Computing Machinery.
Corporation, S. P. E. Spec’s benchmarks and tools. https:
//www.spec.org/benchmarks.html. [Accessed 03-10-
2021].
Mesnier, M. P., Wachs, M., Sambasivan, R. R., Zheng,
A. X., and Ganger, G. R. (2007). Modeling the rela-
tive fitness of storage. In Measurement and Modeling
of Computer Systems.
Nadgir, N., Tuxen, M., and Kononenko, V. (2009). uperf.
https://github.com/uperf/uperf. [Accessed 08-25-
2019].
Ozisikyilmaz, B., Memik, G., and Choudhary, A. (2008).
Machine learning models to predict performance of
computer system design alternatives. In 2008 37th In-
ternational Conference on Parallel Processing, pages
495–502.
RedHat (2018). Performance tuning guide for rhel 7. https:
//access.redhat.com/documentation/en-us/red\ hat
\ enterprise\ linux/7/html/performance\ tuning\ g
uide/index. [Accessed 07-16-2019].
Reeves, B. M., Hunsaker, J., Moravec, P., and Castillo, J.
(2014). Sos. https://github.com/sosreport/sos. [Ac-
cessed 04-02-2020].
Saboori, A., Jiang, G., and Chen, H. (2008). Autotun-
ing configurations in distributed systems for perfor-
mance improvements using evolutionary strategies.
In 2008 The 28th International Conference on Dis-
tributed Computing Systems, pages 769–776.
Team, G. L. Understanding xgboost algorithm — what is
xgboost algorithm? https://www.mygreatlearning.co
m/blog/xgboost-algorithm/. [Accessed 06-15-2019].
Theurer, A., Portante, P., Dokos, N., and Rister, K. (2015).
pbench. https://github.com/distributed-system-analy
sis/pbench. [Accessed 06-09-2019].
Xi, B., Liu, Z., Raghavachari, M., Xia, C. H., and Zhang, L.
(2004). A smart hill-climbing algorithm for applica-
tion server configuration. In Proceedings of the 13th
International Conference on World Wide Web, WWW
’04, page 287–296, New York, NY, USA. Association
for Computing Machinery.
Yiu, T. Understanding random forest: how the algorithm
works and why it is so effective. https://towardsdatas
cience.com/understanding-random-forest-58381e060
2d2. [Accessed 08-10-2019].
Zhu, Y., Liu, J., Guo, M., Ma, W., and Bao, Y. (2017). Acts
in need: Automatic configuration tuning with scal-
ability guarantees. In Proceedings of the 8th Asia-
Pacific Workshop on Systems, APSys ’17, New York,
NY, USA. Association for Computing Machinery.
ICPRAM 2024 - 13th International Conference on Pattern Recognition Applications and Methods
778