
Table 2: Overview of all four scenarios including the overarching as well as the time dependent view (*actual LoS of 0 left
out).
LoS Error Occupancy Error
Scenario Mean MAE MAPE Mean MAE MAPE
Overarching
Scenario 1 0.98 2.34 134.25 42.05 42.10 21.80
Scenario 2 −0.25 2.35 134.28 −7.53 9.82 7.40*
Scenario 3 0.49 1.17 67.12 22.13 22.20 11.95
Scenario 4 −0.12 1.17 67.14 −2.68 5.78 4.04
Dependent
Scenario 1 0.98 2.34 134.25 −5.99 29.87 88.58
Scenario 2 −0.25 2.35 134.28 −13.44 28.30 89.01
Scenario 3 0.49 1.17 67.12 4.13 34.08 89.08
Scenario 4 −0.12 1.17 67.14 −2.74 31.57 89.01
7 CONCLUSION
Overall we made three major contributions in the pa-
per. First, we introduced a translation scheme from
well-researched LoS prediction to the bed occupancy
that is needed for a hospital administrator to work
with. Second, we show-cased how different improve-
ments in the state-of-the-art LoS prediction would
impact the accuracy of the bed occupancy predic-
tion and thus gave clear tasks for further research
in the machine learning community. Third, we dis-
cussed a time-depended hospital administrator view,
that showed the importance of individual information
about patients for adequately predicting a realistic bed
occupancy.
There are a couple of further research questions
that can be tackled based on this paper. One future
research direction is to include more intelligent han-
dling of the time-depended view, i.e. a better way of
including yet unknown patients based on seasonal or
other time-depended patterns. Another research di-
rection would be to validate the approach in a clinic
where patient’s LoS is recorded independently from
bed occupation. There might be effects (e.g. block-
ings, room dependencies, etc.) that lead to a more
noisy relationship between LoS and bed occupancy
than assumed in this paper which could be interesting
to research. Additionally, not only CatBoost should
be considered as a model to predict the LoS of a pa-
tient’s admission and it would be interesting to test
different models on different datasets. Many fac-
tors have a high impact on the LoS of a patient’s
admission, as shown by Winter et al. (2023), where
some are directly available in the dataset and others
are engineered from available features in the dataset.
However, some are hidden in the hospitals policies,
staffing levels, etc. which are not available in the data.
In the future, we aim to collaborate with an hospi-
tal on an interdisciplinary level, ensuring these factors
are thoroughly considered and addressed.
ACKNOWLEDGEMENTS
The research for the paper was funded by the state
of Schleswig-Holstein as part of the APONA project,
project no. 220 23 020.
REFERENCES
Baek, H., Cho, M., Kim, S., Hwang, H., Song, M., and
Yoo, S. (2018). Analysis of length of hospital stay
using electronic health records: A statistical and data
mining approach. PLOS ONE, 13(4):e0195901.
Belciug, S. and Gorunescu, F. (2015). Improving hospital
bed occupancy and resource utilization through queu-
ing modeling and evolutionary computation. Journal
of biomedical informatics, 53:261–269.
Buttigieg, S. C., Abela, L., and Pace, A. (2018). Vari-
ables affecting hospital length of stay: a scoping re-
view. Journal of Health Organization and Manage-
ment, 32(3):463–493.
Farmer, R. and Emami, J. (1990). Models for forecasting
hospital bed requirements in the acute sector. Journal
of Epidemiology & Community Health, 44(4):307–
312.
Forster, A. J., Stiell, I., Wells, G., Lee, A. J., and Van Wal-
raven, C. (2003). The effect of hospital occupancy on
emergency department length of stay and patient dis-
position. Academic Emergency Medicine, 10(2):127–
133.
Gentimis, T., Alnaser, A. J., Durante, A., Cook,
K., and Steele, R. (2017). Predicting hospital
length of stay using neural networks on MIMIC
III data. 2017 IEEE 15th Intl Conf on De-
pendable, Autonomic and Secure Computing, 15th
Intl Conf on Pervasive Intelligence and Comput-
ing, 3rd Intl Conf on Big Data Intelligence and
Computing and Cyber Science and Technology
Congress(DASC/PiCom/DataCom/CyberSciTech).
Gorunescu, F., McClean, S. I., and Millard, P. H. (2002). A
queueing model for bed-occupancy management and
planning of hospitals. Journal of the operational Re-
search society, 53:19–24.
Aggregating Predicted Individual Hospital Length of Stay to Predict Bed Occupancy for Hospitals
183