4 CONCLUSIONS
We actually know that there are not universal
datamining techniques or methodologies to deal with
every kind of problem or task. For length of hospital
stay prediction we think that only unsupervised
models can achieve the best results, because there is
a lack precise guidelines and best practices capable
to infer exactly the period of staying of patients,
especially in those contexts characterized by rapid
changes in technologies and organizational settings.
In other words the knowledge of human experts in
these cases cannot be exploited to define an accurate
LoS prediction system.
For this reason in our research we have focused on
unsupervised machine learning algorithms, in
particular clustering algorithms and self-organizing
networks.
We have obtained encouraging results through the
use of subsymbolic models like the Growing Neural
Gas by B. Fritzke in a previous research work, but
now we are trying to develop more “intelligent” data
analysers which are also capable to give a human-
understandable explanation of their predictions. A
response produced according to a logic formalism
could indeed support decision makers in their health
resources and services management activities.
That is why we have chosen an A-type unorganised
Turing machine to process the admission forms of
hospital patients. The structure itself of the model
could be used like a kind of “dynamic” guideline to
be taken into consideration by a group of human
experts in order to optimally organize the healthcare
activities performed on patients.
The knowledge acquired by an unorganised Turing
machine through its pattern of NAND gates
connections could also be used to produce an
explanation of the reasons that led the system to its
LoS predictions as we have demonstrated in this
preliminary work.
We stopped the training just after having reached the
prediction accuracy of the most performant decision
tree algorithm represented by the J48. Also this
model could be used to build a knowledge
representation to approach the LoS prediction
problem. But its tree-like structure probably is too
simple to generate the complex set of rules to be
used in these kind of decision processes.
We think that these first results can be further
improved adopting another unorganised Turing
machine model, that is the B-type one (Turing,
1948). Also a B-type may contain any number of
NAND gates connected in any pattern. Turing just
added the further condition that each unit-to-unit
connection must pass through a modifier device. The
modifier state can be set in “pass mode”, in which
the output of a NAND gate passes through it
unchanged, or in “interrupt mode”, in which the
signal is always 1, no matter what the output of the
NAND gate is (Copeland and Proudfoot, 1996). The
presence of the modifiers can enable what Turing
described as “appropriate interference, mimicking
education”.
We are going to design and test a two-phase
training, similar to the one proposed by Teuscher
and Sanchez (2000), with a first “evolutive” phase
where the best network configuration is selected,
and a “learning” phase where the switches of
NAND gates are enabled and properly configured to
optimize the prediction accuracy rate.
ACKNOWLEDGEMENTS
Special thanks go to Eng. Antonio Di Giorgio for his
support in Weka datamining processes.
REFERENCES
Agrawal R., Srikant R., 1994. Fast Algorithms for Mining
Association Rules. Proc. Of the 20th VLDB
Conference, Santiago, Chile, 1994.
Arab M., Zarei A., Rahimi A., Rezaiean F., Akbari F.,
2010. Analysis of factors affecting length of stay in
public hospitals in Lorestan Province, Iran, Hakim
Res, Vol. 12, No.4, 2010, pp.27-32.
Baluja S., Caruana R., 1995. Removing the genetics from
the standard genetic algorithm ICML.
Chang K.C., Tseng M.C., Weng H.H., Lin Y.H., Liou
C.W., Tan T.Y., 2002. Prediction of length of stay of
first-ever ischemic stroke, Stroke, Vol. 33, No.11,
2002 pp.2670-4.
Copeland B.J., Proudfoot D., 1996, Alan Turing’s
forgotten ideas in computer science. Sci.Am. n.280,
pp. 76-81.
Fritzke B., 1994. A Growing Neural Gas Network Learns
Topologies. Part of: Advances in Neural Information
Processing Systems 7, NIPS, 1994.
Gomez V., Abasolo J.E., 2009. Using data mining to
describe long hospital stays, Paradigma, Vol. 3, No.1,
2009, pp.1-10.
Gorunescu F., El-Darzi E., Belciug S., Gorunescu M.,
2010. Patient grouping optimization using hybrid Self-
Organizing Map and Gaussian Mixture Model for
length of stay-based clustering system, Intelligent
Systems (IS), 2010 5
th
International Conference.
Holte R.C., 1993. Very simple classification rules perform
well on most commonly used datasets, Machine
Learning, 1993.
HEALTHINF 2018 - 11th International Conference on Health Informatics
406