dictor features, that is to allow abstentions not
only in the target variable of both the input (i.e.,
training) and the output (i.e., predicted) data, but
also in regard to any other feature of the ground
truth, and of the new instances to classify;
• Finally, in this study we considered only binary
classification problems. Thus, we plan to extend
this study considering also multi–class classifica-
tion tasks and the more general case of learning
from partial labels.
ACKNOWLEDGEMENTS
The authors are grateful to Giuseppe Banfi, for grant-
ing access to the anonymized data of the Datareg reg-
istry and promoting this research.
REFERENCES
Altman, N. S. (1992). An introduction to kernel and nearest-
neighbor nonparametric regression. The American
Statistician, 46(3):175–185.
Berg, M. (1997). Rationalizing medical work: decision-
support techniques and medical practices. MIT press.
Bishop, C. M. (2006). Pattern Recognition and Ma-
chine Learning (Information Science and Statistics).
Springer-Verlag, Berlin, Heidelberg.
Boser, B. E., Guyon, I. M., and Vapnik, V. N. (1992). A
training algorithm for optimal margin classifiers. In
Proceedings of the 5th Annual Workshop on Compu-
tational Learning Theory, COLT ’92, pages 144–152,
New York, NY, USA. ACM.
Breiman, L., Friedman, J. H., Olshen, R. A., and Stone,
C. J. (1984). Classification and regression trees.
Wadsworth statistics/probability series. Wadsworth &
Brooks/Cole Advanced Books & Software.
Brown, J., Bryan, S., and Warren, R. (1996). Mammogra-
phy screening: an incremental cost effectiveness anal-
ysis of double versus single reading of mammograms.
BMJ, 312(7034):809–812.
Cabitza, F., Ciucci, D., and Locoro, A. (2017). Exploiting
collective knowledge with three-way decision theory:
Cases from the questionnaire-based research. Inter-
national Journal of Approximate Reasoning, 83:356 –
370.
Cabitza, F., Ciucci, D., and Rasoini, R. (2019a). A giant
with feet of clay: On the validity of the data that feed
machine learning in medicine. In Organizing for the
Digital World, pages 121–136, Cham. Springer Inter-
national Publishing.
Cabitza, F., Locoro, A., Alderighi, C., Rasoini, R., Com-
pagnone, D., and Berjano, P. (2019b). The elephant in
the record: on the variability of data recording work.
Health Informatics Journal.
Campagner, A. and Ciucci, D. (2018). Three-way and semi-
supervised decision tree learning based on orthopar-
titions. In Information Processing and Management
of Uncertainty in Knowledge-Based Systems. Theory
and Foundations: 748–759. Springer Int. Pub.
Castaneda, C., Nalley, K., Mannion, C., Bhattacharyya, P.,
Blake, P., Pecora, A., Goy, A., and Suh, K. S. (2015).
Clinical decision support systems for improving di-
agnostic accuracy and achieving precision medicine.
Journal of clinical bioinformatics, 5(1):4.
Ciucci, D. (2011). Orthopairs: A simple and widely used
way to model uncertainty. Fundam. Inform., 108:287–
304.
Ciucci, D. (2016). Orthopairs and granular computing.
Granular Computing, 1:159–170.
Cour, T., Sapp, B., and Taskar, B. (2011). Learning from
partial labels. J. Mach. Learn. Res., 12:1501–1536.
Deo, R. (2015). Machine learning in medicine. Circulation
132(20), 1920–1930.
Djulbegovic, B. (2004). Lifting the fog of uncertainty from
the practice of medicine. BMJ, 329(7480):1419–1420.
Dowding, D. and Thompson, C. (2004). Using decision
trees to aid decision-making in nursing. Nursing
times, 100:36–39.
Dowie, J. (1996). The research-practice gap and the role of
decision analysis in closing it. Health Care Analysis,
4(1):5–18.
Eakins, C., Ellis, W. D., Pruthi, S., Johnson, D. P., Hernanz-
Schulman, M., Yu, C., and Kan, J. H. (2012). Sec-
ond opinion interpretations by specialty radiologists
at a pediatric hospital: rate of disagreement and clini-
cal implications. American Journal of Roentgenology,
199(4):916–920.
Ellerman, D. (2013). An introduction to logical entropy and
its relation to shannon entropy. International Journal
of Semantic Computing, 7(2):121–145.
Fawcett, T. (2006). An introduction to roc analysis. Pattern
Recognition Letters, 27(8):861 – 874. ROC Analysis
in Pattern Recognition.
Ferri, C. and Hern
´
andez-Orallo, J. (2004). Cautious clas-
sifiers. In ROC Analysis in Artificial Intelligence, 1st
International Workshop, ROCAI-2004, pages 27–36.
Fisher, R. A. (1936). The use of multiple measurements in
taxonomic problems. Annals of Eugenics, 7(2):179–
188.
Fox, R. C. (2000). Medical uncertainty revisited, pages
409–425. SAGE Publications Ltd, London.
Friedman, J. H. (2001). Greedy function approximation: A
gradient boosting machine. The Annals of Statistics,
29(5):1189–1232.
Glorot, X., Bordes, A., and Bengio, Y. (2011). Deep sparse
rectifier neural networks. In Procs of the 14th Interna-
tional Conference on Artificial Intelligence and Statis-
tics: 315–323 (15).
Glynne-Jones, R. and Hughes, R. (2012). Critical appraisal
of the ‘wait and see’approach in rectal cancer for clini-
cal complete responders after chemoradiation. British
Journal of Surgery, 99(7):897–909.
Goodfellow, I., Bengio, Y., and Courville, A. (2016). Deep
Learning. MIT Press.
Exploring Medical Data Classification with Three-Way Decision Trees
157