Authors:
Lerina Aversano
1
;
Mario Luca Bernardi
1
;
Vincenzo Calgano
2
;
Marta Cimitile
3
;
Concetta Esposito
2
;
Martina Iammarino
1
;
Marco Pisco
2
;
Sara Spaziani
2
and
Chiara Verdone
1
Affiliations:
1
University of Sannio, Department of Engineering, Benevento, Italy
;
2
University of Sannio, Optoelectronic Division - Engineering Department, Benevento, Italy
;
3
Unitelma Sapienza University, Rome, Italy
Keyword(s):
Machine Learning, Classification, Raman Spectroscopy Analisys, Health Informatics.
Abstract:
Since cancer represents one of the leading causes of death worldwide, the development of approaches capable of discerning healthy from diseased cells would be of fundamental importance to support diagnostic and screening techniques. Raman spectroscopy is the most effective molecular analysis technique currently available and provides information on the molecular composition, bonds, chemical environment, phase, and crystalline structure of the samples under examination. This work exploits a combination of Raman spectroscopy and machine learning models to discriminate patients’ liver cells between tumor and non-tumor. The research uses real patient data, provided by the Center for Nanophotonics and Optoelectronics for Human Health (CNOS), which analyzed the cells of a patient with liver cancer. Specifically, the dataset has been built through a long data collection process, which first involved the analysis of the cells with Raman spectroscopy and then the training of two classifiers,
Decision Tree and Random Forest. The results show good performance for the trained classifiers, especially those relating to the Random Forest, which reaches an accuracy of 90%.
(More)