Authors:
Wendy Oude Nijeweme – d’Hollosy
1
;
2
;
Lex van Velsen
1
;
Mannes Poel
3
;
Catharina G. M. Groothuis-Oudshoorn
4
;
Remko Soer
5
;
6
;
Patrick Stegeman
6
and
Hermie Hermens
1
Affiliations:
1
Roessingh Research and Development, eHealth Cluster, Enschede, The Netherlands
;
2
University of Twente, EEMC/Biomedical Signals & Systems, Techmed, Personalised eHealth Technology, The Netherlands
;
3
University of Twente, EEMC/Data Science, The Netherlands
;
4
University of Twente, BMS/Health Technology and Services Research, Enschede, The Netherlands
;
5
Saxion University of Applied Science, Enschede, The Netherlands
;
6
University of Groningen, University Medical Center Groningen, Spine Center, Groningen, The Netherlands
Keyword(s):
Classification Algorithms, Clinical Decision Support Systems, Low Back Pain, Machine Learning.
Abstract:
The objective of this pilot study was to determine whether machine learning can be applied on patient-reported data to model decision-making on treatments for low back pain (LBP). We used a database of a university spine centre containing patient-reported data from 1546 patients with LBP. From this dataset, a training dataset with 354 features (input data) was labelled on treatments (output data) received by these patients. For this pilot study, we focused on two treatments: pain rehabilitation and surgery. Classification algorithms in WEKA were trained, and the resulting models were validated during 10-fold cross validation. Next to this, a test dataset was constructed - containing 50 cases judged on treatments by 4 master physician assistants (MPAs) - to test the models with data not used for training. We used prediction accuracy and average area under curve (AUC) as performance measures. The interrater agreement among the 4 MPAs was substantial (Fleiss Kappa 0.67). The AUC values
indicated small to medium (machine) learning effects, meaning that machine learning on patient-reported data to model decision-making processes on treatments for LBP seems possible. However, model performances must be improved before these models can be used in real practice.
(More)