Another aspect to be discussed is the selection of
the thresholds τ defining the binary feature space. For
this paper, an automated approach was used, selecting
initial thresholds based on the quantiles of the data
distribution. In later stages, adjacent intervals thus
defined are merged if their coefficients are equal.
6 CONCLUSION
In this paper, ICS* was introduced as an extension
of ICS. It allows to infer relevant effects, including
interactions, from given data and construct a scoring
system by solving a minimization problem. After in-
troduction of the changes applied to ICS, ICS* was
subjected to a sensitivity study on synthetic data. The
study showed that resampling can be used to improve
the robustness of the method. Furthermore, it also
indicated robustness to noise, training set size and
the number of additional non-informative variables.
However, both set size and number of variables were
shown to have a large impact on execution time. Fi-
nally, ICS* was applied to two UCI data sets with
good results.
Future work will investigate the formulation of
a more advanced approach to the initial estimation
of the τ thresholds. A better estimation of the final
thresholds from the beginning reduces the complexity
of the problem to be solved, since it relates directly to
the dimensionality of the expanded feature space.
Another goal is the formulation of the quadratic
transformation of ICS*. This would ensure the
uniqueness of the solution for a given data set. Fur-
thermore, row-action methods could be applied to
achieve a reduction of the execution time. More gen-
erally, approaches other than the LP, e.g. sparse inte-
ger solutions, could have interesting characteristics.
Finally, the problem to be solved is essentially
a combination of variable selection (sparsity on the
level of the original variables) and minimization of
the number of steps within each effect (sparsity on
the level of coefficient differences). Such a combined
criterion can be tackled by methods as group sparse
LASSO (Simon et al., 2013) for fast convergence to
the optimal solution.
ACKNOWLEDGEMENTS
This research was supported by: Bijzonder Onder-
zoeksfonds KU Leuven (BOF), Center of Excellence
(CoE): PFV/10/002 (OPTEC); KULeuven IDO fund-
ing: #3E140722 Sensor-based Platform for the Accu-
rate and Remote monitoring of Kine(ma)tics Linked
to E-health (SPARKLE); Belgian Federal Science
Policy Office: IUAP #P7/19/ (DYSCO, ‘Dynami-
cal systems, control and optimization’, 2012-2017).
VVB is a postdoctoral fellow of the Research Foun-
dation - Flanders (FWO).
REFERENCES
Chowriappa, P., Dua, S., and Todorov, Y. (2014). Introduc-
tion to machine learning in healthcare informatics. In
Machine Learning in Healthcare Informatics, pages
1–23. Springer.
da Rocha Neto, A. R., Sousa, R., de A. Barreto, G., and
Cardoso, J. S. (2011). Diagnostic of pathology on
the vertebral column with embedded reject option. In
Vitri, J., Sanches, J., and Hernndez, M., editors, Pat-
tern Recognition and Image Analysis, volume 6669 of
Lecture Notes in Computer Science, pages 588–595.
Springer Berlin Heidelberg.
Davenport, M., Duarte, M., Eldar, Y., Kutyniok, G., et al.
(2012). Compressed sensing: theory and applications.
Cambridge University Press Cambridge.
Duch, W., Adamczak, R., Grabczewski, K., Ishikawa, M.,
and Ueda, H. (1997). Extraction of crisp logical rules
using constrained backpropagation networks. In Proc.
of the European Symposium on Artificial Neural Net-
works (ESANN).
Jeong, B.-H., Koh, W.-J., Yoo, H., Um, S.-W., Suh, G. Y.,
Chung, M. P., Kim, H., Kwon, O. J., and Jeon,
K. (2013). Performances of prognostic scoring sys-
tems in patients with healthcare-associated pneumo-
nia. Clinical Infectious Diseases, 56(5):625–632.
Lichman, M. (2013). UCI machine learning repository.
http://archive.ics.uci.edu/ml. last accessed 20/5/2015.
Mounzer, R., Langmead, C. J., Wu, B. U., Evans, A. C.,
Bishehsari, F., Muddana, V., Singh, V. K., Slivka, A.,
Whitcomb, D. C., Yadav, D., Banks, P. A., and Pa-
pachristou, G. I. (2012). Comparison of existing clin-
ical scoring systems to predict persistent organ failure
in patients with acute pancreatitis. Gastroenterology,
142(7):1476 – 1482.
Simon, N., Friedman, J., Hastie, T., and Tibshirani, R.
(2013). A sparse-group lasso. Journal of Computa-
tional and Graphical Statistics.
Sra, S. (2006). Efficient large scale linear programming
support vector machines. In ECML 2006, pages
767–774, Berlin, Germany. Max-Planck-Gesellschaft,
Springer.
Suykens, J. A., Van Gestel, T., De Brabanter, J., De Moor,
B., Vandewalle, J., Suykens, J., and Van Gestel, T.
(2002). Least squares support vector machines, vol-
ume 4. World Scientific.
Ustun, B., Trac, S., and Rudin, C. (2013). Supersparse lin-
ear integer models for predictive scoring systems. In
Proceeding of the 27th AAAI Conference on Artificial
Intelligence (AAAI-13), pages 128–130.
Van Belle, V., Van Calster, B., Timmerman, D., Bourne,
T., Bottomley, C., Valentin, L., Neven, P., Van Huf-