Automatic Feature Selection in the SOPFs Dissolution Profiles Prediction Problem

J. E. Salazar Jiménez, J. D. Sánchez Carvajal, B. Quiros-Gómez, J. D. Arias-Londoño

2017

Abstract

This work addressed the problem of dimensionality reduction in the drug dissolution profile prediction task. The learning problem is assumed as a multi-output learning task, since dissolution profiles are recorded in non-uniform sampling times, which avoid the use of basic function-on-scalar regression approaches. Ensemblebased tree methods are used for prediction, and also for the selection of the most relevant features, because they are able to deal with high dimensional feature spaces, when the number of training samples is small. All the drugs considered corresponds to rapid release solid oral pharmaceutical forms. Six different feature selection schemes were tested, including sequential feature selection and genetic algorithms, along with a feature scoring procedure, which was proposed in order to get a consensus about the best subset of variables. The performance was evaluated in terms of the similitude factor used in the drug industry for dissolution profile comparison. The feature selection methods were able to reduce the dimensionality of the feature space in 79.2%, without loss in the performance of the prediction system. The results confirm that in the dissolution profile prediction problem, especially for different solid oral pharmaceutical forms, variables from different components and phases of the drug development must be considered.

Download


Paper Citation


in Harvard Style

Salazar Jiménez J., Sánchez Carvajal J., Quiros-Gómez B. and Arias-Londoño J. (2017). Automatic Feature Selection in the SOPFs Dissolution Profiles Prediction Problem. In - BIOINFORMATICS, (BIOSTEC 2017) ISBN , pages 0-0. DOI: 10.5220/0006141800001488


in Bibtex Style

@conference{bioinformatics17,
author={J. E. Salazar Jiménez and J. D. Sánchez Carvajal and B. Quiros-Gómez and J. D. Arias-Londoño},
title={Automatic Feature Selection in the SOPFs Dissolution Profiles Prediction Problem},
booktitle={ - BIOINFORMATICS, (BIOSTEC 2017)},
year={2017},
pages={},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0006141800001488},
isbn={},
}


in EndNote Style

TY - CONF

JO - - BIOINFORMATICS, (BIOSTEC 2017)
TI - Automatic Feature Selection in the SOPFs Dissolution Profiles Prediction Problem
SN -
AU - Salazar Jiménez J.
AU - Sánchez Carvajal J.
AU - Quiros-Gómez B.
AU - Arias-Londoño J.
PY - 2017
SP - 0
EP - 0
DO - 10.5220/0006141800001488