sionalites better. The difference in PSO run time is
more than 10-fold between MLR and SVM, on the
other hand, resulting in a trade-off between classifica-
tion performance and speed of computation. Since the
PSO algorithm is iterative in nature, and thus fairly
computer intensive, the MLR alternative can be pre-
ferred during the feature selection process.
The feature selection frequency maps were thresh-
olded by including only voxels in the outlier range
(figure 2), which does not, naturally, guarantee that
these voxels differ significantly between conditions.
Proper significance thresholding can, however, be
easily performed using non-parametric permutation
testing.
When comparing the feature selection frequency
with the univariate feature ranking (figure 2), it can
be seen that, with both classifiers, roughly the same
features have a high selection frequency. These are
all located in an area that is consistent with the known
anatomical location of the secondary somatosensory
cortex, as confirmed by the general linear model T-
map. Moreover, the voxel selection maps (figure
2) appear virtually identical, and it can thus be as-
sumed that, for multivariate activation localization, ei-
ther SVM or MLR can be used with similar results.
Also, if maximal classification scores are required,
the SVM can be applied on the final selected voxel
subset.
The most frequently selected voxels, however, do
not have the highest ranking values (table 2), showing
that univariate ranking of the features influences but
does not dominate the feature selection process.
The low scores achieved when using only the out-
lier voxels as input into a classifier, indicates that
high-scoring subsets must contain a large variety of
features, including some specific key voxels. These
key voxels appear essential to high accuracy discrim-
ination of conditions, but are poor as individual pre-
dictors. The key voxels can be identified by repeti-
tions of the PSO-algorithm, and subsequent investi-
gation of the feature selection frequency. This is evi-
dence for a distributed nature of brain activation pat-
terns, where optimal voxel subsets may include fea-
tures that, when analyzed individually, do not indicate
any significant difference between conditions. More-
over, this prompts the need for multivariate feature
selection allowing for distributed voxel subsets.
5 CONCLUSIONS
Our proposed particle swarm optimization approach
is effective for fMRI pattern classification, and, more-
over, warrants a user-friendly implementation. Also,
the algorithm can be used to localize voxels that are
highly involved in processing of given conditions.
Simple and fast multiple linear regression approach
appears suitable for the localization of relevant vox-
els, whereas for situations where high-accuracy clas-
sification is required SVMs are highly recommended.
ACKNOWLEDGEMENTS
This study was supported by the Swedish Research
Council (grant K2007-63X-3548) and the Sahlgren-
ska University Hospital (grant ALFGBG 3161). We
are grateful to L. L
¨
oken and K. Rylander for supply-
ing the fMRI data.
REFERENCES
˚
Aberg, M. C., L
¨
oken, L., and Wessberg, J. (2008). An evo-
lutionary approach to multivariate feature selection for
fMRI pattern analysis. Proceedings of the Interna-
tional Conference on Bio-inspired Systems and Signal
Processing.
Bellman, R. E. (1961). Adaptive Control Processes. Prince-
ton University Press, Princeton, NJ.
Bj
¨
ornsdotter
˚
Aberg, M. and Wessberg, J. (2008). An evolu-
tionary approach to the identification of informative
voxel clusters for brain state discrimination. IEEE
Journal of selected Topics in Signal Processing, ((in
press)).
Blum, A. and Langley, P. (1997). Selection of relevant fea-
tures and examples in machine learning. Artificial In-
telligence, 97(1-2):245–271.
Friston, K. J., Holmes, A. P., Worsley, K. J., Poline, J. P.,
Frith, C. D., and Frackowiak, R. S. J. (1994). Statisti-
cal parametric maps in functional imaging: A general
linear approach. Human Brain Mapping, 2(4):189–
210.
Kennedy, J. and Eberhart, R. (1995). Particle swarm opti-
mization. Proceedings of IEEE International Confer-
ence on Neural Networks, 4:1942–1948.
Norman, K. A., Polyn, S. M., Detre, G. J., and Haxby, J. V.
(2006). Beyond mind-reading: multi-voxel pattern
analysis of fMRI data. Trends in Cognitive Sciences,
10(9):Pages 424–430.
Olausson, H., Lamarre, Y., Backlund, H., Morin, C., Wallin,
B. G., Starck, G., S.Ekholm, Strigo, I., Worsley, K.,
Vallbo, A. B., and Bushnell, M. C. (2002). Unmyeli-
nated tactile afferents signal touch and project to insu-
lar cortex. Nature Neuroscience, 5(9):900–904.
Suykens, J., Gestel, T. V., Brabanter, J. D., Moor, B. D., and
Vandewalle, J. (2002). Least Squares Support Vector
Machines. World Scientific, Singapore.
Wang, X., Yang, J., Teng, X., Xia, W., and Jensen, R.
(2007). Feature selection based on rough sets and par-
ticle swarm optimization. Pattern Recognition Letters,
28:459–471.
BIOSIGNALS 2009 - International Conference on Bio-inspired Systems and Signal Processing
284