function. The same features were used by Mariani et
al. (2011a) to feed a soft-margin SVM with
Gaussian kernel and by Mariani et al. (2013), using
a variable window, to feed three LDA classifiers (the
first for the background classification, the second for
the A1 classification and the third for the A2 and A3
classification). The A phase classification was
determined by combining the classification vectors.
SVM achieved the highest average results.
Mariani et al. (2012) used the same features and
four classifiers (NN, LDA, SVM and the Adaptive
Boosting classifier, AdaBoost, with 20 weak
learners) verifying that LDA provided the best
results. Machado et al. (2016) used a macro-micro
structure descriptor, the Teager energy operator
(TEO), Lempel-Ziv complexity, Zero-Crossing,
empirical mode decomposition, Shannon entropy
and variance as features to feed three classifiers,
LDA, SVM and kNN. It was determined that SVM
produces the highest accuracy.
By analysing the A phase detection proposals it
was possible to identify the features indicated as the
most relevant: five frequency band descriptors;
Hjorth activity; differential variance; TEO; Lempel-
Ziv complexity; Zero-Crossing; Shannon entropy;
empirical mode decomposition; macro-micro
structure descriptor. It is also possible to determine
that LDA, SVM, kNN and NN are the most suited
classifiers.
The main objective of this work is to propose
new features using the LDA, since it was determined
to be the classifier that achieved the highest
performance in the state of the art analysis. A
comparison with the features indicated by Mariani et
al. (2012) was also implemented since this work
reported the highest performance of the
bibliographical analysis. The results were achieved
using the LDA.
The majority of the presented works remove the
REM periods from the analysis, increasing the
overall performance of the algorithms. In this work a
different approach was used, keeping all the sleep
data, making the developed algorithms of this work
more suitable for automatic system implementation.
3 MATERIALS AND METHODS
A systematic review was performed to determine the
best approach for CAP classification. The chosen
method first classifies the A and B phases and after
uses a FSM to determine the CAP. A public
database was used for training and testing the
classifier and the FSM in a programming
environment.
The employed features are a mix of some
identified in the state of the art as the most relevant
and some new ones proposed. The first test involved
the use of all features and sequential feature
selection (SFS) was applied in the second test to
choose the best features for the classifier.
Principal component analysis (PCA) was used in
the third test to generate the features independently
from the classifier and the final test was the use of
the features indicated by Mariani et al. (2012) in the
developed algorithm.
3.1 Database
A public database from PhysioNet (Terzano et al.,
2001), with specific annotations of the macro and
microstructure made by trained neurologists, was
employed in the tests. A total of 14 recordings were
used, being recorded using the 10-20 international
system and monopolar derivations (C4-A1 or C3-
A2). The annotations include the sleep stage, event
description and duration.
The sleep analysis varies between six hours and
thirty minutes and nine hours and fifty minutes. The
subjects age varies between 23 and 78 years, being
nine males and five females. 50000 samples were
used in average in each of the employed datasets
(data from three subjects), either for test or training.
In both cases train/test with two datasets and
validate with the left off subject, repeating multiple
times until all subjects were used at least one time
for validation. The EEG signals were imported to the
programming environment Matlab 9.0 (The
Mathworks Inc.) for the analysis.
3.2 Feature Set
The features determined in the review as the best
ones for A phase detection were tested. A two
second time window was used, chosen due to be the
minimum A phase duration.
TEO and Shannon entropy presented good
discriminatory capabilities. The five band
descriptors provided a lower accuracy when
compared to the analysis of power spectral density
(PSD) of each band. The same conclusion occur
when comparing the differential variance with the
autocovariance. The time series analysis could also
be used, since the average power and the standard
variation presented a good correlation with the
presence of the A phases. Other relevant feature
ICPRAM 2018 - 7th International Conference on Pattern Recognition Applications and Methods
396