Figure 9: Scalability.
ing number of cores, the processing time decreased
almost linearly and then became constant just under 2
seconds once we reached 28 cores and we were able
to process our EEG data in real time. The experiment
shows that Spark streaming scales well for EEG data
with increasing number of cores with low latency.
Overall, Spark streaming performs well with EEG
data, but we also notice that it has some limitations,
e.g., it does not allow making windows on the basis of
input data time stamps and is only allowed on the ba-
sis of time duration that gives variable number of data
records in each window in a distributed environment.
7 CONCLUSIONS AND FUTURE
WORK
We have presented a parallel lightweight method for
epileptic seizure detection in large EEG data as real
time streams. We provided the architecture, work-
flow and Spark streaming implementation of our al-
gorithm. In an experimental scenario, our lightweight
algorithm was able to detect seizures in real time with
low latency and with good overall seizure detection
rate. Also, we have introduced a new feature, “top-k
amplitude measure” for data reduction. On the basis
of results, we believe that this method can be used
in clinics for epileptic patients. We noticed that al-
though Spark streaming scales well and produces re-
sults in real time with low latency, currently it is bet-
ter suitable for processing unstructured data and needs
improvements in providing facilities for scientific ap-
plications. Overall, we believe that Spark streaming
has potential for real time EEG analysis. In future we
might improve it for easier use for scientists, and this
study is a great first step.
REFERENCES
(2000). Chb-mit scalp eeg database. In Available at
http://physionet.org/pn6/chbmit/.
Ahmed, L. (2015). Github, inc. In
https://github.com/laeeq80/RealTimeEEG.
Bifet, A. (2013). Mining big data in real time. In Informat-
ica 37.1.
Dagum, L. and Enon., R. (1998). Openmp: an industry stan-
dard api for shared-memory programming. In Com-
putational Science & Engineering, IEEE 5.1 (1998):
46-55.
Dean, J. and Ghemawat, S. (2008). Mapreduce: simplified
data processing on large clusters. In Communications
of the ACM 51.1 (2008): 107-113.
Dutta, Haimonti, e. a. (2011). Distributed storage of large-
scale multidimensional electroencephalogram data us-
ing hadoop and hbase. In Grid and Cloud Database
Management. Springer Berlin Heidelberg, 2011. 331-
347.
Ekanayake, Jaliya, S. P. and Fox, G. (2008). Mapreduce
for data intensive scientific analyses. In IEEE Fourth
International Conference on e-science. IEEE.
Esteller, R. (2000). Detection of seizure onset in epileptic
patients from intracranial eeg signals. In Vol. 1.
Ferrarelli, Fabio, e. a. (2010). Reduced sleep spindle activ-
ity in schizophrenia patients. In The American journal
of psychiatry 164.3 (2007): 483-492.. Vol. 10. 2010.
Gropp, William, E. L. and Skjellum, A. (1999). Us-
ing mpi: portable parallel programming with the
message-passing interface. In Vol. 1. MIT press.
Hadoop (2009). Apache hadoop. In Available at
https://hadoop.apache.org.
Jayapandian, Catherine P., e. a. (2013). Cloudwave: dis-
tributed processing of big data from electrophysio-
logical recordings for epilepsy clinical research us-
ing hadoop. In AMIA Annual Symposium Proceed-
ings. Vol. 2013. American Medical Informatics Asso-
ciation.
Kang, Sol Ji, S. Y. L. and Lee, K. M. (2014). Performance
comparison of openmp, mpi, and mapreduce in prac-
tical problems. In Advances in Multimedia.
Kramer, Uri, e. a. (2011). A novel portable seizure detec-
tion alarm system: preliminary results. In Journal of
Clinical Neurophysiology 28.1 (2011): 36-38.
Kreps, Jay, N. N. and Rao, J. (2011). Kafka: A distributed
messaging system for log processing. In Proceedings
of the NetDB.
Litt, Brian, e. a. (2001). Epileptic seizures may begin hours
in advance of clinical onset: a report of five patients.
In Neuron 30.1 (2001): 51-64.
Nunez, P. L. and Srinivasan, R. (2006). Electric fields of
the brain: the neurophysics of eeg. Oxford university
press.
Qu, H. and Gotman, J. (1997). A patient-specific algo-
rithm for the detection of seizure onset in long-term
eeg monitoring: possible use as a warning device. In
Biomedical Engineering, IEEE Transactions on 44.2
(1997): 115-122. IEEE.