variety of evaluated scenarios in Sect. 4, which is ac-
companied by a discussion. Finally, we summarise
our findings and outline future work.
2 RELATED WORK
The development of e-rehabilitation systems is conti-
nuously rising because of a higher demand and a lack
of personnel resources. With the release of the Mi-
crosoft Kinect, cost-effective depth sensors became
affordable and e-rehabilitation applications that em-
ploy the Kinect made its breakthrough. In the last
years, several Kinect-based e-rehabilitation systems
were developed, such as proposed by Su et al. (Su
et al., 2014) or Gal et al. (Gal et al., 2015). Howe-
ver, to date there is no study that evaluates a patient’s
performance during exercises based on remotely de-
termined vital parameters.
There are four main vital parameters, i. e. heart
rate, respiration rate, oxygen saturation and blood
pressure. In this study, we focus on remote heart
rate determination by means of optical sensors using
principles of photoplethysmography (PPG). In clini-
cal environments, the heart rate is normally obtai-
ned by electrocardiography (ECG) or pulse oxime-
ters. The basics of PPG were first described by Hertz-
man and Spealman (Hertzman and Spealman, 1937).
They measured the volumetric changes of the blood
flow with an optical sensor. The light that transmits
through thin body parts, such as fingers or earlobes, is
received by an optical sensor (Allen, 2007). This met-
hod is called transmissive PPG. Next to transmissive
PPG, there exists the reflective PPG as well, which
measures the light reflected from a tissue. Due to
the reflection, the signal-to-noise ratio (SNR) for this
method is decreased by a factor of ten compared to
the transmissive PPG. Still, for both of these met-
hods, sensors have to be attached to the body. In
order to overcome this issue, Humphreys et al. de-
veloped a first concept for remote photoplethysmo-
graphy (rPPG) (Humphreys et al., 2005). This was
followed by first experiments in the infrared spectrum
(Garbey et al., 2007) and the visible light spectrum
(Verkruysse et al., 2008).
In 2007, Verkruysse et al. recorded probands a
with small distance to an RGB camera. These pro-
bands were instructed not to move during the recor-
dings in order to avoid motion artefacts. They de-
tected a region of interest (ROI) within a face, per-
fomed a spatial averaging of the colour channels and
determined the heart rate with the Fast Fourier Trans-
form (FFT). This method was followed by the first au-
tomated approach by Poh et al. (Poh et al., 2010; Poh
et al., 2011). They used an automated face detection
and an independent component analysis (ICA). In or-
der to increase the speed, Lewandoska et al. (Lewan-
dowska et al., 2011) suggested to use a principal com-
ponent analysis (PCA) instead of an ICA. Further
works proposed to improve these methods by using
temporal filters (van Gastel et al., 2014), autoregres-
sive models (Tarassenko et al., 2014) or an adaptive
filtering (Wiede et al., 2016a). All these approaches
belong to the group of methods called intensity-based
methods.
A different group of approaches are the so-called
motion-based methods, which were first proposed by
Balakrishnan et al. (Balakrishnan et al., 2013). They
made use of small head motions caused by the he-
art bump triggered blood flow. By using several dis-
tinctive feature points in the person’s face, small head
motions can be tracked over time with a Kanade-
Lucas-Tomasi (KLT) point tracker. After that, a PCA
determined the principal components of the trajecto-
ries of the points. At last, the heart rate was obtained
by using a peak detection.
As outlined by Wiede et al. (Wiede et al., 2016b),
intensity- and motion-based methods have different
advantages and disadvantages. Intensity-based met-
hods are less sensitive to motion artefacts, whereas the
motion-based methods suffer from fast motions. This
is because the motion artefacts and the heart bump in-
duced motion signal share the same frequency bands.
In contrast to that, motion-based methods are less
prone to illumination artefacts, such as reflections and
shadows. The ratio-based method exploits these facts
by using an intensity-based method when less inten-
sity artefacts occur and a motion-based method when
less motion artefacts are present (Wiede et al., 2016b).
Consequently, the ratio-based method can not com-
pletely eliminate such artefacts, because it only choo-
ses the method with the smallest amount of artefacts.
Thus, the main problems originate from the under-
lying sources of artefacts. If these sources can be re-
duced or eliminated, the accuracy will increase signi-
ficantly. For that, we propose an intensity-based met-
hod, which can overcome the motion artefacts by an
accurate tracking and which significantly reduces in-
tensity artefacts with a skin colour model.
3 METHODS
3.1 Overview
The major steps for the proposed robust remote he-
art rate determination are shown in Figure 1. After
acquiring an RGB image, white balancing was app-
VISAPP 2018 - International Conference on Computer Vision Theory and Applications
492