Motion Evaluation of Therapy Exercises by Means of

Skeleton Normalisation, Incremental Dynamic Time Warping and

Machine Learning: A Comparison of a Rule-Based and a

Machine-Learning-Based Approach

Julia Richter, Christian Wiede, Ulrich Heinkel and Gangolf Hirtz

Department of Electrical Engineering and Information Technology

Technische Universit

at Chemnitz, Reichenhainer Str. 70, 09126 Chemnitz, Germany

Keywords:

Health Care, Medical Training Therapy, Motion Quality Assessment, Assistance Systems, Machine Learning,

Dynamic Time Warping, Skeleton Normalisation.

Abstract:

The assessment of motions by means of technical assistance systems is attracting widespread interest in ﬁelds

such as competitive sports, ﬁtness and rehabilitation. Current research has achieved to generate feedback that

concerns quantity or the grade of similarity with regard to correct reference motions. In view of post-operative

rehabilitation exercises, such type of feedback is regarded as insufﬁcient. That is why recent research aims

at providing a qualitative feedback by communicating motion errors. While existing systems investigated the

use of manually deﬁned rules to detect motion errors, we suggest to employ machine learning techniques in

combination with dynamic time warping and to train classiﬁers with sample exercise executions represented by

3-D skeletons joint trajectories. This study describes both a rule-based and a machine-learning-based approach

and compares them with regard to their accuracy. In the second place, this study seeks to investigate the effect

of using normalised hierarchical coordinates on the classiﬁcation accuracy if data of different persons is used

for the machine-learning-based approach. The results reveal that the performance of the machine-learning-

based method compares well with the rule-based concept. Another outcome to emerge from this study is that

normalised hierarchical coordinates allow to use data of different persons.

1 INTRODUCTION

Latest results from interviews with medical experts

from rehabilitation centres revealed that nowadays a

therapist has to supervise up to ﬁfteen patients simul-

taneously (L

osch et al., 2018). Consequently, patients

do not receive continuous feedback from a therapist

while they are performing therapy exercises, so that

incorrect motion executions are not detected and can-

not be instantly corrected.

A variety of studies have already demonstrated

that the manner of motion execution inﬂuences the

muscle activity during an exercise (Kang et al., 2016),

(Caterisano et al., 2002), (Gorsuch et al., 2013). For

this reason, it is of high importance to control the mo-

tion execution by means of feedback. In this way, mo-

tion errors can be avoided so that the intended muscles

are strained and the desired effects can be achieved.

A possible solution to provide feedback to a pa-

tient could be a technical assistance system that imita-

tes the therapist’s feedback. Such a system could not

only be beneﬁcial in the post-operative rehabilitation

phase, but it could also be used by healthy persons at

home as a preventive measure for maintaining their

mobility.

Therefore, the overall objective that is pursued in

our work is to develop an assistance system to su-

pervise persons during their medical training therapy

when no therapist is present to supervise their perfor-

mance, either in rehabilitation centres or at home. For

this, the therapist’s knowledge and his or her visual

perception as well as the therapist’s real-time feed-

back shall be reproduced. This work includes the

recognition of exercise-speciﬁc motion errors during

the exercise execution to ensure the exercise quality.

The paper is structured as follows: Section 2 gives

an overview about extant systems and recent research

on motion analysis, whereupon the research issue is

described. Section 3 introduces the developed assis-

tance system. The system evaluation is presented in

Section 4. Finally, Section 5 summarises the paper

and draws conclusions about future work.

Richter, J., Wiede, C., Heinkel, U. and Hirtz, G.

Motion Evaluation of Therapy Exercises by Means of Skeleton Normalisation, Incremental Dynamic Time Warping and Machine Learning: A Comparison of a Rule-Based and a

Machine-Learning-Based Approach.

DOI: 10.5220/0007260904970504

In Proceedings of the 14th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2019), pages 497-504

ISBN: 978-989-758-354-4

497

2 RELATED WORK

The following sections introduce state-of-the-art

feedback systems and recent research that has been

carried out so far.

2.1 Extant Systems for Motion Analysis

A variety of systems that provide feedback to the user

can be found in the ﬁeld of ﬁtness training. Examples

of prominent providers are Technomex, Balori, Erg-

oﬁt, HUR and TecnoBody, which equip ﬁtness cen-

tres with their devices. Table 1 provides an overview

about the above mentioned systems.

When taking a closer look at these existing solu-

tions, however, it becomes obvious that the focus lies

on monitoring exercise quantity rather than exercise

quality. Additionally, the feedback is mainly gene-

rated by means of sensors integrated in the devices,

such as force, acceleration and distance sensors. In

that way, motion amplitudes, velocities and the num-

ber of repetitions can be determined based on indi-

rectly measured parameters in the training machine,

e. g. the calculated torsional moment in the lever arm.

Another major drawback of these systems is that they

do not detect speciﬁc motion errors. Consequently,

information about the motion quality is not available.

A step further goes the system developed by Pix-

formance. Their system processes 3-D skeleton joints

in combination with manually deﬁned rules in order

to detect motion errors. Similar to Pixformance, Zhao

et al. (Zhao et al., 2014) manually deﬁned exercise-

speciﬁc rules in XML format in cooperation with cli-

nicians. Joint distances, joint angles, body segment

orientations and repetition velocities were evaluated

to assess motions.

2.2 Recent Research on Motion

Analysis

A possibility to identify incorrect motions and to sub-

sequently assess the person’s performance is to match

the executed motion against pre-deﬁned references or

templates. Thereby, a variety of approaches employed

sensors that do not provide direct information about

joint positions. Yurtman et al. distributed wearable

motion sensors, i. e. accelerometers, gyroscopes and

magnetometers, over the human body in order to clas-

sify a therapy exercise as correct or incorrect (Yurt-

man and Barshan, 2013). They searched for occurren-

ces of deﬁned template signals by using dynamic time

warping (DTW). For post-stroke rehabilitation, Tor-

mene et al. employed DTW on data provided by strain

sensors that were worn into a long-sleeve shirt (Tor-

mene et al., 2009). Tak et al. published a method for

human abnormality detection in video sequences, for

which they used DTW as well (Tak et al., 2011). Their

method compared distance curves calculated from the

detected foreground regions in each frame to ﬁnd mo-

tion similarities.

Driven by the launch of the Kinect version 1.0 in

2010, researchers have increasingly investigated as-

sessment methods that utilise the Kinect skeleton mo-

del (Shotton et al., 2013). Existing work examined

motions during sports, such as dancing, ballet training

and Tai Chi exercises, using the Kinect skeleton (Gal

et al., 2015), (Huang et al., 2013), (Muneesawang

et al., 2015), (Lin et al., 2013). These works introdu-

ced a variety of distance metrics in order to compare

a performed motion against a reference. Such me-

trics were, for example, angles between joints (Gal

et al., 2015), angles between body motion vectors

(Huang et al., 2013) or the mean Euclidean distance

between joint trajectories (Lin et al., 2013). Although

the presented algorithms originally were not designed

for therapy exercise assessment, their principle work-

ﬂows were still inspiring for similarity assessment in

view of therapy.

Hence, in the ﬁeld of physical rehabilitation, simi-

lar principles can be found. In order to assess moti-

ons, several of the presented works successfully de-

monstrated the use of DTW (Su et al., 2014), (Antu-

nes et al., 2016) or variants of DTW, such as Incre-

mental Dynamic Time Warping (IDTW) (Khan et al.,

2014) and Subsequence DTW (Baptista et al., 2017).

The principle of these approaches is to align the two

sequences and to calculate similarity measures or sco-

res. Antunes et al., for example, calculated and visu-

alised feedback vectors that show the difference bet-

ween the performed and the template motion of body

parts (Antunes et al., 2016). Baptista et al. exten-

ded this approach by investigating the alignment of

a template and the performed exercise by means of

Subsequence DTW and Temporal Commonality Dis-

covery, whereas one sequence is a subsequence of the

other (Baptista et al., 2017). Khan et al. introduced

a method for continuous real-time evaluation, which

is called IDTW, (Khan et al., 2014). The system they

developed provides a similarity score but does not de-

termine speciﬁc errors.

2.3 Research Issue

The purpose of this investigation is to explore approa-

ches that allow the recognition of errors that typically

occur during the performance of three selected exerci-

ses of therapeutic relevance: hip abduction, hip exten-

VISAPP 2019 - 14th International Conference on Computer Vision Theory and Applications

498

Table 1: Selection of feedback systems on the market.

System/ Provi-

der

Application Field Description

Technomex Fitness games with focus on mo-

tivation

Skeleton-based measurement, not meant for reha-

bilitation, no motion error detection

Pixformance Motion feedback in the ﬁeld of ﬁt-

ness training

Skeleton-based measurement, rule-based motion

error detection, manual rule speciﬁcation

Balori Coordination and balance training Balance training based on pressure plate

HUR, Ergoﬁt Motion feedback in the ﬁeld of ﬁt-

ness

Indirect measurement via sensors integrated in the

machine, feedback is visualised by means of inten-

sity bars and amplitude plots

TecnoBody Motion feedback in the ﬁeld of ﬁt-

ness

Skeleton-based measurement, no motion error de-

tection by means of joint trajectories

sion and hip ﬂexion, which are performed on a pulley

machine and are error-prone due to the unrestricted

motion range the machine allows.

We ﬁrstly investigated a rule-based approach to

detect motion errors in real-time. Exercise-speciﬁc

rules had to be deﬁned and suitable thresholds had

to be determined. The results of this approach serve

as a reference benchmark for the comparison against

the proposed machine-learning-based approach. Se-

condly, we designed, implemented and evaluated a

machine-learning base approach, which avoids the

manual speciﬁcation of rules. The principle of the ap-

proach has already been presented in (Richter et al.,

2017b). In order to avoid taking pre-recordings and

examples of incorrect motions for new patients, we

investigated a normalisation method. Therefore, the

performance of two types of coordinate representati-

ons, i. e. local and normalised hierarchical coordina-

tes, was compared against each other by using three

different scenarios. Since we intend to give real-time

feedback, the approach adapted the IDTW algorithm

(Khan et al., 2014).

Two research questions arise at this point and shall

be answered in the course of this study:

a) Can we train a machine by means of example se-

quences, so that the classiﬁcation performance is

similar to that of the rule-based approach?

b) Does the normalisation of skeleton data allow the

usage of training data that is different from the

person who ﬁnally uses the system?

3 DEVELOPED ASSISTANCE

SYSTEM

3.1 Concept Overview

The developed assistance system has already been

presented in (Richter et al., 2017a) and (Richter et al.,

2017b), whereas the focus lied on hip abduction.

Figure 1 presents the concept of the assistance sy-

stem: It consist of a sensor, i. e. a Kinect version 1.0,

that measures the 3-D joint positions of a person’s

skeleton. Based on these positions, motion errors are

determined in real-time. This information is sent to

a feedback unit and visualised in form of feedback

messages and joints of an animated avatar that are co-

loured depending on their involvement in the detected

error. The feedback system is shown in Figure 2.

Figure 1: Concept of the assistance system (Richter et al.,

2017a).

Figure 2: Feedback display: In this example, the person per-

forms the exercise hip ﬂexion and does not bend the knee in

the correct way. A message with a suggestion how to cor-

rect the motion appears in real time (see top right, transla-

ted: Bend your knee!).

Motion Evaluation of Therapy Exercises by Means of Skeleton Normalisation, Incremental Dynamic Time Warping and Machine Learning:

A Comparison of a Rule-Based and a Machine-Learning-Based Approach

499

3.2 Motion Errors

This section presents the three exercises the study fo-

cusses on with their speciﬁc motion errors that have

to be detected.

In the Figures 3, 4 and 5, the gray part of the ske-

leton indicates the moving body parts. The regions in

red mark the error occurrences. The left skeletons re-

present the correct exercise executions with the class

label C. The remaining class labels UB, FO, BK, WP

and NBK correspond to the motion errors.

UBFO WP

Figure 3: Motion errors for the exercise hip abduction.

BKFO WP

Figure 4: Motion errors for the exercise hip extension.

NBKFO

Figure 5: Motion errors for the exercise hip ﬂexion.

3.3 Rule-Based Approach

3.3.1 Rule Modules

The rules are based on three function that are descri-

bed in the following sections. These rules were app-

lied on the 3-D joint positions represented in the 3-D

Kinect camera coordinate system.

M1 – Angle between 3-D Articulated Joints. The

angle between two articulated joints is calculated by

this module. The input parameters of the module are

two arbitrary articulated joints a

and b

in a frame t,

which are deﬁned by their corresponding start posi-

tion a

1,t

and b

1,t

as well as their end position a

2,t

and

2,t

respectively. These positions are given by 1×3

vectors. The output angle ρ

is calculated according

to Equation 1.

= arccos

2,t

− a

1,t

) · (b

2,t

− b

1,t

)

||a

2,t

− a

1,t

|| · ||b

2,t

− b

1,t

. (1)

M2 – Normalised Euclidean Distance between 3-D

Joints. The normalised Euclidean distance dj

bet-

ween two joints a and b, whose positions are given by

1×3 vectors, is calculated according to Equation 2. In

order to normalise this distance with regard to a per-

son’s height, the distance is divided by the sum of the

right lower and upper leg leg

l,t

and leg

u,t

. The right

leg was chosen here instead of the left supporting leg,

because in contrast to the left leg, the right leg is al-

ways visible during the exercise execution.

||a

− b

leg

u,t

+ leg

l,t

. (2)

M3 – Normalised Signed Euclidean Distance of a

3-D Joint to a Deﬁned Plane. This module allows

to calculate the distance dp

of a joint of interest p

oi,t

to the x-y, the x-z or the y-z plane of a Cartesian coor-

dinate system. This coordinate system is deﬁned by

three joints a

, b

and c

as well as a joint p

0,t

that de-

ﬁnes the origin of the system. To obtain the distance,

the following steps have to be performed: Firstly, a

coordinate system with the rotation R and the origin

0,t

is calculated using a

, b

and c

. Secondly, p

oi,t

transformed into this coordinate system and results in

the transformed coordinates p

oi,trans,t

. These steps are

represented by the following Equations:

− a

(3)

= (b

− a

) × (a

− c

) (4)

× x

(5)





(6)

oi,trans,t

= R



oi,t

− p

0,t



. (7)

Finally, depending on the plane to which the norma-

lised signed distance shall be calculated, the x, y or z

coordinate of the transformed joint is returned accor-

ding to Equation 8.











oi,trans,t

(z)

leg

u,t

+leg

l,t

, if distance to x-y plane,

oi,trans,t

(y)

leg

u,t

+leg

l,t

, if distance to x-z plane,

oi,trans,t

(x)

leg

u,t

+leg

l,t

, if distance to y-z plane.

(8)

VISAPP 2019 - 14th International Conference on Computer Vision Theory and Applications

500

3.3.2 Rules

Due to limited space, this section presents only the

rule of BK for hip abduction in Table 2. If a rule is ful-

ﬁlled for a frame t, then the corresponding error was

detected in this frame. If none of the deﬁned errors

were detected, the performed motion was considered

to be correct.

Table 2: Hip abduction: Bent Knee (BK).

Module: M1

: Angle between up-

per and lower right leg

Rule:

< thr

Abd,BK

1,t

2,t

3.3.3 Thresholds

The thresholds for every rule were determined by me-

ans of ROC (receiver operating characteristic) curves.

The principle is described for the error BK (hip ab-

duction). In this study, the data of all recorded persons

was intentionally used, which results in the optimal

thresholds for this very test group. This means that, in

case of manual threshold selection, the results cannot

exceed the accuracy that is determined in this experi-

ment. The determined optimal threshold thr

Abd,BK

the error BK for hip abduction was 159

◦

0 0.2 0.4

0.6

0.8 1

0.2

0.4

0.6

0.8

FPR

TPR

Figure 6: ROC curve for hip abduction, Bent Knee (BK).

Figure 6 presents the according ROC curve with

the optimal operating point, i. e. the threshold that

separates the samples representing the error classes

from the rest of the samples. The red dot on the curve

indicates the optimal operating point. Figure 7 shows

the separation of the data by means of this determined

threshold.

100

150

200

400

600

Angle in degree

Absolute occurence

˜BK

Figure 7: Samples separated by the determined threshold

for hip abduction, Bent Knee (BK).

3.4 Machine Learning-Based Approach

The machine learning-based approach has already

been published for hip abduction by Richter et al.

(Richter et al., 2017b). Their algorithm transforms

the obtained 3-D skeleton to normalised hierarchical

coordinates. The aim of this transformation is to nor-

malise skeletons of different persons and to make the

joint data rotation- and translation-invariant. In the

next step, the current motion of a person performing

an exercise was aligned with a correctly performed re-

ference by means of IDTW. The new idea in the work

of Richter et al. was to calculate a difference vector

between the reference and the current joint positions,

which was then fed to a set of support vector machi-

nes. Thereby, every machine voted for one single er-

ror. As a result, the machine could classify whether a

current exercise motion is correct or whether a speci-

ﬁc motion error occurred.

4 EVALUATION

4.1 Evaluation Measures

To compare the performance of the different approa-

ches presented in this work, the accuracy of the single

classiﬁers η

and the overall accuracy η

all

, which is

the mean accuracy of all classiﬁers S for an exercise,

are used. Thereby, the overall accuracy is deﬁned by

all

∑

s=1

, (9)

whereas S is the number of classiﬁers or error classes

used for one exercise.

Motion Evaluation of Therapy Exercises by Means of Skeleton Normalisation, Incremental Dynamic Time Warping and Machine Learning:

A Comparison of a Rule-Based and a Machine-Learning-Based Approach

501

4.2 Dataset Conﬁgurations

To investigate the effect of local and normalised hier-

archical coordinates on the classiﬁcation accuracy,

three different scenarios were deﬁned by Richter et

al. (Richter et al., 2017b):

Scenario 1. For every patient, an individual machine

was trained and tested with the individual patient’s

data and his or her own reference. In practice, trai-

ning data as well as the patient’s reference have to be

recorded for every new patient, which is rather im-

practical.

Scenario 2. In contrast to scenario 1, one single ma-

chine was trained with the motion data of three per-

sons and their individual reference data. Just as in

scenario 1, the test was performed with each test per-

son’s individual reference. In practice, only the new

patient’s reference, but not his or her training data, has

to be recorded.

Scenario 3. The machine was trained with the motion

data of the three persons mentioned in scenario 2. In

contrast to scenario 1 and scenario 2, the reference

has not been changed according to the person. For

testing, the very same reference was used for all test

samples of the ten test persons. This means that, in

practice, neither training data nor a reference has to

be recorded for a new patient.

4.3 Results and Discussion

4.3.1 Usage of Normalised Skeleton Data

Table 3 and Figure 8 present the results to answer the

research question b) raised in Section 2.3.

Table 3: Overall accuracies η

all

for the three scenarios S1,

S2 and S3 with local (L) and normalised hierarchical (H)

coordinates and accuracy differences D = S1 - S3 between

S1 and S3.

Abduction Extension Flexion

L H L H L H

S1 0.86 0.87 0.85 0.85 0.98 0.99

S2 0.79 0.82 0.80 0.82 0.94 0.96

S3 0.77 0.84 0.78 0.82 0.92 0.95

D 0.09 0.03 0.07 0.03 0.06 0.04

Table 3 compares the results of local and norma-

lised hierarchical coordinates for the three different

scenarios and Figure 8 visualised these numbers in a

graph.

When taking a closer look at the inﬂuence of hier-

archical and local coordinates, we notice the follo-

wing: In general, the accuracies for hierarchical coor-

dinates are higher than for local coordinates. Moreo-

ver, and what is the most important, is that hierarchi-

S3 S2 S1

0.8

0.9

Scenario

Accuracy η

all

Abduction H

Abduction L

Extension H

Extension L

Flexion H

Flexion L

Figure 8: Overall accuracies η

all

for the three scenarios S1,

S2 and S3 with local (L) and normalised hierarchical (H)

coordinates.

cal coordinates outperform local coordinates especi-

ally for S3, which is the scenario of highest practical

relevance. From the graph and the table above we

can see that the usage of local, i. e. non-normalised

coordinates results in a considerable deterioration of

the overall accuracy from scenario 1 to scenario 3.

This implies that, for non-normalised local coordina-

tes, the less personalised data is used, the higher is

the negative inﬂuence of factors, such as body size

and proportions, on the accuracy. The difference to

local coordinates becomes obvious when we have a

look at the overall accuracies when using normalised

hierarchical coordinates. It is apparent that, in con-

trast to local coordinates, the accuracy only slightly

decreases when using normalised hierarchical coordi-

nates. These results suggest to use normalised hierar-

chical coordinates instead of local coordinates should

the recording of personalised data for each new pa-

tient be avoided without loosing classiﬁcation perfor-

mance, which correspond to scenario 3. Still, for sce-

nario 2 and even for hip ﬂexion and abduction in sce-

nario 1, normalised hierarchical coordinates outper-

form the local ones. These results indicate that, in ge-

neral, normalised hierarchical coordinates are a suita-

ble representation of therapy exercise trajectories.

4.3.2 Rule-Based Versus

Machine-Learning-Based

Table 4 and Table 5 present the results to answer

the research question a), whereas Table 4 shows

the achieved performance of the rule-based approach

and Table 5 reveals the performance of the machine-

learning-based approach for the scenario 3.

If we compare the results of the rule-based ap-

proach in Table 4 with the results of the machine-

learning-based approach in Table 5, it can be seen

that the performance of the machine-learning-based

method is comparable to the rule-based method. As

a consequence, these results demonstrate that the de-

tection of motion errors by means of machine lear-

VISAPP 2019 - 14th International Conference on Computer Vision Theory and Applications

502

Table 4: Confusion matrices and corresponding accuracies η

and η

all

and overall accuracies η

all

for rule-based approach.

Abduction Extension Flexion

L L η

C 0.72 0.28

0.81

C 0.73 0.27

0.83

C 0.91 0.09

0.94

C 0.09 0.91 C 0.08 0.92 C 0.04 0.96

BK 0.88 0.12

0.91

UB 0.97 0.03

0.98

UB 1.00 0.00

1.00

BK 0.06 0.94 UB 0.02 0.98 UB 0.00 1.00

FO 0.78 0.22

0.73

FO 0.70 0.30

0.69

FO 0.89 0.11

0.90

FO 0.32 0.68 FO 0.31 0.69 FO 0.09 0.91

UB 0.91 0.09

0.93

BK 0.87 0.13

0.87

NBK 1.00 0.00

0.99

UB 0.05 0.95 BK 0.12 0.88 NBK 0.01 0.99

WP 0.84 0.16

0.83

WP 0.84 0.16

0.86

WP 0.18 0.82 WP 0.13 0.87

all

0.84 0.85 0.96

Table 5: Confusion matrices and corresponding accuracies η

and η

all

for scenario 3 for machine-learning-based approach.

Classiﬁers are regarded independently from each other.

Abduction Extension Flexion

L L η

C 0.84 0.16

0.80

C 0.92 0.08

0.84

C 0.84 0.16

0.91

C 0.24 0.76 C 0.24 0.76 C 0.01 0.99

BK 0.95 0.05

0.95

UB 0.90 0.10

0.90

UB 0.98 0.02

0.98

BK 0.05 0.95 UB 0.09 0.91 UB 0.02 0.98

FO 0.83 0.17

0.70

FO 0.54 0.46

0.67

FO 0.91 0.09

0.92

FO 0.44 0.56 FO 0.20 0.80 FO 0.08 0.92

UB 0.89 0.11

0.88

BK 0.87 0.13

0.86

NBK 1.00 0.00

0.99

UB 0.13 0.87 BK 0.16 0.84 NBK 0.02 0.98

WP 0.80 0.20

0.87

WP 0.73 0.27

0.84

WP 0.07 0.93 WP 0.04 0.96

all

0.84 0.82 0.95

ning techniques is equally reliable as the detection by

means of manually deﬁned rules. In conclusion, we

provided evidence that a system can be trained with

a set of example sequences, so that motion errors can

be detected. This ﬁnally avoids the manual deﬁnition

of rules for every exercise, because the machine can

automatically derive the rules based on the provided

example sequences.

5 CONCLUSIONS AND FUTURE

WORK

To sum up, this study has found that the introdu-

ced machine-learning-based approach works equally

accurate as the rule-based version. As a consequence,

the manual deﬁnition of rules can be avoided because

the machine is able to learn motion errors by means of

annotated sample sequences. In addition, this study

has identiﬁed that normalised hierarchical coordina-

tes allow to employ training samples of persons that

are different from those who ﬁnally use the system.

Hence, we lay a foundation towards a more quali-

tative feedback during therapy exercises. This work

contributes to an automatic inclusion of new exercises

by simply recording sample sequences of an exercise

along with sequences containing speciﬁc errors. In

future work, we therefore aim at an automated anno-

tation process for the training data, which is labelled

manually at the moment. Finally, further exercises

shall be included, for example by a therapist who spe-

ciﬁes both the correct execution and the errors that

shall be detected. In this way, it could be possible

in future to remotely create exercises for different pa-

tients or to simply pre-train systems with new exerci-

ses without taking pre-recordings of every person that

uses the system.

REFERENCES

Antunes, M., Baptista, R., Demisse, G., Aouada, D., and

Ottersten, B. (2016). Visual and human-interpretable

feedback for assisting physical activity. In Euro-

Motion Evaluation of Therapy Exercises by Means of Skeleton Normalisation, Incremental Dynamic Time Warping and Machine Learning:

A Comparison of a Rule-Based and a Machine-Learning-Based Approach

503

pean Conference on Computer Vision, pages 115–129.

Springer.

Baptista, R., Antunes, M., Aouada, D., and Ottersten, B.

(2017). Video-based feedback for assisting physical

activity. In Proceedings of the 12th International Joint

Conference on Computer Vision, Imaging and Com-

puter Graphics Theory and Applications - Volume 5:

VISAPP, (VISIGRAPP 2017), pages 274–280.

Caterisano, A., Moss, R., Pellinger, T., Woodruff, K., Le-

wis, V., Booth, W., and Khadra, T. (2002). The effect

of back squat depth on the EMG activity of 4 superﬁ-

cial hip and thigh muscles. The Journal of Strength &

Conditioning Research, 16(3):428–432.

Gal, N., Andrei, D., Neme, D. I., Ndan, E., and Stoicu-

Tivadar, V. (2015). A Kinect based intelligent e-

rehabilitation system in physical therapy. Digital He-

althcare Empowering Europeans, pages 489–493.

Gorsuch, J., Long, J., Miller, K., Primeau, K., Rutledge, S.,

Sossong, A., and Durocher, J. J. (2013). The effect of

squat depth on multiarticular muscle activation in col-

legiate cross-country runners. The Journal of Strength

& Conditioning Research, 27(9):2619–2625.

Huang, T.-C., Cheng, Y.-C., and Chiang, C.-C. (2013). Au-

tomatic Dancing Assessment Using Kinect. In Advan-

ces in Intelligent Systems and Applications-Volume 2,

pages 511–520. Springer.

Kang, S.-Y., Choung, S.-D., and Jeon, H.-S. (2016). Modi-

fying the hip abduction angle during bridging exercise

can facilitate gluteus maximus activity. Manual ther-

apy.

Khan, N. M., Lin, S., Guan, L., and Guo, B. (2014). A

visual evaluation framework for in-home physical re-

habilitation. In Multimedia (ISM), 2014 IEEE Inter-

national Symposium on Multimedia, pages 237–240.

IEEE.

Lin, T.-Y., Hsieh, C.-H., and Lee, J.-D. (2013). A kinect-

based system for physical rehabilitation: Utilizing tai

chi exercises to improve movement disorders in pa-

tients with balance ability. In 2013 7th Asia Modelling

Symposium, pages 149–153. IEEE.

osch, C., Weigert, M., Nitzsche, N., Richter, J., Wiede,

C., and Schulz, H. (2018). Einsatz und Bedeutung

von Seilz

ugen in der Medizinischen Trainingstherapie

am Beispiel H

uft-Totalendoprothese Eine Experten-

perspektive. In Bewegungstherapie und Gesundheits-

sport.

Muneesawang, P., Khan, N. M., Kyan, M., Elder, R. B.,

Dong, N., Sun, G., Li, H., Zhong, L., and Guan, L.

(2015). A machine intelligence approach to virtual

ballet training. IEEE MultiMedia, 22(4):80–92.

Richter, J., Wiede, C., Apitzsch, A., Nitzsche, N., L

osch, C.,

Weigert, M., Kronfeld, T., Weisleder, S., and Hirtz, G.

(2017a). Assisted Motion Control in Therapy Envi-

ronments Using Smart Sensor Technology: Challen-

ges and Opportunities. In Ambient Assisted Living, 9.

AAL-Kongress, Frankfurt/M, Germany, April 20 - 21,

2016, pages 119–132. Springer Verlag.

Richter, J., Wiede, C., Shinde, B., and Hirtz, G. (2017b).

Motion Error Classiﬁcation for Assisted Physical

Therapy - A Novel Approach using Incremental Dyn-

amic Time Warping and Normalised Hierarchical Ske-

leton Joint Data. In Proceedings of the 6th Internati-

onal Conference on Pattern Recognition Applications

and Methods - Volume 1: ICPRAM, pages 281–288.

Shotton, J., Girshick, R., Fitzgibbon, A., Sharp, T., Cook,

M., Finocchio, M., Moore, R., Kohli, P., Criminisi,

A., Kipman, A., et al. (2013). Efﬁcient human pose

estimation from single depth images. IEEE Transacti-

ons on Pattern Analysis and Machine Intelligence,

35(12):2821–2840.

Su, C.-J., Chiang, C.-Y., and Huang, J.-Y. (2014). Kinect-

enabled home-based rehabilitation system using Dy-

namic Time Warping and fuzzy logic. Applied Soft

Computing, 22:652–666.

Tak, Y.-S., Rho, S., and Hwang, E. (2011). Mo-

tion Sequence-Based Human Abnormality Detection

Scheme for Smart Spaces. Wireless Personal Commu-

nications, 60(3):507–519.

Tormene, P., Giorgino, T., Quaglini, S., and Stefanelli, M.

(2009). Matching incomplete time series with dyn-

amic time warping: an algorithm and an application

to post-stroke rehabilitation. Artiﬁcial intelligence in

medicine, 45(1):11–34.

Yurtman, A. and Barshan, B. (2013). Detection and evalu-

ation of physical therapy exercises by dynamic time

warping using wearable motion sensor units. In Infor-

mation Sciences and Systems 2013, pages 305–314.

Springer.

Zhao, W., Lun, R., Espy, D. D., and Reinthal, M. A. (2014).

Rule based realtime motion assessment for rehabilita-

tion exercises. In IEEE Symposium on Computational

Intelligence in Healthcare and e-health (CICARE),

2014, pages 133–140. IEEE.

VISAPP 2019 - 14th International Conference on Computer Vision Theory and Applications

504