using a threshold value of 0.8. In fact, this algorithm
has some advantages such as reducing overfitting and
being extremely flexible. However, Random Forests
(depending on the dataset) are time-consuming.
Random Forest presents the best recall value
(93.8%). If the scope of this research was related to
health-related systems (e.g, a person has a disease or
not), the recall would be a better measure than preci-
sion. That is, it is far preferable to not miss any person
with the disease even if that means “signaling” some
patients as having a disease that actually do not have
it. As here we study the detection of new physical ac-
tivities, false-negatives are less of a concern. Then,
precision is preferable here.
We highlight that the precision values for Random
Forest and k-NN are indeed very close (85.1% and
84.9%). This means that they both are good at detect-
ing novelty activities of all activities that were pre-
dicted as a novelty.
5 CONCLUSIONS
In order to put our research in retrospect, we recall
that our motivation is to study novelty detection in
the context of activity recognition and be able to de-
tect new activities. To achieve this goal, we propose a
method that involves experimenting with three differ-
ent algorithms by creating three classification models
in a example set that contained three classes (or activ-
ities). We apply these models in a test set that con-
tained five classes, two of which were new, not being
present in the original training set. When comparing
the model’s confidence predictions with four thresh-
old values, we are able to detect how many of these
five activities were in fact novel (or not). We now
point some general observations.
Firstly, by increasing the threshold value, it means
that more activities are classified as a novel, which
leads to higher accuracy, recall but in a lower pre-
cision. Furthermore, lowering the threshold means
that fewer activities are classified as a novel, which
leads to lower recall and higher precision. Finally, by
choosing a threshold bigger than 0.8 would make it
possible to detect more novel activities. However, it
would make the model less precise. The best results
go for the Random Forest algorithm with a threshold
value equal to 0.8. However, k-NN is not far behind,
as both of them achieve a very close precision.
For future work, it would be relevant to study a
mechanism that would allow us to divide novel activ-
ities into different categories. Although we are detect-
ing if these examples are novel or not, it does not nec-
essarily mean that they belong to the same activity.
Another improvement would be using a clustering-
based technique to take into account outliers, to avoid
classifying an example as a novelty activity since it
also is a detached occurrence. Besides that, the use of
the latest deep learning techniques can help improve
the performance of novelty detection.
To sum up, we see a promising outlook for this re-
search area in the future, as novelty detection can help
us by recognizing and monitor our daily actions with
the fruitful purpose of providing useful information.
REFERENCES
Clifton, D. A., Clifton, L., Hugueny, S., Wong, D., and
Tarassenko, L. (2012). An extreme function theory for
novelty detection. IEEE Journal of Selected Topics in
Signal Processing, 7(1):28–37.
Clifton, L., Clifton, D. A., Watkinson, P. J., and Tarassenko,
L. (2011). Identification of patient deterioration in
vital-sign data using one-class support vector ma-
chines. In 2011 federated conference on computer sci-
ence and information systems (FedCSIS), pages 125–
131. IEEE.
Filippone, M. and Sanguinetti, G. (2010). Information
theoretic novelty detection. Pattern Recognition,
43(3):805–814.
Ghoting, A., Parthasarathy, S., and Otey, M. E. (2008). Fast
mining of distance-based outliers in high-dimensional
datasets. Data Mining and Knowledge Discovery,
16(3):349–364.
Hassan, A., Mokhtar, H. M., and Hegazy, O. (2011). A
heuristic approach for sensor network outlier detec-
tion. Int J Res Rev Wirel Sens Netw (IJRRWSN), 1(4).
Hawkins, S., He, H., Williams, G., and Baxter, R. (2002).
Outlier detection using replicator neural networks. In
International Conference on Data Warehousing and
Knowledge Discovery, pages 170–180. Springer.
Hazan, A., Lacaille, J., and Madani, K. K. (2012). Extreme
value statistics for vibration spectra outlier detection.
In International Conference on Condition Monitoring
and Machinery Failure Prevention Technologies, page
p.1, Londres, United Kingdom.
Helali, R. G. M. (2010). Data mining based network intru-
sion detection system: A survey. In Novel Algorithms
and Techniques in Telecommunications and Network-
ing, pages 501–505. Springer.
Le, T., Tran, D., Ma, W., and Sharma, D. (2010). An op-
timal sphere and two large margins approach for nov-
elty detection. In The 2010 International Joint Confer-
ence on Neural Networks (IJCNN), pages 1–6. IEEE.
Lee, K., Lee, K., Min, K., Zhang, Y., Shin, J., and Lee,
H. (2018). Hierarchical novelty detection for visual
object recognition. In Proceedings of the IEEE Con-
ference on Computer Vision and Pattern Recognition,
pages 1034–1042.
Marchi, E., Vesperini, F., Eyben, F., Squartini, S., and
Schuller, B. (2015). A novel approach for automatic
ICAART 2021 - 13th International Conference on Agents and Artificial Intelligence
864