Things You Might Not Know about the k-Nearest Neighbors Algorithm

Aleksandra Karpus; Marta Raczyńska; Adam Przybylek

doi:10.5220/0008365005390547

Things You Might Not Know about the k-Nearest Neighbors Algorithm

Aleksandra Karpus, Marta Raczyńska, Adam Przybylek

2019

Abstract

Recommender Systems aim at suggesting potentially interesting items to a user. The most common kind of Recommender Systems is Collaborative Filtering which follows an intuition that users who liked the same things in the past, are more likely to be interested in the same things in the future. One of Collaborative Filtering methods is the k Nearest Neighbors algorithm which finds k users who are the most similar to an active user and then it computes recommendations based on the subset of users. The main aim of this paper is to compare two implementations of k Nearest Neighbors algorithm, i.e. from Mahout and LensKit libraries, as well as six similarity measures. We investigate how implementation differences between libraries influence optimal neighborhood size k and prediction error. We also show that measures like F1-score and nDCG are not always a good choice for choosing the best neighborhood size k. Finally, we compare different similarity measures according to the average time of generating recommendations and the prediction error.

Download

Paper Citation

in Harvard Style

Karpus A., Raczyńska M. and Przybylek A. (2019). Things You Might Not Know about the k-Nearest Neighbors Algorithm. In Proceedings of the 11th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2019) - Volume 1: KDIR; ISBN 978-989-758-382-7, SciTePress, pages 539-547. DOI: 10.5220/0008365005390547

in Bibtex Style

@conference{kdir19,
author={Aleksandra Karpus and Marta Raczyńska and Adam Przybylek},
title={Things You Might Not Know about the k-Nearest Neighbors Algorithm},
booktitle={Proceedings of the 11th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2019) - Volume 1: KDIR},
year={2019},
pages={539-547},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0008365005390547},
isbn={978-989-758-382-7},
}

in EndNote Style

TY - CONF

JO - Proceedings of the 11th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2019) - Volume 1: KDIR
TI - Things You Might Not Know about the k-Nearest Neighbors Algorithm
SN - 978-989-758-382-7
AU - Karpus A.
AU - Raczyńska M.
AU - Przybylek A.
PY - 2019
SP - 539
EP - 547
DO - 10.5220/0008365005390547
PB - SciTePress