loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Corrado Grappiolo 1 ; Eline Verwielen 2 and Nils Noorman 3

Affiliations: 1 ESI (TNO), Eindhoven and The Netherlands ; 2 Unaffiliated, Valkenswaard and The Netherlands ; 3 Philips Healthcare, Best and The Netherlands

Keyword(s): String Clustering, N-Grams, Operational Usage Modelling, System Verification Testing.

Abstract: Connected high-tech systems allow the gathering of operational data at unprecedented volumes. A direct benefit of this is the possibility to extract usage models, that is, a generic representations of how such systems are used in their field of application. Usage models are extremely important, as they can help in understanding the discrepancies between how a system was designed to be used and how it is used in practice. We interpret usage modelling as an unsupervised learning task and present a novel algorithm, hereafter called Growing N-Grams (GNG), which relies on n-grams — arguably the most popular modelling technique for natural language processing — to cluster and model, in a two-step rationale, a dataset of strings. We empirically compare its performance against some other common techniques for string processing and clustering. The gathered results suggest that the GNG algorithm is a viable approach to usage modelling.

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 52.14.0.24

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Grappiolo, C.; Verwielen, E. and Noorman, N. (2019). The Growing N-Gram Algorithm: A Novel Approach to String Clustering. In Proceedings of the 8th International Conference on Pattern Recognition Applications and Methods - ICPRAM; ISBN 978-989-758-351-3; ISSN 2184-4313, SciTePress, pages 52-63. DOI: 10.5220/0007259200520063

@conference{icpram19,
author={Corrado Grappiolo. and Eline Verwielen. and Nils Noorman.},
title={The Growing N-Gram Algorithm: A Novel Approach to String Clustering},
booktitle={Proceedings of the 8th International Conference on Pattern Recognition Applications and Methods - ICPRAM},
year={2019},
pages={52-63},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0007259200520063},
isbn={978-989-758-351-3},
issn={2184-4313},
}

TY - CONF

JO - Proceedings of the 8th International Conference on Pattern Recognition Applications and Methods - ICPRAM
TI - The Growing N-Gram Algorithm: A Novel Approach to String Clustering
SN - 978-989-758-351-3
IS - 2184-4313
AU - Grappiolo, C.
AU - Verwielen, E.
AU - Noorman, N.
PY - 2019
SP - 52
EP - 63
DO - 10.5220/0007259200520063
PB - SciTePress