loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Madhuka Udantha ; Surangika Ranathunga and Gihan Dias

Affiliation: University of Moratuwa, Sri Lanka

Keyword(s): Web Usage Mining, Pattern Mining, Regular Expressions, Anomaly Detection.

Related Ontology Subjects/Areas/Topics: Classification ; Clustering ; Combinatorial Optimization ; Feature Selection and Extraction ; Pattern Recognition ; Similarity and Distance Learning ; Theory and Methods

Abstract: Mining web access log data is a popular technique to identify frequent access patterns of website users. There are many mining techniques such as clustering, sequential pattern mining and association rule mining to identify these frequent access patterns. Each can find interesting access patterns and group the users, but they cannot identify the slight differences between accesses patterns included in individual clusters. But in reality these could refer to important information about attacks. This paper introduces a methodology to identify these access patterns at a much lower level than what is provided by traditional clustering techniques, such as nearest neighbour based techniques and classification techniques. This technique makes use of the concept of episodes to represent web sessions. These episodes are expressed in the form of regular expressions. To the best of our knowledge, this is the first time to apply the concept of regular expressions to identify user access patterns in web server log data. In addition to identifying frequent patterns, we demonstrate that this technique is able to identify access patterns that occur rarely, which would have been simply treated as noise in traditional clustering mechanisms. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 18.224.30.6

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Udantha, M. ; Ranathunga, S. and Dias, G. (2016). An Episode-based Approach to Identify Website User Access Patterns. In Proceedings of the 5th International Conference on Pattern Recognition Applications and Methods - ICPRAM; ISBN 978-989-758-173-1; ISSN 2184-4313, SciTePress, pages 343-350. DOI: 10.5220/0005752703430350

@conference{icpram16,
author={Madhuka Udantha and Surangika Ranathunga and Gihan Dias},
title={An Episode-based Approach to Identify Website User Access Patterns},
booktitle={Proceedings of the 5th International Conference on Pattern Recognition Applications and Methods - ICPRAM},
year={2016},
pages={343-350},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005752703430350},
isbn={978-989-758-173-1},
issn={2184-4313},
}

TY - CONF

JO - Proceedings of the 5th International Conference on Pattern Recognition Applications and Methods - ICPRAM
TI - An Episode-based Approach to Identify Website User Access Patterns
SN - 978-989-758-173-1
IS - 2184-4313
AU - Udantha, M.
AU - Ranathunga, S.
AU - Dias, G.
PY - 2016
SP - 343
EP - 350
DO - 10.5220/0005752703430350
PB - SciTePress