Authors:
Ricardo Campos
1
;
Gaël Dias
2
and
Alípio Mário Jorge
3
Affiliations:
1
Polytechnic Institute of Tomar, University of Beira Interior, Portugal
;
2
University of Beira Interior, Portugal
;
3
University of Oporto, Portugal
Keyword(s):
Temporal Information Retrieval, Time-Based Clustering, Topic Clustering, Web Content Mining.
Related
Ontology
Subjects/Areas/Topics:
Artificial Intelligence
;
Clustering and Classification Methods
;
Information Extraction
;
Knowledge Discovery and Information Retrieval
;
Knowledge-Based Systems
;
Symbolic Systems
Abstract:
With so much information available on the web, looking for relevant documents on the Internet has become a difficult task. Temporal features play an important role with the introduction of a time dimension and the possibility to restrict a search by time, recreating a particular moment of a web page set. Despite its importance, temporal information is still under-considered by current search engines, limiting themselves to the capture of the most recent snapshot of the information. In this paper, we describe the architecture of a temporal search engine which uses timelines to browse search results. More specifically, we intend to add a time measure to cluster web page results, by analyzing web page contents, supporting the search of temporal and non-temporal information embedded in web documents.