loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Sybille Peters ; Claus-Peter Rückemann and Wolfgang Sander-Beuermann

Affiliation: Leibniz Universität Hannover (LUH), Germany

Keyword(s): Focused crawling, Search engine, Vertical search engine, Metadata, Educational research, Link analysis.

Related Ontology Subjects/Areas/Topics: Artificial Intelligence ; Data Engineering ; Digital Libraries ; Knowledge Management and Information Sharing ; Knowledge-Based Systems ; Metadata and Metamodeling ; Ontologies and the Semantic Web ; Ontology and the Semantic Web ; Searching and Browsing ; Symbolic Systems ; Web Information Systems and Technologies ; Web Interfaces and Applications

Abstract: Search engines typically consist of a crawler which traverses the web retrieving documents and a search frontend which provides the user interface to the acquired information. Focused crawlers refine the crawler by intelligently directing it to predefined topic areas. The evolution of search engines today is expedited by supplying more search capabilities such as a search for metadata as well as search within the content text. Semantic web standards have supplied methods for augmenting webpages with metadata. Machine learning techniques are used where necessary to gather more metadata from unstructured webpages. This paper analyzes the effectiveness of techniques for vertical search engines with respect to focused crawling and metadata integration exemplarily in the field of “educational research”. A search engine for these purposes implemented within the EERQI project is described and tested. The enhancement of focused crawling with the use of link analysis and anchor text classific ation is implemented and verified. A new heuristic score calculation formula has been developed for focusing the crawler. Full-texts and metadata from various multilingual sources are collected and combined into a common format. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 18.218.127.141

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Peters, S.; Rückemann, C. and Sander-Beuermann, W. (2010). A NEW APPROACH TOWARDS VERTICAL SEARCH ENGINES - Intelligent Focused Crawling and Multilingual Semantic Techniques. In Proceedings of the 6th International Conference on Web Information Systems and Technology - Volume 2: WEBIST; ISBN 978-989-674-025-2; ISSN 2184-3252, SciTePress, pages 181-186. DOI: 10.5220/0002777901810186

@conference{webist10,
author={Sybille Peters. and Claus{-}Peter Rückemann. and Wolfgang Sander{-}Beuermann.},
title={A NEW APPROACH TOWARDS VERTICAL SEARCH ENGINES - Intelligent Focused Crawling and Multilingual Semantic Techniques},
booktitle={Proceedings of the 6th International Conference on Web Information Systems and Technology - Volume 2: WEBIST},
year={2010},
pages={181-186},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002777901810186},
isbn={978-989-674-025-2},
issn={2184-3252},
}

TY - CONF

JO - Proceedings of the 6th International Conference on Web Information Systems and Technology - Volume 2: WEBIST
TI - A NEW APPROACH TOWARDS VERTICAL SEARCH ENGINES - Intelligent Focused Crawling and Multilingual Semantic Techniques
SN - 978-989-674-025-2
IS - 2184-3252
AU - Peters, S.
AU - Rückemann, C.
AU - Sander-Beuermann, W.
PY - 2010
SP - 181
EP - 186
DO - 10.5220/0002777901810186
PB - SciTePress