Building a Query Engine for a Corpus of Open Data

Mauro Pelucchi; Giuseppe Psaila; Maurizio Toccu

Research.Publish.Connect.

*Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Country:
Subject:

Advanced Search Affiliations Search

If you're looking for an exact phrase use quotation marks on text fields.

Proceedings

Proceedings Search *Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

Papers

Papers Search *Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

Authors

Authors Search *Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

Advanced Search

Paper

Building a Query Engine for a Corpus of Open Data

Topics: Searching and Browsing; Web Information Filtering and Retrieval

In Proceedings of the 13th International Conference on Web Information Systems and Technologies WEBIST - Volume 1, 126-136, 2017 , Porto, Portugal

Authors: Mauro Pelucchi ; Giuseppe Psaila and Maurizio Toccu

Affiliation: University of Bergamo, Italy

Keyword(s): Retrieval of Open Data, Blind Querying, Single Item Extraction.

Related Ontology Subjects/Areas/Topics: Searching and Browsing ; Web Information Systems and Technologies ; Web Interfaces and Applications

Abstract: Public Administrations openly publish many data sets concerning citizens and territories in order to increase the amount of information made available for people, firms and public administrators. As an effect, Open Data corpora has become so huge that it is impossible to deal with them by hand; as a consequence, it is necessary to use tools that include innovative techniques able to query them. In this paper, we present a technique to select open data sets containing specific pieces of information, and retrieve them in a corpus published by a portal of open data. In particular, users can formulate structured queries blindly submitted to our search engine prototype (i.e., being unaware of the actual structure of data sets). Our approach reinterpret and mixes several known information retrieval approaches, giving at the same time a database view of the problem. We implemented this technique within a prototype, that we tested on a corpus containing more that over 2000 data sets . We noted that our technique provides focused results w.r.t. the baseline experiments performed with Apache Solr. (More)

CC BY-NC-ND 4.0

Guest: Register as new SciTePress user now for free.

SciTePress user: please login.

My Papers

You are not signed in, therefore limits apply to your IP address 216.73.216.108

In the current month:

Recent papers: 100 available of 100 total

2⁺ years older papers: 200 available of 200 total

Paper citation in several formats:

Pelucchi, M., Psaila, G. and Toccu, M. (2017). Building a Query Engine for a Corpus of Open Data. In Proceedings of the 13th International Conference on Web Information Systems and Technologies - WEBIST; ISBN 978-989-758-246-2; ISSN 2184-3252, SciTePress, pages 126-136. DOI: 10.5220/0006308801260136

@conference{webist17,
author={Mauro Pelucchi and Giuseppe Psaila and Maurizio Toccu},
title={Building a Query Engine for a Corpus of Open Data},
booktitle={Proceedings of the 13th International Conference on Web Information Systems and Technologies - WEBIST},
year={2017},
pages={126-136},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0006308801260136},
isbn={978-989-758-246-2},
issn={2184-3252},
}

TY - CONF

JO - Proceedings of the 13th International Conference on Web Information Systems and Technologies - WEBIST
TI - Building a Query Engine for a Corpus of Open Data
SN - 978-989-758-246-2
IS - 2184-3252
AU - Pelucchi, M.
AU - Psaila, G.
AU - Toccu, M.
PY - 2017
SP - 126
EP - 136
DO - 10.5220/0006308801260136
PB - SciTePress