loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Jun Liu ; Zhaohui Wu ; Lu Jiang ; Qinghua Zheng and Xiao Liu

Affiliation: Xi’an Jitaotong University, China

Keyword(s): Deep Web, Deep Web Surfacing, Minimum Executable Pattern, Adaptive Query.

Related Ontology Subjects/Areas/Topics: Searching and Browsing ; Web Information Systems and Technologies ; Web Interfaces and Applications

Abstract: This paper proposes the concept of Minimum Executable Pattern (MEP), and then presents a MEP generation method and a MEP-based Deep Web adaptive query method. The query method extends query interface from single textbox to MEP set, and generates local-optimal query by choosing a MEP and a keyword vector of the MEP. Our method overcomes the problem of “data islands” to a certain extent which results from deficiency of current methods. The experimental results on six real-world Deep Web sites show that our method outperforms existing methods in terms of query capability and applicability.

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 18.223.210.249

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Liu, J.; Wu, Z.; Jiang, L.; Zheng, Q. and Liu, X. (2009). CRAWLING DEEP WEB CONTENT THROUGH QUERY FORMS. In Proceedings of the Fifth International Conference on Web Information Systems and Technologies - WEBIST; ISBN 978-989-8111-81-4; ISSN 2184-3252, SciTePress, pages 629-637. DOI: 10.5220/0001830806290637

@conference{webist09,
author={Jun Liu. and Zhaohui Wu. and Lu Jiang. and Qinghua Zheng. and Xiao Liu.},
title={CRAWLING DEEP WEB CONTENT THROUGH QUERY FORMS},
booktitle={Proceedings of the Fifth International Conference on Web Information Systems and Technologies - WEBIST},
year={2009},
pages={629-637},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001830806290637},
isbn={978-989-8111-81-4},
issn={2184-3252},
}

TY - CONF

JO - Proceedings of the Fifth International Conference on Web Information Systems and Technologies - WEBIST
TI - CRAWLING DEEP WEB CONTENT THROUGH QUERY FORMS
SN - 978-989-8111-81-4
IS - 2184-3252
AU - Liu, J.
AU - Wu, Z.
AU - Jiang, L.
AU - Zheng, Q.
AU - Liu, X.
PY - 2009
SP - 629
EP - 637
DO - 10.5220/0001830806290637
PB - SciTePress