SCRAWLER: A SEED-BY-SEED PARALLEL WEB CRAWLER

Joo Yong Lee; Sang Ho Lee; Yanggon Kim

Research.Publish.Connect.

*Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Country:
Subject:

Advanced Search Affiliations Search

If you're looking for an exact phrase use quotation marks on text fields.

Proceedings

Proceedings Search *Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

Papers

Papers Search *Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

Authors

Authors Search *Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

Advanced Search

Paper

SCRAWLER: A SEED-BY-SEED PARALLEL WEB CRAWLER

Topics: Web Services

In Proceedings of the Second International Conference on e-Business - Volume 0ICETE, 151-156, 2007 , Barcelona, Spain

Authors: Joo Yong Lee ¹ ; Sang Ho Lee ¹ and Yanggon Kim ²

Affiliations: ¹ School of Computing, Soongsil University, Korea, Republic of ; ² Computer and Information Sciences, Towson University, United States

Keyword(s): Web crawler, Parallel crawler, Scalability, Web database.

Related Ontology Subjects/Areas/Topics: Cloud Computing ; Collaboration and e-Services ; Data Engineering ; e-Business ; Enterprise Information Systems ; Mobile Software and Services ; Ontologies and the Semantic Web ; Services Science ; Software Agents and Internet Computing ; Software Engineering ; Software Engineering Methods and Techniques ; Telecommunications ; Web Services ; Wireless Information Networks and Systems

Abstract: As the size of the Web grows, it becomes increasingly important to parallelize a crawling process in order to complete downloading pages in a reasonable amount of time. This paper presents the design and implementation of an effective parallel web crawler. We first present various design choices and strategies for a parallel web crawler, and describe our crawler’s architecture and implementation techniques. In particular, we investigate the URL distributor for URL balancing and the scalability of our crawler.

CC BY-NC-ND 4.0

Guest: Register as new SciTePress user now for free.

SciTePress user: please login.

My Papers

You are not signed in, therefore limits apply to your IP address 18.188.180.32

In the current month:

Recent papers: 100 available of 100 total

2⁺ years older papers: 200 available of 200 total

Paper citation in several formats:

Yong Lee, J.; Ho Lee, S. and Kim, Y. (2007). SCRAWLER: A SEED-BY-SEED PARALLEL WEB CRAWLER. In Proceedings of the Second International Conference on e-Business (ICETE 2007) - ICE-B; ISBN 978-989-8111-11-1, SciTePress, pages 151-156. DOI: 10.5220/0002108701510156

@conference{ice-b07,
author={Joo {Yong Lee}. and Sang {Ho Lee}. and Yanggon Kim.},
title={SCRAWLER: A SEED-BY-SEED PARALLEL WEB CRAWLER},
booktitle={Proceedings of the Second International Conference on e-Business (ICETE 2007) - ICE-B},
year={2007},
pages={151-156},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002108701510156},
isbn={978-989-8111-11-1},
}

TY - CONF

JO - Proceedings of the Second International Conference on e-Business (ICETE 2007) - ICE-B
TI - SCRAWLER: A SEED-BY-SEED PARALLEL WEB CRAWLER
SN - 978-989-8111-11-1
AU - Yong Lee, J.
AU - Ho Lee, S.
AU - Kim, Y.
PY - 2007
SP - 151
EP - 156
DO - 10.5220/0002108701510156
PB - SciTePress