loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Luca Deri 1 ; Maurizio Martinelli 1 ; Daniele Sartiano 2 and Loredana Sideri 1

Affiliations: 1 CNR, Italy ; 2 University of Pisa, Italy

Keyword(s): Internet Domain, Web-Content Classification, HTTP Crawling, Web Mining, SVM.

Abstract: Web classification is used in many security devices for preventing users to access selected web sites that are not allowed by the current security policy, as well for improving web search and for implementing contextual advertising. There are many commercial web classification services available on the market and a few publicly available web directory services. Unfortunately they mostly focus on English-speaking web sites, making them unsuitable for other languages in terms of classification reliability and coverage. This paper covers the design and implementation of a web-based classification tool for TLDs (Top Level Domain). Each domain is classified by analysing the main domain web site, and classifying it in categories according to its content. The tool has been successfully validated by classifying all the registered .it Internet domains, whose results are presented in this paper.

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 18.206.13.112

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Deri, L.; Martinelli, M.; Sartiano, D. and Sideri, L. (2015). Large Scale Web-Content Classification. In Proceedings of the 7th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2015) - SSTM; ISBN 978-989-758-158-8; ISSN 2184-3228, SciTePress, pages 545-554. DOI: 10.5220/0005635605450554

@conference{sstm15,
author={Luca Deri. and Maurizio Martinelli. and Daniele Sartiano. and Loredana Sideri.},
title={Large Scale Web-Content Classification},
booktitle={Proceedings of the 7th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2015) - SSTM},
year={2015},
pages={545-554},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005635605450554},
isbn={978-989-758-158-8},
issn={2184-3228},
}

TY - CONF

JO - Proceedings of the 7th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2015) - SSTM
TI - Large Scale Web-Content Classification
SN - 978-989-758-158-8
IS - 2184-3228
AU - Deri, L.
AU - Martinelli, M.
AU - Sartiano, D.
AU - Sideri, L.
PY - 2015
SP - 545
EP - 554
DO - 10.5220/0005635605450554
PB - SciTePress