Authors:
Ermelinda Oro
and
Massimo Ruffolo
Affiliation:
National Research Council (CNR) and Technest University of Calabria, Italy
Keyword(s):
Big Data, Data Integration, Web Extraction, Text Analysis, Text Processing, Social Network, Sentiment Analysis, Knowledge Extraction, Smart Tourism, Big Sport Event, Tennis Italian Open.
Related
Ontology
Subjects/Areas/Topics:
Biomedical Engineering
;
Cloud Computing
;
Coupling and Integrating Heterogeneous Data Sources
;
Data Engineering
;
Databases and Information Systems Integration
;
Enterprise Information Systems
;
Health Information Systems
;
Information Systems Analysis and Specification
;
Knowledge Management
;
Ontologies and the Semantic Web
;
Semantic Web Technologies
;
Services Science
;
Society, e-Business and e-Government
;
Software Agents and Internet Computing
;
Web 2.0 and Social Networking Controls
;
Web Information Systems and Technologies
Abstract:
Big data generated across the web is assuming growing importance in producing insights useful to understand
real-world phenomena and to make smarter decisions. The tourism is one of the leading growth sectors, therefore,
methods and technologies that simplify and empower web contents gathering, processing, and analysis
are becoming more and more important in this application area. In this paper, we present a web content analytics
method that automates and simplifies content extraction and acquisition from many different web sources,
like newspapers and social networks, accelerate content cleaning, analysis, and annotation, makes faster insights
generation by visual exploration of analysis results. We, also, describe an application to a real-world
use case regarding the analysis of the touristic impact of the Italian Open tennis tournament. Obtained results
show that our method makes the analysis of news and social media posts more easy, agile, and effective.