5. ETL & OLAP Design: The main tasks in this ac-
tivity are:
• ETL design and implementation, that strongly
depends on features of the semantic engine, on
the richness of the meta-data retrieved by the
crawler (e.g., URLs, author, source type), and
on the possible presence of specific data acqui-
sition channels such as CRM.
• KPI design; different kinds of KPIs can be
designed and calculated depending on which
kinds of meta-data the crawler fetches.
• Dashboard design, during which a set of re-
ports is built that captures the user needs ex-
pressed by inquiries during macro-analysis.
6. Execution and Test: has a basic role in the
methodology, as it triggers a new iteration in the
design process. Crawling queries are executed,
the resulting clips are processed, and the reports
are launched over the enriched clips. The specific
tests related to each single activity, described in
the preceding subsections, can be executed sepa-
rately thoughthey are obviouslyinter-related. The
first test executed is normally the one of crawling;
even after a first round, the semantic enrichment
tests can be run on the resulting clips. Similarly,
when the first enriched clips are available, the test
of ETL and OLAP can be triggered.
The analysis of the outcomes of a set of case stud-
ies (Francia et al., 2014) has shown that the adoption
of a proper methodology strongly impacts on the ca-
pability of keeping under control execution time, re-
quired resources and effectiveness of the results. In
particular the key points of the proposed methodol-
ogy are: (1) a clear organization of goals and tasks
for each activity, (2) the adoption of a protocol and a
set of templates to record and share information be-
tween activities and (3) the implementation of a set of
tests to be applied during the methodology phases.
5 CONCLUSIONS
In this paper we discussed some of the key issues
related to the emerging area of SBI. Although some
commercial solutions is already available, this types
of applications deserve further investigations. SBI
is at the crossroad between different disciplines, this
makes researches more challenging but it potentially
opens to more interesting results.
REFERENCES
Castellanos, M., Dayal, U., Hsu, M., Ghosh, R., Dekhil, M.,
Lu, Y., Zhang, L., and Schreiman, M. (2011). LCI:
a social channel analysis platform for live customer
intelligence. In Proc. SIGMOD, pages 1049–1058.
Dayal, U., Gupta, C., Castellanos, M., Wang, S., and
Garc´ıa-Solaco, M. (2012). Of cubes, DAGs and hi-
erarchical correlations: A novel conceptual model for
analyzing social media data. In Proc. ER, pages 30–
49.
Francia, M., Golfarelli, M., and Rizzi, S. (2014). A method-
ology for social bi. In Proc. IDEAS.
Gallinucci, E., Golfarelli, M., and Rizzi, S. (2013). Meta-
stars: multidimensional modeling for social business
intelligence. In Proc. DOLAP, pages 11–18.
Garca-Moya, L., Kudama, S., Aramburu, M., and Berlanga,
R. (2013). Storing and analysing voice of the mar-
ket data in the corporate data warehouse. Information
Systems Frontiers, 15(3):331–349.
Garc´ıa-Moya, L., Kudama, S., Aramburu, M. J., and Lla-
vori, R. B. (2013). Storing and analysing voice of the
market data in the corporate data warehouse. Informa-
tion Systems Frontiers, 15(3):331–349.
Lee, J., Grossman, D. A., Frieder, O., and McCabe, M. C.
(2000). Integrating structured data and text: A multi-
dimensional approach. In Proc. ITCC, pages 264–271.
Liu, B. and Zhang, L. (2012). A survey of opinion mining
and sentiment analysis. In Mining Text Data, pages
415–463. Springer.
Pang, B., Lee, L., and Vaithyanathan, S. (2002). Thumbs
up? sentiment classification using machine learning
techniques. In Proc. EMNLP, volume 10, pages 79–
86.
Pedersen, T. B., Jensen, C. S., and Dyreson, C. E. (2001). A
foundation for capturing and querying complex multi-
dimensional data. Inf. Syst., 26(5):383–423.
Ravat, F., Teste, O., Tournier, R., and Zurfluh, G. (2008).
Top keyword: An aggregation function for textual
document OLAP. In Proc. DaWaK, pages 55–64.
Rehman, N., Mansmann, S., Weiler, A., and Scholl, M.
(2012a). Building a data warehouse for twitter stream
exploration. In Advances in Social Networks Analy-
sis and Mining (ASONAM), 2012 IEEE/ACM Interna-
tional Conference on, pages 1341–1348.
Rehman, N. U., Mansmann, S., Weiler, A., and Scholl,
M. H. (2012b). Building a data warehouse for Twitter
stream exploration. In Proc. ASONAM, pages 1341–
1348.
Taboada, M., Brooke, J., Tofiloski, M., Voll, K. D., and
Stede, M. (2011). Lexicon-based methods for senti-
ment analysis. Computational Linguistics, 37(2):267–
307.
Zhang, D., Zhai, C., and Han, J. (2009). Topic Cube:
Topic modeling for OLAP on multidimensional text
databases. In Proc. SDM, pages 1123–1134.