Hadoop-based Framework for Information Extraction from Social Text
Ferdaous Jenhani, Mohamed Salah Gouider, Lamjed Bensaid
2017
Abstract
Social data analysis becomes a real business requirement regarding the frequent use of social media as a new business strategy. However, their volume, velocity and variety are challenging their storage and processing. In a previous contribution [11, 12], we proposed an events extraction system in which we focused only on data variety and we did not handle volume and velocity dimensions. So, our solution cannot be considered a big data system. In this work, we port previously proposed system to a parallel and distributed framework in order to reduce the complexity of task and scale up to larger volumes of data continuously growing. We propose two loosely coupled Hadoop clusters for entity recognition and events extraction. In experiments, we carried time test and accuracy test to check the performance of the system on extracting drug abuse behavioral events from 1000000 tweets. Hadoop-based system achieves better performance compared to old system.
DownloadPaper Citation
in Harvard Style
Jenhani F., Salah Gouider M. and Bensaid L. (2017). Hadoop-based Framework for Information Extraction from Social Text.In Proceedings of the 9th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 1: KDIR, ISBN 978-989-758-271-4, pages 233-237. DOI: 10.5220/0006501402330237
in Bibtex Style
@conference{kdir17,
author={Ferdaous Jenhani and Mohamed Salah Gouider and Lamjed Bensaid},
title={Hadoop-based Framework for Information Extraction from Social Text},
booktitle={Proceedings of the 9th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 1: KDIR,},
year={2017},
pages={233-237},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0006501402330237},
isbn={978-989-758-271-4},
}
in EndNote Style
TY - CONF
JO - Proceedings of the 9th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 1: KDIR,
TI - Hadoop-based Framework for Information Extraction from Social Text
SN - 978-989-758-271-4
AU - Jenhani F.
AU - Salah Gouider M.
AU - Bensaid L.
PY - 2017
SP - 233
EP - 237
DO - 10.5220/0006501402330237