Hadoop-based Framework for Information Extraction from Social Text

Ferdaous Jenhani, Mohamed Salah Gouider, Lamjed Bensaid

2017

Abstract

Social data analysis becomes a real business requirement regarding the frequent use of social media as a new business strategy. However, their volume, velocity and variety are challenging their storage and processing. In a previous contribution [11, 12], we proposed an events extraction system in which we focused only on data variety and we did not handle volume and velocity dimensions. So, our solution cannot be considered a big data system. In this work, we port previously proposed system to a parallel and distributed framework in order to reduce the complexity of task and scale up to larger volumes of data continuously growing. We propose two loosely coupled Hadoop clusters for entity recognition and events extraction. In experiments, we carried time test and accuracy test to check the performance of the system on extracting drug abuse behavioral events from 1000000 tweets. Hadoop-based system achieves better performance compared to old system.

Download


Paper Citation


in Harvard Style

Jenhani F., Salah Gouider M. and Bensaid L. (2017). Hadoop-based Framework for Information Extraction from Social Text.In Proceedings of the 9th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 1: KDIR, ISBN 978-989-758-271-4, pages 233-237. DOI: 10.5220/0006501402330237


in Bibtex Style

@conference{kdir17,
author={Ferdaous Jenhani and Mohamed Salah Gouider and Lamjed Bensaid},
title={Hadoop-based Framework for Information Extraction from Social Text},
booktitle={Proceedings of the 9th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 1: KDIR,},
year={2017},
pages={233-237},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0006501402330237},
isbn={978-989-758-271-4},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 9th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 1: KDIR,
TI - Hadoop-based Framework for Information Extraction from Social Text
SN - 978-989-758-271-4
AU - Jenhani F.
AU - Salah Gouider M.
AU - Bensaid L.
PY - 2017
SP - 233
EP - 237
DO - 10.5220/0006501402330237