Text Analysis of User-Generated Contents for Health-care Applications - Case Study on Smoking Status Classification

Deema Abdal Hafeth; Amr Ahmed; David Cobham

Research.Publish.Connect.

*Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Country:
Subject:

Advanced Search Affiliations Search

If you're looking for an exact phrase use quotation marks on text fields.

Proceedings

Proceedings Search *Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

Papers

Papers Search *Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

Authors

Authors Search *Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

Advanced Search

Paper

Text Analysis of User-Generated Contents for Health-care Applications - Case Study on Smoking Status Classification

Topics: Information Extraction; Mining Text and Semi-Structured Data

In Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 0IC3K, 242-249, 2014 , Rome, Italy

Authors: Deema Abdal Hafeth ; Amr Ahmed and David Cobham

Affiliation: University of Lincoln, United Kingdom

Keyword(s): Smoking Status Classification, Text Mining, User-Generated Contents.

Related Ontology Subjects/Areas/Topics: Artificial Intelligence ; Information Extraction ; Knowledge Discovery and Information Retrieval ; Knowledge-Based Systems ; Mining Text and Semi-Structured Data ; Symbolic Systems

Abstract: Text mining techniques have demonstrated a potential to unlock significant patient health information from unstructured text. However, most of the published work has been done using clinical reports, which are difficult to access due to patient confidentiality. In this paper, we present an investigation of text analysis for smoking status classification from User-Generated Contents (UGC), such as online forum discussions. UGC are more widely available, compared to clinical reports. Based on analyzing the properties of UGC, we propose the use of Linguistic Inquiry Word Count (LIWC) an approach being used for the first time for such a health-related task. We also explore various factors that affect the classification performance. The experimental results and evaluation indicate that the forum classification performs well with the proposed features. It has achieved an accuracy of up to 75% for smoking status prediction. Furthermore, the utilized features set is compact (88 features only ) and independent of the dataset size. (More)

CC BY-NC-ND 4.0

Guest: Register as new SciTePress user now for free.

SciTePress user: please login.

My Papers

You are not signed in, therefore limits apply to your IP address 216.73.216.108

In the current month:

Recent papers: 100 available of 100 total

2⁺ years older papers: 200 available of 200 total

Paper citation in several formats:

Abdal Hafeth, D., Ahmed, A. and Cobham, D. (2014). Text Analysis of User-Generated Contents for Health-care Applications - Case Study on Smoking Status Classification. In Proceedings of the International Conference on Knowledge Discovery and Information Retrieval (IC3K 2014) - KDIR; ISBN 978-989-758-048-2; ISSN 2184-3228, SciTePress, pages 242-249. DOI: 10.5220/0005080502420249

@conference{kdir14,
author={Deema {Abdal Hafeth} and Amr Ahmed and David Cobham},
title={Text Analysis of User-Generated Contents for Health-care Applications - Case Study on Smoking Status Classification},
booktitle={Proceedings of the International Conference on Knowledge Discovery and Information Retrieval (IC3K 2014) - KDIR},
year={2014},
pages={242-249},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005080502420249},
isbn={978-989-758-048-2},
issn={2184-3228},
}

TY - CONF

JO - Proceedings of the International Conference on Knowledge Discovery and Information Retrieval (IC3K 2014) - KDIR
TI - Text Analysis of User-Generated Contents for Health-care Applications - Case Study on Smoking Status Classification
SN - 978-989-758-048-2
IS - 2184-3228
AU - Abdal Hafeth, D.
AU - Ahmed, A.
AU - Cobham, D.
PY - 2014
SP - 242
EP - 249
DO - 10.5220/0005080502420249
PB - SciTePress