Goal-based Evaluation of Text Mining Results in an Industrial Use Case

Jens Drawehn; Matthias Blohm; Maximilien Kintz; Monika Kochanowski

Research.Publish.Connect.

*Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Country:
Subject:

Advanced Search Affiliations Search

If you're looking for an exact phrase use quotation marks on text fields.

Proceedings

Proceedings Search *Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

Papers

Papers Search *Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

Authors

Authors Search *Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

Advanced Search

Paper

Goal-based Evaluation of Text Mining Results in an Industrial Use Case

Topics: Document Analysis; Ensemble Methods; Feature Selection and Extraction; Information Retrieval; Natural Language Processing

In Proceedings of the 9th International Conference on Pattern Recognition Applications and Methods ICPRAM - Volume 1, 183-191, 2020 , Valletta, Malta

Authors: Jens Drawehn ; Matthias Blohm ; Maximilien Kintz and Monika Kochanowski

Affiliation: Fraunhofer Institute for Industrial Engineering IAO, Nobelstraße 12, 70569 Stuttgart, Germany

Keyword(s): Text Mining, Feature Extraction, Artificial Intelligence, Evaluation.

Abstract: Artificial intelligence boosted the interest in text mining solutions in the last few years. Especially in nonEnglish-speaking countries, where there might not be clear market leaders, a variety of solutions for different text mining scenarios has become available. Most of them support special use cases and have strengths and weaknesses in others. In text or page classification, standard measures like precision, recall, sensitivity or F1-score are prevalent. However, evaluation of feature extraction results requires more tailored approaches. We experienced many issues on the way to benchmarking feature extraction results from text, like whether a result is correct, partly correct, helpful or useless. The main contribution of this work is a method for designing a tailored evaluation procedure in an individual text extraction benchmark for one specific use case. In this context, we propose a general way of mapping the common CRISP-DM process to particularities of text mining projects. Furthermore, we describe possible goals of information extraction, the features to be extracted, suitable evaluation criteria and a corresponding customized scoring system. This is applied in detail in an industrial use case. (More)

CC BY-NC-ND 4.0

Guest: Register as new SciTePress user now for free.

SciTePress user: please login.

My Papers

You are not signed in, therefore limits apply to your IP address 3.144.29.148

In the current month:

Recent papers: 100 available of 100 total

2⁺ years older papers: 200 available of 200 total

Paper citation in several formats:

Drawehn, J., Blohm, M., Kintz, M. and Kochanowski, M. (2020). Goal-based Evaluation of Text Mining Results in an Industrial Use Case. In Proceedings of the 9th International Conference on Pattern Recognition Applications and Methods - ICPRAM; ISBN 978-989-758-397-1; ISSN 2184-4313, SciTePress, pages 183-191. DOI: 10.5220/0008973801830191

@conference{icpram20,
author={Jens Drawehn and Matthias Blohm and Maximilien Kintz and Monika Kochanowski},
title={Goal-based Evaluation of Text Mining Results in an Industrial Use Case},
booktitle={Proceedings of the 9th International Conference on Pattern Recognition Applications and Methods - ICPRAM},
year={2020},
pages={183-191},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0008973801830191},
isbn={978-989-758-397-1},
issn={2184-4313},
}

TY - CONF

JO - Proceedings of the 9th International Conference on Pattern Recognition Applications and Methods - ICPRAM
TI - Goal-based Evaluation of Text Mining Results in an Industrial Use Case
SN - 978-989-758-397-1
IS - 2184-4313
AU - Drawehn, J.
AU - Blohm, M.
AU - Kintz, M.
AU - Kochanowski, M.
PY - 2020
SP - 183
EP - 191
DO - 10.5220/0008973801830191
PB - SciTePress