loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Joscha Grüger 1 and Georg J. Schneider 2

Affiliations: 1 Computer Science Department, Trier University of Applied Sciences, Main Campus, Trier, Germany, University of Trier, Department of Business Information Systems II, 54286 Trier and Germany ; 2 Computer Science Department, Trier University of Applied Sciences, Main Campus, Trier and Germany

Keyword(s): Data Analysis, Web Mining, Natural Language Processing, Information Retrieval, Machine Learning, Job Ads, Skills.

Abstract: The paper presents a concept and a system for the automatic identification of skills in German-language job advertisements. The identification process is divided into Data Acquisition, Language Detection, Section Classification and Skill Recognition. Online job exchanges served as the data source. For identification of the part of a job advertisement containing the requirements, different machine-learning approaches were compared. Skills were extracted based on a POS-template. For classification of the found skills into predefined skill classes, different similarity measures were compared. The identification of the part of a job advertisement containing the requirements works with the pre-trained LinearSVC model for 100% of the tested job advertisements. Extracting skills is difficult because skills can be written in different ways in the German language – especially since the language allows ad-hoc creation of compound. For extraction of skills, POS templates were used. This approac h worked for 87.33% of the skills. The combination of a fasttext model and Levenshtein distance achieved a correct assignment of skills to skill classes for 75.33% of the recognized skills. The results show that extracting required skills from German-language job ads is complex. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 3.144.21.206

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Grüger, J. and Schneider, G. (2019). Automated Analysis of Job Requirements for Computer Scientists in Online Job Advertisements. In Proceedings of the 15th International Conference on Web Information Systems and Technologies - WEBIST; ISBN 978-989-758-386-5; ISSN 2184-3252, SciTePress, pages 226-233. DOI: 10.5220/0008068202260233

@conference{webist19,
author={Joscha Grüger. and Georg J. Schneider.},
title={Automated Analysis of Job Requirements for Computer Scientists in Online Job Advertisements},
booktitle={Proceedings of the 15th International Conference on Web Information Systems and Technologies - WEBIST},
year={2019},
pages={226-233},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0008068202260233},
isbn={978-989-758-386-5},
issn={2184-3252},
}

TY - CONF

JO - Proceedings of the 15th International Conference on Web Information Systems and Technologies - WEBIST
TI - Automated Analysis of Job Requirements for Computer Scientists in Online Job Advertisements
SN - 978-989-758-386-5
IS - 2184-3252
AU - Grüger, J.
AU - Schneider, G.
PY - 2019
SP - 226
EP - 233
DO - 10.5220/0008068202260233
PB - SciTePress