Automated Analysis of Job Requirements for Computer Scientists in Online Job Advertisements
Joscha Grüger, Georg Schneider
2019
Abstract
The paper presents a concept and a system for the automatic identification of skills in German-language job advertisements. The identification process is divided into Data Acquisition, Language Detection, Section Classification and Skill Recognition. Online job exchanges served as the data source. For identification of the part of a job advertisement containing the requirements, different machine-learning approaches were compared. Skills were extracted based on a POS-template. For classification of the found skills into predefined skill classes, different similarity measures were compared. The identification of the part of a job advertisement containing the requirements works with the pre-trained LinearSVC model for 100% of the tested job advertisements. Extracting skills is difficult because skills can be written in different ways in the German language – especially since the language allows ad-hoc creation of compound. For extraction of skills, POS templates were used. This approach worked for 87.33% of the skills. The combination of a fasttext model and Levenshtein distance achieved a correct assignment of skills to skill classes for 75.33% of the recognized skills. The results show that extracting required skills from German-language job ads is complex.
DownloadPaper Citation
in Harvard Style
Grüger J. and Schneider G. (2019). Automated Analysis of Job Requirements for Computer Scientists in Online Job Advertisements.In Proceedings of the 15th International Conference on Web Information Systems and Technologies - Volume 1: WEBIST, ISBN 978-989-758-386-5, pages 226-233. DOI: 10.5220/0008068202260233
in Bibtex Style
@conference{webist19,
author={Joscha Grüger and Georg Schneider},
title={Automated Analysis of Job Requirements for Computer Scientists in Online Job Advertisements},
booktitle={Proceedings of the 15th International Conference on Web Information Systems and Technologies - Volume 1: WEBIST,},
year={2019},
pages={226-233},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0008068202260233},
isbn={978-989-758-386-5},
}
in EndNote Style
TY - CONF
JO - Proceedings of the 15th International Conference on Web Information Systems and Technologies - Volume 1: WEBIST,
TI - Automated Analysis of Job Requirements for Computer Scientists in Online Job Advertisements
SN - 978-989-758-386-5
AU - Grüger J.
AU - Schneider G.
PY - 2019
SP - 226
EP - 233
DO - 10.5220/0008068202260233