Authors:
Jaakko Tervonen
1
;
Tuomas Sormunen
1
;
Arttu Lämsä
1
;
Johannes Peltola
1
;
Heidi Kananen
2
and
Sari Järvinen
1
Affiliations:
1
VTT Technical Research Centre of Finland, Kaitoväylä 1, Oulu, Finland
;
2
Kaleva Media, Solistinkatu 4, Oulu, Finland
Keyword(s):
BERT, Headline Effectiveness, Journalism, Machine Learning, Natural Language Processing.
Abstract:
The decision to read an article in online news media or social networks is often based on the headline, and thus writing effective headlines is an important but difficult task for the journalists and content creators. Even defining an effective headline is a challenge, since the objective is to avoid click-bait headlines and be sure that the article contents fulfill the expectations set by the headline. Once defined and measured, headline effectiveness can be used for content filtering or recommending articles with effective headlines. In this paper, a metric based on received clicks and reading time is proposed to classify news media content into four classes describing headline effectiveness. A deep neural network model using the Bidirectional Encoder Representations from Transformers (BERT) is employed to classify the headlines into the four classes, and its performance is compared to that of journalists. The proposed model achieves an accuracy of 59% on the four-class classificat
ion, and 72-78% on corresponding binary classification tasks. The model outperforms the journalists being almost twice as accurate on a random sample of headlines.
(More)