Sub_period_6_News is the total viewing time
for genre News in the time period
21:00 – 00:00.
Weekday classification: 1 for weekday and
0 for weekend.
Age_Gender_Group is a household
composition by age and gender.
Max_cos_similarity is the largest coefficient of
similarity with virtual users of the same
household of the same inhabitant of the age and
sex group.
Day is viewing day.
Table 5: Household ID 2-L4A-440 viewing data.
Day Sub_
period_
1_
Animation
Sub_
period_
6_
News
Week
days
Age_
gender_
group
21.09.
2019.
0 2.1 0
G44-F*
G44-M
22.09.
2019.
1.3 2.9 0
G44-F*
G44-M
23.09.
2019.
0 1 1
G44-F*
G44-M
24.09.
2019.
0 0 1
G44-F*
G44-M
5 CONCLUSIONS
The quality of the predictive model is significantly
affected by the quality of the viewing data, which
means that it is essential to carry out data quality
checks and to produce data in line with the needs of
the predictive model.
It would be necessary to establish/implement
monitoring tools to operate the prototype in the
production environment, which would allow the
correct execution of the model during training and
predictive operation.
Suggestions for model quality maintenance
include the following: the prototype validation can be
done by testing against the composition of household
inhabitants. This way, the accuracy of the prototype
can be checked by forecasting the composition of
household inhabitants using a test data set that not
used to train the model. Such a test data set may be
obtained by repeated questionnaires or by obtaining
data on the actual composition of household
inhabitants in any other way.
The prototype can be supplemented with
automated error handling and checksum verification
mechanisms in each of the model execution steps,
which will allow checking the identification of the
model's performance and input data quality level.
Assess how quickly the full training cycle of the
prototype achieved and how long each of the model
training steps performed. This information will be
essential for developing a product solution and
measuring individual components of the solution or
optimizing the speed of data loading.
Given that prototype components have been
created using open-source libraries and programming
languages, it is essential to develop a scalable
architecture for a prototype production version and to
deploy extensive data and analysis platforms at the
disposal of Tet.
ACKNOWLEDGEMENTS
The research leading to these results has received
funding from the research project "Development of a
solution for multi-factorial television real-time
audience profiling and responsive ads targeting" of
EU Structural funds, contract No. 1.1.1.1/18/A/127
signed between SIA “Tet” and Central Finance and
Contracting Agency.
REFERENCES
Adomavicius, G., Tuzhilin, A., 2015. Context-Aware
Recommender Systems, in Recommender Systems
Handbook. Springer, Boston, MA, pp. 191–226.
D'Mello, S. K., Kory, J., 2015. A review and meta-analysis
of multimodal affect detection systems. ACM
Computing Surveys (CSUR), 47(3), 43.
Goldberg, L. R., 1993. The structure of phenotypic
personality traits. American psychologist, 48(1), 26.
Gomez-Uribe, C. A., Hunt, N., 2015. The Netflix
Recommender System: Algorithms, Business Value,
and Innovation, ACM Trans. Manage. Inf. Syst., vol. 6,
no. 4, pp. 13:1–13:19.
Masthoff, J., 2015. Group Recommender Systems:
Aggregation, Satisfaction and Group Attributes, in
Recommender Systems Handbook. Springer, Boston,
MA, pp. 743–776.
Veras, D., Prota, T., Pereira, A., Prudencio, R., and Ferraz,
C., 2015, A literature review of recommender systems
in the television domain, Expert Systems with
Applications, vol. 42, no. 22, pp. 9046–9076.
Vyncke, P., 2002. Lifestyle segmentation: From attitudes,
interests and opinions, to values, aesthetic styles, life
visions and media preferences. European journal of
communication, 17(4), pp. 445-463.
Wang, Z., He, L., 2016. User identification for enhancing
IP-TV recommendation. Knowledge-Based Systems 98,
pp. 68-75.