5 CONCLUSION
In this study, Cinema Ensemble Model (CEM) is
introduced to predict the financial success of a movie.
The model is based on machine learning methods and
six features of movies: genre, sequel, number of plays
on the first day of release, movie buzz before the
release, transmedia storytelling and star buzz. Then
seven different machine learning classification
algorithms are used to predict the level of box office
for movies: adaptive tree boosting, gradient tree
boosting, linear discriminant, logistic regression,
neural networks, random forests and support vector
classifier. After evaluating the performance of each
model, it is shown that GTB, LD, LR and FR have the
best performances, so they are selected to be the
component models in the ensemble model (CEM).
The four component models estimate the class and
the class of the largest number of votes wins the
estimation. The result shows that CEM model has
58.5% of accuracy, which is generally higher than
previous researches.
This approach of movie box office prediction can
be applied by movie investors and cinemas to make
financial decisions before movies are released. That
is, to decide whether to invest the movie or how much
screen will be provided for that movie. It will help
them to make more accurate and advisable decisions
and therefore produce more revenue for the film
industry.
For researches in the future, firstly, more kinds of
component models can be used and more times of
trials can be applied to increase the accuracy of the
CEM model. Secondly, more features can be used to
identify the movies. For example, sentiment analysis
for dialogues in movies can be applied to describe the
structure of movies and the peaks in movies.
REFERENCES
Dhar, TI., Sun, G.,& Weinberg, C. B. (2012). The long-term
box office performance of sequel movies. Marketing
Letters, 23(1), 13-29.
Lee, K., Park, J., Kim, I. et al. Predicting movie success
with machine learning techniques: ways to improve
accuracy. Inf Syst Front 20, 577–588 (2018).
https://doi.org/10.1007/s10796-016-9689-z.
Liu, Y. (2006). Word of mouth for movies: its dynamics and
impact on box office revenue. Journal of Marketing,
70(3), 74 89.
Murschetz, P. C., Bruneel, C. ., Guy, J.-L. ., Haughton, D. .,
Lemercier, N. ., McLaughlin, M.-D. ., Mentzer, K. .,
Vialle, Q. ., Zhang, C. ., Murschetz, P. C. ., &
Bakhtawar, B. (2020). Movie Industry Economics:
How Data Analytics Can Help Predict Movies’
Financial Success. Nordic Journal of Media
Management, 1(3), 339–359.
https://doi.org/10.5278/njmm.2597-0445.5871.
Sharda, R., & Delen, D. (2006). Predicting box-office
success of motion pictures with neural networks.
Expert Systems with Applications,30(2), 243-254.
Steven F. Lehrer, Tian Xie (2021) The Bigger Picture:
Combining Econometrics with Analytics Improves
Forecasts of Movie.