select lagged values that are most correlated to recent
events in a time series. In this study, however, ACF
coefficients were used to introduce time dependency
characteristics as input vectors of AdaGrad in our
method AA-ACF that combines Auto.Arima and
AdaGrad in the time series forecasting.
The results obtained using the method suggest that
positive gains can be observed in different datasets,
and for different forecasting horizons. For the 19
subsets evaluated, the average gain varied from
1.65% (for 6 to 48 forecasts) to 2.081% (when
considered forecasting horizon equals 1). The gain
values calculated for TSDL datasets (5.528% for
frequency 1 series, 12.429% for frequency 4 series,
and 8.307% for frequency 12 series) were expressive.
Besides that, the results obtained by the processing of
the 1428 series of M3 monthly dataset (3.875%) also
can be noted.
The combined use of different methods is not a
novelty. However, a combination of a statistical
algorithm and DSM methods is not evident in the
literature, especially for the prediction of series
relatively short in length.
Finally, we highlight that the experiments were
performed using time series with fixed length and
using a batch mode processing. Therefore, the support
to data streams is envisioned in future versions, as
well as tests with intermittent time series data, helping
to assess the method's applicability in other scenarios.
Additional studies regarding the algorithm selection
strategy shall be evaluated, as the analysis of the
oracle established that the best overall forecast results
can be obtained designating AdaGrad for 55,912
(39.5%) from the 141,558 analysed series. Thus,
improvements in the selection criteria having the
oracle as a goal may lead to better overall results.
ACKNOWLEDGEMENTS
This study was financed in part by the Coordenação
de Aperfeiçoamento de Pessoal de Nível Superior -
Brasil (CAPES) - Finance Code 001.
REFERENCES
Armstrong, J. (1985). Long-range forecasting: from crystal
ball to computer. Wiley, 2nd edition edition.
Bifet, A., Gavalda, R., Holmes, G., and Pfahringer, B.
(2018). Machine Learning for Data Streams with
Practical Examples in MOA. MIT Press.
Bifet, A., Holmes, G., Kirkby, R., and Pfahringer, B. (2010).
MOA: Massive Online Analysis. Technical report.
Bischl, B., Lang, M., Bossek, J., Horn, D., Richter, J., and
Surmann, D. (2017). BBmisc: Miscellaneous Helper
Functions for B. Bischl. R package version 1.11.
Bontempi, G., Ben Taieb, S., and Le Borgne, Y.-A. (2013).
Machine Learning Strategies for Time Series
Forecasting, volume 138.
Box, G. and Jenkins, G. (1976). Time series analysis:
forecasting and control. Holden-Day series in time
series analysis and digital processing. Holden-Day.
Dong, E., Du, H., and Gardner, L. (2020). An interactive
web-based dashboard to track covid-19 in real time. The
Lancet Infectious Diseases, 20(5):533–534.
Duchi, J., Hazan, E., and Singer, Y. (2011). Adaptive
subgradient methods for online learning and stochastic
optimization. J. Mach. Learn. Res., 12(null):2121–
2159.
Duong, Q.-H., Ramampiaro, H., and Nørvåg, K. (2018).
Applying temporal dependence to detect changes in
streaming data. Applied Intelligence, 48(12):4805–
4823.
Esling, P. and Agon, C. (2012). Time-series data mining.
ACM Comput. Surv., 45(1).
Gama, J., Žliobaitė, I., Bifet, A., Pechenizkiy, M., and
Bouchachia, A. (2014). A survey on concept drift
adaptation. ACM Computing Surveys, 46(4):1–37.
Hamner, B. and Frasco, M. (2018). Metrics: Evaluation
Metrics for Machine Learning. R package version
0.1.4.
Hyndman, R., Athanasopoulos, G., Bergmeir, C., Caceres,
G., Chhay, L., O’Hara-Wild, M., Petropoulos, F.,
Razbash, S., Wang, E., and Yasmeen, F. (2019).
forecast: Forecasting functions for time series and
linear models. R package version 8.9.0.9000.
Hyndman, R. and Yang, Y. (2020). tsdl: Time Series Data
Library. https://finyang.github.io/tsdl/,
https://github.com/FinYang/tsdl.
Hyndman, R. J. (2014). Forecasting: Principles and
Practice.
Hyndman, R. J. and Khandakar, Y. (2008). Automatic time
series forecasting: the forecast package for R. Journal
of Statistical Software, 26(3):1–22.
Makridakis, S. (1976). A Survey of Time Series. Technical
Report 1.
Makridakis, S. and Hibon, M. (2000). The M3-competition:
Results, conclusions and implications. International
Journal of Forecasting, 16:451–476.
Makridakis, S., Spiliotis, E., and Assimakopoulos, V.
(2018a). The M4 competition: Results, findings,
conclusion and way forward. International Journal of
Forecasting, 34(4):802 – 808.
Makridakis, S., Spiliotis, E., and Assimakopoulos, V.
(2018b). Statistical and machine learning forecasting
methods: Concerns and ways forward. PLOS ONE,
13(3):1–26.
Makridakis, S., Spiliotis, E., and Assimakopoulos, V.
(2020). The M5 accuracy competition: Results,
findings and conclusions.
Mochinski, M. A., Paul Barddal, J., and Enembreck, F.
(2020). Improving multiple time series forecasting with
data stream mining algorithms. In 2020 IEEE