In general terms all of the classifiers, with the ex-
ception of BayesNet, achieve good results. In terms
of its F-Score results, RandomForest outperforms the
rest, achieving the best F-Score in most of the cases
(6/9), followed by IBk (3/9). According to this criteria
we can sort these five classifiers by their performance:
RandomForest, IBk, PART, J48 and BayesNet.
In order to verify and provide quantitative evi-
dences for supporting these results, tests of statisti-
cal significance were performed to determine if sta-
tistical differences existed among the performances
achieved by the five classifiers. The test results con-
firmed that RandomForest, IBk and PART are sta-
tistically the best classifiers, although RandomForest
achieved slightly better results. On the other hand,
J48 and BayesNet present significant performance
differences, and therefore they do not show to be so
valid from an experimental viewpoint.
In this work we have conceived a feasible model
for providing rapid and accurate linguistic predictions
in an intermediate language composed of linguistic la-
bels. As future work we aim to extend this analysis to
test with more powerful classification methods such
as SVM or Artificial Neural Networks and develop a
NLG-oriented approach which generate textual fore-
casts from the intermediate language obtained by the
classifiers.
ACKNOWLEDGEMENTS
This work was supported by the Spanish Ministry of
Economy and Competitiveness under grant TIN2011-
29827-C02-02. I. Rodriguez-Fdez is supported by the
Spanish Ministry of Education, under the FPU Fel-
lowships Plan. A. Ramos-Soto is supported by the
Spanish Ministry for Economy and Competitiveness
(FPI Fellowship Program). This work was also sup-
ported in part by the European Regional Development
Fund (ERDF/FEDER) under grants CN2012/151 and
GRC2014/030 of the Galician Ministry of Education.
REFERENCES
Adeyanju, I. (2012). Generating weather forecast texts with
case based reasoning. International Journal of Com-
puter Applications, 45.
AEMET (2014). Spanish meteorological agency website.
http://www.aemet.es/, Retrieved: 2014-10-08.
Aha, D. W., Kibler, D., and Albert, M. K. (1991).
Instance-based learning algorithms. Machine learn-
ing, 6(1):37–66.
Bouckaert, R. R. (2004). Bayesian network classifiers in
weka. Department of Computer Science, University
of Waikato.
Breiman, L. (2001). Random forests. Machine learning,
45(1):5–32.
Coch, J. (1998). Multimeteo: multilingual production of
weather forecasts. ELRA Newsletter, 3(2).
Finner, H. (1993). On a monotonicity problem in step-down
multiple test procedures. Journal of the American Sta-
tistical Association, 88(423):920–923.
Frank, E. and Witten, I. H. (1998). Generating accurate
rule sets without global optimization. In Shavlik, J.,
editor, Fifteenth International Conference on Machine
Learning, pages 144–151. Morgan Kaufmann.
Goldberg, E., Driedger, N., and Kittredge, R. I. (1994). Us-
ing natural-language processing to produce weather
forecasts. IEEE Expert, 9(2):45–53.
Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann,
P., and Witten, I. H. (2009). The weka data min-
ing software: an update. ACM SIGKDD explorations
newsletter, 11(1):10–18.
Iman, R. L. and Davenport, J. M. (1980). Approximations
of the critical region of the fbietkan statistic. Commu-
nications in Statistics-Theory and Methods, 9(6):571–
595.
Intellicast (2014). Intellicast website. http://
www.intellicast.com/, Retrieved: 2014-10-08.
MetOffice (2014a). British meteorological office data2text
website. http://www.metoffice.gov.uk/public/weather/
forecast-data2text, Retrieved: 2014-10-08.
MetOffice (2014b). British meteorological office website.
http://www.metoffice.gov.uk/, Retrieved: 2014-10-08.
NWF (2014). National weather forecast website. http://
www.weather.gov/, Retrieved: 2014-10-08.
Quinlan, J. R. (1993). C4. 5: programs for machine learn-
ing, volume 1. Morgan kaufmann.
Ramos Soto, A., Bugarin, A., Barro, S., and Taboada, J.
(2014). Linguistic descriptions for automatic gener-
ation of textual short-term weather forecasts on real
prediction data. IEEE Transactions on Fuzzy Systems,
Early Access.
Reiter, E., Dale, R., and Feng, Z. (2000). Building natural
language generation systems, volume 33. MIT Press.
Sokolova, M., Japkowicz, N., and Szpakowicz, S. (2006).
Beyond accuracy, f-score and roc: a family of dis-
criminant measures for performance evaluation. In AI
2006: Advances in Artificial Intelligence, pages 1015–
1021. Springer.
STAC (2014). Stac: Web platform for algorithms compar-
ison through statistical tests. http://tec.citius.usc.es/
stac/, Retrieved: 2014-10-08.
Van Deemter, K., Krahmer, E., and Theune, M. (2005).
Real versus template-based natural language genera-
tion: A false opposition? Computational Linguistics,
31(1):15–24.
WeatherForecast (2014). Weather-forecast website. http://
www.weather-forecast.com/, Retrieved: 2014-10-08.
ICAART2015-InternationalConferenceonAgentsandArtificialIntelligence
562