Also noteworthy is the fact that by constructing
high quality features, simple methods, such as linear
regression can work as well or even better than other
more sophisticated algorithms (such as Random
Forests, SVR, etc.). By using less complex models,
optimal model parameters are found more readily,
the models run a lot faster, and they are easier to
understand and maintain.
We also state that the main disadvantage of
models presented in this research, is its inability to
predict unusual traffic events. Even though common
traffic status is informative for a commuter in a new
environment, unusual traffic is the most informative
information for local commuter who is aware of
usual traffic. The main reason for this disadvantage
is that current models uses only historical traffic
data. Since, some of unusually traffic events are
caused by other related events (such as nearby traffic
accidents, bad weather, holidays, etc.), we believe
that by including additional data sources in the
model, prediction of such events could be
significantly improved.
Therefore, our future plan is to collect several
quality traffic related data sources (such as weather
forecasts, traffic alerts, special days statuses, bigger
social events, etc.) and fuse them with loop counters
data in order to generate better traffic prediction
models. We intend to test different data fusion
approaches, such as: early (or full) integration;
which transforms data sources into a single feature-
based table, late (or decision) integration; where
each data source give rise to a separate model and
predictions are later fused, and intermediate (or
partial) integration, where data are fused through
inference of a single joint model with a recent matrix
factorization based algorithms, providing very good
results in the field of bioinformatics (Žitnik and
Zupan, 2013).
ACKNOWLEDGEMENTS
This work was supported by Slovenian Research
Agency and the ICT Programme of the EC under
MobiS (FP7-318452).
REFERENCES
Draper, N.R. and Smith, H., 1998. Applied regression
analysis, New York: Wiley, 3
th
edition.
Van Hinsenberg, C.P.I., Lint, J.W.C. Van and Sanders,
F.M., 2007. Short Term Traffic Prediction Models.
Proc., 14th World Congress on Intelligent Transport
Systems: ITS for a Better Life.
Karlaftis, M.G. and Vlahogianni, E.I., 2011. Statistical
methods versus neural networks in transportation
research: Differences, similarities and some insights.
Transportation Research Part C: Emerging
Technologies, 19(3), pp.387–399.
Van Lint, J.W.C. and Van Hinsbergen, C.P.I.., 2012.
Short-Term Traffic and Travel Time Prediction
Models, in Artificial Intelligence Applications to
Critical Transportation Issues. Transportation
Research Circular, E-C168 (November).
Liu, Y. et al., 2006. A scalable distributed stream mining
system for highway traffic data. Knowledge Discovery
in …, pp.309–321.
Pedregosa, F. et al., 2011. Scikit-learn: Machine Learning
in Python. Journal of Machine Learning Research, 12,
pp.2825–2830.
Rice, J. and van Zwet, E., 2004. A simple and effective
method for predicting travel times on freeways.
Intelligent Transportation Systems, IEEE
Transactions, pp.200–207.
Stathopoulos, A. and Karlaftis, M.G., 2003. A multivariate
state space approach for urban traffic flow modeling
and prediction. Transportation Research Part C:
Emerging Technologies, 11(2), pp.121–135.
Vanajakshi, L. and Rilett, L.R., 2007. Support Vector
Machine Technique for the Short Term Prediction of
Travel Time. 2007 IEEE Intelligent Vehicles
Symposium, pp.600–605.
Vlahogianni, E.I., Karlaftis, M.G. and Golias, J.C., 2014.
Short-term traffic forecasting: Where we are and
where we’re going. Transportation Research Part C:
Emerging Technologies.
Zhang, J. et al., 2011. Data-Driven Intelligent
Transportation Systems: A Survey. IEEE Transactions
on Intelligent Transportation Systems, 12(4), pp.1624–
1639.
Žitnik, M. and Zupan, B., 2013. Data Fusion by Matrix
Factorization. IEEE Transactions on Pattern Analysis
and Machine Intelligence, pp.13.
TrafficFlowPredictionfromLoopCounterSensorDatausingMachineLearningMethods
127