The remainder of this paper is structured as
follows: Section 2 reviews existing works related to
our research. In Section 3 the proposed approach is
presented along with a description of the data used
and the normalization techniques applied. Section 4
discusses the preliminary results obtained and the
validation of the proposed approach. In section 5 the
benefits of the selected neural network and future
work are discussed. Finally, the conclusions are
drawn in Section 6.
2 RELATED WORK
Traditional statistical methods of data analysis are
well described (Halls-Moore, 2019), (Baralis et al.,
2017), (Bekiros, 2015), (Zhong and Enke, 2017) and
include technologies/methodologies such as Online
Analytical Processing (Chaudhuri and Dayal, 1997)
but are mainly focused on testing pre-formulated
hypotheses. Many researchers have used Recurrent
Neural Networks (RNN) (Hagan at al., 2016), (Dixon,
2018), (Lee and Soo, 2017) to provide predictions for
financial time-series (Chen at al., 2018), (Wang at al.,
2016) but they are not so good at filtering out noise
from very large amounts of input data. Furthermore,
in the selection of the period of interest, fixed time
interval units (weeks, days, hours, minutes) are often
used within which critical information can be lost due
to the smoothing effect of integrating the transaction
data within each fixed equal unit time interval over
the set of intervals covering that period. Such critical
information (dependent on the behaviour of the
financial market participants) might be time intervals
between transactions, large share/contract (volume)
individual transactions, and pivot point indicators. In
contrast to RNN, Convolution Neural Networks
(CNN) process input data in a reduced-resolution
form allowing all data to contribute to the training
process and finding patterns in the data that otherwise
would not be revealed (Samarasinghe, 2016).
A CNN is a type of artificial neural network that
has successfully been applied to analyzing two-
dimensional visual imagery. The work of a CNN (as
applied to an image) is usually interpreted as a
transition from specific image features to more
abstract details via a series of stages culminating in a
set of high-level concepts. At the same time the
network self-tunes its weights and generates the
necessary hierarchy of abstract attributes, filtering
unimportant details and highlighting the essential
properties. The same principle of creating the
abstracts may be employed for analyzing financial
market data. Several approaches have used technical
indicators in market transaction data to identify
patterns (Chen and Liao, 2018), (Dymova,
Sevastjanov and Kaczmarek, 2016), (Chen and Chen,
2016). In (Sezer and Ozbayoglu, 2018), the authors
utilised 15 images of time series charts from stock
market and exchange traded funds and 15 technical
indicators to train a CNN over a 15-day period.
However, technical indicators are based on integral
(averaged) parameters and significant market
information can be lost (Gocken at al., 2016). The
approach of representing financial time-series in the
form of two-dimensional images for further analysis
using standard CNN for image recognition has also
gained popularity (Chen at al., 2016). For the analysis
of medium-term movements, a combination of a long
short-term memory neural network and CNN has
been used (Zhang, Zohren and
Roberts, 2019). All the
above papers examine medium-term movements of
financial markets that are longer than one day. On the
other hand, CNN has also been used to predict
outcomes specifically for high-frequency financial
market data (Doering, Fairbank and Markose, 2017),
although it remains to be seen if this technique
performs better than other machine learning methods.
The purpose of the current research is to present
an intraday trading method that uses a CNN, with raw
tick data with variable interval transactions that
preserve critical information in order to predict the
probabilities of the directions and depths of the next
price movement. As opposed to other research, this
paper does not aim to predict a specific price level in
the future, but only determines the probabilities of
directions and depths of movement, which is
sufficient to open a long or short position in a
financial instrument. Closing a position is carried out
by standard methods of money and risk management.
Thus, this model is a classification algorithm, not a
regression. Also, this paper differs from previous
work by using a CNN to analyse the intraday
movements of the financial market without use of
high-frequency trading (HFT) order books so
allowing utilization of publicly available GPU
machine resources. In addition, this paper presents a
unique method of nonlinear normalisation of prices,
intervals and volumes to improve the quality of
probability prediction. The authors also propose a
method for reducing the number of calculations by
pre-processing the data and considering only patterns
ready for significant movement.