contain a hierarchy of primitive patterns. They were
classified into five (5) groups providing an average
hit ratio of 72% for buy/shell signals. Additionally,
another study made by Gaginalp and Laurent (1998),
showed that specific candlestick patterns (“three-
white-soldiers”, “three-black-crows” etc.) have
predictive capability and indicate a profit of 1%
during a two-day holding period. In contrary, in a
paper by Marshall et al., (2006), the authors found
candlestick technical analysis has no value on U.S.
Dow Jones Industrial Average stocks during the
period from 1992–2002. The common characteristic
of all previous studies is that all are dealing with a-
priori known specific patterns. A related work of
discovering unknown (hidden) profitable candlestick
patterns was carried out by Sheng et al., (2006).
They designed a Knowledge Representation Model
which held the information of three (3) successive
candlesticks using a bit codification method, called
Relative Price Movement (RPM). The training daily
data was from January 1, 1994 to December 31,
1998, of 82 stocks, while the testing data was from
January 1, 1999 to December 31, 1999. In total, in
the test set, the mined patterns occurred less than
100 times.
1.2 Position
Our position states that by using advanced
computational methods (machine learning, data
mining, computational intelligence) for pattern
recognition, one can obtain high quality actionable
information (knowledge) that no one else has.
Particularly, although the same raw information is
available to everyone, not everyone has the ability to
analyze it successfully and so there is opportunity
for profit while the market adjusts its prices. So, our
hypothesis is that while the market is mostly
efficient for large periods of time, there exist periods
of time (starting and ending at certain events) when
the efficient market hypothesis breaks down. During
these periods, profits can be realized.
Our preliminary results found more than 15,000
different patterns which can predict the direction of
next day(s) price. These hidden patterns cannot be
visually detected by a trader not only due to the
large number (thousands) of patters but also due to
complexity of information they present. There is no
common point with the well-known candlestick
patterns (which are not more than 100).This work
starts with the research question: Are there any
candlestick patterns useful beyond the well-known
candlestick patterns? If there are, how we can
discover them? How we can construct a dynamic
rule-based system to extract such information?
2 SYSTEM ARCHITECTURE
Figure 1: The Rules Codification Engine.
The system architecture consists of an engine
that produces ten (10) different methods of
generating patterns and a lot of dynamic statistical
and technical indicator filters. The specification
engine contains rule-based expressions which can
record three types of information: Bits refered to
candlestick itself, bits refered to exact position
(relationship) among two or more candlesticks and
bits presenting the strength of the price movement.
The translation system uses these rules in a specific
order to construct ten (10) types of binary products:
Pattern of 3 candlesticks (simple or detailed), Pattern
of 4 candlesticks (simple or detailed), Pattern of 5
candlesticks (simple or detailed), Pattern of 3
candlesticks with complicated codified filters,
Pattern of 4 candlesticks with complicated codified
filters, Pattern of 3 candlesticks with numerical
simple filters, Pattern of 4 candlesticks with
numerical simple filters. The created binary pattern
is stored in a relational database which corresponds
to a specific day and stock price data (open, high,
low, close of specific day). Around five million
patterns were stored in the database (4,982,994 to be
exact). For clarity, we present one bit-condification
rule for each category:
i) Condification rule for Simple pattern: if body
down of the candlestick is geater than its (high +
low)/2 return ‘1’ else return ‘0’.
ii) Condification Rule for Simple pattern
relationship: If current open price is greater than
previous close price return ‘1’ else return ‘0’.
iii) Detailed Condification Rule (strangth): If open-
lose difference is less than moving average (MA) of
UsefulPatternMiningonTimeSeries-ApplicationsintheStockMarket
609