Innovation Cycles Control through Markov Decision Processes
Vassil S. Sgurev, Stanislav T. Drangajov, Lyubka A. Doukovska and Vassil G. Nikov
Institute of Information and Communication Technologies, Bulgarian Academy of Sciences,
Acad. G. Bonchev Str., Bl. 2, 1113 Sofia, Bulgaria
sgurev@bas.bg, sdrangajov@gmail.com, doukovska@iit.bas.bg, vasilnikov@abv.bg
Keywords: Innovation Introduction, Markov Decision Processes.
Abstract: Innovations are introduced in several cycles, or steps which are of stochastic character. Successful
completion of each cycle results in the beginning of the next one. Initial stages are connected with expenses
of risk (venture) capital and the investments are returned in the final stages, usually with quite big profit. A
helpful approach for control of the innovation process is the use of Markov decision processes which have
proved to be an efficient tool for control of multi state stochastic processes. Those stages may be
summarizes as: 1 – prestart stage; 2 – start stage; 3 – initial expansion stage; 4 – quick expansion stage; 5 –
stage of reaching liquidity of venture investments; 6 – stage of project failure and its cancelling. The
transition from state to state may be controlled through control techniques of Markov Decision Processes so
that maximum profit is achieved in shortest time. The stages are conditional and some of them may be
united, e.g. 1 and 2, or 3 and 4.
1 INTRODUCTION
It is known that the innovations’ introduction
through the respective innovation cycles as a rule is
accompanied with considerable uncertainty and it is
of definitely expressed stochastic character. As the
successful completion of each innovation project
very often results in considerable profit this
stimulates the investment of considerable venture
(risk) means. A very important task arises for
preliminary careful considering and calculating the
stochastic character of the on going processes.
A multi step discrete Markov decision process
with mixed policies is proposed in the present work,
for the innovation risks interpretation. The
innovation process is accomplished, and probably
finished, as a rule, in a cycle of the following 6
stages: 1 – prestart and start stage; 2 – initial
expansion stage; 3 – quick expansion stage; 4 –
preparatory stage; 5 – stage of reaching liquidity of
the venture investment; 6 – stage of project failure
and its liquidation (Grossi, 1990, Cormican, 2004,
Bernsteina, 2006). Besides, the process at each stage
may be in different states where the decision maker
may undertake different actions which result in the
transition to a new state with respective profits and
losses. The first three stages are connected with
initial investments and respective losses. The
objective is they to be minimized. The last three
stages may generate profit and ensure full return of
the investments and considerable gains, but it may
also result in considerable loss if the innovation
product is a failure. It is to be clearly noticed that the
innovation introduction is a risky enterprise and not
each attempt is successful and winning.
It should be explicitly noticed that the innovation
process may only pass from a given stage to the next
one and can never return to a previous stage. No
other stages except the last ones – success or failure,
are absorbing - i.e. the innovation process may not
stay for ever in any of the initial stages or it fails.
The process may stay in a given stage for some time.
It is a responsibility of the decision maker to
undertake such control actions that the process
leaves as soon as possible the first three stages,
which generate expenses, with min losses and
reaches the final stage, which generate profit.
It is to be also noted, that depending on the
decision makers actions a stage may be omitted, e.g.
to pass directly from stage r to stage r+2. I.e. stages
so described are to some degree conditional but
nonetheless the process may develop in only forward
direction.
286
S. Sgurev V., T. Drangajov S., Doukovska L. and G. Nikov V.