et al. (2012) using robust optimization and learning
approaches. Many models consider continuous time
models with finite horizon and limited inventory. In
most existing models, discounting is not included and
the demand is assumed to be of a special functional
form. We consider infinite horizon models with un-
limited inventory (i.e., products can be reproduced or
reordered). Demand is allowed to depend generally
on time as well as the prices of all market participants.
While many publications concentrate on (the ex-
istence of) equilibrium strategies, we do not assume
that all market participants act rationally. In many
markets it can be observed that automated strategies
that are used by firms are relatively simple and aggres-
sive. The most common strategy is to slightly under-
cut the competitor’s price, cf. Kephart et al. (2000).
In order to be able to respond to various potentially
suboptimal pricing strategies we provide applicable
solution algorithms that allow to compute optimal re-
sponse strategies.
The main contribution of this paper is threefold.
We (i) derive optimal price response strategies that
anticipate competitors’ prices, (ii) we quantify the im-
pact of different (randomized) reaction times on ex-
pected long-term profits of all market participants,
and (iii) we are able to explain different types of price
cycles.
This paper is organized as follows. In Section 2,
we describe the stochastic dynamic oligopoly model
with infinite time horizon (durable goods). We allow
sales probabilities to depend on competitor prices as
well as on time (seasonal effects). The state space
is characterized by time and the actual competitors’
prices. The stochastic dynamic control problem is ex-
pressed in discrete time. In Section 3, we consider
a duopoly competition. The competitor is assumed
to frequently adjust its prices using a predetermined
strategy. We assume that the price reactions of com-
petitors as well as their reaction times can be antic-
ipated. We set up a firm’s Hamilton-Jacobi-Bellman
equation and use recursive methods (value iteration)
to approximate the value function. We are able to
compute optimal feedback prices as well as expected
long-term profits of the two competing firms. Evalu-
ating price paths over time, we are able to explain spe-
cific price cycles. Furthermore, the results obtained
are generalized to scenarios with randomized reaction
times and mixed strategies.
In Section 4, we analyze optimal response strate-
gies in the presence of active and passive competi-
tors. We study how the duopoly game of two active
competitors is affected by additional passive competi-
tors. We show how to compute optimal pricing strate-
gies and to evaluate expected profits. We also illus-
trate how the cyclic price paths of the active competi-
tors are affected by different price levels of passive
competitors. Finally, we evaluate the expected prof-
its when different strategies are played against each
other. Conclusions and managerial recommendations
are offered in final Section 5.
2 MODEL DESCRIPTION
We consider the situation where a firm wants to sell
goods (e.g., gasoline, groceries, technical devices) on
a digital market platform (e.g., Amazon, eBay). We
assume that several sellers compete for the same mar-
ket, i.e., customers are able to compare prices of dif-
ferent competitors.
We assume that the time horizon is infinite. We
assume that firms are able to reproduce or reorder
products (promise to deliver), and the ordering is de-
coupled from pricing decisions. If a sale takes place,
shipping costs c have to be paid, c ≥ 0. A sale of one
item at price a, a ≥ 0, leads to a net profit of a − c.
Discounting is also included in the model. For the
length of one period, we will use the discount factor
δ, 0 < δ < 1.
Since in many practical applications prices cannot
be continuously adjusted, we consider a discrete time
model. The sales intensity of our product is denoted
by λ. Due to customer choice, the sales intensity will
particularly depend on our offer price a and the com-
petitors’ prices. We also allow the sales intensity to
depend on time, e.g., the time of the day or the week.
We assume that the time dependence is periodic and
has an integer cycle length of J periods. In our model,
the sales intensity λ is a general function of time, our
offer price a and the competitors’ prices ~p. Given the
prices a and ~p in period t, the jump intensity λ satis-
fies, t = 0, 1,2,..., a ≥ 0, ~p ≥
~
0,
λ
t
(a,~p) = λ
t mod J
(a,~p). (1)
In our discrete time model, we assume the sales
probabilities (for one period) to be Poisson dis-
tributed. I.e., the probability to sell exactly i items
within one period of time is given by, t = 0,1,2, ...,
a ≥ 0, ~p ≥
~
0, i = 0, 1,2,...,
P
t
(i,a,~p) =
λ
t
(a,~p)
i
i!
· e
−λ
t
(a,~p)
. (2)
For each period t, a price a has to be chosen. We
call strategies (a
t
)
t
admissible if they belong to the
class of Markovian feedback policies; i.e., pricing de-
cisions a
t
≥ 0 may depend on time t and the current
ICORES 2017 - 6th International Conference on Operations Research and Enterprise Systems
48