mum which in our case were γ
21
= 0.001, γ
22
= 0.001.
In addition, we have the design freedom in γ
2k
val-
ues selection to enhance system performance. From
Figs. 2(b) to 2(c), the states converge to values close
to zero.
7 CONCLUSIONS
In this paper, we analyzed an output feedback cumu-
lant differential game control problem using cost cu-
mulant optimization approach. We investigated a lin-
ear stochastic system with two players and derived
a 2-player near-optimal strategies for the tractable
auxiliary problem. The efficiency of our proposed
method has been demonstrated using a numerical ex-
ample where a neural network series method was ap-
plied to solve the HJB equations.
REFERENCES
Aberkane, S., Ponsart, J. C., Rodrigues, M., and Sauter, D.
(2008). Output feedback control of a class of stochas-
tic hybrid systems. Automatica, 44:1325–1332.
Basar, T. (1999). Nash Equilibria of Risk-Sensitive Non-
linear Stochastic Differential Games. Journal of Opti-
mization Theory and Applications, 100(3):479–498.
Bensoussan, A. and Schuppen, J. H. V. (1985). Opti-
mal Control of Partially Observable Stochastic Sys-
tems with an Exponential-of-Integral Performance In-
dex. SIAM Journal of Control and Optimization,
23(4):599–613.
Charilas, D. E. and Panagopoulos, A. D. (2010). A sur-
vey on game theory applications in wireless networks.
Computer Networks, 54(18):3421–3430.
Chen, T., Lewis, F. L., and Abu-Khalaf, M. (2007). A
Neural Network Solution for Fixed-Final Time Op-
timal Control of Nonlinear Systems. Automatica,
43(3):482–490.
Chen, Z. and Jagannathan, S. (2008). Generalized
Hamilton-Jacobi-Bellman Formulation-Based Neural
Network Control of Affine Nonlinear Discrete-Time
Systems. IEEE Transactions on Neural Networks,
19(1):90–106.
Cruz, J. B., Simaan, M. A., Gacic, A., and Liu, Y. (2002).
Moving Horizon Nash Strategies for a Military Air
Operation. IEEE Transactions on Aerospace and
Electronic Systems, 38(3):989–999.
Davis, M. (1977). Linear Estimation and Stochastic Con-
trol. Chapman and Hall, London, UK.
Finlayson, B. A. (1972). The Method of Weighted Residu-
als and Variational Principles. Academic Press, New
York, NY.
Fleming, W. H. and Rishel, R. W. (1975). Determinis-
tic and Stochastic Optimal Control. Springer-Verlag,
New York, NY.
Geromel, J. C., de Souza, C. C., and Skelton, R. E. (1998).
Static Output Feedback Controllers: Stability and
Convexity. IEEE Transactions on Automatic Control,
43(1):120–125.
Kailath, T. (1968). An Innovations Approach to Least
Square Estimation Part I: Linear Filtering in Additive
White Noise. IEEE Transactions on Automatic Con-
trol, 13(6):646–655.
Klompstra, M. B. (2000). Nash equilibria in risk-sensitive
dynamic games. IEEE Transactions on Automatic
Control, 45(7):1397–1401.
Mukaidani, H., Xu, H., and Dragon, V. (2010). Static Out-
put Feedback Strategy of Stochastic Nash Games for
Weakly-Coupled Large-Scale Systems. In Proc. of the
American Control Conference, pages 361–366, Balti-
more, MD.
Sain, M. K. (1966). Control of Linear Systems According
to the Minimal Variance Criterion—A New Approach
to the Disturbance Problem. IEEE Transactions on
Automatic Control, AC-11(1):118–122.
Sain, M. K. and Liberty, S. R. (1971). Performance Measure
Densities for a Class of LQG Control Systems. IEEE
Transactions on Automatic Control, AC-16(5):431–
439.
Sain, M. K., Won, C.-H., Spencer, Jr., B. F., and Liberty,
S. R. (2000). Cumulants and risk-sensitive control:
A cost mean and variance theory with application to
seismic protection of structures. In Filar, J., Gaitsgory,
V., and Mizukami, K., editors, Advances in Dynamic
Games and Applications, volume 5 of Annals of the
International Society of Dynamic Games, pages 427–
459. Birkhuser Boston.
Sandberg, I. W. (1998). Notes on Uniform Approx-
imation of Time-Varying Systems on Finite Time
Intervals. IEEE Transactions on Circuit and
Systems-1:Fundamental Theory and Applications,
AC-45(8):863–865.
Smith, P. J. (1995). A Recursive Formulation of the Old
Problem of Obtaining Moments from Cumulants and
Vice Versa. The American Statistician, (49):217–219.
Van De Water, H. and Willems, J. C. (1981). The Cer-
tainty Equivalence Property in Stochastic Control
Theory. IEEE Transactions on Automatic Control,
AC-26(5):1080–1087.
Won, C.-H., Diersing, R. W., and Kang, B. (2010). Sta-
tistical Control of Control-Affine Nonlinear Systems
with Nonquadratic Cost Function: HJB and Verifica-
tion Theorems. Automatica, 46(10):1636–1645.
Wonham, W. M. (1968). On the Seperation Theorem
of Stochastic Control. SIAM Journal of Control,
6(2):312–326.
Zheng, D. (1989). Some New Results on Optimal and Sub-
optimal Regulators of the LQ Problem with Output
Feedback. IEEE Transactions on Automatic Control,
34(5):557–560.
Zhu, Q., Han, Z., and Basar, T. (2012). A differential
game approach to distributed demand side manage-
ment in smart grid. In IEEE International Conference
on Communications (ICC), pages 3345–3350.
Two-playerAdhocOutput-feedbackCumulantGameControl
59