c3bdc3fa8bc12f40.tex
1: \begin{abstract}
2: Generalising the idea of the classical EM algorithm that is widely used for computing maximum likelihood estimates, we propose an EM-Control (EM-C) algorithm for solving multi-period finite time horizon stochastic control problems. The new algorithm sequentially updates the control policies in each time period using Monte Carlo simulation in a forward-backward manner; in other words, the algorithm goes forward in simulation and backward in optimization in each iteration.
3: Similar to the EM algorithm, the EM-C algorithm has the monotonicity of performance improvement in each iteration, leading to good convergence properties.
4: We demonstrate the effectiveness of the algorithm by solving stochastic control problems in the monopoly pricing of perishable assets and in the study of real business cycle.
5: %The new algorithm extends the existing literature as follows: %(2) If the optimal policy can be  spanned by the basis functions, then the EM-C algorithm will converge in one iteration;
6: %(2) The EM-C algorithm applies to general stochastic control problems that may not satisfy the dynamic programming principle.
7: %(3) We do not assume any particular dynamics of the stochastic processes such as diffusion or jump diffusions.
8: 
9: \emph{Keywords}: EM algorithm, stochastic control, recursive model, dynamic programming, monopoly pricing, real business cycle, numerical methods, stochastic approximation
10: 
11: \emph{JEL classification}: C44, C61, C63, D4, E3
12: 
13: % C44	Operations Research, Statistical Decision Theory
14: % C61	Optimization Techniques,  Programming Models, Dynamic Analysis
15: % C63	Computational Techniques, Simulation Modeling
16: % D4	Market Structure, Pricing, and Design, D42	Monopoly
17: % E3	Prices, Business Fluctuations, and Cycles, E32	Business Fluctuations  Cycles
18: 
19: % MSC Class: 93E20, 93E35, 91G60, 91G80, 90B05, 90C15, 90C35, 90C39, 90C40
20: 
21: \end{abstract}
22: