21b1e42c006a17e3.tex
1: \begin{abstract}
2: We propose a new algorithm to estimate the structural parameters in
3: dynamic discrete choice models. The algorithm is based on the conditional
4: choice probability approach, but uses the idea of Temporal-Difference
5: learning from the Reinforcement Learning literature to estimate the
6: different terms in the value functions. In estimating these terms
7: with functional approximations using basis functions, our approach
8: has the advantage of naturally allowing for continuous state spaces.
9: Furthermore, it does not require specification of transition probabilities,
10: and even estimation of choice probabilities can be avoided using a
11: recursive procedure. Computationally, our algorithm only requires
12: solving a low dimensional linear equation. We find that it is substantially
13: faster than existing approaches when the finite dependence property
14: does not hold, and comparable in speed to approaches that exploit
15: this property. For the estimation of dynamic games, our procedure
16: does not require integrating over the actions of other players, which
17: further heightens the computational advantage. We show that our estimator
18: is consistent, and efficient under discrete state spaces. In settings
19: with continuous states, we propose easy to implement locally robust
20: corrections in order to achieve parametric rates of convergence. Preliminary
21: Monte Carlo simulations confirm the workings of our algorithm.
22: \end{abstract}
23: