abstract:7f4e6d45aa4ae060.tex

1: \begin{abstract}

2: % Endogeneity in explanatory variables usually arises from omitted variables, measurement error, and simultaneity \citep{wooldridge2010econometric}. However, traditional prediction-based machine learning methods fail to deal with endogeneity and could lead to false causal discovery.

3: Structural equation models (SEMs) are widely

4: used in sciences, ranging from economics to psychology,

5: to uncover causal relationships underlying a complex system

6: under consideration and estimate structural parameters of interest.

7: We study estimation in a class of generalized SEMs where the object

8: of interest is defined as the solution to a linear operator equation.

9: We formulate the linear operator equation as a min-max game, where both

10: players are parameterized by neural networks (NNs), and learn the

11: parameters of these neural networks using the stochastic gradient descent.

12: We consider both 2-layer and multi-layer NNs with ReLU activation

13: functions and prove global convergence in an overparametrized regime, where

14: the number of neurons is diverging. The results are established using

15: techniques from online learning and local linearization of NNs,

16: and improve in several aspects the current state-of-the-art. For the first

17: time we provide a tractable estimation procedure for SEMs

18: based on NNs with provable convergence and without the need for sample

19: splitting.

20: \end{abstract}

21: