1: \begin{abstract}
2:
3: Multivariate density estimation and graphical models play important roles
4: in statistical learning. The estimated density can be used to construct a
5: graphical model that reveals conditional relationships whereas a graphical
6: structure can be used to build models for density estimation. Our goal is
7: to construct a consolidated framework that can perform both density and
8: graph estimation.
9: Denote $\bm{Z}$ as the random vector of interest with density function
10: $f(\bz)$. Splitting $\bm{Z}$ into two parts, $\bm{Z}=(\bm{X}^T,\bm{Y}^T)^T$
11: and writing $f(\bz)=f(\bx)f(\by|\bx)$ where $f(\bx)$ is the density function
12: of $\bm{X}$ and $f(\by|\bx)$ is the conditional density of $\bm{Y}|\bm{X}=\bx$.
13: We propose a semiparametric framework that models $f(\bx)$ nonparametrically
14: using a smoothing spline ANOVA (SS ANOVA) model and $f(\by|\bx)$ parametrically
15: using a conditional Gaussian graphical model (cGGM). Combining flexibility of
16: the SS ANOVA model with succinctness of the cGGM, this framework allows us to
17: deal with high-dimensional data without assuming a joint Gaussian distribution.
18: We propose a backfitting estimation procedure for the cGGM with
19: a computationally efficient approach for selection of tuning parameters. We also
20: develop a geometric inference approach for edge selection. We establish
21: asymptotic convergence properties for both the parameter and density estimation.
22: The performance of the proposed method is evaluated through
23: extensive simulation studies and two real data applications.
24:
25: KEY WORDS: cross-validation, high dimensional data, penalized likelihood,
26: reproducing kernel Hilbert space, smoothing spline ANOVA
27:
28: \end{abstract}
29: