abstract:53d0bde9497bcdfa.tex

1: \begin{abstract}

2:

3: 		Multivariate density estimation and graphical models play important roles

4: 		in statistical learning. The estimated density can be used to construct a

5: 		graphical model that reveals conditional relationships whereas a graphical

6: 		structure can be used to build models for density estimation. Our goal is

7: 		to construct a consolidated framework that can perform both density and

8: 		graph estimation.

9: 		Denote $\bm{Z}$ as the random vector of interest with density function

10: 		$f(\bz)$. Splitting $\bm{Z}$ into two parts, $\bm{Z}=(\bm{X}^T,\bm{Y}^T)^T$

11: 		and writing $f(\bz)=f(\bx)f(\by|\bx)$ where $f(\bx)$ is the density function

12: 		of $\bm{X}$ and $f(\by|\bx)$ is the conditional density of $\bm{Y}|\bm{X}=\bx$.

13: 		We propose a semiparametric framework that models $f(\bx)$ nonparametrically

14: 		using a smoothing spline ANOVA (SS ANOVA) model and $f(\by|\bx)$ parametrically

15: 		using a conditional Gaussian graphical model (cGGM). Combining flexibility of

16: 		the SS ANOVA model with succinctness of the cGGM, this framework allows us to

17: 		deal with high-dimensional data without assuming a joint Gaussian distribution.

18: 		We propose a backfitting estimation procedure for the cGGM with

19: 		a computationally efficient approach for selection of tuning parameters. We also

20: 		develop a geometric inference approach for edge selection. We establish

21: 		asymptotic convergence properties for both the parameter and density estimation.

22: 		The performance of the proposed method is evaluated through

23: 		extensive simulation studies and two real data applications.

24:

25: 		KEY WORDS: cross-validation, high dimensional data, penalized likelihood,

26: 		reproducing kernel Hilbert space, smoothing spline ANOVA

27:

28: 	\end{abstract}

29: