53d0bde9497bcdfa.tex
1: \begin{abstract}
2: 		
3: 		Multivariate density estimation and graphical models play important roles 
4: 		in statistical learning. The estimated density can be used to construct a 
5: 		graphical model that reveals conditional relationships whereas a graphical 
6: 		structure can be used to build models for density estimation. Our goal is 
7: 		to construct a consolidated framework that can perform both density and 
8: 		graph estimation. 
9: 		Denote $\bm{Z}$ as the random vector of interest with density function 
10: 		$f(\bz)$. Splitting $\bm{Z}$ into two parts, $\bm{Z}=(\bm{X}^T,\bm{Y}^T)^T$ 
11: 		and writing $f(\bz)=f(\bx)f(\by|\bx)$ where $f(\bx)$ is the density function 
12: 		of $\bm{X}$ and $f(\by|\bx)$ is the conditional density of $\bm{Y}|\bm{X}=\bx$. 
13: 		We propose a semiparametric framework that models $f(\bx)$ nonparametrically 
14: 		using a smoothing spline ANOVA (SS ANOVA) model and $f(\by|\bx)$ parametrically 
15: 		using a conditional Gaussian graphical model (cGGM). Combining flexibility of 
16: 		the SS ANOVA model with succinctness of the cGGM, this framework allows us to 
17: 		deal with high-dimensional data without assuming a joint Gaussian distribution.  
18: 		We propose a backfitting estimation procedure for the cGGM with 
19: 		a computationally efficient approach for selection of tuning parameters. We also 
20: 		develop a geometric inference approach for edge selection. We establish 
21: 		asymptotic convergence properties for both the parameter and density estimation.
22: 		The performance of the proposed method is evaluated through 
23: 		extensive simulation studies and two real data applications.
24: 		
25: 		KEY WORDS: cross-validation, high dimensional data, penalized likelihood, 
26: 		reproducing kernel Hilbert space, smoothing spline ANOVA
27: 		
28: 	\end{abstract}
29: