f7869ce64a18334c.tex
1: \begin{abstract}
2: Covariance estimation for high-dimensional datasets is a fundamental
3: problem in modern day statistics with numerous applications. In these
4: high dimensional datasets, the number of variables $p$ is typically
5: larger than the sample size $n$. A popular
6: way of tackling this challenge is to induce sparsity
7: in the covariance matrix, its inverse or a relevant transformation.
8: In particular, methods inducing sparsity in the Cholesky parameter of
9: the inverse covariance matrix can be useful as they are guaranteed to
10: give a positive definite estimate of the covariance matrix. Also, the estimated sparsity
11: pattern corresponds to a Directed Acyclic Graph (DAG) model for
12: Gaussian data. In recent years, two useful penalized likelihood methods for
13: sparse estimation of this Cholesky parameter (with no restrictions on
14: the sparsity pattern) have been developed. However, these methods
15: either consider a non-convex optimization problem which can lead to
16: convergence issues and singular estimates of the covariance matrix 
17: when $p > n$, or achieve a convex formulation by
18: placing a strict constraint on the conditional variance parameters. In
19: this paper, we propose a new penalized likelihood method for sparse
20: estimation of the inverse covariance Cholesky parameter that aims to
21: overcome some of the shortcomings of current methods, but retains
22: their respective strengths. We obtain a jointly convex formulation for
23: our objective function, which leads to convergence guarantees, even
24: when $p > n$. The approach always leads to a positive definite and
25: symmetric estimator of the covariance matrix. We establish
26: high-dimensional estimation and graph selection consistency, and
27: also demonstrate finite sample performance on simulated/real data. 
28: %These experiments demonstrate that our approach is competitive with 
29: %previous approaches for graph selection and can lead to significant 
30: %improvements in estimation, especially when $n < p$. 
31: \end{abstract}
32: