abstract:f7869ce64a18334c.tex

1: \begin{abstract}

2: Covariance estimation for high-dimensional datasets is a fundamental

3: problem in modern day statistics with numerous applications. In these

4: high dimensional datasets, the number of variables $p$ is typically

5: larger than the sample size $n$. A popular

6: way of tackling this challenge is to induce sparsity

7: in the covariance matrix, its inverse or a relevant transformation.

8: In particular, methods inducing sparsity in the Cholesky parameter of

9: the inverse covariance matrix can be useful as they are guaranteed to

10: give a positive definite estimate of the covariance matrix. Also, the estimated sparsity

11: pattern corresponds to a Directed Acyclic Graph (DAG) model for

12: Gaussian data. In recent years, two useful penalized likelihood methods for

13: sparse estimation of this Cholesky parameter (with no restrictions on

14: the sparsity pattern) have been developed. However, these methods

15: either consider a non-convex optimization problem which can lead to

16: convergence issues and singular estimates of the covariance matrix

17: when $p > n$, or achieve a convex formulation by

18: placing a strict constraint on the conditional variance parameters. In

19: this paper, we propose a new penalized likelihood method for sparse

20: estimation of the inverse covariance Cholesky parameter that aims to

21: overcome some of the shortcomings of current methods, but retains

22: their respective strengths. We obtain a jointly convex formulation for

23: our objective function, which leads to convergence guarantees, even

24: when $p > n$. The approach always leads to a positive definite and

25: symmetric estimator of the covariance matrix. We establish

26: high-dimensional estimation and graph selection consistency, and

27: also demonstrate finite sample performance on simulated/real data.

28: %These experiments demonstrate that our approach is competitive with

29: %previous approaches for graph selection and can lead to significant

30: %improvements in estimation, especially when $n < p$.

31: \end{abstract}

32: