1: \begin{abstract}
2: The problem of finding large average submatrices of a real-valued matrix arises in the exploratory analysis of data from a variety of disciplines, ranging from genomics to social sciences.
3: %Large average submatrices often reveal substructures and sample-variable associations of interest in high dimensional problems.
4: In this paper we provide a detailed asymptotic analysis of large average submatrices of an $n \times n$ Gaussian random matrix.
5: % when both the underlying matrix and the submatrices of interest have equal numbers of rows and columns.
6: The first part of the paper addresses global maxima.
7: For fixed $k$ we identify the average and the joint distribution of the $k \times k$ submatrix having
8: largest average value.
9: As a dual result, we establish that the size of the largest square sub-matrix with average
10: bigger than a fixed positive constant is, with high probability, equal to one of two consecutive
11: integers that depend on the threshold and the matrix dimension $n$.
12: The second part of the paper addresses local maxima. Specifically we consider
13: submatrices with dominant row and column sums that arise as the local optima
14: of iterative search procedures for large average submatrices. For fixed $k$, we identify the limiting average value and joint distribution of a $k \times k$ submatrix conditioned to be a local maxima. In order to understand the density of such local optima and explain the quick convergence of such iterative procedures, we analyze the number $L_n(k)$ of local maxima, beginning with exact asymptotic expressions for the mean and fluctuation behavior of $L_n(k)$.
15: For fixed $k$, the mean of $L_{n}(k)$ is $\Theta(n^{k}/(\log{n})^{(k-1)/2})$ while the standard deviation is $\Theta(n^{2k^2/(k+1)}/(\log{n})^{k^2/(k+1)})$.
16: Our principal result is a Gaussian central limit theorem for $L_n(k)$ that is based on a new variant of Stein's method.
17: \end{abstract}
18: