1: \begin{abstract}
2: We develop an information-theoretic view of the stochastic block
3: model, a popular statistical model for the large-scale structure of
4: complex networks.
5: A graph $G$ from such a model is generated by first assigning vertex
6: labels at random from a finite alphabet, and then connecting vertices
7: with edge probabilities depending on the labels of the endpoints.
8: In the case of the symmetric two-group model, we
9: establish an explicit `single-letter' characterization of the
10: per-vertex mutual information between the vertex labels and the graph.
11:
12: The explicit expression of the mutual information is intimately
13: related to estimation-theoretic quantities, and --in particular--
14: reveals a phase transition at the critical point for community detection. Below
15: the critical point the per-vertex mutual information is asymptotically the same
16: as if edges were independent. Correspondingly, no algorithm can
17: estimate the partition better than random guessing.
18: Conversely, above the threshold, the per-vertex mutual information is
19: strictly smaller than the independent-edges upper bound.
20: In this regime there exists a procedure that estimates the vertex
21: labels better than random guessing.
22: \end{abstract}
23: