1: \begin{abstract}
2: %\boldmath
3: The degree distribution is one of the most fundamental graph properties of interest
4: for real-world graphs. It has been widely observed in numerous domains that graphs
5: typically have a \emph{tailed} or \emph{scale-free} degree distribution. While the average
6: degree is usually quite small, the variance is quite high and there are vertices
7: with degrees at all scales. We focus on the problem of approximating the degree distribution
8: of a large streaming graph, with small storage. We design an algorithm \degdist{}, whose main novelty
9: is a new estimator of infrequent degrees using truncated geometric random
10: variables. We give a mathematical analysis of \degdist{}
11: and show that it has excellent behavior in practice. We can process streams with millions
12: of edges with storage less than $1\%$ and get extremely accurate approximations
13: for \emph{all} scales in the degree distribution.
14:
15: We also introduce a new notion of \emph{Relative Hausdorff} distance between tailed
16: histograms. Existing notions of distances between distributions are not suitable, since they ignore
17: infrequent degrees in the tail. The Relative Hausdorff distance measures deviations at all scales,
18: and is a more suitable distance for comparing degree distributions. By tracking this new measure,
19: we are able to give strong empirical evidence of the convergence of \degdist.
20: \end{abstract}
21: