1: %\documentclass[12pt,preprint]{aastex}
2: \documentclass[]{aastex}
3: \usepackage{emulateapj5}
4: \usepackage{epsfig}
5: \citestyle{aa}
6:
7: \newcommand{\kmsmpc}{{\rm\ km\ s^{-1}\ Mpc^{-1}}}
8: \newcommand{\LCDM}{$\Lambda$CDM}
9: \newcommand{\WMAP}{{\it WMAP}}
10: \newcommand{\SDSS}{{\it SDSS}}
11: \newcommand{\msun}{{M_{\odot}}}
12: \newcommand{\hmsun}{{{\rm h}^{-1}\msun}}
13: \newcommand{\hmpc}{{{\rm h}^{-1}\, {\rm Mpc}}}
14: \newcommand{\dd}{{\rm d}}
15:
16: \newenvironment{inlinefigure}{
17: \def\@captype{figure}
18: \noindent\begin{minipage}{0.999\linewidth}\begin{center}}
19: {\end{center}\end{minipage}\smallskip}
20:
21: %----------------------------------------------------
22: \begin{document}
23: \slugcomment{{\em submitted to Astrophysical Journal Letters}}
24:
25: \shorttitle{Cosmic Variance in GOODS}
26:
27: \shortauthors{Somerville et al.}
28:
29: \title{Cosmic Variance in the Great Observatories Origins Deep Survey}
30: %\title{A Cosmic Variance Cheatsheet for the GOODS Survey}
31:
32: %-------------------------------------------------------------
33: \author{Rachel S. Somerville\altaffilmark{1}, Kyoungsoo Lee\altaffilmark{2},
34: Henry C. Ferguson\altaffilmark{1},
35: Jonathan P. Gardner\altaffilmark{3}, Leonidas
36: A. Moustakas\altaffilmark{1}, Mauro Giavalisco\altaffilmark{1}}
37:
38: \altaffiltext{1}{Space Telescope Science Institute, Baltimore MD 21218;
39: somerville,ferguson,leonidas,mauro@stsci.edu}
40:
41: \altaffiltext{2}{Department of Physics and Astronomy, Johns Hopkins
42: University, Baltimore, MD 21218; soolee@stsci.edu}
43:
44: \altaffiltext{3}{Laboratory for Astronomy and Solar Physics, Code 681, Goddard Space Flight Center, Greenbelt MD 20771; jonathan.p.gardner@nasa.gov
45: }
46:
47: \begin{abstract}
48: Cosmic variance is the uncertainty in observational estimates of the
49: volume density of extragalactic objects such as galaxies or quasars
50: arising from the underlying large-scale density fluctuations. This is
51: often a significant source of uncertainty, especially in deep galaxy
52: surveys, which tend to cover relatively small areas. We present
53: estimates of the relative cosmic variance for one-point statistics
54: (i.e. number densities) for typical scales and volumes sampled by the
55: Great Observatories Origins Deep Survey (GOODS). We use two
56: approaches: for objects with a known two-point correlation function
57: that is well-approximated by a power law, one can use the standard
58: analytic formalism to calculate the cosmic variance (in excess of shot
59: noise). We use this approach to estimate the cosmic variance for
60: several populations that are being studied in the GOODS program:
61: Extremely Red Objects (ERO) at $z\sim1$, and Lyman Break Galaxies
62: (LBG) at $z\sim3$ and $z\sim4$, using clustering information for
63: similar populations in the literature. For populations with unknown
64: clustering, one can use predictions from Cold Dark Matter theory to
65: obtain a rough estimate of the variance as a function of number
66: density. We present a convenient plot which allows one to use this
67: approach to read off the cosmic variance for a population with a known
68: mean redshift and estimated number density. We conclude that for the
69: volumes sampled by GOODS, cosmic variance is a significant source of
70: uncertainty for strongly clustered objects ($\sim 40$-60\% for EROs)
71: and less serious for less clustered objects, $\sim 10$-20\% for LBGs.
72:
73: \end{abstract}
74:
75: \subjectheadings{galaxies: statistics --- large scale structure of universe}
76:
77: %=======================
78: % 1
79: \section{Introduction}
80: \label{sec:intro}
81: %=======================
82:
83: The number density of observed extragalactic populations in the
84: Universe is a fundamental property which may hold clues to the nature
85: of the objects. However, observational estimates of the number density
86: of any clustered population are plagued by uncertainty due to
87: \emph{cosmic variance}, the field-to-field variation (in excess of
88: Poisson shot noise) due to large scale structure. Clearly, if one can
89: sample a volume that is very large compared with the intrinsic
90: clustering scale of the objects in question, cosmic variance will be
91: insignificant. In practice, especially in high-redshift studies, the
92: volumes sampled are small enough that cosmic variance is often a
93: significant source of uncertainty. Perhaps the majority of published
94: cosmological number densities and related quantities (e.g. luminosity
95: functions, integrated luminosity densities, etc.) do not properly
96: account for cosmic variance in their quoted error budgets.
97:
98: Cosmic variance has frequently been invoked as a motivation for
99: carrying out deep pencil-beam surveys along multiple sightlines.
100: While the term ``cosmic variance'' is generally understood, the effects
101: are of course dependent on the clustering properties of the sources of
102: interest, a fact that is often lost in discussions of deep survey
103: strategy. With the availability of the GOODS data, it seems
104: appropriate to cast this variance in practical terms, calculating
105: explicitly the expected uncertainties due to clustering for various
106: source populations under study. A simple exposition of this cosmic
107: variance may be useful both to researchers using the GOODS data and to
108: those planning future studies.
109:
110: The mean $\langle N\rangle$ and variance $\langle N^2\rangle$ are the
111: first and second moments of the probability distribution function
112: $P_N(V)$, which represents the probability of counting $N$ objects
113: within a volume $V$. We define the \emph{relative} cosmic variance:
114: \begin{equation}
115: \sigma^2_{v} \equiv \frac{\langle N^2\rangle -\langle
116: N\rangle^2}{\langle N \rangle^2} - \frac{1}{\langle N\rangle} \, .
117: \end{equation}
118: Note that the last term is the usual correction for Poisson shot
119: noise, which for the samples considered here will typically be
120: negligible. In any case, it is relatively straightforward to perform
121: this correction, so we do not discuss this term further. In the
122: general hierarchical scenario of structure formation, in which density
123: perturbations grow via gravitational instability, $P_N$ is expected to
124: have non-zero higher moments (e.g. skewness and kurtosis). For a
125: detailed and general treatment of the cosmic error, see \citet{csjc},
126: \citet{szapudi:99,scjc}, and references therein. Here, we concentrate
127: solely on one-point statistics (i.e. counts in cells) and do not
128: address the cosmic error with respect to two-point or higher order
129: statistics such as correlation functions. We defer treatment of these
130: issues to future works.
131:
132: For a population with a known two-point correlation function $\xi(r)$,
133: it is straightforward to calculate the cosmic variance as a function
134: of cell radius $R$ or equivalently, cell volume $V$ \citep[see
135: e.g.][or Section~\protect\ref{sec:pl} below]{peebles:80}. There are,
136: however, several potential practical difficulties with this simple
137: approach. While the correlation function of galaxies is typically
138: well-approximated by a power law in the strongly non-linear regime ($r
139: \la 10$-15 Mpc), on larger scales, in the linear regime, the
140: correlation function is expected to deviate from the power-law slope
141: measured on smaller scales. Also, estimating the correlation function
142: (especially its slope) of an observed population is more difficult
143: than estimating the number density, so often the latter quantity is
144: known while the former is not. In this situation, we can use the
145: theory of clustering and bias in the Cold Dark Matter (CDM) paradigm
146: to estimate the cosmic variance for a population with a known mean
147: redshift and average comoving number density.
148:
149: In this \emph{Letter}, we estimate the uncertainty due to cosmic
150: variance for several populations that have been identified in the
151: GOODS survey, and present general results based on CDM theory that can
152: be used to estimate the cosmic variance for populations at $z<6$.
153: Throughout, we assume cosmological parameters consistent with the
154: recent analysis of \WMAP\ data \citep{spergel:03}: matter density
155: $\Omega_m = 0.3$, baryon density $\Omega_b =0.044$, cosmological
156: constant $\Omega_{\Lambda}=0.70$, Hubble parameter $H_0=70$ km/s/Mpc,
157: fluctuation amplitude $\sigma_8 = 0.9$, and a scale-free primordial
158: power spectrum $n_s=1$.
159:
160: \section{The geometry of GOODS}
161:
162: \begin{figure*}
163: \epsscale{2.0}
164: %\clearpage
165: %\begin{figure}
166: \plotone{f1.eps}
167: \caption{\small [left] The comoving volume per unit redshift spanned
168: by (from bottom to top), the original HDF, the UDF, the GOODS field
169: and the GEMS field. [right] The transverse comoving size of the HDF,
170: UDF, GOODS, and GEMS fields.
171: \label{fig:scale}}
172: \end{figure*}
173: %\end{figure}
174: %\clearpage
175: The Great Observatories Origins Deep Survey (GOODS) covers two fields,
176: the Chandra Deep Field South (CDFS) and the Hubble Deep Field North
177: (HDFN). The CDFS field has dimensions 10'$\times$16', and the GOODS
178: HDFN field has similar dimensions. For more details and a general
179: overview of the GOODS program, see \citet{giavalisco:03}. Here, we
180: treat the case of a single CDFS-sized field. For widely separated
181: fields, the cosmic variance goes as $1/N_{\rm field}$, so the variance
182: will decrease by a factor of two when the second field is included.
183: In Fig.~\ref{fig:scale}, we show the comoving volume per unit redshift
184: for several recent, ongoing, and planned deep HST surveys: the
185: original Hubble Deep Field North \citep{williams}, the GOODS CDFS
186: field, the GEMS (Galaxy Evolution from Morphology and SEDs; Rix et
187: al. in prep)
188: field\footnote{http://www.mpia.de/homes/barden/gems/gems.htm}, and the
189: planned Ultra Deep Field (UDF)
190: \footnote{http://www.stsci.edu/science/udf/}. For redshifts $z\ga1$
191: and $\Delta z \sim 0.5$--1, GOODS samples a volume of a few $\times
192: 10^5\, {\rm Mpc^3}$. Fig.~\ref{fig:scale} also shows the average
193: transverse size ($L = \sqrt{10'\times 16'} = 12.7'$) of the GOODS
194: field as a function of redshift, again compared with the original HDF
195: ($L = \sqrt{5.7 \rm arcmin^2} = 2.4'$).
196:
197: \section{The Power-Law Model}
198: \label{sec:pl}
199:
200: The relative cosmic variance for a population with known two-point
201: correlation function $\xi(r)$ is given by:
202: \begin{equation}
203: \sigma^2_v = \frac{1}{V^2} \int^R_0 \xi(|{\bf r_1}- {\bf r_2}|)\, dV_1 dV_2
204: \end{equation}
205: \citep[see, e.g.][p. 234]{peebles:80}. If the correlation function can
206: be represented by a power-law $\xi(r) = (r_0/r)^\gamma$, then this
207: expression can be evaluated in closed form:
208: \begin{equation}
209: \sigma^2_v = J_2\, (r_0/r)^\gamma
210: \label{eqn:cv_pl}
211: \end{equation}
212: where $J_2 = 72.0/[(3-\gamma)(4-\gamma)(6-\gamma)2^\gamma]$
213: \citep[][p. 230]{peebles:80}. Assuming spherical cells, the variance
214: may be equivalently expressed in terms of the cell radius $R$ or the
215: cell volume $V\equiv 4 \pi R^3/3$.
216:
217: \begin{table*}
218: \begin{center}
219: \caption{Summary of parameters for representative populations
220: \label{tab:param}}
221: \begin{tabular}{lcccccc}
222: \tableline
223: object & $\bar{z}$ & mag. limit &
224: $n$ (h$^3$ Mpc$^{-3}$) & $r_0$ (h$^{-1}$ Mpc) & $\gamma$ & Ref. \\
225: \tableline
226: ERO & 1.2 & $K_s<19.2$ & $\sim 10^{-3}$ & $12 \pm 3$ & [1.8] & D2001 \\
227: %\protect\citet{daddi:01} \\
228: ERO & 1.2 & $18 < H < 20.5$ & $(1.0 \pm 0.1) \times 10^{-3}$ & $9.5\pm5$ & [1.8] & M2001\\
229: %\protect\citet{mccarthy} \\
230: U-drop & 3 & $\mathcal{R}<25.5$ & $4.7 \times 10^{-3}$ & $3.96 \pm 0.29$ & $1.55 \pm 0.15$ & A2003 \\
231: %\protect\citet{adel:03}\\
232: B-drop & 4 & $i'<26$ & $1.78 \times 10^{-3}$ & $2.7^{+0.5}_{-0.6}$ & [1.8] & O2001\\
233: %\protect\citet{ouchi:01}\\
234: \tableline
235: \end{tabular}
236: \tablecomments{The mean redshift, magnitude limit, number density,
237: correlation length, and correlation function slope for the populations
238: shown in Fig.~\protect\ref{fig:var_pl}. References are as follows:
239: D2001 -- \protect\citet{daddi:01}; M2001 -- \protect\citet{mccarthy};
240: A2003 -- \protect\citet{adel:03}; O2001 -- \protect\citet{ouchi:01}.
241: Where the correlation function slope is in brackets, this indicates
242: that the value was assumed in, rather than derived from, the
243: analysis.}
244: \end{center}
245: \end{table*}
246:
247: For objects with a known correlation function that is well represented
248: by a power-law, we can simply use Eqn.~\ref{eqn:cv_pl} to compute the
249: cosmic variance for a given effective volume, as illustrated in
250: Fig.~\ref{fig:var_pl}. We show $\sigma_v$ as a function of volume, for
251: three populations with correlation function estimates from the
252: literature: Extremely Red Objects (EROs) at mean redshift $\bar{z}\sim
253: 1.2$, U-band dropouts at $\bar{z}\sim 3$ (also known as Lyman break
254: galaxies (LBG)), and B-band dropouts at $\bar{z}\sim 4$. The
255: magnitude limit and color selection used for each of these populations
256: selects objects in a given redshift range, resulting in an effective
257: volume $V_{\rm eff}$. Characteristic number densities for each of
258: these populations, along with correlation function parameters and the
259: relevant references, are summarized in Table~\ref{tab:param}. For
260: example, for EROs in the GOODS field, $\sigma_v \sim 0.4-0.6$, while
261: for the less clustered LBGs, $\sigma_v \sim 0.15-0.2$. Note that we
262: have assumed here a spherical geometry for the cells, while in fact
263: for the GOODS survey the cells are very elongated, with the redshift
264: dimension being much longer (about a factor of ten) than the
265: transverse dimension in comoving distance units. We have also ignored
266: the evolution in clustering that occurs over the time interval between
267: the `back' and the `front' of the cell. It should be noted that, for
268: two fields with the same volume, the cosmic variance is \emph{smaller}
269: for an elongated (parallelepiped or cylindrical) field than for a
270: compact (cubical or spherical) field \citep[see
271: e.g.][]{newman:02}. This is because an elongated field samples more
272: independent (uncorrelated) regions. Therefore, the estimates given
273: here provide an upper bound on the cosmic variance.
274:
275: %\clearpage
276: \begin{inlinefigure}
277: \begin{center}
278: \resizebox{\textwidth}{!}{\includegraphics{f2.eps}}
279: \end{center}
280: \figcaption{\small The square root of the cosmic variance, $\sigma_v$,
281: is plotted as a function of cell volume $V$. The middle set of lines
282: are for objects which are as clustered as the dark matter at
283: $z=0$. The slightly curved line shows $\sigma_{\rm DM}$ from linear
284: theory, while the straight line is a power law model with $r_0=5
285: \hmpc$ and $\gamma=1.8$. The topmost solid and dashed lines are for
286: objects as clustered as EROs at $z=1.2$ (with $r_0=9.0 \hmpc$ and
287: $r_0=12 \hmpc$, respectively). The second to the bottom line is for
288: objects that cluster like U-dropouts (LBGs) at $z=3$, and the
289: bottom-most line is for objects as clustered as B-dropouts at
290: $z\sim4$. The arrows show representative effective volumes for the
291: original HDF and GOODS, for EROs (top set) and LBGs (bottom set).
292: \label{fig:var_pl}}
293: \end{inlinefigure}
294:
295: %\clearpage
296:
297: \section{CDM models}
298: %\clearpage
299: \begin{figure*}
300: %\begin{figure}
301: %\epsscale{2.0}
302: \plottwo{f3a.eps}{f3b.eps}
303: \caption{\small [left] Bias as a function of comoving number density,
304: for dark matter halos at $z=6$, 5, 4, 3, 2, 1, and $z=0$ (from top to
305: bottom). [right] Variance of dark matter from linear theory, for the
306: same redshifts ($z=0$--6 from top to bottom).
307: \label{fig:bias}}
308: \end{figure*}
309: %\clearpage
310: We now consider the situation in which we know only the number density
311: but not the correlation function of a population. In this situation,
312: we can use predictions from CDM to estimate the clustering strength of
313: a population with a given number density at a known average
314: redshift. In CDM models, both the number density and clustering
315: strength of dark matter halos are a strong function of halo
316: mass. Fig.~\ref{fig:bias} shows the average bias as a function of
317: number density, for dark matter halos at various redshifts (as
318: described in the figure caption), computed using the analytic model of
319: \citet{st:99}. The bias is defined as the ratio of the root variance
320: of the halos and the dark matter, $b \equiv
321: \sigma_{h}/\sigma_{DM}$. It is likely that this relationship is more
322: complicated for galaxies, since there is probably not a one-to-one
323: correspondence between galaxies and dark matter halos. Similar
324: relations for more general `occupation functions' (i.e., allowing
325: varying numbers of galaxies per halo) are given in
326: e.g. \citet{ms:02}. Fig.~\ref{fig:bias} also shows the variance of
327: dark matter $\sigma_{DM}$ as a function of cell volume for the same
328: redshifts, as predicted by linear theory ($\sigma(R, z) = \sigma(R, 0)
329: \, D_{\rm lin}$, where $D_{\rm lin}$ is the linear growth
330: function). We now have all the ingredients necessary to obtain a rough
331: estimate of the cosmic variance for any population associated with
332: dark matter halos with a known number density and mean redshift:
333:
334: \begin{enumerate}
335: \item Read off the average bias $b$ for objects of a given number
336: density and mean redshift from the left panel of Fig.~\ref{fig:bias}.
337: \item Obtain the value of $\sigma_{DM}$ at the relevant scale $V$ and
338: redshift from the right panel of Fig.~\ref{fig:bias}.
339: \item The cosmic variance for the population is then given by $\sigma_v =
340: b\,\sigma_{DM}$.
341: \end{enumerate}
342:
343: As a consistency check, we can use the values given in
344: Table~\ref{tab:param} to estimate the cosmic variance for the same
345: populations discussed in the previous section. For EROs, using the
346: number density $n=1.0\times10^{-3} {\rm h^3} {\rm Mpc^{-3}}$, we would
347: estimate a bias of $b\sim1.8$, resulting in $\sigma_v \sim 0.7$ at
348: $z=1$, in reasonable agreement with our earlier estimate of $\sigma_v
349: \sim 0.6$ at $z=1.2$. Similarly, for LBGs at $z=3$, we find
350: $b\sim2.5$, resulting in $\sigma_v \sim 0.25$, again in agreement with
351: the earlier estimate of $\sigma_v \sim 0.2$. One reason that these
352: estimates are not in precise agreement with the values obtained from
353: the calculation based on the actual correlation length is due to the
354: unknown halo occupation distribution (i.e., the number of galaxies per
355: halo as a function of halo mass). It has been shown previously that
356: one cannot simultaneously exactly reproduce both the number density
357: and observed correlation length of either of these populations under
358: the simple assumption of one galaxy per halo \citep{ms:02}, adopted
359: here.
360:
361: \section{Conclusions}
362:
363: Cosmic variance can be a significant source of uncertainty in
364: estimates of the number density or related quantities in deep
365: surveys. We have given empirical estimates of the uncertainty due to
366: cosmic variance for several populations that have been identified in
367: the GOODS survey: EROs at $z\sim1$, U-dropouts at $z\sim3$ and
368: B-dropouts at $z\sim4$. These empirical estimates were based on
369: correlation function measurements from the literature for similarly
370: defined populations, and may be refined once correlation function
371: estimates have been obtained for the actual populations identified in
372: GOODS. From this calculation, we saw that the cosmic variance is much
373: reduced in GOODS compared with the original HDF (40--60\% rather than
374: a factor of 2 for for very strongly clustered populations such as
375: EROs, 15--20\% rather than 40\% for less clustered populations such as
376: LBGs). We have also presented predictions from the theory of
377: clustering and bias in a \LCDM\ Universe, which allow one to estimate
378: the cosmic variance for a population of a known average redshift and
379: number density but unknown clustering strength. We emphasize that this
380: approach is intended to give only a simple first order estimate of the
381: cosmic variance. More detailed estimates, tailored to individual
382: populations and including treatments of e.g. a generalized halo
383: occupation distribution formalism, geometric effects, the
384: observational selection function, and clustering evolution and the
385: change in absolute magnitude limit over the redshift range of the
386: sample, will be addressed in future works.
387:
388: %===================================
389: \section*{Acknowledgments}
390: \begin{small}
391:
392: We thank Emanuele Daddi and Mike Fall for useful discussions and
393: comments. Support for this work was provided by NASA through grant
394: GO09583.01-96A from the Space Telescope Science Institute, which is
395: operated by the Association of Universities for Research in Astronomy,
396: under NASA contract NAS5-26555.
397:
398: \end{small}
399: %=====================================
400:
401: \bibliographystyle{apj}
402: \bibliography{apj-jour,cosvar}
403:
404: %\clearpage
405:
406: \end{document}
407:
408: