0707.3445/ms.tex
1: 
2: \documentclass[apj]{emulateapj}
3: \usepackage{apjfonts}
4: \usepackage{amsbsy}
5: 
6: 
7: %\documentclass[apj]{../new_apjstyle/emulateapj}
8: %\usepackage{../new_apjstyle/apjfonts}
9: %\usepackage{../new_apjstyle/amsbsy}
10: 
11: %\documentstyle[11pt, preprint]{../../aastex52/aastex}
12: %\documentstyle[11pt]{../../aastex52/aastex}
13: %\documentclass[12pt, preprint]{aastex}
14: 
15: \input{newcommands.tex}
16: \bibliographystyle{apj}
17: 
18: \def\p0{P_0(r)}
19: \def\pu{P_U(r)}
20: \def\mrlogh{M_r-5\log h}
21: \def\mbjlogh{M_{b_J}-5\log h}
22: \def\nn{{\mathcal N}}
23: \def\msat{M_{\rm sat}}
24: \def\dev{\Delta P/\sigma_{\rm SDSS}}
25: \def\zlim{z_{\rm lim}}
26: \def\xibar{\bar{\xi}}
27: \def\nbar{\bar{N}}
28: \def\ngbar{\bar{n}_g(r)}
29: \def\fred{f_{\rm red}}
30: \def\mnl{M_{\rm nl}}
31: 
32: \def\xvpf{\chi^2_{\rm VPF}}
33: \def\xupf{\chi^2_{\rm UPF}}
34: \def\xwp{\chi^2_{w_p}}
35: 
36: \begin{document}
37: 
38: \title{Void Statistics in Large Galaxy Redshift Surveys: Does Halo
39:   Occupation of Field Galaxies Depend on Environment?}
40: 
41: \author{
42: Jeremy L. Tinker\altaffilmark{1,2},
43: Charlie Conroy\altaffilmark{1,3},
44: Peder Norberg\altaffilmark{4}, \\
45: Santiago G. Patiri\altaffilmark{5},
46: David H. Weinberg\altaffilmark{6},
47: \& Michael S. Warren\altaffilmark{7}
48: }
49: \altaffiltext{1}{Kavli Institute for Cosmological Physics, University of Chicago}
50: \altaffiltext{2}{Department of Astronomy \& Astrophysics, University of Chicago}
51: %\altaffiltext{2}{Department of Astrophysical Sciences, Peyton Hall,
52: %Princeton University, Princeton, NJ 08544, USA}
53: \altaffiltext{3}{Department of Astrophysical Sciences,
54: Princeton University}
55: \altaffiltext{4}{Institute for Astronomy, University of Edinburgh}
56: \altaffiltext{5}{Instituto de Astrof{\'\i}sica de Canarias}
57: \altaffiltext{6}{Department of Astronomy, Ohio State University}
58: \altaffiltext{7}{Theoretical Astrophysics, Los Alamos National Labs}
59: 
60: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
61: % ABSTRACT ABSTRACT ABSTRACT ABSTRACT ABSTRACT ABSTRACT ABSTRACT 
62: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
63: 
64: \begin{abstract}
65: 
66: We use measurements of the projected galaxy correlation function
67: \wp\ and galaxy void statistics to test whether the galaxy content of
68: halos of fixed mass is systematically different in low density
69: environments. We present new measurements of the void probability
70: function (VPF) and underdensity probability function (UPF) from Data
71: Release Four of the Sloan Digital Sky Survey (SDSS), as well as new
72: measurements of the VPF from the full data release of the Two-Degree
73: Field Galaxy Redshift Survey. We compare these measurements to
74: predictions calculated from models of the Halo Occupation Distribution
75: (HOD) that are constrained to match both the projected correlation
76: function \wp\ and the space density of galaxies $\ngavg$. The standard
77: implementation of the HOD assumes that galaxy occupation depends on
78: halo mass only, and is independent of local environment. For
79: luminosity-defined samples, we find that the standard HOD prediction
80: is a good match to the observations, and the data exclude models in
81: which galaxy formation efficiency is reduced in low-density
82: environments. For $L_\star$ samples we cannot rule out a slight
83: increase in galaxy formation efficiency at low densities. More
84: remarkably, we find that the void statistics of red and blue galaxies
85: (at $L\sim 0.4L_\star$) are perfectly predicted by standard HOD models
86: matched to the correlation function of these samples, ruling out
87: ``assembly bias'' models in which galaxy color is correlated with
88: large-scale environment at fixed halo mass. We conclude that the
89: luminosity and color of field galaxies are determined predominantly by
90: the mass of the halo in which they reside and have little direct
91: dependence on the environment in which the host halo formed. In
92: broader terms, our results show that the sizes and emptiness of voids
93: found in the distribution of $L\gtrsim 0.2L_\star$ galaxies are in
94: excellent agreement with the predictions of a standard cosmological
95: model with a simple connection between galaxies and dark matter halos.
96: 
97: \end{abstract}
98: \keywords{cosmology:theory --- galaxies:halos --- large scale structure of the universe}
99: 
100: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
101: % INTRODUCTION INTRODUCTION INTRODUCTION INTRODUCTION INTRODUCTION 
102: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
103: 
104: \section{Introduction}
105: 
106: The Halo Occupation Distribution (HOD) has become one of the primary
107: methods for analyzing and interpreting galaxy clustering measurements
108: (e.g., \citealt{kauffmann_etal:97, jing_etal:98, benson_etal:00,
109:   seljak:00, peacock_smith:00, ma_fry:00, roman_etal:01,
110:   berlind_weinberg:02}). The unique and powerful aspect of the halo
111: occupation approach is that it quantifies the bias of a class of
112: galaxies with respect to the underlying dark matter distribution
113: through the statistical relationship between galaxies and the dark
114: matter halos in which they reside. In the HOD formalism, the bias of a
115: galaxy sample is specified by the quantity $P(N|M)$, the probability
116: that a halo of mass $M$ contains $N$ galaxies. Along with assumptions
117: about the spatial and velocity biases of galaxies with respect to the
118: dark matter within their host halos, $P(N|M)$ describes the bias of
119: the sample on all scales and for any clustering measure. The implicit
120: assumption of this approach is that $P(N|M)$ depends only the mass of
121: the halo and is independent of the halo's larger-scale
122: environment. This ``standard implementation'' of the HOD has been
123: called into question by a number of recent theoretical results. Thus
124: it is important to test the underlying assumptions of the HOD and
125: quantify any residuals of the standard implementation, reducing
126: systematic uncertainties in the cosmological constraints derived from
127: HOD modeling. In turn these tests lead to insight on the processes of
128: galaxy formation. In this paper we use new measurements of void
129: probability statistics in the Sloan Digital Sky Survey (SDSS,
130: \citealt{york_etal:00}) and Two-Degree Field Galaxy Redshift Survey
131: (2dFGRS, \citealt{colless_etal:01, colless_etal:03}) to test whether
132: the relation between the properties of field galaxies and their host
133: halos depends on mass only.  We define field galaxies as
134: isolated systems residing in low density regions of the galaxy distribution. In the
135: halo occupation context, these are galaxies that live at the center
136: of halos at or near the minimum halo mass scale for the given galaxy
137: class.
138: 
139: In \cite{tinker_etal:06b} (hereafter, Paper I), we demonstrated that
140: the statistics of galaxy voids are a sensitive diagnostic for
141: environmental dependence of halo occupation. The statistics explored
142: in Paper I were the void probability function (VPF, denoted $P_0$),
143: and the underdensity probability function (UPF, denoted $P_U$). The
144: VPF is defined as the probability that a sphere of radius $r$ contains
145: zero galaxies of a given type. The UPF is defined as the probability
146: that a sphere of radius $r$ has a galaxy density less than some
147: fraction of the overall mean density for that galaxy type. Here we set
148: that fraction to the conventional value of 0.2.  Previous theoretical
149: studies sought to determine what information, if any, void statistics
150: alone offer for constraining galaxy bias (\citealt{little_weinberg:94,
151:   benson:01, berlind_weinberg:02}). Paper I explored void statistics
152: in conjunction with other clustering measures, demonstrating that
153: standard HOD models that match observations of the projected two-point
154: correlation function \wp, and the number density of galaxies $\ngavg$,
155: predict nearly degenerate void statistics regardless of the mapping
156: between halo mass and central galaxy luminosity or the amplitude of
157: dark matter clustering, conclusions similar to those of
158: \cite{conroy_etal:05}. The remarkable robustness of void statistics
159: (under the assumptions of the standard HOD) implies that they can be
160: used to test these underlying assumptions. The two-point correlation
161: function is dominated by galaxies in mean and high density regions of
162: the universe. If one uses this statistic to constrain galaxy
163: occupation and correctly predicts another clustering measure that
164: probes underdense regions, then one infers that halo occupation at
165: fixed mass does not change between high and low densities.
166: 
167: Early studies concluded that the properties of dark matter halos, such
168: as their formation times and merger histories, were independent of, or
169: weakly dependent on environment (\citealt{lemson_kauffmann:99,
170:   sheth_tormen:04}). More recent results, aided by higher resolution
171: and larger-volume simulations, detect a clear relation between
172: formation times and local environment (\citealt{gao_etal:05,
173:   harker_etal:05, wechsler_etal:06, zhu_etal:06, gao_white:06,
174:   wetzel_etal:07}). These studies conclude that this correlation is
175: strongest for low-mass halos, with a sign such that older halos form
176: in higher density regions. Attempts to measure this effect in high
177: mass halos observationally have met with conflicting results
178: (\citealt{yang_etal:06, berlind_etal:06}). Although the correlation
179: between halo formation time and environment is now firmly established,
180: the effect on the galaxy population is less
181: clear. \cite{croton_etal:06a, croton_etal:07} use their
182: semi-analytical galaxy formation model to quantify this ``assembly
183: bias'' in the galaxy population. \cite{croton_etal:07} quantify
184: assembly bias by its effect on the large-scale galaxy two-point
185: correlation function, $b_x = \sqrt{\xi/\xi_0}$, where $\xi_0$ is the
186: clustering amplitude of a model once the assembly bias has been
187: removed from the sample by scrambling galaxies among halos of the same
188: mass.  For luminosity-defined samples, they find $b_x-1 \approx 0.05$
189: for faint galaxies, decreasing to $-0.05$ for bright galaxies. The
190: effect is strongest in their model for faint, red, central galaxies,
191: increasing the amplitude of the correlation function of these objects
192: by nearly a factor of 4. Because central galaxies define the voids (in
193: the statistical sense of the VPF and UPF), our approach is well-suited
194: to testing this effect in the true galaxy distribution. Observational
195: tests of the environmental dependence of galaxy properties by
196: \cite{blanton_etal:06a} have shown that the blue fraction correlates
197: with the galaxy density on small-scales (i.e., the scale of a large
198: halo), but not with the larger-scale density field (see also
199: \citealt{blanton_etal:06b}).  \cite{abbas_sheth:05, abbas_sheth:06}
200: use the halo occupation formalism to calculate galaxy clustering as a
201: function of local galaxy density, concluding that the standard
202: $P(N|M)$ approach correctly predicts the clustering of SDSS galaxies
203: as a function of their local environment. \cite{skibba_etal:06} use
204: the standard HOD approach to accurately predict the
205: luminosity-weighted correlation function of SDSS galaxies. Our use of
206: void statistics is complementary to these tests, in that voids probe
207: the most extreme galaxy environments. While the papers above are
208: sensitive to assembly bias of satellite galaxies or galaxies in mean
209: and high-density environments, void statistics are affected by the
210: bias of a small subset of the overall galaxy sample, making them more
211: sensitive to assembly bias in central galaxies and at low densities.
212: 
213: In this paper we present new measurements of the VPF and UPF from Data
214: Release Four of the SDSS (DR4, \citealt{dr4}).  Through the use of a
215: larger observational sample, this work extends earlier analysis of
216: void statistics from the CfA redshift survey
217: (\citealt{vogeley_etal:94}), the 2dFGRS (\citealt{croton_etal:04,
218:   hoyle_vogeley:04, patiri_etal:06a}) and Data Release Two of the SDSS
219: (\citealt{conroy_etal:05}). We also present new measurements of the
220: VPF from the full data release of the 2dFGRS that are better suited to
221: the purposes of this study than earlier analyses. We compare these
222: data to predictions for the VPF and UPF created with the standard
223: implementation of the HOD and for models in which the occupation of
224: central galaxies depends on environment. All models are constrained to
225: match \wp\ and $\ngavg$. Using the parameterization of Paper I, we
226: create density-dependent models in which the minimum mass scale for
227: hosting a central galaxy shifts by a factor $\fmin$ in environments
228: where the density falls below a threshold value $\dc$. A value of
229: $\fmin>1$ physically represents a model in which galaxy formation
230: become less efficient in low-density regions, creating positive
231: assembly bias ($b_x>1$). Models with $\fmin<1$ imply an increase in
232: galaxy formation efficiency, in the sense that a given mass halo can
233: host a more luminous galaxy, yielding negative assembly bias
234: ($b_x<1$). We show that the void statistics for faint galaxies,
235: $\mrlogh<-19$, are accurately predicted by the standard HOD, while
236: models with density dependence always produce a worse fit to the
237: observational data. The void statistics for bright galaxies,
238: $\mrlogh<-21$, are adequately fit by the standard HOD prediction,
239: while models with positive assembly bias are strongly excluded. (For
240: reference, the characteristic luminosity $L_\star$ in the
241: \cite{blanton_etal:03} $r$-band luminosity function is $M_r - 5\log h
242: = -20.44$.) We also make predictions for void statistics in the
243: 2dFGRS. We find once again that the standard HOD accurately predicts
244: the VPF in these samples, leaving little room for assembly bias.
245: 
246: We also explore models for faint color-defined galaxy samples. The
247: dependence of galaxy color and morphology on local environment is well
248: established (e.g.\ \citealt{dressler:80, postman_geller:84}). The
249: correlation between color and environment has been refined with the
250: increased statistics of the SDSS, (\citealt{blanton_etal:05a,
251:   park_etal:07}). \cite{berlind_etal:05} use cosmological hydrodynamic
252: simulations to demonstrate that these variations of color with
253: environment can be explained by the variations of the halo mass
254: function with environment only, without variations of halo occupation
255: at fixed mass. The observational results of \cite{blanton_etal:06a}
256: support this conclusion. However, the theoretical results of
257: \cite{croton_etal:07} imply that environmental effects of halo
258: occupation should be strong for color-defined samples. In their
259: semi-analytic model, faint red central galaxies preferentially occupy
260: halos in dense regions (at fixed halo mass). This is contrary to the
261: standard HOD assumption that the central galaxy of a halo has a
262: probability of being red that is independent of environment.  We show
263: that the measured VPFs are well-fit by the standard HOD predictions
264: for these samples. An assembly bias as strong as that in the
265: semi-analytic model of \cite{croton_etal:07} would likely be
266: detectable within the given errors of our VPF measurements.
267: 
268: Section 2 presents the details of our measurements of the VPF and UPF
269: from the SDSS, and our methodology for making predictions for these
270: statistics from the HOD. Section 3 presents the results for
271: luminosity-defined samples from the SDSS, comparing observational
272: measurements to HOD predictions using both the standard implementation
273: and models with density dependence. In \S 4 we show results for
274: color-selected galaxy samples from the SDSS, and compare to HOD
275: models. In \S 5 we present results for luminosity-defined samples from
276: the 2dFGRS. In \S 6 we discuss these results.
277: 
278: 
279: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
280: % SECTION 2
281: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
282: 
283: \section{SDSS Data and Modeling}
284: 
285: \begin{figure*}
286: \centerline{\epsfxsize=5.5truein\epsffile{hod_4win.ps}}
287: %\epsscale{1.0}
288: %\plotone{hod_4win.ps}
289: %\vspace{-4.5cm}
290: \caption{\label{hod_4win} Projected correlation function data and HOD
291:   fits for the $\mrlogh<-19$ sample (panels [a] and [c], respectively)
292:   and the $\mrlogh<-20$ sample (panels [b] and [d], respectively). In
293:   the top panels, points with error bars are the SDSS data of Z05,
294:   while the gray region represents the range in HOD fits with $\Delta
295:   \xwp<1$ with respect to the best-fit HOD model. Bottom panels plot
296:   the mean occupation functions $\navg$ for 20 randomly chosen HOD
297:   fits with $\Delta \xwp<1$. Results in panels (b) and (d) are for the
298:   $\mrlogh<-20$ sample restricted to $z\le 0.06$.  }
299: \end{figure*}
300: 
301: \begin{figure*}
302: \centerline{\epsfxsize=5.5truein\epsffile{hod_4win2.ps}}
303: %\epsscale{1.0}
304: %\plotone{hod_4win2.ps}
305: %\vspace{-4.5cm}
306: \caption{\label{hod_4win2} Projected correlation function data and HOD
307:   fits for the $M_r<-21$ sample (panels [a] and [c], respectively) and
308:   the $M_r<-22$ sample (panels [b] and [d], respectively). In the top
309:   panels, points with error bars are the SDSS data of
310:   Z05, while the gray region represents the range in
311:   HOD fits with $\Delta \xwp<1$ with respect to the best-fit HOD
312:   model. Bottom panels plot the mean occupation functions $\navg$ for
313:   20 randomly chosen HOD fits with $\Delta \xwp<1$.  }
314: \end{figure*}
315: 
316: 
317: 
318: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
319: % Subsection
320: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
321: 
322: \subsection{Observational Samples and Measurements}
323: 
324: For SDSS galaxies, we use measurements of \wp\ from
325: \cite{zehavi_etal:05} (hereafter Z05). These measurements were
326: performed on volume-limited samples from a spectroscopic sample of
327: nearly $200,000$ galaxies, from an angular survey area of 2497
328: deg$^2$, approximately the size of Data Release Two of the SDSS (DR2,
329: \citealt{dr2}). We use four volume-limited samples defined by $r$-band
330: magnitude thresholds $\mrlogh = -19$, $-20$, $-21$, and $-22$. For all
331: samples, we utilize the full covariance error matrix of the
332: measurements when comparing HOD models of \wp\ to observations. To
333: measure the void statistics in DR4 of the SDSS we use the NYU Value
334: Added Galaxy Catalog (\citealt{blanton_etal:05b}).  This sample is
335: larger in volume than the Z05 sample; the survey area for DR4 is 4783
336: deg$^2$, but the flanking fields and other isolated patches are not
337: well suited for our measurements, and are not used. As in the Z05
338: samples, all galaxies are $k$-corrected to redshift $z=0.1$ using the
339: software package {\tt kcorrect} (\citealt{blanton_etal:06c}). Although
340: the larger volume of DR4 might lead to differences in \wp, the samples
341: for which \wp\ have been measured in DR4 are within the errors of the
342: Z05 data (I. Zehavi, private communication). As we will demonstrate in
343: \S 2.3, the measurements of \wp\ used in this paper are sufficient to
344: constrain the HOD for the $\mrlogh<-19$ and $-21$ samples, so that the
345: uncertainties in HOD parameters are nearly negligible in comparison to
346: the measurement errors on the VPF and UPF. When analyzing the data we
347: use the full error covariance matrix, also taken from Z05.
348: 
349: 
350: %%%%%%%%%%%%%%%%%%%%%%
351: % TABLE 1
352: %%%%%%%%%%%%%%%%%%%%%%
353: 
354: \begin{deluxetable*}{cccccc}
355: \tablecolumns{6} 
356: \tablewidth{28pc} 
357: \tablecaption{Properties of the SDSS Volume Limited Catalogs}
358: \tablehead{\colhead{Sample} & \colhead{$z_{\rm min}$} & \colhead{$z_{\rm max}$} & \colhead{$\ngavg$} & 
359: \colhead{$f_{\rm comp}$} & \colhead{Volume $[(h^{-1}{\rm Mpc})^3]$} }
360: 
361: \startdata
362: 
363: $-19$ & 0.02 & 0.06 & $1.19\times 10^{-2}$ & 0.873 & $1.78\times 10^6$ \\
364: $-20$ & 0.02 & 0.06 & $4.33\times 10^{-3}$ & 0.873 & $1.78\times 10^6$ \\
365: \,$-20^\prime$ & 0.02 & 0.10 & $4.93\times 10^{-3}$ & 0.873 & $8.28\times 10^6$ \\
366: $-21$ & 0.03 & 0.15 & $1.01\times 10^{-3}$ & 0.876 & $2.11\times 10^7$ \\
367: $-22$ & 0.05 & 0.22 & $5.77\times 10^{-5}$ & 0.876 & $8.15\times 10^7$ \\
368: \hline
369: red & 0.02 & 0.06 & $3.28\times 10^{-3}$ & 0.873 & $1.78\times 10^6$ \\
370: blue & 0.02 & 0.06 & $4.33\times 10^{-3}$ & 0.873 & $1.78\times 10^6$
371: 
372: \enddata \tablecomments{Number densities are given in units of
373:   $($\hmpc$)^3$. $f_{\rm comp}$ is the mean completeness of each
374:   sample. $-20^\prime$ refers to the unrestricted sample that includes
375:   the Sloan Great Wall. See \S 2.2 for discussion. $V$ is the volume
376:   of each sample. Samples are defined with luminosity thresholds, but
377:   the red and blue samples are restricted to the magnitude range $-19
378:   < \mrlogh < -20$. }
379: \end{deluxetable*}
380: 
381: To measure the UPF and VPF from the survey at a given $r$, we randomly
382: place a large number of spheres with radius $r$ within the survey,
383: counting the number of galaxies located in each sphere. We limit the
384: number of spheres to a maximum of $10^7$ and minimum of $10^6$,
385: numbers that have been tested to ensure convergence. The largest
386: number of spheres is used at small radii to reduce the shot noise in
387: the measurement at those scales. Once the counts in each cell are
388: determined, the VPF is defined as the fraction of empty spheres, i.e.,
389: 
390: \begin{equation}
391: \label{e.vpf}
392: P_0(r) = \frac{\nn_0}{\nn_{\rm tot}},
393: \end{equation}
394: 
395: \noindent where $\nn_N$ refers to the number of spheres that contain
396: $N$ galaxies, and $\nn_{\rm tot}$ indicates the total number of
397: spheres. The UPF is defined as the fraction of spheres that contain
398: less than 20\% of the expected number of galaxies from the mean
399: density,
400: 
401: \begin{equation}
402: \label{e.upf}
403: P_U(r) = \nn_{\rm tot}^{-1}\sum_{N=0}^{N_U(r)}\nn_N
404: \end{equation}
405: 
406: \noindent where $N_U(r)={\tt floor}( 0.2\times \ngavg 4\pi
407: r^3/3)$. While $P_0(r)$ rapidly approaches zero at radii larger than
408: the mean galaxy separation, $P_U(r)$ falls off approximately as an
409: exponential function and is less subject to shot noise at larger
410: $r$. Thus is it possible to measure $P_U(r)$ more accurately at larger
411: scales than $P_0(r)$. Paper I also demonstrated that these statistics
412: have somewhat complementary information when testing for
413: density-dependence in $\navg$; altering galaxy formation efficiency
414: may eliminate galaxies from low-density regions without entirely
415: emptying them. 
416: 
417: Our handling of the SDSS survey geometry and completeness closely
418: parallels that of \cite{conroy_etal:05}. The completeness, defined as
419: the ratio of successfully attained redshifts to targetable objects,
420: varies non-trivially from 0 to 1 as a function of right ascension and
421: declination.  Sophisticated software has been developed to efficiently
422: handle complex survey geometry such as the SDSS.  In order to identify
423: and avoid regions of low completeness we use the \texttt{Mangle}
424: package (\citealt{hamilton_tegmark:04}) to generate a densely sampled
425: angular window function.  This window function incorporates regions of
426: the sky not surveyed, either because the region lies outside the
427: bounds of the survey or because of bright foreground stars, and
428: incompleteness within the survey due either to fiber collisions (no
429: two fibers can be separated by less than 55 arcseconds, affecting ~7\%
430: of targetable redshifts) or objects that could not be assigned a
431: reliable redshift, affecting $\sim1$\% of targetable objects.
432: 
433: In order to treat edge-effects arising from measuring the VPF and UPF
434: via counts-in-spheres, we convolve the window function with a circular
435: smoothing kernel of angular radius $\theta(r,z)$ proportional to a
436: sphere projected onto the plane of the sky with comoving radius $r$ at
437: redshift $z$. This convolution yields the total completeness of the
438: survey at each point in the sky for a given angular sphere size, where
439: the incompleteness could arise from either a sphere lying partially
440: off the edge of the survey or being in a region of the survey with low
441: spectroscopic completeness. We then place random spheres only at
442: points above a minimum convolved completeness. This allows us to
443: robustly avoid regions of bright stars, regions of low completeness
444: (due, for example, to inclement weather during observations) and the
445: edges of the survey.  The distribution of completenesses is
446: approximately a Gaussian centered at $\sim 88$\% with an additional
447: constant component extending to low completeness. Motivated by this
448: distribution, we place spheres only in regions above a minimum
449: convolved completeness of $83$\%, noting that our results are
450: insensitive to this exact value.  Spheres are placed uniformly along
451: the line of sight because each sample is volume limited.
452: 
453: In the above methodology, completeness issues are handled by including
454: only those regions of the survey which are both high and uniform in
455: completeness and then incorporating the remaining small incompleteness
456: effects into model predictions (which we will discuss below). An
457: alternative methodology has been proposed by \cite{croton_etal:04}, in
458: which incompleteness effects are treated by correcting the measured
459: VPF in order to recover the ``true'' underlying VPF of the galaxy
460: distribution.  This particular correction scheme counts the number of
461: galaxies within a sphere of radius $r'=r/f^{1/3}$ as contributing to
462: the VPF at radius $r$ ($f$ is the convolved completeness at $r$).
463: This scheme in essence treats incompleteness as missed volume rather
464: than missed galaxies. Although this correction is exact in the Poisson
465: limit, it will over-correct the VPF to some degree at larger $r$ or
466: lower $\p0$. The systematic error accrued is difficult to estimate
467: without the use of detailed mock catalogs, reducing the usefulness of
468: the correction method in the first place. Thus to compare our models
469: to data, we modify the theoretical predictions to match the
470: incompleteness of the survey, rather than trying to remove the
471: incompleteness from the survey itself. We will discuss this further in
472: \S 2.3.
473: 
474: As in Z05, we create two separate volume-limited samples with
475: $\mrlogh<-20$. The maximum redshift for these objects is $z=0.10$, but
476: this redshift limit includes the so-called `Sloan Great Wall'
477: supercluster (\citealt{gott_etal:05}; see also \citealt{baugh_etal:04}
478: for results from the 2dFGRS). This structure dominates the overall
479: clustering of the full $-20$ sample, and the presence of such a
480: structure makes it difficult to accurately estimate the true cosmic
481: variance for this sample. We follow Z05 in focusing on a sample
482: restricted to the same redshift limit as the $\mrlogh<-19$ sample of
483: $z\le 0.06$. Unless otherwise stated, all results for these galaxies
484: use the restricted redshift sample.
485: 
486: 
487: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
488: % Subsection
489: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
490: 
491: \subsection{HOD Modeling}
492: 
493: We constrain the occupation function by fitting the observed \wp\ and
494: $\ngavg$ for each sample with the analytic model for \wp\ described in
495: \cite{tinker_etal:05} (see also \citealt{zheng:04,
496:   zehavi_etal:04}). The mean occupation function is divided into two
497: terms; central galaxies located at the center of mass of the halo, and
498: satellite galaxies distributed throughout the halo. For SDSS samples
499: defined by a luminosity threshold, the central occupation function
500: takes the form
501: 
502: \begin{equation}
503: \label{e.ncen_sdss}
504: \ncen = \frac{1}{2}\left[ 1+\mbox{erf}\left(\frac{\log M - \log
505:     \mmin}{\sigmaM} \right) \right],
506: \end{equation}
507: 
508: \noindent where $\mmin$ is a cutoff mass scale and all logarithms are
509: base-10. Formally, in equation (\ref{e.ncen_sdss}) $\mmin$ is the mass
510: at which $\ncen=0.5$. The parameter $\slogm$ describes the shape of
511: the central galaxy cutoff. Physically, this parameter represents the
512: scatter between halo mass and central galaxy luminosity; if this
513: scatter is large then a fraction of low-mass halos will be included in
514: the sample and the shape of the cutoff will be soft. If this scatter
515: is small then central galaxies follow a nearly one-to-one mapping of
516: mass to luminosity, and $\ncen$ resembles a step function.
517: 
518: The satellite galaxy occupation function is modeled as a truncated
519: power law,
520: 
521: \begin{equation}
522: \label{e.nsat_sdss}
523: \nsat = \left( \frac{M-\mcut}{\msat}\right)^{\asat},
524: \end{equation}
525: 
526: \noindent where $\mcut$ is a cutoff mass scale for satellites, $\msat$
527: is the amplitude of the power law, and $\asat$ is its slope. In
528: equation (\ref{e.nsat_sdss}) the mass at which halos host on average
529: one satellite is $M_1 = \mcut+\msat$. The total occupation function is
530: $\navg = \ncen + \nsat$. As expressed in equations (\ref{e.ncen_sdss})
531: and (\ref{e.nsat_sdss}) the occupation function has five free
532: parameters. In practice, the number of free parameters is reduced to
533: four because $\mmin$ is set by $\ngavg$ once the other parameters have
534: been chosen. One can accurately fit \wp\ with only a three-parameter
535: occupation function (e.g., \citealt{zehavi_etal:04, zehavi_etal:05}),
536: but we allow $\navg$ extra freedom to explore how variations in the
537: shape of $\navg$ alter the predicted void statistics. In Paper I we
538: demonstrated that the void statistics are relatively insensitive to
539: $\slogm$ and $\mmin$ allowed by \wp\ and $\ngavg$, but to quantify the
540: uncertainty in our predicted void statistics we leave all parameters
541: free. For each \wp, the best-fit model is found by $\chi^2$
542: minimization using the full covariance error matrix of the data. To
543: minimize $\chi^2$ we use the Monte Carlo Markov chain method
544: (MCMC). While less efficient than other techniques, MCMC quantifies
545: the errors on the HOD parameters. For each sample, we randomly select
546: twenty HOD fits from the MCMC chain that have a $\Delta\chi^2<1$ with
547: respect to the best-fit model. These 20 fits will be used to estimate
548: the range in HOD predictions for the void statistics allowed by the
549: \wp\ data. The best-fit models for each sample are listed in Table 2.
550: 
551: Figure \ref{hod_4win} presents the results of the HOD analysis of the
552: $\mrlogh<-19$ and $-20$ samples. Figures \ref{hod_4win}a and
553: \ref{hod_4win}b plot the data with diagonal error bars, along with the
554: sample of twenty HOD fits from the MCMC chain. Figures \ref{hod_4win}c
555: and \ref{hod_4win}d present the occupation functions for each of those
556: twenty fits for faint and bright samples, respectively. For the
557: $\mrlogh<-19$ sample, the twenty projected correlation functions
558: calculated from the HOD fits are nearly indistinguishable. But the
559: occupation functions in \ref{hod_4win}c differ substantially at low
560: masses. Because $\mmin$ for this sample is significantly below the
561: nonlinear mass scale $\mstar = 8.60\times 10^{12}$ \hmsol\ for this
562: cosmology, \wp\ is relatively unaffected by softer or harder central
563: cutoffs; the mean bias of the HOD is largely unaffected by variations
564: in $\slogm$. In Paper I we demonstrated that the distribution of voids
565: is also unaffected by such changes to the occupation function,
566: yielding degenerate VPFs and UPFs. Figure \ref{hod_4win}d presents the
567: twenty occupation function for the $\mrlogh<-20$ sample. For this
568: sample, the shape of the central galaxy cutoff is essentially
569: unconstrained; the range in $\slogm$ from the twenty MCMC models is
570: 1.4 to 0.05. Because the volume of this sample is the same as the
571: $\mrlogh<-19$ sample, the differences in the constraints are somewhat
572: surprising. The size of the diagonal errors on \wp\ are similar, but
573: the data points for the brighter galaxies are more correlated,
574: reducing the constraining power for this sample.
575: 
576: Figure \ref{hod_4win2} shows the same quantities as the previous
577: figure, but now for the $\mrlogh<-21$ sample, and the $\mrlogh<-22$
578: sample. Figure \ref{hod_4win}c presents the twenty occupation
579: functions for the $\mrlogh<-21$ sample. For this sample, $\mmin \sim
580: \mstar$, thus the constraints on $\slogm$ from \wp\ alone are
581: substantially stronger than for the fainter samples. For the brightest
582: galaxies, Figure \ref{hod_4win2}b shows large differences in one-halo
583: clustering among acceptable models, resulting in significant
584: differences in $\nsat$ in Figure \ref{hod_4win2}d. The lack of strong
585: constraints on the HOD prevent the use of this sample and the
586: $\mrlogh <-20$ sample for testing assembly bias in the void
587: statistics. But, as we will show in the following section, for these
588: the constraints on the HOD can be enhanced moderately through the
589: addition of the VPF and UPF.
590: 
591: 
592: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
593: % Subsection
594: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
595: 
596: \subsection{Mock Catalogs}
597: 
598: Once the best-fit HOD model is identified, we predict void statistics
599: by populating the halos identified in dark matter N-body
600: simulations. Central galaxies are located at the center of mass of the
601: halo, and satellite galaxies are placed randomly throughout the halo
602: such that they follow the density profile of \cite{nfw:97} with a
603: concentration parameter given by the model of
604: \cite{bullock_etal:01}. Central galaxies are given the velocity of the
605: halo center of mass, while satellite galaxies are given an additional
606: random velocity in each Cartesian direction drawn from a Gaussian
607: distribution with dispersion equal to the virial velocity of the halo
608: $\sigma_{\rm vir}^2 = GM/2R_{\rm vir}$, where we have defined $R_{\rm
609:   vir}$ to be the radius at which the mean interior density of the
610: halo is 200 times the background density. All calculations of the VPF
611: and UPF are performed in redshift space using the distant observer
612: approximation, with the $z$-axis as the line of sight. Our results are
613: insensitive to the value of $\om$ or possible velocity bias of the
614: galaxies within reasonable limits (i.e., variations less than $\sim
615: 40\%$). Although these parameters alter the redshift space positions of
616: galaxies, the net effect on the void statistics is negligible. As with
617: the observational measurements, we calculate the VPF and UPF using
618: $10^6-10^7$ random spheres at each radius. Errors on the calculations
619: are estimated by jackknife sampling of the simulation volume into 125
620: subsamples.
621: 
622: We use two simulations to calculate void statistics, a smaller box 400
623: \hmpc\ on a side and a larger box 1086 \hmpc\ on a side. Both
624: simulations are inflationary cold dark matter models with identical
625: cosmologies. The linear matter power spectrum used to create the
626: initial conditions of each simulation was calculated with {\small
627:   CMBFAST} (\citealt{cmbfast}) with the parameter set
628: $(\om,\s8,\omb,n_s,h) = (0.3,0.9,0.04,1.0,0.7)$. For fainter galaxies
629: we utilize the smaller simulation to make predictions. This is the
630: same simulation used in Paper I, consisting of $1280^3$ particles,
631: yielding a particle mass of $2.54\times 10^9$ \hmsol. To model the
632: brighter galaxies we populate the larger simulation. This simulation
633: contains $1024^3$ particles, yielding a particle mass of $9.95\times
634: 10^{10}$ \hmsol. For both simulations, the initial conditions are
635: integrated with the hashed oct-tree code of \cite{warren_salmon:93},
636: with Plummer force softening lengths of 14.6 \hkpc\ and 40 \hkpc\ for
637: the small and large boxes, respectively. Halos are identified in the
638: simulations using the friends-of-friends algorithm with a linking
639: parameter of 0.2 times the mean interparticle separation, a value that
640: selects halos roughly corresponding to our adopted definition of a
641: virial overdensity of 200 (\citealt{davis_etal:85}). To be
642: self-consistent, all analytic calculations are performed with the same
643: set of cosmological parameters listed above. For these calculations,
644: the halo mass function is obtained with the fitting function of
645: \cite{jenkins_etal:01}. For the halo bias function, we use the fitting
646: function from \cite{tinker_etal:05}. This bias relation utilizes the
647: functional form presented in \cite{sheth_mo_tormen:01}, but with
648: parameter values ($a=0.707$, $b=0.35$ and $c=0.8$) calibrated on a
649: set of larger-volume N-body simulation with widely varying
650: cosmologies.
651: 
652: As mentioned in \S 2.1, we modify the number density of galaxies in
653: each mock to match that measured in each observational sample. At each
654: radius the mean number density of SDSS galaxies within spheres
655: $\ngbar$ is calculated. The maximum deviation of $\ngbar$ from the
656: overall mean averaged over all radial bins in less than $\sim 2$\% for
657: each luminosity sample, demonstrating that our treatment of the survey
658: mask is robust and that we are probing the same volume with each
659: sphere size. The mean number densities for each sample are listed in
660: Table 1, along with other details of each sample. Due to
661: incompleteness, these number densities are less than that expected
662: from the measurement of the $r$-band luminosity function by
663: \cite{blanton_etal:03}. When calculating the VPF in our mock galaxy
664: distributions, we dilute the mocks to match the number densities of
665: the data at each radius. Using the overall mean density produces
666: minimal differences in the theoretical predictions, with small
667: differences at the lowest-$r$ points where the VPF is Poisson
668: dominated.
669: 
670: The galaxy number density required by the HOD analysis of \wp\ is the
671: {\it true} number density, which must be estimated from observational
672: samples and has an error associated with it due to cosmic variance. As
673: noted in \cite{kev_etal:05}, this error is $\lesssim 5\%$ for the
674: $\mrlogh<-21$ sample. To test the sensitivity of our model predictions
675: to errors in the true number density, we alter $\ngavg$ by $+/- 10\%$
676: and re-fit \wp. The resulting void statistics, once matched to the
677: {\it sample} number densities, are within the errors on the
678: theoretical estimates. Due to the steepness of the halo mass function,
679: increasing or decreasing $\ngavg$ by 10\% alters the mass scales of
680: the HOD parameters ($\mmin$, $M_1$, $\mcut$) by $\sim 5\%$ (with the
681: opposite sign of the change in $\ngavg$), but the overall shape of the
682: HOD is nearly unchanged. We conclude that cosmic variance errors on
683: the true galaxy number density do not bias our results.
684: 
685: 
686: 
687: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
688: % Subsection
689: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
690: 
691: \subsection{Error Estimation and Systematics}
692: 
693: We estimate the errors on the measured void statistics with our
694: simulations, described the previous section. The volume of the
695: $\mrlogh < -19$ and $-20$ samples is approximately equal to a cube
696: $120$ \hmpc\ per side, $1/37$ the volume of the $400$ \hmpc\ box. Once
697: the HOD predictions have been calculated, using the best-fit HOD from
698: the \wp\ fitting, the box is divided into 27 cubic subregions, each
699: 133 \hmpc\ per side. The dispersion among the subregions is scaled by
700: $(133/120)^{3/2}=1.17$ to correct for the fact that the subregions do
701: not exactly match the volume of the observational sample. This scaled
702: dispersion is taken to be the error on the observational quantity. We
703: also estimate the covariance matrix from this method. This method is
704: more robust than estimating the errors directly from the observational
705: sample due to variations of the galaxy number density on 120
706: \hmpc\ scales. When estimated directly from the data, these
707: fluctuations cause the errors to be underestimated with respect to the
708: dispersion amongst the simulation subregions. At large scales, $r\sim
709: 10$ \hmpc\ in the fainter samples, proper error estimation from the
710: data is also inhibited by small sample volume. To estimate errors for
711: the brighter samples, the same process is followed using the $1086$
712: \hmpc\ box. For the $\mrlogh<-21$ sample, this larger simulation is
713: approximately $47$ times larger. For $\mrlogh<-22$ galaxies, the
714: volume of the large simulation is equivalent to 16 times the
715: observational sample. When calculating $\chi^2$ for a given model, we
716: neglect the innermost ($r=1$ \hmpc ) data point. In tests we find that
717: the errors on this scale require a prohibitive number of random
718: spheres to converge, and including this point in the covariance matrix
719: introduces significant noise to the error estimate. The data at this
720: scale contain little useful information anyway; the behavior of the
721: VPF is nearly Poisson at $r\lesssim 1$ \hmpc\ for all luminosity
722: samples (\citealt{croton_etal:04}).
723: 
724: For clarity, we will refer to $\chi^2$ values with respect to $\p0$,
725: $\pu$, and \wp\ as $\xvpf$, $\xupf$, and $\xwp$, respectively.
726: 
727: 
728: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
729: % SECTION THREE SECTION THREE SECTION THREE SECTION THREE SECTION THREE 
730: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
731: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
732: % SECTION THREE SECTION THREE SECTION THREE SECTION THREE SECTION THREE 
733: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
734: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
735: % SECTION THREE SECTION THREE SECTION THREE SECTION THREE SECTION THREE 
736: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
737: 
738: \section{Results for Luminosity-Defined SDSS Samples}
739: 
740: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
741: % Subsection
742: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
743: 
744: \begin{figure*}
745: \centerline{\epsfxsize=5.5truein\epsffile{4win_vpf.ps}}
746: %\epsscale{1.0}
747: %\plotone{4win_vpf.ps}
748: %\vspace{-4.5cm}
749: \caption{\label{vpf_hod} Comparison of the measured SDSS VPF to HOD
750:   predictions from fitting \wp. The luminosity sample is labeled in
751:   each panel. The results for each sample are presented in two panels;
752:   the upper panel presents the SDSS $P_0(r)$ and the HOD prediction,
753:   while the lower panel plots difference between the data and
754:   prediction, relative the the errorbar on the data. The errors on the
755:   HOD prediction are calculated from the simulation by the jackknife
756:   method. The shaded region in the lower panel represents the range in
757:   predictions from a sample of HODs with $\Delta \xwp<1$ with
758:   respect to the best-fit model. The data and model in the $M_r<-20$
759:   panels are using the restricted volume-limited sample. The yellow shaded
760:   region plots the results from using the full sample, $z\le 0.10$. }
761: \end{figure*}
762: 
763: \begin{figure*}
764: \centerline{\epsfxsize=5.5truein\epsffile{4win_upf.ps}}
765: %\epsscale{1.0}
766: %\plotone{4win_upf.ps}
767: %\vspace{-4.5cm}
768: \caption{\label{upf_hod} Comparison of the measured SDSS UPF to HOD
769:   predictions from fitting \wp. Jumps in the predictions and data
770:   occur when the number of galaxies corresponding to 20\% of the mean
771:   density crosses an integer boundary in the number of galaxies
772:   required to make a sphere ``underdense''. For the $M_r<-22$ sample,
773:   the number density is so low that the UPF only differs from the VPF
774:   at $r>26$ \hmpc, so we have plotted $P_1(r)$ instead. As in Figure
775:   \ref{vpf_hod}, the $M_r<-20$ panels have an additional yellow shaded
776:   region comparing the $P_0(r)$ from the $z\le 0.10$ sample with the
777:   HOD constraints from \wp\ for the same sample. }
778: \end{figure*}
779: 
780: 
781: 
782: \subsection{Observational Results and HOD Predictions}
783: 
784: Our approach is to take a random sample of 20 HOD models that all
785: produce accurate fits to the \wp\ data, and for each model calculate
786: the VPF and UPF. All HOD models are $\Delta\xwp<1$ with respect to
787: the best-fitting model. As shown in Table 1, the best-fit models all
788: yield $\xwp/\nu\lesssim 1$. \cite{conroy_etal:05} and Paper I
789: concluded that $P_0(r)$ contained little additional information about
790: the galaxy distribution relative to the two-point correlation
791: function. If this is exactly true, and the precision of the
792: measurements of the different statistics are equal, we would expect
793: that 1) all 20 models will produce good fits to the void statistics,
794: and 2) that the range in $\xvpf$ will be approximately 1, just as with
795: the distribution of $\xwp$ values. If the void statistics do contain
796: complementary information about the galaxy distribution, one or both
797: of these expectations will be violated. An alternate method would be
798: to perform a joint fit to \wp, $\p0$, and $\pu$ simultaneously, and
799: then compare the constraints on HOD parameters to the analysis in
800: which \wp\ is considered alone. Because calculating the void
801: statistics involves the use of an N-body simulation this procedure is
802: time intensive. It also requires an estimate of the covariance between
803: all three data sets, which is not available. This method is more
804: rigorous than the one we employ, but our approach provides a
805: straightforward test of the HOD models, and discrepancies between
806: predictions and measurements are readily detectable and quantifiable.
807: 
808: Figure \ref{vpf_hod} plots the measured SDSS VPF for the four
809: luminosity samples in Table 1 and compares them to the best-fit models
810: using the standard implementation of the HOD. In each panel, the
811: points with error bars represent the observed SDSS values. Lines show
812: the VPF obtained from the populated simulations. The lower panels in
813: each quadrant present the residuals of the model from the data. We
814: define the residual as $\dev \equiv (P_0^{\rm HOD} - P_0^{\rm
815:   SDSS})/\sigma_{\rm SDSS}$, where $\sigma_{\rm SDSS}$ is the diagonal
816: error bar on the SDSS data. We divide by the error to more clearly
817: present the differences between theory and observations; the
818: fractional error on the VPF (and the UPF) can range from $10^{-3}$ at
819: small radii to $\sim 1$ at large $r$. The data are highly correlated,
820: so a $\dev \sim 1$ for several consecutive data points is still only a
821: $\sim 1\sigma$ deviation overall. The errors on the lines are the
822: jackknife error bars, quantifying the theoretical uncertainty in our
823: predicted VPF for the HOD model that best fits \wp, resulting from the
824: finite volume of the simulation. The shaded region in the lower window
825: of each panel represents the range in $P_0(r)$ from the MCMC models,
826: quantifying the uncertainty in the theoretical prediction associated
827: with the uncertainty in the HOD parameters. We now describe each sample
828: in detail.
829: 
830: Figure \ref{vpf_hod}a presents the results for the $\mrlogh<-19$
831: sample. Due to the low luminosity threshold, this sample has the
832: smallest volume and largest observational errors on both \wp\ and the
833: void statistics. It also has the highest number density, driving the
834: VPF to zero at the smallest value of $r$ of all four samples. The
835: agreement between the measured VPF and that predicted by the best-fit
836: HOD, which assumes no density dependence to $\navg$, is excellent. The
837: residuals are approximately $0.5\sigma_{\rm SDSS}$ or less at all
838: $r$. The $\xvpf$ for the best-fit model is 8.99 for 10 data points
839: (note that ``best-fit'' here refers to \wp, and no parameters are
840: adjusted to match the VPF itself).  Due to the errors on \wp, the
841: range in predicted $P_0(r)$ from the set of acceptable HOD fits is
842: larger than the jackknife errors on $P_0(r)$ for an individual model,
843: but $\dev\lesssim 1$ for all HOD models with \wp\ fits of $\Delta
844: \xwp<1$. The $\xvpf$ values for these models range from 8.27 to
845: 11.7. We attribute the larger range of $\xvpf$ of 3.4
846: mostly to the increased volume of the DR4 sample relative to the
847: \wp\ sample.
848: 
849: Figure \ref{vpf_hod}b presents the results for $\mrlogh<-20$
850: galaxies. The points with error bars in the upper panel of Figure
851: \ref{vpf_hod}b show the results from the restricted sample. The
852: best-fit HOD prediction is $\sim 2\sigma$ low at $r\ge 4$ \hmpc,
853: yielding $\xvpf=54.2$ for 10 data points. The range in $\xvpf$ from the
854: twenty MCMC models is 11.4 to 118. This is in sharp contrast to the
855: results in \ref{vpf_hod}a, in which a set of \wp\ models with $\Delta
856: \xwp\le 1$ produces a set of VPFs with $\Delta \xvpf \le 4$. This is
857: due to the large range in $\slogm$ allowed by the \wp\ data. Although
858: the VPF is most sensitive to the fraction of galaxies that are
859: satellites, large variations in the central occupation function still
860: influence the size of voids to some degree (Paper I, Figure 6). The
861: value of $\xvpf$ correlates with the $\slogm$ such that sharper
862: central cutoffs yield more accurate predictions for $\p0$, with a
863: correlation coefficient $r=0.94$.  Combined with the fact that the
864: central cutoff shape is ill-constrained by \wp\ alone, the VPF adds
865: significant information for constraining the HOD for this sample;
866: models with $\slogm<0.3$ yield $\xvpf\lesssim 12$. The yellow shaded
867: region presents the residuals for VPF predictions for the same
868: analysis as the orange shaded region, but now using the full $z\le
869: 0.10$ volume. The larger volume and smaller \wp\ errors tighten the
870: constraints on the HOD, but as noted in Z05 the presence of the Sloan
871: Great Wall makes it difficult to find an HOD model that accurately
872: fits the amplitude of the correlation function in the two-halo
873: regime. The supercluster boosts the large-scale power in the two-point
874: clustering, and dramatically alters the three-point clustering
875: (\citealt{nichol_etal:06, baugh_etal:04, gaztanaga_etal:05}). This
876: amplification of clustering creates larger voids in the galaxy
877: distribution, producing residuals with respect to the HOD predictions
878: that are significantly negative.
879: 
880: Figure \ref{vpf_hod}c presents the results for the $\mrlogh<-21$
881: sample. Although this sample includes the Sloan Great Wall, the volume
882: of this sample is large enough such that the inclusion of this
883: structure does not significantly alter the clustering
884: statistics. Recall that for this sample (and for the $\mrlogh<-22$
885: sample), we use the 1086 \hmpc\ box to calculate the HOD predictions
886: and estimate the observational errors. The residuals of the best-fit
887: model are $\dev\lesssim 1$ for $r< 10$ \hmpc, but at larger scales the
888: residuals gradually increase to the point where the residuals between
889: the best-fit model and data are $\sim 2\sigma_{\rm SDSS}$ are $r>12$
890: \hmpc. The $\xvpf$ for the best-fit model prediction is 27.1 for 19
891: data points. The range of $\xvpf$ values for the MCMC sample of models
892: is $\xvpf=22.1$ to $\xvpf=29.2$, indicating that $\p0$ adds some
893: complementary information to \wp\ for constraining the occupation
894: function, assuming that the HOD is environment independent. The value
895: of $\xvpf$ is negatively correlated with $\slogm$, but the correlation
896: is much weaker than in Figure \ref{vpf_hod}b, with a correlation
897: coefficient $r=-0.59$. For this model, a joint fit to both \wp\ and
898: $\p0$ would most likely find a solution with a combined
899: $\chi^2/\nu<1$. We will discuss this further in the following section.
900: 
901: Figure \ref{vpf_hod}d presents the results for $\mrlogh<-22$
902: galaxies. This sample has the largest volume, but galaxies above this
903: magnitude threshold are rare. Thus Poisson fluctuations contribute
904: substantially to the jackknife errors on \wp\ at smaller scales, and
905: the Z05 \wp\ for this sample has no pairs at $r_p<1$ \hmpc. The lack
906: of information on clustering in the one-halo regime decreases the
907: constraints that can be placed on the HOD. The best-fit HOD model is
908: in good agreement with the observations, with $\xvpf=29.0$ for 29 data
909: points. The range of $\xvpf$ for the MCMC models is large, extending
910: from 23.4 to 52.0. The shape of the central cutoff for these models
911: varies from $\slogm = 0.5$ to $\slogm=0.8$. The halos that contain
912: galaxies in this magnitude regime lie in the exponential cutoff of the
913: mass function, where the halo bias is a strong function of
914: mass. Models with higher values of $\slogm$ have on average lower
915: $\xvpf$ values with respect to the VPF, yielding $r=-0.74$.  Thus for
916: samples of objects with limited clustering information at small
917: scales, extra constraining power can be obtained through void
918: statistics.
919: 
920: Figures \ref{upf_hod}a--\ref{upf_hod}c present the UPF results for the
921: same four luminosity samples. In each figure the upper panel shows the
922: measured UPF for SDSS galaxies and the best-fit HOD prediction. As in
923: Figure \ref{vpf_hod}, the lower panels plot the residuals between data
924: and best-fit model, as well as the range in predictions from the 20
925: MCMC models. The comparison of this statistic to the HOD predictions
926: are similar to those of the VPF. In Figure \ref{upf_hod}a, the
927: $\xupf$ for the best-fit model model is 9.93 for 11 data points, with
928: a range of 7.17 to 11.7 for the MCMC models. In Figure \ref{upf_hod}b,
929: the best-fit HOD model is once again $\sim 2-\sigma$ below the
930: observations at $r\ge 4$ \hmpc, yielding $\xupf=69.8$ for 11 data
931: points. The range in $\xupf$ values is 10.7 to 93.9, with $\xupf$
932: correlating with the value of $\slogm$ as with the models in Figure
933: \ref{vpf_hod}b. In Figure \ref{upf_hod}c, the HOD predictions are in
934: better agreement with the data at large scales than the VPF results
935: from \ref{vpf_hod}c, yielding $\xupf=20.7$ for 19 data points. The
936: range of $\xupf$ values from the MCMC models is smaller than the VPF
937: results, with maximum and minimum $\chi^2$ values of 22.0 and 16.5,
938: respectively.
939: 
940: For $\mrlogh<-22$ galaxies, the UPF contains little new information
941: with respect to the VPF. The number density of this sample is low
942: enough that a single galaxy in a sphere is enough to make the local
943: density larger than the threshold of 0.2$\ngavg$ for all $r<27$
944: \hmpc. Therefore we compare predictions and measurements for $P_1(r)$,
945: the probability that a random sphere has exactly one galaxy within
946: it. For $r<16$ \hmpc\ this statistic probes overdense regions
947: ($\delta_g>0$). At large and small scales, the agreement between the
948: data and best-fit model are excellent. At intermediate scales,
949: $8<r<18$ \hmpc, the agreement is adequate but the range of predictions
950: is wide, resulting from the lack of tight constraints on the HOD from
951: \wp\ alone.
952: 
953: With the exception of the $\mrlogh<-20$ sample, void statistics do not
954: provide a significant amount of new information about HOD parameters,
955: but for each sample they do tighten the constraints on the shape of
956: the central galaxy cutoff parameter, $\slogm$, relative to \wp\ alone.
957: 
958: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
959: % Subsection
960: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
961: 
962: \subsection{Comparison to Density-Dependent Models}
963: 
964: \begin{figure*}
965: \centerline{\epsfxsize=5.5truein\epsffile{dd_models_19.ps}}
966: %\epsscale{1.0}
967: %\plotone{dd_models_19.ps}
968: %\vspace{-4.5cm}
969: \caption{ \label{dd_19} (a) Upper panel: Points with error bars are
970:   the SDSS measurements of $\p0$ for the $\mrlogh<-19$ sample. Lines
971:   are three density-dependent HOD models with $\fmin=4$ and $\dc=-0.6$
972:   (red line), $-0.4$ (green line), and $-0.2$ (blue line). Bottom
973:   panel: The residuals of the model predictions relative to the
974:   data. (b) Same as (a), but for $\pu$. (c) The $\xvpf$ of the model
975:   predictions as a function of $\dc$ for models with $\fmin=2$ (red
976:   line), and $\fmin=4$ (blue line). The shaded horizontal band is the
977:   range in $\xvpf$ from the twenty MCMC models, all using the standard
978:   HOD implementation. Red and blue lines represent $\fmin=2$ and
979:   $\fmin=4$, respectively. The points along each line indicate models
980:   that produce an assembly bias (in the correlation function) of 5\%
981:   and 10\%, from right to left.  (d) Same as (c), but for the UPF. }
982: \end{figure*}
983: 
984: \begin{figure*}
985: \centerline{\epsfxsize=5.5truein\epsffile{dd_models_21.ps}}
986: %\epsscale{1.0}
987: %\plotone{dd_models_21.ps}
988: %\vspace{-4.5cm}
989: \caption{ \label{dd_21} (a) Upper panel: Points with error bars
990:   represent the SDSS measurements of $\p0$ for $\mrlogh<-21$
991:   galaxies. Lines represent three different density-dependent HOD
992:   models, $\fmin=2$, $\dc=-0.2$ (red line), $\fmin=0.5$, $\dc=-0.2$
993:   (green line), and $\fmin=0.5$, $\dc=+0.2$ (blue line). Lower panel:
994:   The residuals of the model predictions relative to the data. The
995:   filled circles represent a standard HOD model with $\slogm=0.5$. (b)
996:   Same as (a), but for $\pu$. (c) The $\xvpf$ of the HOD predictions
997:   for $\p0$ as a function of $\dc$. Red and blue lines represent
998:   models with $\fmin=2$ and $\fmin=4$, respectively. Green and cyan
999:   lines represent models with $\fmin=0.75$ and $\fmin=0.5$,
1000:   respectively. The shaded horizontal line is the range in $\xvpf$
1001:   from the twenty MCMC models. The filled circle represents the
1002:   $\xvpf$ for the model with $\slogm=0.5$ with no density
1003:   dependence. Colors are the same as for the lines. Points along each
1004:   line indicate models that produce and assembly bias of 5\% and 10\%,
1005:   from right to left. For the $\fmin<1$ models, only the models with a
1006:   5\% assembly bias are indicated. (d) Same as (c), but for the UPF.
1007: }
1008: \end{figure*}
1009: 
1010: 
1011: 
1012: In Paper I we presented a simple model for density-dependent
1013: occupation functions that focuses on changes to the minimum mass scale
1014: for central galaxies. The parameters of this model are $\dc$, the
1015: threshold density below which the HOD changes, and $\fmin$, the factor
1016: by which $\mmin$ changes in these low density regions. We calculate
1017: the local density of each halo with a top-hat smoothing filter with
1018: radius 5 \hmpc. For example, if $\dc=-0.5$ and $\fmin=2$, halos in
1019: regions that are below 50\% of the cosmic mean density must be twice
1020: as massive (relative to halos in denser regions) in order to host a
1021: galaxies above the luminosity threshold. A value of $\fmin=\infty$
1022: corresponds to complete suppression of galaxies in regions below
1023: $\dc$. We can place these models in the context of assembly bias as
1024: defined by \cite{croton_etal:07} by calculating the ratio of the
1025: large-scale correlation function in the density-dependent model to its
1026: standard counterpart, i.e., $b_x = \sqrt{\xi/\xi_0}$.  We find that
1027: $b_x\gtrsim 5\%$ for $\dc\gtrsim -0.5$ and $\fmin>2$ for models of the
1028: $\mrlogh<-19$ sample. For the brighter sample, $b_x\gtrsim 5\%$ for
1029: models with $\dc\gtrsim -0.1$. We demonstrated in \S3.1 that the
1030: $\mrlogh<-19$ and $-21$ void statistics are already well-fit by the
1031: standard HOD. Thus adding two new parameters will not statistically
1032: improve the model. But we explore density dependent models in order to
1033: constrain the level of assembly bias for central galaxies: to what
1034: degree can $\fmin$ differ from unity (the standard HOD assumption) and
1035: still adequately fit the void statistics?
1036: 
1037: When creating density-dependent HOD models, we follow the procedure
1038: outlined in Paper I: the number density of a sample is held constant,
1039: so if $\mmin$ in low-density areas increases $(\fmin>1)$, the overall
1040: $\mmin$ must decrease (slightly) to compensate for the missing
1041: low-density galaxies.
1042: 
1043: Figure \ref{dd_19}a presents three models with $\fmin=4$ and
1044: $\dc=-0.6$, $-0.4$, and $-0.2$. Only $\sim 8\%$ of halos with mass
1045: $M=10^{11.5}$\hmsol\ reside in regions with $\delta<-0.6$ (see Paper
1046: I, Figure 7), but the effect on the void statistics can be seen in the
1047: lower panel of Figure \ref{dd_19}a, which shows the residuals of the
1048: predicted VPF to the data for this model. At $r>5$ \hmpc, $\dev = 1$,
1049: and the discrepancy monotonically increases with increasing $r$. For
1050: $\dc=-0.4$ and $-0.2$, the effect can be seen clearly in the upper
1051: panel, with residuals that are larger than the scale of the lower
1052: panel. Figure \ref{dd_19}c plots $\xvpf$ as a function of $\dc$ for
1053: $\fmin=2$ and $4$. The gray shaded regions shows the range of $\xvpf$
1054: values from the 20 MCMC models. For $\fmin=2$, there is no change to
1055: the VPF at $\dc=-0.9$, but as the threshold density increases, $\xvpf$
1056: rapidly increases, going from $\xvpf=10.7$ at $\dc=-0.8$ to
1057: $\xvpf=32.5$ at $\dc=-0.7$. As noted in Paper I, the effect of
1058: increasing $\dc$ `saturates' at $\dc\approx -0.4$, yielding a maximum
1059: $\xvpf$ of around 300. For $\fmin=4$, $\chi^2$ rapidly increases at
1060: $\dc\ge-0.7$ and saturates at a value of $\sim 1200$. The points along
1061: each $\chi^2$ curve indicate models that produce $b_x=1.05$ and
1062: $b_x=1.1$ (from right to left). Both points lie in the saturation
1063: regime, where the discrepancies with the data are largest. In other
1064: words, in this class of $( \fmin, \dc)$ models, one cannot alter the
1065: large scale bias factor by 5\% without drastically violating
1066: constraints from the VPF. A model with $(\fmin,\dc)=(2,-0.75)$ yields
1067: a $\Delta\chi^2$ of 10 with respect to the standard HOD prediction.
1068: 
1069: At low $\dc$, the overall fraction of galaxies that are ``moved'' from
1070: low-density regions to median- and high-density regions is too small
1071: to affect the overall two-point clustering of the sample. As this
1072: fraction becomes non-negligible, the amplitude of the two-halo term
1073: increases as the mean bias of the sample is altered. Statistically,
1074: however, void statistics are far more sensitive to these changes in
1075: the galaxy distribution. For $\fmin=2$, $\dc=-0.5$, the $\Delta \xwp$
1076: relative to the standard HOD is only 2, while $\Delta\xvpf=210$. Note
1077: also that the values of $\xwp$ are dependent on the value of
1078: $\sigma_8$ assumed in the model. We have adopted a value of
1079: $\sigma_8=0.9$ to match that of the simulation. A lower value of
1080: $\sigma_8$, consistent with new results from cosmic microwave
1081: background anisotropies (\citealt{spergel_etal:06}), could compensate
1082: for the increased amplitude of the two-halo term in \wp\ for
1083: high-$\dc$ models. For the void statistics, no such degeneracy with
1084: $\sigma_8$ exists; in Paper I we showed that models with
1085: $\sigma_8=0.9$ and $0.7$ yielded nearly identical void statistics,
1086: even though the lower value of $\sigma_8$ produced a poor fit to the
1087: observed \wp. Thus $\p0$ is both a more robust and more sensitive test
1088: to density dependence in the central galaxy occupation function.
1089: 
1090: Figures \ref{dd_19}b and \ref{dd_19}d compare the measured $\pu$ to
1091: the same density-dependent models in \ref{dd_19}a and \ref{dd_19}c. As
1092: $\dc$ and $\fmin$ increase, the underdense regions increase in size
1093: and frequency. The advantage of the UPF is that the effect of density
1094: dependence does not saturate at high values of $\dc$; rather, the UPF
1095: will continue to increase as the threshold density
1096: increases. Additionally, the UPF is less susceptible to shot noise,
1097: and the percentage UPF error bars are close to half that of the VPF
1098: errors. However, this statistic is somewhat less sensitive to density
1099: dependence than the VPF because it probes moderately higher
1100: densities. While the $\fmin=2$, $\dc=-0.5$ model yields $\xvpf$
1101: of 220, it yields $\xupf=25.1$. The UPF is still more sensitive to
1102: density dependence than \wp\ alone.
1103: 
1104: For brighter galaxies, the standard HOD prediction for $\p0$ for the
1105: $\mrlogh<-21$ sample in Figures \ref{vpf_hod}c and \ref{upf_hod}c
1106: yields voids that are somewhat large compared with the measured SDSS
1107: statistics. Density-dependent models with $\fmin>1$ only increase the
1108: sizes of voids and make this discrepancy more significant. Therefore,
1109: we present results for models with $\fmin>1$ and $\fmin<1$; in the
1110: latter models, halos in underdense regions host (on average) more
1111: luminous galaxies at fixed halo mass. Figure \ref{dd_21}a compares the
1112: VPFs for three different models to the SDSS data: $(\fmin,\dc) =
1113: (2,-0.2)$, $(0.5, -0.2)$ and $(0.5, +0.2)$. The model with $\fmin=2$
1114: is clearly discrepant and yields residuals larger than the scale of
1115: the lower panel for $r>10$ \hmpc. The two models with $\fmin = 0.5$
1116: appear more consistent with the data than the standard HOD model in
1117: Figure \ref{vpf_hod}c. The residuals are smaller at large scales, but
1118: these models tend to depress the frequency of small voids below what
1119: is measured in the SDSS. In Figure \ref{dd_21}c, we present $\xvpf$ as
1120: a function of $\dc$ for models with $\fmin=0.5, 0.75, 2$ and 4. The
1121: models with $\fmin>1$ produce monotonically increasing $\xvpf$ with
1122: increasing $\dc$, and are always worse fits to the data then the
1123: standard HOD. Models that produce $b_x\ge 1.05$ yield
1124: $\xvpf>100$. Negatively biased models with $\fmin<1$ produce a minimum
1125: at $(\fmin,\dc)=(0.75,-0.2)$, yielding $\xvpf = 11.2$. Figures
1126: \ref{dd_21}b and \ref{dd_21}d present the same results for the UPF. As
1127: with the VPF, models with $\fmin<1$ are in better agreement with
1128: $\pu$, producing a minimum of $\xupf=11.8$ at
1129: $(\fmin,\dc)=(0.75,-0.2)$, as compared with the minimum $\xupf$ of
1130: $16.5$ from the MCMC models. For this model, $b_x-1 = -0.02$. Models
1131: with $b_x-1$ of $-0.05$, indicated with the filled circles in Figures
1132: \ref{dd_21}c and \ref{dd_21}d, do not produce improved fits to the VPF
1133: of UPF.
1134: 
1135: As with the fainter samples, altering $\ncen$ in low density regions
1136: can alter the two-point clustering to some extent. The models with
1137: $\fmin>1$ produce better fits to \wp, resulting from the increased
1138: amplitude of large-scale clustering. In the comparison of the best-fit
1139: HOD model to the SDSS \wp\ data, it can be seen that the model is
1140: slightly below the measured amplitude in the two-halo regime. Thus
1141: redistributing galaxies from low to high density areas produces better
1142: agreement with the data. The models with $\fmin<1$ have the opposite
1143: effect on the two-point clustering; these models lower the bias and
1144: increase the discrepancy with the \wp\ data. If we relax our
1145: constraints on the standard HOD models by setting $\slogm=0.5$, the
1146: same result is achieved. This model yields $\xwp=10.4$, a value
1147: similar to that of the best density-dependent model ($\xwp=11.4$). The
1148: high $\xwp$ is a result of the lower overall bias of the sample; the
1149: lower bias in turn produces smaller voids and yields $\p0$ and $\pu$
1150: that are as accurate as the best density-dependent model. The
1151: residuals of the $\slogm=0.5$ are shown with the black dots in Figures
1152: \ref{dd_21}a and \ref{dd_21}c, and the $\chi^2$ values for the void
1153: statistics are shown in \ref{dd_21}d and \ref{dd_21}d. Combining
1154: results for all data for this model, $\xwp+\xvpf+\xupf=29.4$ for 49
1155: data points and 4 free parameters. This summation neglects the
1156: covariance between statistics, but it implies that a joint analysis of
1157: all data would easily find a set of HOD parameters that accurately
1158: fits both \wp\ and the void statistics. Thus no strong evidence for
1159: $\fmin<1$ density dependence can be inferred.
1160: 
1161: 
1162: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
1163: % Section
1164: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
1165: 
1166: 
1167: \begin{figure*}
1168: \centerline{\epsfxsize=5.5truein\epsffile{color_samples.ps}}
1169: %\epsscale{1.0}
1170: %\plotone{color_samples.ps}
1171: %\vspace{-4.5cm}
1172: \caption{ \label{color_samples} (a) Open squares with error bars show
1173:   the measured \wp\ for $-20<\mrlogh<-19$ galaxies. Red and blue
1174:   points represent red galaxies and blue galaxies. The lines represent
1175:   the best-fit HOD model to the data, with the same
1176:   color-coordination. (b) The best-fit occupation functions for the
1177:   model in panel (a). Red and blue lines plot $\navg$ for red and blue
1178:   galaxies, respectively. The HOD for the full sample is the sum of
1179:   these two curves. (c) Open squares with error bars show the VPFs for
1180:   red and blue galaxies. Lines plot the HOD prediction for the VPF
1181:   from the best-fit $\navg$ in panel (b). (d) The reduced VPF for blue
1182:   and red galaxies, plotted with blue and red squares,
1183:   respectively. The solid line represents the negative binomial model,
1184:   which provides a good fit to the blue galaxy VPF, but the HOD
1185:   prediction (dotted curve) is much more accurate for the red
1186:   galaxies. }
1187: \end{figure*}
1188: 
1189: \section{Color-Defined SDSS Samples}
1190: 
1191: \subsection{Results for the Standard HOD}
1192: 
1193: To model the occupation function of color samples, the standard HOD
1194: parameterization presented in \S 2 is used to describe the overall
1195: sample, but $\ncen$ and $\nsat$ are multiplied by a coefficient that
1196: specifies the blue fraction $f_b^{\rm cen}$ and $f_b^{\rm sat}$,
1197: respectively. The fraction of blue central galaxies is parameterized
1198: with a lognormal function of the form
1199: 
1200: \begin{equation}
1201: \label{e.fblue}
1202: f_b^{\rm cen}(M) = f_0^{\rm cen} \exp\left[ \frac{ - \left(\log_{10}M - \log_{10}\mmin \right)^2}{2(\sigma_b^{\rm cen})^2}\right],
1203: \end{equation}
1204: 
1205: \noindent (cf.\ Z05, equation 11). Equation (\ref{e.fblue}) is the
1206: same for satellite galaxies, with separate parameters for $f_0^{\rm
1207:   sat}$ and $\sigma_b^{\rm sat}$. This adds four free parameters to
1208: the HOD model, but in practice one of the new parameters is fixed by
1209: the overall blue fraction of galaxies (we choose $f_0^{\rm cen}$). We
1210: fit \wp\ for the full sample, red sample, and blue sample
1211: simultaneously. These samples will be correlated, but we only use the
1212: covariance matrices of each sample independently. Using the full sample
1213: adds some complementary information to the color-only \wp\ functions
1214: because it contains the cross-correlation of the red and blue
1215: galaxies.
1216: 
1217: To measure the color-dependent VPF from DR4, we adopt the same color
1218: cut as Z05, $g-r = -0.03(\mrlogh) + 0.21$. The fraction of blue
1219: galaxies varies significantly with luminosity. Therefore we use
1220: galaxies with magnitudes $-19 < \mrlogh < -20$, rather than a sample
1221: defined by a luminosity threshold, to ensure that the red and blue
1222: samples have similar mean magnitudes. We choose this sample because we
1223: wish to analyze the lowest luminosity sample for which accurate
1224: measurements can be made. \cite{croton_etal:07} find that the assembly
1225: bias of red galaxies monotonically increases with decreasing
1226: luminosity. The use of a magnitude bin sample necessitates an upper
1227: cutoff mass for the central occupation function, representing the mass
1228: at which halos begin hosting central galaxies too bright to be
1229: contained within the sample. For simplicity, we adopt a step function
1230: cutoff at an upper mass limit of $10^{12}$ \hmsol, obtained from
1231: fitting the $\mrlogh<-20$ sample with a step-function $\ncen$ (i.e.,
1232: $\slogm=0$). Like Z05, we also set $\slogm=0$ for the magnitude bin
1233: sample, effectively making the central occupation function a
1234: square-window. Although we are fitting the same data presented in Z05
1235: (see their Figure 23 and Table 3), we use a different linear power
1236: spectrum, and \wp\ must be re-fit. We use $\chi^2$ minimization to
1237: once again determine the best-fit model, the parameters of which are
1238: listed in Table 2.\footnote{ Note that the satellite occupation
1239:   functions in Table 3 of Z05 assume a luminosity threshold sample. To
1240:   obtain $\nsat$ for each magnitude bin, $\nsat$ for the next-brighter
1241:   bin was subtracted off. The parameters of $\nsat$ in this paper are
1242:   for the magnitude bin only and do not require knowledge of $\nsat$
1243:   of brighter galaxies.} The large values of $\sigma_b^{\rm cen}$ and
1244: $\sigma_b^{\rm sat}$ in Table 2 essentially mean that the blue galaxy
1245: fraction is a constant as a function of mass.
1246: 
1247: Figure \ref{color_samples}a shows the results of the HOD modeling of
1248: the color-dependent clustering. The open squares plot the data from
1249: Z05 while the solid lines plot the best-fit HOD models. Blue and red
1250: colors represent blue galaxies, red galaxies, and the full sample,
1251: respectively. The $\xwp$ for the full set of 33 data points is 11.7
1252: (recall however that we have not taken into account the covariance
1253: between samples). The amplitude of clustering increases at all scales
1254: when comparing blue and red galaxies. The known correlation between
1255: galaxy color and environment states that red galaxies exist in more
1256: dense environments, implying that they occupy higher-mass halos that
1257: are strongly biased. Blue galaxies generally live in the field,
1258: implying that they are the central galaxies of lower-mass halos that
1259: are less strongly clustered. The best-fit occupation functions, shown
1260: in Figure \ref{color_samples}b, bears this out. Blue galaxies dominate
1261: the central occupation function, while satellite galaxies are
1262: primarily red galaxies. These results are consistent with those in
1263: Z05.
1264: 
1265: Points in Figure \ref{color_samples}c show the measured VPFs for red
1266: and blue galaxies. The VPF for red galaxies is significantly higher
1267: than for blue galaxies, nearly 0.5 dex at $r=11$ \hmpc. While the
1268: number density of the red sample is below that of the blue sample,
1269: diluting the blue sample randomly to match the red number density only
1270: increases the VPF at $r=11$ \hmpc\ by 0.04 dex; the larger voids in
1271: red galaxies are a consequence of their stronger clustering. Curves
1272: show the VPF predictions of the HOD model from Figure
1273: \ref{color_samples}b, in which a $\sim 10^{11.5}$ \hmsol\ halo has a
1274: $\sim 30\%$ chance of hosting a red galaxy {\it independent of its
1275:   large scale environment}. The agreement with the measured VPFs is
1276: strikingly good, with $\xvpf=9.89$ for red galaxies and $\xvpf=5.77$
1277: for blue galaxies (with 10 data points in each case). We don't perform
1278: the MCMC analysis for this sample, but we expect the results to be
1279: similar to the luminosity-defined $\mrlogh<-19$ sample in Figure
1280: \ref{vpf_hod}a.
1281: 
1282: Figure \ref{color_samples}d presents the data in the form of the
1283: reduced VPF (RVPF), in which the quantity $\chi = -\ln(P_0)/\bar{N}$
1284: is plotted as a function of $\bar{N}\xibar$, where $\bar{N}$ is the mean
1285: number of galaxies in a sphere of radius $r$ and $\bar{\xi}$ is the
1286: volume-averaged two-point correlation function (in
1287: redshift space). The quantity $\xibar$ is related to the variance of
1288: the distribution of cell counts, yielding
1289: 
1290: \begin{equation}
1291: \label{e.xibar}
1292: \bar{\xi} \equiv \frac{3}{r^3}\int_0^r \xi(s)s^2ds = 
1293: \frac{\langle (N - \nbar)^2\rangle - \nbar}{\nbar^2}.
1294: \end{equation}
1295: 
1296: \noindent Under the hierarchical clustering ansatz (see, e.g.,
1297: \citealt{bernardeau_etal:02})), all higher-order $n$-point correlation
1298: functions can be written in terms of powers of the two-point
1299: correlation function and a scaling coefficient. Many different
1300: theoretical models have been proposed for the scaling coefficients
1301: (see \citealt{fry_etal:89}). \cite{croton_etal:04} and
1302: \cite{conroy_etal:05} both demonstrated that luminosity-defined
1303: samples of galaxies exhibit the void statistics predicted by a
1304: negative binomial model, in which the VPF is related to $\xibar$ and
1305: $\bar{N}$ by
1306: 
1307: \begin{equation}
1308: \label{e.negbin}
1309: P_0(r) = (1+\nbar\xibar)^{-1/\xibar}.
1310: \end{equation}
1311: 
1312: 
1313: 
1314: \noindent This result led \cite{conroy_etal:05} to conclude that the
1315: VPF contains no complementary information over the two-point
1316: correlation function for constraining galaxy bias or halo
1317: occupation.\footnote{ It should be noted that \cite{conroy_etal:05}
1318:   use $\xibar$ in redshift space; in essence they utilize more
1319:   information than contained in \wp\ alone. When analyzing the
1320:   clustering in mock galaxy samples, those authors found that the
1321:   negative binomial is not a good description of real-space
1322:   clustering.} The RVPF for blue galaxies in Figure
1323: \ref{color_samples}d is consistent with the negative binomial model,
1324: but for red galaxies the negative binomial is not a good description
1325: of the data, in agreement with the recent results from the 2dFGRS of
1326: \cite{croton_etal:06c}. The HOD model, shown with the red dotted line,
1327: correctly predicts the behavior of the RVPF for this sample. In tests
1328: we find that occupation functions that produce correlation functions
1329: with large residuals from a power law tend to lie away from the
1330: negative binomial model in RVPF space. The high fraction of satellite
1331: galaxies in the red occupation function produces the strong transition
1332: from the one-halo to two-halo regime exhibited by red galaxies in
1333: Figure \ref{color_samples}a, leading to the behavior seen in
1334: \ref{color_samples}d. The correlation function for the blue sample is
1335: very close to a power law and thus is well-described by the negative
1336: binomial. This trend works in the opposite direction as well; HODs
1337: that de-emphasize high mass halos, such as those with a lower value of
1338: $\asat$, lie {\it below} the negative binomial curve, indicating that
1339: the negative binomial is not universal, but depends on the details of
1340: halos occupied by a given class of galaxies.
1341: 
1342: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
1343: %  Subsection
1344: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
1345: 
1346: \subsection{Comparison to Density-Dependent Models}
1347: 
1348: The assembly bias seen in the \cite{croton_etal:07} semi-analytical
1349: models is strongest for faint red galaxies, implying that low-mass
1350: halos that host red galaxies at their centers almost exclusively
1351: reside near a much larger halo, while in low-density environments the
1352: probability of encountering a red central galaxy is rare. To model
1353: this form of assembly bias in our HOD models, we adopt a
1354: parameterization similar to that for the luminosity-defined samples:
1355: at a density below a threshold $\dc$, the fraction of central galaxies
1356: that are red changes by a factor $\fred$.  
1357: 
1358: Figure \ref{dd_colors} shows the results for models in which
1359: $\fred=0$, implying that there are no red galaxies below $\dc$. Figure
1360: \ref{dd_colors}a compares the \wp\ data for the red sample to
1361: density-dependent models with $\dc=-0.4$, $-0.2$, and $0.0$. As with
1362: the luminosity-defined samples, the amplitude of the two-halo term in
1363: the HOD models increases as red galaxies are removed from low-density
1364: areas and redistributed in mean- and high-density environments. The
1365: model with $\dc=-0.4$, while in reasonable agreement with the
1366: \wp\ data, is clearly discrepant with the VPF, yielding $\xvpf=93.2$
1367: for 10 data points. Less extreme models, $\fred=1/8$ and $1/4$, still
1368: produce VPF $\xvpf$ values of $60.7$ and $29.4$, respectively, at
1369: $\dc=-0.4$. As $\dc$ increases, the discrepancy with both the VPF and
1370: \wp\ data get substantially larger. \cite{baldry_etal:06} have
1371: investigated the environmental dependence of the halo occupation for
1372: central red galaxies in the \cite{croton_etal:07} model. Although
1373: their definition of environment is based on nearest-neighbor
1374: statistics, they find that the red fraction in the model decreases by
1375: nearly a factor of ten around the mean density. In
1376: \cite{croton_etal:07}, the assembly bias for red $\mrlogh = -19$
1377: galaxies increases the large-scale bias of red central galaxies by a
1378: factor of 2. For the overall population of red galaxies at this
1379: magnitude, the assembly bias is $\sim 1.25$, comparable to the
1380: increase in \wp\ in the $\fred=1/8$, $\dc=-0.2$ model, which yields
1381: $\xvpf=138$. A more direct comparison is required to make precise
1382: statements about the form of the assembly bias in Croton et al, but
1383: the results in Figure \ref{dd_colors} only allow for low levels of
1384: assembly bias for faint color-defined samples.
1385: 
1386: \begin{figure*}
1387: \centerline{\epsfxsize=5.5truein\epsffile{dd_colors.eps}}
1388: %\epsscale{1.0}
1389: %\plotone{dd_colors.ps}
1390: %\vspace{-4.5cm}
1391: \caption{ \label{dd_colors} Panel (a): Data and models for \wp\ for
1392:   red galaxies in the $-20<\mrlogh<-19$ sample. Points with error bars
1393:   are SDSS data. Lines are models in which the central occupation
1394:   function is set to zero for halos with local densities below $-0.4$
1395:   (black solid line), $-0.2$ (red solid line), and $0.0$ (dashed
1396:   line). Panel (b): VPFs predicted by those same models. Points with
1397:   error bars are the SDSS data from Figure \ref{color_samples}. In
1398:   both panels, the red line is a model with similar assembly bias as
1399:   that found in \cite{croton_etal:07} for the same luminosity range. }
1400: \end{figure*}
1401: 
1402: 
1403: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
1404: %  2df Section
1405: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
1406: 
1407: \section{2dFGRS Results}
1408: 
1409: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
1410: % SECTION
1411: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
1412: 
1413: \subsection{2dFGRS \wp\ Data and Modeling}
1414: 
1415: The approach we take to apply the HOD to clustering measurements from
1416: the 2dFGRS differs slightly from that used above. 2dFGRS measurements
1417: are made on luminosity bins rather than threshold samples. This
1418: necessitates a modified form of the central occupation function and a
1419: somewhat different approach to model fitting. We use the approach
1420: detailed in \cite{tinker_etal:07} for modeling these data. We use
1421: measurements of \wp\ that have been updated from those presented in
1422: \cite{norberg_etal:01} and \cite{norberg_etal:02} to include the full
1423: data release of the 2dFGRS (\citealt{colless_etal:03}), an increase
1424: from $\sim 160,000$ galaxies to $\sim 221,000$ galaxies. The details
1425: of the clustering measurements will be found in Norberg et al.~(in
1426: preparation). We present here a brief summary of the
1427: calculations. Using the full 2dFGRS survey we create four volume
1428: limited samples, with faint limits from $\mbjlogh = -18.0$ to
1429: $\mbjlogh = -21.0$, each sample 1.0 magnitudes wide. All galaxies
1430: brighter than $\mbj = -21$ are grouped into a single sample. As in
1431: \cite{norberg_etal:01, norberg_etal:02}, a careful account of the
1432: selection function is made and the correlation functions are obtained
1433: using the standard \cite{landy_szalay:93} and \cite{hamilton:93}
1434: estimators, with typically 100 times more randoms than galaxies. The
1435: projected correlation function is estimated by integrating
1436: $\xi(r_p,r_\pi)$ out to $r_{\pi,{\rm max}} = 70$~\hmpc, providing a
1437: stable estimate for \wp\ out to at least $r_p = 40$~\hmpc. Due to the
1438: sensitivity of the results on close pair incompleteness, we only use
1439: data from scales beyond $r_p\gtrsim 150$~\hkpc. The correlation
1440: function is measured in twelve radial bins, spaced evenly by 0.2 in
1441: $\log_{10}r$ beginning at $\log_{10}r = -0.7$. The errors on the
1442: clustering measurements are estimated by a bootstrap resampling
1443: technique on roughly equal sized subregions, of which there are 16 in
1444: total (8 in each 2dFGRS region, covering in each region
1445: approximatively the same survey area; see
1446: \citealt{porciani_norberg:06} for further details). We estimate the
1447: full covariance matrix for each sample using 100 bootstrap
1448: resamplings. The analysis in \cite{tinker_etal:07} was performed on
1449: bins of width 0.5 magnitudes. The bins used here are a full magnitude
1450: wide, so we redo the analysis on these new data.
1451: 
1452: In contrast to HOD models of luminosity threshold samples, binned
1453: samples require both a minimum and a maximum mass scale for central
1454: galaxies; as halo mass increases, central galaxies become
1455: brighter. The central occupation function for these samples is denoted
1456: $\nceni$, where $i$ denotes magnitude bin. The sum over all $\nceni$
1457: must be less than or equal to unity. Thus we use a modified form of
1458: equation (\ref{e.ncen_sdss}) that subtracts off brighter galaxies, i.e.,
1459: 
1460: \begin{eqnarray}
1461: \label{e.ncen2}
1462: \nceni & = & \frac{1}{2}\left[ 1+\mbox{erf}\left(\frac{\log M - \log \mmini}{\sigmaMi} \right)
1463: 	\right] - \ncenip, \,\,\, 1\le i\le 3, \nonumber \\ 
1464: \nceni & = & \frac{1}{2}\left[ 1+\mbox{erf}\left(\frac{\log M - \log \mmini}{\sigmaMi} \right)
1465: 	\right], \,\,\, i=4,
1466: \end{eqnarray}
1467: 
1468: \noindent where $\mmini$ is the cutoff mass scale for central
1469: galaxies, and $\sigmaMi$ controls the width of the cutoff mass
1470: range. In equation (\ref{e.ncen_sdss}), $\mmin$ is defined as the mass
1471: at which $\ncen=0.5$, but in equation (\ref{e.ncen2}) this mass can
1472: differ from $\mmini$. The form we use for the satellite galaxy
1473: occupation function is
1474: 
1475: \begin{equation}
1476: \label{e.nsat2}
1477: \nsati = \exp \left(-\frac{\mcuti}{M-\mmini}\right) \left( \frac{M}{\msat^i}\right)^{\asat}.
1478: \end{equation}
1479: 
1480: Because information about all the bins is required to calculate
1481: $\navgi$ for any bin $i<4$, the best-fit occupation functions are
1482: determined simultaneously for all four bins. The model has 13 free
1483: parameters, with each $\mmini$ once again constrained by the number
1484: density of galaxies within each bin, calculated from the 2dFGRS
1485: luminosity function (\cite{norberg_lumfunc:02}, updated to include the
1486: results from the full data release). For 48 data points, the best-fit
1487: $\xwp$ is 30.3, yielding a $\chi^2$ per degree of freedom of 0.87. The
1488: parameters of the best-fit model are listed in Table 3. We make our
1489: predictions for the VPF by populating the 400 \hmpc\ box in the same
1490: manner as for the SDSS samples.
1491: 
1492: 
1493: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
1494: % SDSS HOD parameters
1495: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
1496: 
1497: \begin{deluxetable*}{ccccccc}
1498: \tablecolumns{7} 
1499: \tablewidth{32pc} 
1500: \tablecaption{Best-fit HOD model parameters for SDSS galaxies}
1501: \tablehead{\colhead{Sample} & \colhead{$\chi^2/\nu$} & \colhead{$\mmin$} & \colhead{$\msat$} & \colhead{$\asat$} &
1502: \colhead{$\mcut$} & \colhead{$\slogm$} }
1503: 
1504: \startdata
1505: 
1506: -19 & 4.89/7 & $3.76\times 10^{11}$ & $9.23\times 10^{12}$ & 1.11 & $4.23\times 10^{9}$ & 0.158 \\
1507: -20 & 4.77/7 & $2.69\times 10^{12}$ & $2.46\times 10^{13}$ & 1.13 & $2.12\times 10^{10}$ & 0.915 \\
1508: \,-20$^\prime$ & 8.63/7 & $9.37\times 10^{11}$ & $1.39\times 10^{13}$ & 1.01 & $9.54\times 10^{12}$ & 0.084 \\
1509: -21 & 7.48/7 & $4.89\times 10^{12}$ & $1.05\times 10^{14}$ & 1.23 & $3.58\times 10^{12}$ & 0.052 \\
1510: -22 & 0.87/3 & $1.17\times 10^{14}$ & $4.21\times 10^{14}$ & 1.20 & $2.40\times 10^{14}$ & 0.615  \\
1511: %\hline
1512: $[$-$19,$-$20]$ & 11.7/28 & $3.91\times 10^{11}$ & $1.32\times 10^{13}$ & 1.06 & --- & --- \\
1513: \hline
1514:  & & $\sigma_b^{\rm cen}$ & $f_b^{\rm cen}$ & $\sigma_b^{\rm sat}$ & $f_b^{\rm sat}$ & \\
1515: $[$-$19,$-$20]$&  & 7.97 & 0.68 & 9.46 & 0.33 & 
1516: 
1517: \enddata      
1518: 
1519: \tablecomments{\colhead{Sample} } \tablecomments{All masses are in units
1520:   of \hmsol. The bottom two rows are parameters for modeling
1521:   color-selected samples. See \S 4 for a discussion.}
1522: \end{deluxetable*}
1523: 
1524: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
1525: % 2dF HOD parameters
1526: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
1527: 
1528: \begin{deluxetable*}{cccccc}
1529: \tablecolumns{6} 
1530: \tablewidth{30pc} 
1531: \tablecaption{Best-fit HOD model parameters for 2dFGRS galaxies}
1532: \tablehead{\colhead{Sample} & \colhead{$\chi^2$} & \colhead{$\mmini$} & \colhead{$M_{\rm sat}^i$} & 
1533: \colhead{$\mcuti$} & \colhead{$\slogmi$} }
1534: 
1535: \startdata
1536: 
1537: $[-18.0,-19.0]$ & 5.7 & $2.79\times 10^{11}$ & $9.19\times 10^{12}$ & $5.07\times 10^{11}$ & 0.25 \\
1538: $[-19.0,-20.0]$ & 10.0 & $6.14\times 10^{11}$ & $1.50\times 10^{13}$ & $1.55\times 10^{12}$ & 0.07 \\
1539: $[-20.0,-21.0]$ & 8.8 & $3.15\times 10^{12}$ & $4.23\times 10^{13}$ & $1.35\times 10^{13}$ & 0.23 \\
1540: $<-21.0$ & 5.8 & $5.21\times 10^{13}$ & $3.55\times 10^{14}$ & $1.27\times 10^{14}$ & 0.53 
1541: 
1542: \enddata \tablecomments{ All masses are in units of \hmsol. All
1543:   samples are analyzed simultaneously, so $\chi^2/\nu=30.3/(48-13)=0.87$. }
1544: \end{deluxetable*}
1545: 
1546: 
1547: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
1548: % SUBSECTION
1549: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
1550: 
1551: 
1552: \subsection{VPF Measurements and HOD Predictions}
1553: 
1554: Measurements of the VPF for the 2dFGRS have been presented by
1555: \cite{hoyle_vogeley:04}, \cite{croton_etal:04}, and
1556: \cite{patiri_etal:06a}. For the purposes of this study, none of these
1557: measurements is entirely adequate. \cite{hoyle_vogeley:04} use the
1558: $k+e$ correction of \cite{norberg_etal:01} in their analysis, which is
1559: significantly different than the latest $k+e$ correction for 2dFGRS
1560: galaxies presented in \cite{cole_etal:05} used in the
1561: \wp\ measurements described above. This leads to a difference in the
1562: number densities of galaxies between the \wp\ samples and the $\p0$
1563: samples. This difference becomes larger with the mean redshift of the
1564: sample, and the $\mbjh<-21$ sample in \cite{hoyle_vogeley:04} has more
1565: than twice as many galaxies in it as the sample analyzed here. Note
1566: that the effect of this mismatch is quite different from the
1567: difference in number densities in the SDSS samples and the HOD models
1568: in \S 3. That discrepancy is due to the survey selection function, but
1569: \cite{hoyle_vogeley:04} essentially analyze different sets of
1570: galaxies, which have different clustering and void
1571: statistics. Thus one would not expect our HOD predictions to match
1572: their $\p0$ measurements, even if we adjusted our models to match
1573: their $\ngavg$. For the \cite{croton_etal:04} data, the completeness
1574: correction applied to them accrues an unquantified systematic error
1575: that is difficult to model. Their measurements also employ an outdated
1576: $k+e$ correction from \cite{norberg_lumfunc:02}, although the
1577: differences between this correction and the \cite{cole_etal:05}
1578: correction are substantially smaller. \cite{patiri_etal:06a} construct
1579: volume-limited samples within the 2dFGRS with magnitude thresholds of
1580: $\mbjh<-19.32$ and $\mbjh<-20.181$, values that do not correspond to
1581: the unit magnitude bins of our \wp\ measurements. Thus for comparison
1582: with our HOD predictions we repeat the analysis of
1583: \cite{patiri_etal:06a}, making several adjustments to better facilitate
1584: the comparison. We use the \cite{cole_etal:05} $k+e$ correction, and
1585: all galaxies are corrected to $z=0.1$. We construct volume-limited
1586: samples that match our \wp\ samples, and we keep track of the number
1587: density of galaxies at each $r$ in order to repeat the procedure used
1588: above for comparing to SDSS data.
1589: 
1590: We create HOD predictions by populating the 400 \hmpc\ simulation with
1591: the best-fit HOD parameters for each magnitude bin and scaling the
1592: number density at each $r$ to the value measured, as with the SDSS
1593: data. Error bars on the data are also obtained from this
1594: simulation. The mean incompleteness of the 2dFGRS is larger than in
1595: the SDSS, and the variation of $\ngbar$ is also larger. We are unable
1596: to use the 1086 \hmpc\ box for modeling the brighter two samples
1597: because the occupation functions extend below the resolution limit of
1598: that simulation. For these reasons, we do not perform a detailed
1599: statistical analysis as with the SDSS samples, but rather compare the
1600: data and models more qualitatively.
1601: 
1602: Figure \ref{2df_vpf}a and \ref{2df_vpf}b show the results for the
1603: $-19<\mbjlogh<-18$ and $-20<\mbjlogh-19$ magnitude bins. The best-fit
1604: HOD model accurately predicts the the VPF for these two
1605: samples. Figure \ref{2df_vpf}c shows the results for
1606: $-21<\mbjlogh-20$. The model slightly over-predicts $\p0$ for $r\ge 10$
1607: \hmpc\ in much the same way as the $\mrlogh<-21$ SDSS sample but with
1608: smaller significance (with respect to diagonal error bars only). For
1609: the brightest 2dFGRS galaxies in Figure \ref{2df_vpf}d, our model is a
1610: poor fit to the observed VPF. The voids in the data are clearly much
1611: smaller than those predicted by the HOD. As a rough guide, the
1612: diagonal-only $\chi^2_{\rm diag}=338$ for the best-fit \wp\ model (not
1613: shown in this Figure). These rare galaxies reside in rare, highly
1614: biased halos and the model predictions are more sensitive to the value
1615: of $\slogm$ than for other samples. To explore this effect, we analyze
1616: this sample separately in an MCMC chain. The upper and lower bounds of
1617: the shaded region are models from the chain with the lowest and
1618: highest values, respectively, of $\slogm$ with $\Delta\xwp<1$ with
1619: respect to the best-fit model. The lower bound, with $\slogm=0.9$,
1620: lies closer to the data but is still significantly
1621: discrepant. Although it is possible to construct density dependent
1622: models to match the measured $\p0$, these models will be highly
1623: discrepant with the measurements of \wp\ since they require an
1624: increase in the galaxy formation efficiency in lower density
1625: regions. \cite{berlind_etal:06} investigated the clustering of massive
1626: galaxy groups, demonstrating that at fixed mass, systems with bluer
1627: central galaxies are more strongly clustered than redder central
1628: galaxies. Because 2dFGRS is a blue-selected survey, the effect of
1629: density dependence would make the voids in the brightest 2dFGRS
1630: galaxies larger than in the standard HOD, the opposite of the
1631: discrepancy in Figure \ref{2df_vpf}d.
1632: 
1633: The conflict in \ref{2df_vpf}d can be resolved if the brightest 2dFGRS
1634: galaxies are not always in the most massive halos. The solid line in
1635: Figure \ref{2df_vpf}d represents a model for this sample in which the
1636: maximum value of $\ncen$ for this model is 0.5 rather than
1637: unity. Equation (\ref{e.ncen2}) is modified by a simple multiplicative
1638: factor of 0.5, preserving the shape of the cutoff. Physically this
1639: model implies that the relationship between host halo mass and central
1640: galaxy luminosity $L_c$ becomes essentially flat at $M\gtrsim 10^{14}$
1641: \hmsol\ in $b_J$. Setting the maximum value of $\ncen$ to 0.5 reduces
1642: $\mmin$ for this sample in order to match the number density and the
1643: overall bias of the model is reduced. This model is conceptually
1644: similar to one with a very large value of $\slogm$, large enough such
1645: that $\ncen$ never reaches unity at the largest resolved halos. But
1646: due to the functional form of equation (\ref{e.ncen2}) values of
1647: $\slogm$ large enough to resolve the discrepancy with the $\p0$ data
1648: place a non-negligible fraction of the brightest galaxies in $\sim
1649: 10^8$ \hmsol\ halos, which is both physically unreasonable and
1650: significantly lowers the amplitude of \wp. The large-scale amplitude
1651: of the low-$\ncen$ model is also below that of the fiducial model, but
1652: the increase in $\xwp$ is a modest $\sim 2$. The effect on the void
1653: statistics is marked: the low-$\ncen$ model VPF is in good agreement
1654: with the observations. The lower-bound of the shaded region in Figure
1655: \ref{2df_vpf}d (the model with $\slogm=0.9$) yields $\chi^2_{\rm
1656:   diag}=91.3$, while the low-$\ncen$ model yields $\chi^2_{\rm
1657:   diag}=35.3$. To properly compare these models, larger simulations
1658: with the proper mass resolution are required to estimate the
1659: covariance matrices, but it is clear that the low-$\ncen$ model is an
1660: improvement. The clustering of the galaxies in the next-brightest bin
1661: are unaffected by this change in the HOD; although fainter galaxies
1662: can occupy the highest mass halos, the overall number of these
1663: galaxies is insignificant, and both \wp\ and $\p0$ are unchanged.
1664: 
1665: For galaxy groups in the 2dFGRS, \cite{yang_etal:05} find that $L_c$
1666: increases with halo mass as $M^{2/3}$ at low masses, but becomes
1667: shallower for $M>10^{13}$\hmsol, increasing as $M^{1/4}$. At
1668: $M>10^{14}$\hmsol, the mass at which $\ncen$ reaches its maximum, the
1669: scatter in the $L_c-M$ relation becomes large, covering nearly half a
1670: dex in $L_c$. For redder bands like Sloan-$r$, a continual monotonic
1671: relation between $L_c$ and $M$ is well motivated, but just due to the
1672: width of the color-magnitude relation some galaxies from a lower $M_r$
1673: magnitude bin will fall in the brightest $b_J$ bin. As
1674: \cite{cole_etal:06} recently pointed out, the color distributions of the
1675: SDSS and 2dFGRS are substantially different, with the SDSS being
1676: dominated by red galaxies and the 2dFGRS dominated by blue
1677: objects. Convolved with the magnitude errors of the 2dFGRS, which are
1678: larger than those in the SDSS (and will scatter asymmetrically from
1679: lower luminosities to higher luminosities), a complete sample of the
1680: brightest $M_r$ galaxies in one survey may not contain all of the
1681: brightest $b_J$ galaxies in the other.
1682: 
1683: \begin{figure*}
1684: \centerline{\epsfxsize=5.5truein\epsffile{2df_vpf.ps}}
1685: %\epsscale{1.0}
1686: %\plotone{2df_vpf.ps}
1687: %\vspace{-4.5cm}
1688: \caption{ \label{2df_vpf} Panels (a)--(c) show a comparison between
1689:   2dFGRS VPF data and HOD model predictions. Points with error bars
1690:   observational measurements. Solid lines are the HOD predictions from
1691:   the best-fit model, obtained from the 400 \hmpc\ box. In panel (d),
1692:   the points show the 2dFGRS data, while the shaded region shows the
1693:   range of predictions for models with the highest and lowest values
1694:   of $\slogm$ that produce a $\Delta\xwp<1$ with respect to the
1695:   best-fit model, which has a value of $\slogm=0.53$. The lower bound
1696:   represents a model with $\slogm=0.9$, and the upper bound represents
1697:   a model with $\slogm=0.05$. The solid line is a model in which the
1698:   maximum value of $\ncen$ is 0.5, as opposed to 1 for the other
1699:   models. This low-$\ncen$ model has a value of $\slogm=0.73$.}
1700: \end{figure*}
1701: 
1702: 
1703: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%0%%%%%%%%%%%%%%%%%
1704: %  Summary
1705: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
1706: 
1707: \section{Discussion}
1708: 
1709: In this paper we have demonstrated that an environmentally independent
1710: approach to halo occupation can simultaneously model both the
1711: two-point clustering and the void statistics of galaxy samples
1712: selected by both luminosity and color. Because \wp\ and $\p0$ weight
1713: environments differently, with \wp\ determined predominantly by halos
1714: that sit at or above the mean density and $\p0$ determined by halos
1715: that reside in low-density regions, our results show that, to the
1716: limit these statistics can be measured, $\ncen$ is independent of
1717: environment. Although we are not explicitly testing environmental
1718: dependence of satellite galaxy occupation, the results in the paper
1719: offer an implicit test: If the number of satellite galaxies strongly
1720: correlates with halo environment, then the HOD inferred from modeling
1721: \wp\ will be systematically biased and could predict the wrong void
1722: distribution regardless of whether central galaxies exhibit assembly
1723: bias. In Paper I we demonstrated that \wp\ constrains the fraction of
1724: galaxies that are satellites, and thus the complementary fraction that
1725: are central. The central galaxy fraction, in turn, strongly influences
1726: the distribution of void sizes. If the HOD model for \wp\ is
1727: systematically under- or overestimating this quantity, then the
1728: predicted void statistics will not match observations.
1729: 
1730: It has been suggested that voids and void galaxies represent a
1731: challenge to the \lcdm\ model (\citealt{peebles:01}). If there exists
1732: substantial mass in underdense regions, the argument goes, then the
1733: observed paucity of low-luminosity galaxies in these regions is
1734: incompatible with the standard hierarchical clustering picture because
1735: low-mass halos in the voids will contain low-luminosity
1736: galaxies. \cite{wechsler_etal:06} propose that the assembly bias of
1737: low mass halos may be related to this so-called `void phenomenon',
1738: because low mass halos in underdense regions form later, and the gas
1739: within them may therefore form stars less efficiently due to an
1740: increased photoionizing background. Our results suggest that there
1741: is no void phenomenon for galaxies as faint as $\sim 0.2L_\star$ (a
1742: halo mass of $\sim 0.02\mstar$, the minimum mass probed in the
1743: \citealt{wechsler_etal:06} results). The observed voids in samples of
1744: low-luminosity galaxies match the voids predicted by the typical halos
1745: those galaxies occupy. The data presented here leave little room for a
1746: shift in galaxy formation efficiency (or a shift in the typical halo
1747: occupied) in low-density regions. These results are in agreement with
1748: the results of semi-analytic models of \cite{mathis_white:02} and
1749: \cite{benson_etal:03}, which find that low luminosity galaxies avoid
1750: the voids defined by the brighter galaxies. As \cite{wechsler_etal:06}
1751: suggest, assembly bias may influence the formation of fainter void
1752: galaxies, but larger observational samples are required to fully
1753: address this problem.
1754: 
1755: In the semi-analytic results of \cite{croton_etal:07}, the impact of
1756: the assembly bias on the correlation function ranges from $b_x-1=0.05$
1757: for faint galaxies to $b_x-1=-0.05$ for bright samples. We have shown
1758: that, at least in the class of $(\fmin,\dc)$ models considered here,
1759: assembly bias of this order for central galaxies cannot be reconciled
1760: with the measured void statistics. Density dependent models that
1761: produce acceptable fits to $\p0$ and $\pu$ produce values of
1762: $|b_x-1|\le 0.02$. At some level, assembly bias should be present in
1763: the galaxy population, but we have ruled out a strong dependence of
1764: $\ncen$ on $\delta$ that could bias measurements of halo occupation
1765: parameters or constraints on cosmological parameters obtained through
1766: the application of the HOD. At the level of precision of the current
1767: generation of large-scale redshift surveys, our results suggest that
1768: assembly bias is generally not a concern, though it could still have
1769: some influence on statistical measures not constrained (directly or
1770: indirectly) by our analysis. The assembly bias issue will need to be
1771: revisited for accurate analysis of the next generation of galaxy
1772: redshift surveys, when percent-level effects become significant.
1773: 
1774: For luminosity samples, it is perhaps not unexpected that
1775: environmental effects are small. For color-defined samples, on the
1776: other hand, our results are more surprising. Because halo formation
1777: time depends so strongly on local density, with younger low mass halos
1778: living in low density regions, one might naturally expect the stellar
1779: populations within these halos to reflect this trend.  This would lead
1780: to an assembly bias such that, at fixed mass, the lower the local
1781: density, the larger the fraction of central galaxies that are blue in
1782: color. This is exactly the type of bias seen in the models of \cite{croton_etal:07}:
1783: low mass halos with red central galaxies form near $z\sim 2$, while
1784: low mass halos with blue central galaxies have formation redshifts of $z
1785: \sim 1.5$ or less. However, in our ``standard'' HOD analysis of
1786: color-defined samples, we assume that the red central galaxy fraction
1787: is independent of environment. Equation (\ref{e.fblue}) implies, when
1788: applied to the samples explored \S 4, that a $\sim 10^{11.5}$
1789: \hmsol\ halo has a $\sim 30\%$ chance of hosting a red central galaxy
1790: regardless of environment or formation time. In \cite{croton_etal:07},
1791: assembly bias results in voids in the red galaxies that are
1792: significantly larger than in the HOD prediction. The results of
1793: Figures \ref{color_samples} and \ref{dd_colors} support a central red
1794: fraction that is environment independent. In contrast to
1795: luminosity-defined samples, the current precision of the SDSS is
1796: sufficient to exclude the level of assembly bias measured in
1797: \cite{croton_etal:07} for color-defined samples, which in their model
1798: is driven primarily by central galaxies.
1799: 
1800: Inconsistencies between observed properties of the red galaxy
1801: population and the predictions of \cite{croton_etal:07} have been
1802: reported elsewhere as well. \cite{springel_etal:05} show that the
1803: amplitude of the two-point correlation function of red galaxies in the
1804: Millennium Run semi-analytic galaxy population is much higher than
1805: observations at all scales. \cite{weinmann_etal:06}, using a catalog
1806: of galaxy groups created from the SDSS, demonstrate that the red
1807: galaxy fraction of groups is too high in the \cite{croton_etal:06a}
1808: model. \cite{baldry_etal:06} investigate the red fraction as a
1809: function of local galaxy density in the SDSS, measuring a monotonic
1810: decrease in red fraction with decreasing density. Such a correlation
1811: can naturally result from the dependence of the halo mass function of
1812: local environment without invoking assembly bias
1813: (\citealt{berlind_etal:05}): red galaxies are primarily satellites in
1814: high mass halos (see Figure \ref{color_samples}b), and the frequency
1815: of such halos correlates strongly with local
1816: density. \cite{baldry_etal:06} find that the correlation of red galaxy
1817: fraction with environment is much steeper in the
1818: \cite{croton_etal:06a} model than measured in the SDSS. They show that
1819: this result is primarily due to the correlation between red central
1820: galaxies and environment. Although Baldry \etal\ define density by
1821: local galaxy density in redshift space using a nearest neighbor
1822: criterion, the relation they find in the \cite{croton_etal:06a} model
1823: between the red fraction of central galaxies and density is similar to
1824: the models tested in \S 3.3 with a sharp decrease in red central
1825: fraction by nearly an order of magnitude at densities below the mean.
1826: 
1827: These discrepancies between semi-analytic models and observations
1828: discrepancies offer insight into galaxy formation processes. The
1829: aspect of the \cite{croton_etal:06a} model that most directly
1830: influences galaxy color is its treatment of AGN feeding and feedback,
1831: which heats the gas and halts star formation. In the model, this
1832: mechanism is correlated with environment to produce the
1833: color-dependent assembly bias. This work, and the papers listed above,
1834: suggest that a gas-heating mechanism less sensitive to halo
1835: environment will bring the models into better agreement with the
1836: clustering data.
1837: 
1838: Regardless of the details of galaxy formation, it is well-established
1839: now that correlations exists between halo properties and environment,
1840: especially for the low-mass halos that contain $\mrlogh \sim -19$
1841: galaxies. \cite{gao_white:06} show that environment correlates with
1842: halo formation time, concentration, and spin for $M<\mstar$. Why then
1843: is the correlation with galaxy properties so weak? To produce the
1844: observed void statistics, the luminosity of a central galaxy must be
1845: largely uncorrelated with halo properties other than mass, in the
1846: sense that the correlation must be significantly smaller than the
1847: scatter in $L_c$ at a given halo formation time or formation
1848: history. Additionally, central galaxy color must also be weakly
1849: correlated with halo formation. The amount of star formation required
1850: to make a red galaxy blue is relatively small, while the amount of
1851: time required for a blue galaxy to passively evolve into a red object
1852: can be $\lesssim 1$ Gyr (see \citealt{faber_etal:05} and references
1853: therein). If the color distribution is determined mainly by the
1854: occurrence of recent star formation, galaxy colors may be a stochastic
1855: process in better agreement with the assumptions of the
1856: HOD. \cite{rojas_etal:04, rojas_etal:05} find that void galaxies have
1857: higher specific star formation rates than galaxies in higher-density
1858: environments. As with the color-density relation
1859: (\cite{berlind_etal:05}), the correlation of star formation rate with
1860: environment may reflect changes in the underlying halo mass function
1861: between low and high densities rather than a correlation with
1862: formation history.
1863: 
1864: The nature of voids has been an important question since their
1865: discovery in the first large galaxy redshift survey
1866: (\citealt{gregory_thompson:78, kirshner_etal:81}). Are voids truly
1867: empty of matter or just deficient in galaxies? Is a non-gravitational
1868: process required to explain their observed sizes? These questions have
1869: become better defined through convergence on a standard cosmological
1870: model and better understanding of the relation between galaxies and
1871: dark matter halos. We find that the sizes and emptiness of observed
1872: voids are in excellent agreement with straightforward theoretical
1873: predictions. \\
1874: 
1875: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
1876: %  Acknowledgments
1877: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
1878: 
1879: \vspace{2cm}
1880: 
1881: \noindent The authors wish to thank Darren Croton, Brant Robertson,
1882: Ravi Sheth, Michael Vogeley, Risa Wechsler, Martin White, and Andrew
1883: Zentner for many useful discussions. JLT acknowledges the use of the
1884: computing facilities of the Department of Astronomy at Ohio State
1885: University. JLT also would like to acknowledge the generous
1886: hospitality of the Institute for Computational Cosmology at the
1887: University of Durham, where part of this work was completed. DW
1888: acknowledges the support of NSF grant AST-0407125. Portions of this
1889: work were performed under the auspices of the U.S. Dept. of Energy,
1890: and supported by its contract \#W-7405-ENG-36 to Los Alamos National
1891: Laboratory.  Computational resources were provided by the LANL open
1892: supercomputing initiative. CC thanks the Instituto de Astrofisica de
1893: Andalucia (CSIC) for their wonderful espresso bar and financial
1894: support in the Spring of 2006.
1895: 
1896: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
1897: %  Bibliography
1898: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
1899: 
1900: \bibliography{risa}
1901: 
1902: 
1903: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
1904: % FIGURES FIGURES FIGURES FIGURES FIGURES FIGURES FIGURES FIGURES 
1905: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
1906: 
1907: 
1908: 
1909: 
1910: \end{document}
1911: