1: % Template article for New Astronomy
2:
3: \documentclass{elsart}
4:
5: % the harvard package allows Harvard style referencing
6: %\usepackage{harvard}
7: \usepackage{natbib}
8: \usepackage{graphicx}
9:
10: % if you use PostScript figures in your article
11: % use the graphics package for simple commands
12: % \usepackage{graphics}
13: % or use the graphicx package for more complicated commands
14: % \usepackage{graphicx}
15: % or use the epsfig package if you prefer to use the old commands
16: % \usepackage{epsfig}
17:
18: % The amssymb package provides various useful mathematical symbols
19: \usepackage{amssymb}
20: \usepackage{amsmath}
21: \usepackage{bm}
22:
23: % definitions for astronomical objects, hyperlinks and ADS bibcodes
24: \def\astrobj#1{#1}
25: \def\url#1{{\ttfamily\def\/{/\discretionary{}{}{}}#1}}
26: %\def\bibcode#1{(\texttt{#1})}
27: \def\bibcode#1{}
28: \def\ellb{\mbox{\boldmath $\ell$}}
29: \def\thetab{\mbox{\boldmath $\theta$}}
30: \begin{document}
31:
32: \begin{frontmatter}
33: \title{Finding Clusters in SZ Surveys}
34: \author[address1]{Chris Vale\thanksref{cvemail}},
35: \author[address1,address2]{Martin White\thanksref{mwemail}}
36: \address[address1]{Department of Physics, University of California,
37: Berkeley, CA, 94720}
38: \address[address2]{Department of Astronomy, University of California,
39: Berkeley, CA, 94720}
40: \thanks[cvemail]{E-mail: cvale@astro.berkeley.edu}
41: \thanks[mwemail]{E-mail: mwhite@astro.berkeley.edu}
42:
43: \begin{abstract}
44: We use simulated maps to investigate the ability of high resolution, low
45: noise surveys of the CMB to create catalogues of Clusters of galaxies by
46: detecting the characteristic signature imprinted by the Sunyaev Zeldovich
47: effect. We compute the completeness of the catalogues in our simulations
48: for several survey strategies, and evaluate the relative merit of some
49: Fourier and wavelet based filtering techniques.
50: \end{abstract}
51:
52: \begin{keyword}
53: Cosmology \sep Large-Scale structures \sep Theory
54: \PACS 98.65.Dx \sep 98.80.Es \sep 98.70.Vc
55: \end{keyword}
56: \end{frontmatter}
57:
58: \section{Introduction}
59:
60: Future measurement of the distribution and number density of clusters of
61: galaxies will place increasingly important constraints on the nature
62: of the universe we live in \cite[e.g. ][]{Bahcall99,RBN02,Voit04},
63: and is a major science goal of upcoming surveys such as
64: SZA\footnote{http://astro.uchicago.edu/sza/},
65: APEX-SZ\footnote{http://bolo.berkeley.edu/apexsz/},
66: the South Pole Telescope (SPT\footnote{http://astro.uchicago.edu/spt/})
67: and the Atacama Cosmology Telescope
68: (ACT\footnote{http://www.hep.upenn.edu/$\sim$angelica/act/act.html})
69: These surveys will map the millimeter and
70: sub-millimeter sky with unprecedented power and resolution, which will
71: enable the construction of a catalogue of clusters detected through the
72: thermal Sunyaev Zel'dovich effect (SZE)
73: \citep[][for recent reviews see \citealt{Reph95,Birk99,Carl02}]{SZ72,SZ80}
74: In this paper, we examine different survey strategies
75: and signal processing methodologies to enhance this effort.
76:
77: The imprint of the SZE on the cosmic microwave background (CMB) is
78: an integrated effect from the time of last scattering to the present era,
79: and as such the SZE signal suffers from projection effects due to other
80: objects along the line of sight. This introduces non-linear complications
81: to our signal processing efforts, and makes it impossible to conclusively
82: determine the best method by analytic means alone. We therefore test
83: and compare three promising filtering techniques: discrete wavelets,
84: continuous wavelets, and Fourier methods. These are applied to mock SZ maps
85: for several different survey strategies, and results for the different
86: strategies and filters are computed.
87:
88: The mock SZ maps are created using an N-body simulation of sufficient volume
89: to be a fair sample of the universe. Due to the current uncertainty in
90: both the magnitude of the SZE and of relevant astrophysical foregrounds,
91: a detailed modeling of the signal and noise is not currently possible.
92: However, some of the complications that will be encountered by actual
93: surveys, such as confusion due to projection effects, irregularly shaped
94: sources, maps with edges and holes, and spatially varying noise, are
95: included in our tests prospective filters.
96:
97: The outline of the paper is as follows. We describe our simulations in
98: Section \ref{sec:simulations} and our filtering schemes in
99: Section \ref{sec:wavelets}. We then present our results in the context of
100: various survey scenarios and signal processing techniques in
101: Section \ref{sec:results}, and discuss our conclusions in
102: Section \ref{sec:conclusions}.
103:
104: \section{Simulating the SZE} \label {sec:simulations}
105:
106: \begin{figure}
107: \begin{center}
108: {\includegraphics*[height=6.8cm,width=6.8cm]{fig1a.ps}
109: \includegraphics*[height=6.8cm,width=6.8cm]{fig1b.ps}}
110: {\includegraphics*[height=6.8cm,width=6.8cm]{fig1c.ps}
111: \includegraphics*[height=6.8cm,width=6.8cm]{fig1d.ps}}
112: \end{center}
113: \caption{An example of the maps before filtering. The full map (top left)
114: includes the SZE signal (which shows up as cold spots on the map) and all
115: sources of ``noise''. Since the signal is overwhelmed by the primary CMB on
116: the angular scale shown here, we display the same map without the CMB
117: (top right), but still including point source and instrument noise.
118: The relative importance of these two effects can be seen in the bottom maps,
119: where we have displayed the SZE with point sources but no instrument noise
120: (left) and vice versa (right). The maps are
121: $3^{\circ} \times 3^{\circ}$ and contain $1024^2$ pixels, rebinned to
122: $256^2$ for display. The color scale of the maps is linear, and
123: span $100 \mu$K, except for the map including the primary CMB,
124: which spans $500 \mu$K. This particular map is made at 150 GHz for a 12
125: meter dish, assuming $10 \mu$K-arcmin of instrument noise, and with point
126: source contributions near the high end of the expected magnitude.}
127: \label{fig:maps}
128: \end{figure}
129: Since we use the method outlined in \cite{SW03} to create maps of the SZE, we
130: provide only a brief description here. The maps are created from a large
131: volume, high resolution N-body simulation containing a fair sample of the
132: universe, for a flat $\Lambda$CDM cosmology with
133: $\Omega_m = 0.3$, $\Omega_b h^2 = 0.02$, $h = 0.7$, and $\sigma_8 = 1$.
134: We use a semi-analytic model, in which baryonic matter traces the dark
135: matter in our clusters, in order to include the gas physics responsible
136: for the SZE. This assumption is likely to be a good approximation
137: everywhere except at the cluster cores, which will not be resolved by the
138: surveys considered here. We identify clusters in the N-body simulation
139: using a friends-of-friends (FOF) algorithm \citep{Davis85} with a linking
140: length $b = 0.15$ times the mean interparticle spacing. The mass contained
141: by hot gas is set to $\Omega_b / \Omega_m$ of the total, and each cluster
142: is set to be isothermal at a temperature given by
143: \begin{equation} \label{eq:clustertemp}
144: {k_B T \over keV} \sim
145: \left ( {H(z) M \over 10^{15} h^{-1} M_\odot } \right )^{2 / 3}
146: \end{equation}
147: where $H(z)$ is the hubble parameter. This effectively reproduces the results
148: of the hydrodynamic simulations of \cite{WHS02}. The normalization has been
149: set to pass through the lower envelope of the CBI deep field \citep{Mason03}
150: and through the BIMA point \citep{Dawson01} on small angular scales. We
151: generate Compton-Y maps by projecting along each line of sight, so that
152: \begin{equation} \label{eq:comptony}
153: y=\int \sigma_T n_{e}{k_B T \over m_{e} c^2} dl \qquad
154: \end{equation}
155: where $\sigma_T$ is the Thompson scattering cross section, $n_e$ is the
156: electron number density, and $m_e$ is the electron mass. The temperature
157: fluctuation for a given frequency $\nu$ is related to the Y-maps by
158: \begin{equation} \label{eq:DeltaTonT}
159: {\Delta T \over T} = y \left ( x {e^x + 1 \over e^x -1} - 4 \right )
160: \end{equation}
161: where $x = h \nu / k_B T_{CMB} \simeq \nu / 56.84$GHz is the dimensionless
162: frequency. Ten maps are made in this manner (see Figure \ref{fig:maps} for
163: an example), each with $1024^2$ pixels and $3^\circ$ on a side. These are
164: not as accurate as those produced using full hydrodynamic simulations, but
165: they allow us to probe a larger volume and therfore provide a better sample
166: of large clusters situated in their proper cosmological context.
167:
168: In the absence of perfect spectral information, confusion due to the primary
169: CMB temperature anisotropy and to point sources may impede the detection of
170: clusters. We simulate the former using realizations of a Gaussian
171: random fields convolved with the CMB power spectrum computed using
172: CMBfast \citep{SeZa96}. We then add radio and infrared (IR) point sources to
173: the maps using the model of \cite{WM04}. For radio sources, this
174: is a fit to the Q-band data of WMAP \citep{Bennett03}, while IR
175: sources are fit using the 350 GHz observations of \cite{Borys03}
176: with the Submillimeter Common User Bolometer Array
177: \citep[SCUBA:][]{Holland99} on the James Clerk Maxwell Telescope. We
178: note that there is substantial uncertainty in extrapolating these fits
179: to frequencies relevant to us here, so we examine two different
180: extrapolations likely to span the magnitude of the effect.
181:
182: The maps are then smoothed with a Gaussian beam, and Gaussian white noise is
183: added. Although we have ignored many effects which may be important in real
184: world observations (such as offsets, drifts, and atmosphere), we treat the
185: maps as completed and ready for signal processing.
186:
187: \section{Filtering methods} \label{sec:wavelets}
188:
189: In this section, we describe the filters we use to process the simulated
190: maps discussed in Section \ref{sec:simulations}. These maps have a
191: complex structure which makes it impossible to analytically determine
192: the best filter to aid our efforts at cluster identification, so we
193: explore several different methods. We begin by describing
194: the optimal Fourier based filter of \cite{Tegmark98},
195: then briefly describe filtering in the discrete wavelet basis, where we
196: focus on the Daubechies wavelet family \citep{Daub92}. Finally, we
197: discuss filtering using the continuous mexican hat wavelet filter
198: \cite[see][for more discussion of filtering with continuous
199: wavelets]{Pierpaoli04}.
200:
201: The signal power in the SZE is expected to exceed that of the primary CMB
202: and (for a sufficiently powerful survey) instrument noise on roughly arcminute
203: scales (Figure \ref{fig:szwindow}). The optimal filter derived in
204: \cite{Tegmark98}
205: is essentially a bandpass window which can be centered on the appropriate
206: angular scale, and is therefor an obvious candidate for our purpose.
207: This filter is azimuthally symmetric, and its radial dependence in
208: Fourier space is
209: \begin{equation} \label{eq:tegfilt}
210: \tilde{\psi}_{match}(\ell) \sim
211: {e^{\theta^2 \ell(\ell +1)/2} \over C_{\ell}^{\rm{Tot}} }
212: \end{equation}
213: where $\theta$ is the full width half max (FWHM) beam size and
214: $ C_{\ell}^{\rm{Tot}}$ is the total power spectrum of all ``noise''.
215: See Figure \ref{fig:szwindow} for an example of this filter appropriate
216: for our fiducial surveys.
217: \begin{figure}
218: \begin{center}
219: {\includegraphics*[height=4.8cm]{fig2a.ps}
220: \includegraphics*[height=4.8cm]{fig2b.ps}}
221: \end{center}
222: \caption{(Left) The expected magnitude of the SZE, the CMB, and
223: instrument noise for a fiducial $10 \mu K$ per $1^{\prime}$ beam survey,
224: shown here in Fourier space. The SZE signal exceeds both the noise and
225: CMB on roughly arcminute scales. (Right) The optimal and mexican hat
226: filters in Fourier space. The filters are wedge shaped band pass filters,
227: designed to pass scales where the SZE is large relative to the CMB and
228: instrument noise.}
229: \label{fig:szwindow}
230: \end{figure}
231:
232: Wavelets have emerged as a powerful tool for signal processing
233: \cite[see the Appendix for a brief discussion and e.g.][for a
234: review of wavelet signal processing]{Mallat99}.
235: They are simultaneously (but imperfectly) localized in both real space and
236: Fourier space, and are therefore a natural choice for processing data
237: which possesses both real space and Fourier space correlations, such
238: as we expect from our fiducial surveys. We employ the wavelet transform
239: algorithm outlined in \cite{Press92}, and although we focus on the
240: Daubechies wavelets, we note that we have also explored the Coiflet,
241: Symlet, and Morlet wavelet families, and as these offer essentially
242: the same results as the Daubechies wavelets, we do not discuss
243: them further.
244:
245: To generate our filter, we follow \cite{Pen99} and estimate the
246: expected signal given data $\langle S | D \rangle$ for each coefficient in
247: the wavelet transform. This amounts to Wiener filtering
248: in the limit of Gaussian noise and Gaussian signal, and to
249: thresholding for highly non-Gaussian signal distributions, such as unsmoothed
250: point sources. This approach is likely to be superior (in the sense of
251: minimizing the least squared error in the reconstruction) if the
252: signal or noise include substantial non-Gaussian behavior, but requires that
253: the signal and noise probability density functions (PDF) be known.
254: Although \cite{Pen99} suggests computing $\langle S | D \rangle$ directly
255: from the non-Gaussian behavior of the observed maps, we find that the
256: procedure outlined there is subject to numerical artifacts due to the
257: finite size of sky in our simulations, so we compute this function directly
258: from our 10 input signal maps. This will not be possible for real surveys,
259: and so should be considered an upper limit to the performance of the
260: technique.
261:
262: We implement a third class of filter, the continuous wavelet filter, using
263: the mexican hat wavelet transform \citep[e.g. ][]{Cayon00,Maisinger04}. The
264: continuous wavelet transform $W(a,b)$ of a one dimensional function $f(x)$
265: is a real space convolution of $f(x)$ with a ``mother wavelet'' $\psi(x)$
266: \begin{equation} \label{eq:continuous}
267: W(a,b) = \int dx f(x) {1 \over \sqrt{a}} \psi \left ( {x - b \over a} \right )
268: \end{equation}
269: where $a$ and $b$ are the scale and position parameters. This convolution
270: is normally performed as a multiplication in the Fourier domain, so that
271: $\tilde{W} = \sqrt{a} \tilde{f}(\ell) \tilde{\psi}(a \ell)$,
272: where the tilde denotes the Fourier transform. The mexican hat wavelet is
273: the second derivative of a Gaussian, so that its Fourier transform is
274: \begin{equation} \label{eq:mexhat}
275: \tilde{\psi}_{mex}(a \ell) \sim ( a \ell)^2 \rm{exp}
276: \left [ - {(a \ell)^2 \over 2} \right ]
277: \end{equation}
278: Like the optimal filter, the continuous wavelet transform is essentially
279: a filter in Fourier space, as can be seen in the side by side comparison
280: in Figure \ref{fig:szwindow}.
281:
282: In the next section, we demonstrate the use of these filters on the simulated
283: maps described in Section \ref{sec:simulations} for various survey
284: strategies.
285: \begin{figure}
286: \begin{center}
287: {\includegraphics*[height=6.8cm,width=6.8cm]{fig3a.ps}
288: \includegraphics*[height=6.8cm,width=6.8cm]{fig3b.ps}}
289: {\includegraphics*[height=6.8cm,width=6.8cm]{fig3c.ps}
290: \includegraphics*[height=6.8cm,width=6.8cm]{fig3d.ps}}
291: \end{center}
292: \caption{The input SZE (top left) and the filtered maps for the discrete
293: Daubechies wavelet filter (top right), the mexican hat continuous wavelet
294: filter (bottom left), and for the optimal Fourier filter (bottom right).
295: The color scale is linear, and structures
296: less than $1 \sigma$ of the noise have been suppressed for visual clarity.}
297: \label{fig:filtermaps}
298: \end{figure}
299:
300: \section{Results} \label{sec:results}
301:
302: In this section, we examine the maps after application of the filters
303: described in Section \ref{sec:wavelets}. As can be seen in the examples
304: shown in Figure \ref{fig:filtermaps},
305: the three filters clearly all succeed in improving signal
306: to noise, with large structures in the filtered maps all corresponding to
307: massive clusters. We quantify the level of this success using the peak
308: finding algorithm of \cite{SW03}, and conclude that for the surveys we
309: consider here, the optimal filter performs at least as well as the
310: wavelet based filters for creating complete, efficient surveys of clusters.
311: We then examine cluster finding in the context of several survey strategies.
312:
313: Of the three filtered maps shown here, the one created using the discrete
314: wavelet (specifically, Daub6) ``$\langle S | D \rangle$'' technique
315: described in Section \ref{sec:wavelets} best reconstructs the input SZE
316: signal in the sense of minimizing the least squared error of the
317: reconstruction. However, this reconstruction does not do as well at
318: creating complete, efficient surveys of clusters. This is because a
319: large fraction of the improved signal recovery is associated with a few
320: giant clusters. Since these produce enormous signal, they are easy to
321: find, regardless of the filter, and improving their reconstruction
322: does not aid in the completeness of the catalogue. However, these
323: clusters have signal on relatively large angular scales, so that the
324: ``$\langle S | D \rangle$'' technique smooths the maps more than is
325: optimal for finding smaller clusters at the threshold of detection.
326: Although this can be accounted for by eliminating the large clusters from
327: consideration when formulating the filter, the result is to simply
328: scale the signal by a constant at each level, so that the smaller
329: cluster signal is effectively in the Wiener filter limit. Although we do
330: not show them here, we have also considered maps with holes, rough edges,
331: and spatially varying noise. The wavelet filter performs better in this
332: context than the other filters for extreme conditions, but there is no
333: detectable advantage for realistic assumptions.
334: \begin{figure}
335: \begin{center}
336: {\includegraphics*[height=5.8cm]{fig4a.ps}
337: \includegraphics*[height=5.8cm]{fig4b.ps}}
338: \end{center}
339: \caption{Clusters found (left) and missed (right) as a function of mass and
340: redshift for our fiducial $10 \ \mu$K per arcminute beam survey. The survey
341: size is 90 square degrees, and we have required a 75\% detection efficiency
342: for clusters of mass above $10^{14} \ \rm{M}_\odot$.}
343: \label{fig:foundmissed}
344: \end{figure}
345:
346: \begin{figure}
347: \begin{center}
348: {\includegraphics*[height=8.5cm,width=5.5in]{fig5.ps}}
349: \end{center}
350: \caption{The number of massive clusters associated with the 5000 largest
351: peaks in the filtered signal for our 10 maps, shown here in descending
352: order of signal strength, for our fiducial $10 \ \mu$K per arcminute
353: beam survey. The largest signal peaks are always associated
354: with a cluster. However, noise and projection effects cause
355: substantial scatter in the mass-observable relation. In the small
356: signal regime, this scatter begins to dominate, so that many
357: smaller peaks are not associated with a cluster of mass
358: $\rm{M} > 10^{14} \ \rm{M}_\odot$. We note that there are a total of
359: 1900 clusters in the (solid) mass range of
360: $10^{14} < \rm{M} < 2 \times 10^{14} \ \rm{M}_\odot$
361: and 550 of (dashed) mass $ \rm{M} > 2 \times 10^{14} \ \rm{M}_\odot$
362: in our simulation.}
363: \label{fig:completeness1}
364: \end{figure}
365:
366: \begin{figure}
367: \begin{center}
368: {\includegraphics*[height=8.5cm,width=5.5in]{fig6.ps}}
369: \end{center}
370: \caption{Completeness of clusters detected above a mass threshold for three
371: survey strategies as a function of the number of signal peaks included in the
372: analysis. The upper family
373: of curves is the completeness for clusters of mass greater than
374: $2.0 \times 10^{14} \ \rm{M}_\odot$, while the lower family is for clusters of
375: mass less than this but greater than
376: $1.5 \times 10^{14} \ \rm{M}_\odot$. The efficiency for a threshold mass of
377: $10^{14} \ \rm{M}_\odot$ for one survey is also shown. The totals are derived
378: from ten $3^{\circ} \times 3^{\circ}$ simulated maps.}
379: \label{fig:completeness}
380: \end{figure}
381: The mexican hat filter and the optimal filter of \cite{Tegmark98} perform
382: about equally well. In fact, we find that the primary CMB anisotropy is
383: easy to separate morphologically from the SZE for the small beam sizes
384: we consider here, and any reasonable hi-pass filter can be used for
385: this task. If the two filters are then set to be roughly the same on small
386: angular scales, then they perform about equally well at cluster finding. In
387: particular, we find that for clusters which are smaller than
388: the beam of the survey, simply smoothing the noisy maps by the beam
389: performs best for small scale filtering.
390:
391: We use two methods to identify signal peaks in the filtered maps. The
392: first simply flags local maxima, while the second computes the total
393: flux in all pixels surrounding (and including) the local maxima which are
394: greater than one quarter the peak value. Since the choice of method does
395: not significantly alter our results, the results presented here are for the
396: simpler local maxima technique unless stated otherwise.
397:
398: Once we have identified signal peaks in the maps, we compare these to a
399: list of clusters in our simulation. As can be seen in
400: Figure \ref{fig:foundmissed}, massive clusters are nearly always
401: identified using this method, while no particular dependence on
402: redshift is evident. Also, large signal peaks are nearly always
403: associated with a cluster (Figure \ref{fig:completeness1}), although
404: noise and projection effects cause substantial scatter in the
405: mass-observable relation. The efficiency of the survey (that is,
406: the chance that a signal peak corresponds to a cluster) is therefor
407: nearly 100\% for large peaks.
408:
409: We now turn our attention to survey strategy, where we examine the
410: use of multiple vs single frequency measurements. Although the primary CMB
411: anisotropy is not a serious contaminant, contributions from point sources
412: are non-negligible. Spectral information can alleviate this issue, but at
413: the price of either reduced signal to instrument noise, or less sky coverage,
414: per unit of telescope time. If only a single frequency is used, then point
415: source contributions must be considered in addition to signal strength,
416: instrument noise, and beam size.
417:
418: To investigate this issue, we begin with a fiducial survey at 150 GHz for a
419: 12 meter dish (roughly a $0.^{'} 8$ beam at 150 GHz) and $10 \mu$K-arcmin
420: instrument noise over our 90 square degrees of simulated sky. As expected,
421: this survey does well at cluster identification in the absence of point
422: sources. However, including them substantially worsens the result
423: (Figure \ref{fig:completeness}), even for a level at the low end of the
424: expected confusion noise, and a deeper integration at the same frequency only
425: marginally improves the outcome. A better result is achieved by combining
426: the 150 GHz survey with a 220 GHz observation and differencing the results to
427: remove the point source contribution. Although the 220 GHz channel contains
428: no signal and is noisier than the 150 GHz channel
429: (we assume $\sim 15 \mu$K-arcmin for the same integration time), the
430: effective signal to instrument noise is only marginally worse. This is
431: because the point source contribution is roughly twice as large at 220 GHz as
432: it is at 150 GHz, so that (roughly speaking) the point source elimination is
433: accomplished by including only half the noise of the instrument noise at 220
434: GHz. Added in quadrature, this is effectively a $12 \mu$K-arcmin survey.
435: \begin{figure}
436: \begin{center}
437: {\includegraphics*[height=5.3cm]{fig7a.ps}
438: \includegraphics*[height=5.3cm]{fig7b.ps}}
439: \end{center}
440: \caption{The number of galaxies brighter than (left) R = 24 and
441: (right) R = 25 within the virial radius of clusters in our sample
442: with mass greater than $10^{14} \rm{M}_\odot$ and at redshift
443: $Z<1$. The number within the core region, where the contrast
444: against the background is highest, is about 10\% of $N_{gal}(< r_{200})$.}
445: \label{fig:ngal24}
446: \end{figure}
447:
448: An alternative strategy is explored in the form of a single frequency
449: survey at 95 GHz. This offers the advantage of a larger signal and lower
450: point source contamination, but is noisier (we assume $17 \mu$K-arcmin for
451: equal integration time) and has a larger ($\sim 1.^{`} 3$) beam. This
452: results in improved detections, and although the level of improvement
453: depends upon the model of point source confusion assumed, this survey
454: strategy is superior in our simulations for the models we have considered
455: here. We have not accounted for clustering of point sources,
456: which may change this result.
457:
458: Finally, we note that optical and near-IR emission is still the least
459: expensive way of measuring cluster redshifts, and that redshift
460: information is crucial to the physical interpretation of the cluster
461: sample. We use the method outlined in \cite{White03} to provide a
462: rough estimate (Figure \ref{fig:ngal24}) of the number of galaxies
463: brighter than $R=24$ and $R=25$ within the virial radius of the
464: clusters detected (at 75\% efficiency above mass
465: $M > 10^{14} \ M_{\odot}$ and at redshift $Z < 1$) in our fiducial
466: survey. The number in the core region, where the contrast against the
467: background is the highest, will obviously be smaller. If the galaxies
468: follow the mass, approximately 20\% of the galaxies lie within the break
469: radius ($0.2r_{200}$) and 8\% within the core radius ($0.1r_{200}$).
470: Although the results we show here are for clusters at redshift $Z < 1$,
471: the results for higher redshifts will clearly be somewhat worse.
472: Optical follow up will no doubt be an integral but challenging complement
473: to an SZE cluster survey similar to that discussed here.
474:
475: \section{Conclusions} \label{sec:conclusions}
476:
477: Measurement of the CMB using the unprecedented combination of power,
478: resolution, and sky coverage expected in upcoming surveys will return a
479: wealth of information, including high resolution detections in the SZE sky
480: sufficient to provide enormous catalogues of galaxy clusters. We have
481: studied the use of several filtering techniques to aid the cluster
482: identification process, and evaluated the likely detection of clusters
483: using simulated maps of the CMB. We note that while further signal analysis
484: will be required to optimally measure cluster properties, we have not
485: addressed that issue here.
486:
487: We have tested three filter techniques, using Fourier methods, continuous
488: wavelets, and discrete wavelets, and have found that all of these can
489: be efficiently used to enhance the signal to noise in our maps. Although
490: the discrete wavelets perform better under some extreme conditions, we
491: find that each of these techniques may be used effectively to aid cluster
492: detection.
493:
494: We have also examined the success of survey strategies in creating complete
495: catalogues of clusters for a given mass threshold. We find that the primary
496: CMB anisotropy is not an important source of noise for the high resolution
497: surveys we have considered here, but that point sources may in some cases be
498: more important than instrument noise. Accordingly, multi-frequency
499: measurements are likely required if the sky is probed in frequency bands
500: where point sources are large relative to the signal strength. A single
501: frequency band may be used effectively if the band center is selected at a
502: frequency where point source contamination is not expected to overwhelm the
503: signal. We note that the magnitude of the point source confusion and
504: clustering are not as yet well measured at frequencies relevant to us here,
505: so that we are dependent upon models for our results.
506:
507: A well understood, nearly complete catalogue of massive clusters over a
508: large fraction of the sky would be a major achievement for cosmology, and
509: will likely be available in the near future as powerful surveys begin
510: operation. The road to optimizing the results includes a determination
511: of the best survey strategies and signal processing techniques, and in this
512: endeavor, simulations can play an important role. We have made the raw
513: maps, along with some auxiliary data products, freely available to the
514: community at http://mwhite.berkeley.edu/ in the hope that they will be
515: useful in taking the next step.
516:
517: CV would like to thank J.D. Cohn, Tom Crawford, Steve Myers, and Wayne Hu
518: for useful discussions. The simulations used here were performed on the
519: IBM-SP at the National Energy Research Scientific Computing Center.
520: This research was supported by the NSF and NASA.
521:
522: \appendix \section{Appendix: Wavelets} \label{sec:appendix}
523:
524: In this section, we provide a brief discussion of discrete wavelets to
525: orient the reader \citep[see e.g.][for a more substantial introduction to
526: wavelet signal processing]{Mallat99}.
527: To get a feel for wavelets, let us consider the first order Daubechies
528: wavelet, Daub1, also called the Haar wavelet. In one dimension, the
529: first level (not to be confused with order) Daub1 transform involves
530: computing the two pixel average
531: $\mathbf{a}$ and difference $\mathbf{d}$ of a signal
532: $\mathbf{f} = (f_1, f_2, \dots, f_N)$, so that the elements $a_m$ of
533: $\mathbf{a}$ are defined by
534: \begin{equation} \label{eq:trend}
535: a_m = {f_{2m-1} + f_{2m} \over \sqrt{2}}
536: \end{equation}
537: and similarly for $\mathbf{d}$, but with a minus sign on the right hand side
538: of Eq.(\ref{eq:trend}). Like all discrete wavelet transforms, the Daub1
539: transform decomposes a signal into two subsignals half the length of the
540: original: a running average $\mathbf{a}$ called the trend, and a running
541: difference $\mathbf{d}$ called the fluctuation. Note that this transform is
542: linear, invertible, and preserves the total sum of squares of the pixels
543: (the latter is often called ``conservation of energy'' in wavelet parlance).
544: For higher order transforms, the trend and fluctuation subsignals are no
545: longer simple averages and differences (for example, the Daub2 transform uses
546: a four pixel linear fit rather than a two pixel average), but the basic idea
547: is the same.
548:
549: An essential component of wavelet based analysis is the simultaneous
550: processing of data at multiple scales. This ``Multi-Resolution Analysis''
551: (MRA) is implemented by a hierarchical application of the wavelet transform
552: on the data, so that the first level transform, which probes the smallest
553: physical (and highest frequency) scales, is applied to the original signal.
554: The second level transform is then computed by taking the wavelet transform of
555: the first level trend signal, and so on, so that for an $\rm{n}^{th}$ level
556: transform, the result is a single trend $\mathbf{a^n}$ and n fluctuations
557: $\mathbf{d}^1, \mathbf{d}^2, \dots, \mathbf{d}^n$.
558: \begin{figure}
559: \begin{center}
560: {\includegraphics*[height=8.5cm,width=5.5in]{fig8.ps}}
561: \end{center}
562: \caption{The SZE signal is comparable to the level of confusion from the
563: primary CMB and instrument noise on intermediate angular scales. Here,
564: we display this effect for the $6^{th}$ order Daubechies wavelet, and the
565: resulting level dependent filter. }
566: \label{fig:daub6}
567: \end{figure}
568:
569: Most discrete wavelet filtering techniques are based on the ``thresholding''
570: modality, where wavelet coefficients with an absolute value less than a
571: chosen threshold value are discarded. This is effective when the signal is
572: much larger than the noise, and when the goal is to recover an image which
573: is visually appealing to the human eye. An approach more suited to our
574: purpose is to attempt to reconstruct the signal with a minimum least
575: squared error. For a Gaussian signal and uncorrelated Gaussian noise,
576: this implies a Wiener filter in Fourier space. The equivalent filter in
577: wavelet space can be constructed by estimating the energy of the signal
578: divided by the data for each level and scaling the transformed data at each
579: level by this ratio. The required estimate of the signal energy can be
580: obtained either using simulations, or directly from the data if the
581: noise is well understood. We show an example of this filter for the Daub6
582: wavelet in Figure \ref{fig:daub6}.
583:
584: One simple use of a wavelet filter is to account for spatially varying
585: noise. If the statistical properties of the noise is known as a function
586: of position, then a wavelet filter can adjust to accommodate this in a
587: more natural way than filters with no localized spatial properties. We
588: show an example of this in Figure \ref{fig:varynoise} vs. the optimal
589: filter, for an extreme case where the noise fluctuates by a factor of several
590: hundred in the maps. Although the reconstruction is notably improved in
591: this highly artificial case, no improvement was evident in our maps
592: for more realistic noise.
593: \begin{figure}
594: \begin{center}
595: {\includegraphics*[height=4.5cm,width=4.5cm]{fig8a.ps}
596: \includegraphics*[height=4.5cm,width=4.5cm]{fig8b.ps}}
597: {\includegraphics*[height=4.5cm,width=4.5cm]{fig8c.ps}}
598: \end{center}
599: \caption{The input SZE (left) and the filtered maps for the optimal
600: filter (center) and the discrete Daubechies wavelet filter (right),
601: in an extreme case where the noise level in the maps is varied by a
602: factor of several hundred. The recovered maps shown here are in low
603: noise regions, and in this case, the localized nature of the wavelet
604: filter allows for a substantial improvement in the reconstruction.
605: For realistic noise, this advantage is not detected in our maps.}
606: \label{fig:varynoise}
607: \end{figure}
608:
609: \begin{thebibliography}{}
610:
611: \harvarditem{Bahcall et al.}{1999}{Bahcall99}
612: Bahcall N., Ostriker J., Perlmutter S., Steinhardt P.,
613: 1999, Science, 284, 1481
614:
615: \harvarditem{Bennett et al.}{2003}{Bennett03}
616: Bennett C.L., et al., 2003, ApJS, 148, 97 [astro-ph/0302208]
617:
618: \harvarditem{Birkinshaw}{1999}{Birk99}
619: Birkinshaw M., 1999, Phys. Rep., 310, 98
620:
621: \harvarditem{Borys et al.}{2003}{Borys03}
622: Borys C., Chapman S.C., Halpern M., Scott D., 2003, MNRAS, 344, 385
623: [astro-ph/0305444]
624:
625: \harvarditem{Carlstrom, Holder, \& Reese}{2002}{Carl02}
626: Carlstrom J., Holder G., Reese E., 2002, ARAA, 40, 643
627:
628: \harvarditem{Cayon et al.}{2000}{Cayon00}
629: Cayon L., et al., 2000, MNRAS, 315, 757
630:
631: \harvarditem{Daubechies}{1992}{Daub92}
632: Daubechies I., 1992, Ten Lectures on Wavelets, SIAM, Philadelphia
633:
634: \harvarditem{Davis et al.}{1985}{Davis85}
635: Davis M., Efstathiou G., Frenk C.S., White S.D.M., 1985, ApJ, 292, 371
636:
637: \harvarditem{Dawson et al.}{2001}{Dawson01}
638: Dawson K.S., Holzapfel W.L., Carlstrom J.E., Joy M., LaRoque S.J.,
639: Reese E.D., 2001, ApJ, 553, L1
640:
641: \harvarditem{Holland et al.}{1999}{Holland99}
642: Holland W.S, et al., 1999, MNRAS, 303, 659
643:
644: \harvarditem{Maisinger, Hobson, \& Lasenby}{2004}{Maisinger04}
645: Maisinger K., Hobson M.P., Lasenby A.N., 2004,
646: MNRAS, 347, 339 [astro-ph/0303246]
647:
648: \harvarditem{Mallat}{1999}{Mallat99}
649: Mallat S., 1999, A Wavelet Tour of Signal Processing, Academic Press
650:
651: \harvarditem{Mason et al.}{2003}{Mason03}
652: Mason B.S., et al., 2003, ApJ, 591, 540 [astro-ph/0205384]
653:
654: \harvarditem{Pen}{1999}{Pen99}
655: Pen U., 1999, Phil.Trans.Roy.Soc.Lond. A357, 2561 [astro-ph/9904170]
656:
657: \harvarditem{Pierpaoli et al.}{2004}{Pierpaoli04}
658: Pierpaoli E., Anthoine S., Huffenberger K., Daubechies I., 2004,
659: submitted to MNRAS [astro-ph/0412197]
660:
661: \harvarditem{Press et al.}{1992}{Press92}
662: Press W.H., Teukolsky S.A., Vetterling W.T., Flannery B.P., 1992,
663: Numerical Recipes in C, 2nd edition, Cambridge University Press
664:
665: \harvarditem{Rephaeli}{1995}{Reph95}
666: Rephaeli, Y., 1995, ARA\&A, 33, 541
667:
668: \harvarditem{Rosati, Borgani, \& Norman}{2002}{RBN02}
669: Rosati P., Borgani S., Norman C., 2002, ARA\&A, 40, 539
670:
671: \harvarditem{Schulz \& White}{2003}{SW03}
672: Schulz A., White M., 2003, ApJ, 586, 723 [astro-ph/0210667]
673:
674: \harvarditem{Seljak \& Zaldarriaga}{1996}{SeZa96}
675: Seljak U., Zaldarriaga M., 1996, ApJ, 469, 437
676:
677: \harvarditem{Sunyaev \& Zel'dovich}{1972}{SZ72}
678: Sunyaev R.A., Zel'dovich Ya. B., 1972, Comm. Astrophys. Space Phys., 4, 173
679:
680: \harvarditem{Sunyaev \& Zel'dovich}{1980}{SZ80}
681: Sunyaev R.A., Zel'dovich Ya. B., 1980, ARA\&A, 18, 537
682:
683: \harvarditem{Tegmark \& de Oliveira-Costa}{1998}{Tegmark98}
684: Tegmark M., de Oliveira-Costa A., 1998, ApJ, 500, L83 [astro-ph/9802123]
685:
686: \harvarditem{Voit}{2004}{Voit04}
687: Voit G.M., 2004, Rev. Mod. Phys. (in press) [astro-ph/0410173]
688:
689: \harvarditem{White}{2003}{White03}
690: White M., 2003, ApJ, 597, 650 [astro-ph/0302371]
691:
692: \harvarditem{White, Hernquist \& Springel}{2002}{WHS02}
693: White M., Hernquist L., Springel V., 2002, ApJ, 579, 16 [astro-ph/0205437]
694:
695: \harvarditem{White \& Majumdar}{2004}{WM04}
696: White M., Majumdar S., 2004, ApJ, 602, 565 [astro-ph/0308464]
697:
698: \end{thebibliography}
699: \end{document}
700: