astro-ph0501132/ms.tex
1: % Template article for New Astronomy
2: 
3: \documentclass{elsart}
4: 
5: % the harvard package allows Harvard style referencing
6: %\usepackage{harvard}
7: \usepackage{natbib}
8: \usepackage{graphicx}
9: 
10: % if you use PostScript figures in your article
11: % use the graphics package for simple commands
12: % \usepackage{graphics}
13: % or use the graphicx package for more complicated commands
14: % \usepackage{graphicx}
15: % or use the epsfig package if you prefer to use the old commands
16: % \usepackage{epsfig}
17: 
18: % The amssymb package provides various useful mathematical symbols
19: \usepackage{amssymb}
20: \usepackage{amsmath}
21: \usepackage{bm}
22: 
23: % definitions for astronomical objects, hyperlinks and ADS bibcodes
24: \def\astrobj#1{#1}
25: \def\url#1{{\ttfamily\def\/{/\discretionary{}{}{}}#1}}
26: %\def\bibcode#1{(\texttt{#1})}
27: \def\bibcode#1{}
28: \def\ellb{\mbox{\boldmath $\ell$}}
29: \def\thetab{\mbox{\boldmath $\theta$}}
30: \begin{document}
31: 
32: \begin{frontmatter}
33: \title{Finding Clusters in SZ Surveys}
34: \author[address1]{Chris Vale\thanksref{cvemail}},
35: \author[address1,address2]{Martin White\thanksref{mwemail}}
36: \address[address1]{Department of Physics, University of California,
37: Berkeley, CA, 94720}
38: \address[address2]{Department of Astronomy, University of California,
39: Berkeley, CA, 94720}
40: \thanks[cvemail]{E-mail: cvale@astro.berkeley.edu}
41: \thanks[mwemail]{E-mail: mwhite@astro.berkeley.edu}
42: 
43: \begin{abstract} 
44: We use simulated maps to investigate the ability of high resolution, low 
45: noise surveys of the CMB to create catalogues of Clusters of galaxies by 
46: detecting the characteristic signature imprinted by the Sunyaev Zeldovich 
47: effect.  We compute the completeness of the catalogues in our simulations 
48: for several survey strategies, and evaluate the relative merit of some 
49: Fourier and wavelet based filtering techniques.
50: \end{abstract}
51: 
52: \begin{keyword}
53: Cosmology \sep Large-Scale structures \sep Theory
54: \PACS 98.65.Dx \sep 98.80.Es \sep 98.70.Vc
55: \end{keyword}
56: \end{frontmatter}
57: 
58: \section{Introduction}
59: 
60: Future measurement of the distribution and number density of clusters of 
61: galaxies will place increasingly important constraints on the nature 
62: of the universe we live in \cite[e.g. ][]{Bahcall99,RBN02,Voit04}, 
63: and is a major science goal of upcoming surveys such as 
64: SZA\footnote{http://astro.uchicago.edu/sza/},
65: APEX-SZ\footnote{http://bolo.berkeley.edu/apexsz/},
66: the South Pole Telescope (SPT\footnote{http://astro.uchicago.edu/spt/})
67: and the Atacama Cosmology Telescope
68: (ACT\footnote{http://www.hep.upenn.edu/$\sim$angelica/act/act.html})  
69: These surveys will map the millimeter and 
70: sub-millimeter sky with unprecedented power and resolution, which will 
71: enable the construction of a catalogue of clusters detected through the 
72: thermal Sunyaev Zel'dovich effect (SZE) 
73: \citep[][for recent reviews see \citealt{Reph95,Birk99,Carl02}]{SZ72,SZ80}
74: In this paper, we examine different survey strategies 
75: and signal processing methodologies to enhance this effort.
76: 
77: The imprint of the SZE on the cosmic microwave background (CMB) is 
78: an integrated effect from the time of last scattering to the present era, 
79: and as such the SZE signal suffers from projection effects due to other 
80: objects along the line of sight.  This introduces non-linear complications 
81: to our signal processing efforts, and makes it impossible to conclusively 
82: determine the best method by analytic means alone.  We therefore test 
83: and compare three promising filtering techniques: discrete wavelets, 
84: continuous wavelets, and Fourier methods.  These are applied to mock SZ maps 
85: for several different survey strategies, and results for the different 
86: strategies and filters are computed.
87: 
88: The mock SZ maps are created using an N-body simulation of sufficient volume 
89: to be a fair sample of the universe.  Due to the current uncertainty in 
90: both the magnitude of the SZE and of relevant astrophysical foregrounds, 
91: a detailed modeling of the signal and noise is not currently possible.  
92: However, some of the complications that will be encountered by actual 
93: surveys, such as confusion due to projection effects, irregularly shaped 
94: sources, maps with edges and holes, and spatially varying noise, are 
95: included in our tests prospective filters.
96: 
97: The outline of the paper is as follows.  We describe our simulations in 
98: Section \ref{sec:simulations} and our filtering schemes in 
99: Section \ref{sec:wavelets}.  We then present our results in the context of 
100: various survey scenarios and signal processing techniques in 
101: Section \ref{sec:results}, and discuss our conclusions in 
102: Section \ref{sec:conclusions}.
103: 
104: \section{Simulating the SZE} \label {sec:simulations}
105: 
106: \begin{figure}
107: \begin{center}
108: {\includegraphics*[height=6.8cm,width=6.8cm]{fig1a.ps}
109:  \includegraphics*[height=6.8cm,width=6.8cm]{fig1b.ps}}
110: {\includegraphics*[height=6.8cm,width=6.8cm]{fig1c.ps}
111:  \includegraphics*[height=6.8cm,width=6.8cm]{fig1d.ps}}
112: \end{center}
113: \caption{An example of the maps before filtering.  The full map (top left) 
114: includes the SZE signal (which shows up as cold spots on the map) and all 
115: sources of ``noise''.  Since the signal is overwhelmed by the primary CMB on 
116: the angular scale shown here, we display the same map without the CMB 
117: (top right), but still including point source and instrument noise.  
118: The relative importance of these two effects can be seen in the bottom maps, 
119: where we have displayed the SZE with point sources but no instrument noise 
120: (left) and vice versa (right).  The maps are 
121: $3^{\circ} \times 3^{\circ}$ and contain $1024^2$ pixels, rebinned to 
122: $256^2$ for display.  The color scale of the maps is linear, and 
123: span $100 \mu$K, except for the map including the primary CMB, 
124: which spans $500 \mu$K.  This particular map is made at 150 GHz for a 12 
125: meter dish, assuming $10 \mu$K-arcmin of instrument noise, and with point 
126: source contributions near the high end of the expected magnitude.}
127: \label{fig:maps}
128: \end{figure}
129: Since we use the method outlined in \cite{SW03} to create maps of the SZE, we 
130: provide only a brief description here.  The maps are created from a large 
131: volume, high resolution N-body simulation containing a fair sample of the 
132: universe, for a flat $\Lambda$CDM cosmology with 
133: $\Omega_m = 0.3$, $\Omega_b h^2 = 0.02$, $h = 0.7$, and $\sigma_8 = 1$.  
134: We use a semi-analytic model, in which baryonic matter traces the dark 
135: matter in our clusters, in order to include the gas physics responsible 
136: for the SZE.  This assumption is likely to be a good approximation 
137: everywhere except at the cluster cores, which will not be resolved by the 
138: surveys considered here.  We identify clusters in the N-body simulation 
139: using a friends-of-friends (FOF) algorithm \citep{Davis85} with a linking 
140: length $b = 0.15$ times the mean interparticle spacing.  The mass contained 
141: by hot gas is set to $\Omega_b / \Omega_m$ of the total, and each cluster 
142: is set to be isothermal at a temperature given by
143: \begin{equation} \label{eq:clustertemp}
144: {k_B T \over keV}  \sim 
145: \left ( {H(z) M \over 10^{15} h^{-1} M_\odot } \right )^{2 / 3} 
146: \end{equation}
147: where $H(z)$ is the hubble parameter.  This effectively reproduces the results 
148: of the hydrodynamic simulations of \cite{WHS02}.  The normalization has been 
149: set to pass through the lower envelope of the CBI deep field \citep{Mason03} 
150: and through the BIMA point \citep{Dawson01} on small angular scales.  We 
151: generate Compton-Y maps by projecting along each line of sight, so that
152: \begin{equation} \label{eq:comptony}
153: y=\int \sigma_T n_{e}{k_B T \over m_{e} c^2} dl \qquad
154: \end{equation}
155: where $\sigma_T$ is the Thompson scattering cross section, $n_e$ is the 
156: electron number density, and $m_e$ is the electron mass.  The temperature 
157: fluctuation for a given frequency $\nu$ is related to the Y-maps by
158: \begin{equation} \label{eq:DeltaTonT}
159: {\Delta T \over T} = y \left ( x {e^x + 1 \over e^x -1} - 4 \right )
160: \end{equation}
161: where $x = h \nu / k_B T_{CMB} \simeq \nu / 56.84$GHz is the dimensionless 
162: frequency.  Ten maps are made in this manner (see Figure \ref{fig:maps} for 
163: an example), each with $1024^2$ pixels and $3^\circ$ on a side.  These are 
164: not as accurate as those produced using full hydrodynamic simulations, but 
165: they allow us to probe a larger volume and therfore provide a better sample 
166: of large clusters situated in their proper cosmological context.
167: 
168: In the absence of perfect spectral information, confusion due to the primary 
169: CMB temperature anisotropy and to point sources may impede the detection of 
170: clusters.  We simulate the former using realizations of a Gaussian 
171: random fields convolved with the CMB power spectrum computed using 
172: CMBfast \citep{SeZa96}.  We then add radio and infrared (IR) point sources to 
173: the maps using the model of \cite{WM04}.  For radio sources, this 
174: is a fit to the Q-band data of WMAP \citep{Bennett03}, while IR 
175: sources are fit using the 350 GHz observations of \cite{Borys03} 
176: with the Submillimeter Common User Bolometer Array 
177: \citep[SCUBA:][]{Holland99} on the James Clerk Maxwell Telescope.  We 
178: note that there is substantial uncertainty in extrapolating these fits 
179: to frequencies relevant to us here, so we examine two different 
180: extrapolations likely to span the magnitude of the effect.
181: 
182: The maps are then smoothed with a Gaussian beam, and Gaussian white noise is 
183: added.  Although we have ignored many effects which may be important in real 
184: world observations (such as offsets, drifts, and atmosphere), we treat the 
185: maps as completed and ready for signal processing.  
186: 
187: \section{Filtering methods} \label{sec:wavelets}
188: 
189: In this section, we describe the filters we use to process the simulated 
190: maps discussed in Section \ref{sec:simulations}.  These maps have a 
191: complex structure which makes it impossible to analytically determine 
192: the best filter to aid our efforts at cluster identification, so we 
193: explore several different methods.  We begin by describing 
194: the optimal Fourier based filter of \cite{Tegmark98}, 
195: then briefly describe filtering in the discrete wavelet basis, where we 
196: focus on the Daubechies wavelet family \citep{Daub92}.  Finally, we 
197: discuss filtering using the continuous mexican hat wavelet filter 
198: \cite[see][for more discussion of filtering with continuous 
199: wavelets]{Pierpaoli04}.
200: 
201: The signal power in the SZE is expected to exceed that of the primary CMB 
202: and (for a sufficiently powerful survey) instrument noise on roughly arcminute 
203: scales (Figure \ref{fig:szwindow}).  The optimal filter derived in 
204: \cite{Tegmark98} 
205: is essentially a bandpass window which can be centered on the appropriate 
206: angular scale, and is therefor an obvious candidate for our purpose.  
207: This filter is azimuthally symmetric, and its radial dependence in 
208: Fourier space is
209: \begin{equation} \label{eq:tegfilt}
210: \tilde{\psi}_{match}(\ell) \sim 
211: {e^{\theta^2 \ell(\ell +1)/2} \over C_{\ell}^{\rm{Tot}} }
212: \end{equation}
213: where $\theta$ is the full width half max (FWHM) beam size and 
214: $ C_{\ell}^{\rm{Tot}}$ is the total power spectrum of all ``noise''.  
215: See Figure \ref{fig:szwindow} for an example of this filter appropriate 
216: for our fiducial surveys.
217: \begin{figure}
218: \begin{center}
219: {\includegraphics*[height=4.8cm]{fig2a.ps}
220: \includegraphics*[height=4.8cm]{fig2b.ps}}
221: \end{center}
222: \caption{(Left) The expected magnitude of the SZE, the CMB, and 
223: instrument noise for a fiducial $10 \mu K$ per $1^{\prime}$ beam survey, 
224: shown here in Fourier space.  The SZE signal exceeds both the noise and 
225: CMB on roughly arcminute scales.  (Right) The optimal and mexican hat 
226: filters in Fourier space.  The filters are wedge shaped band pass filters, 
227: designed to pass scales where the SZE is large relative to the CMB and 
228: instrument noise.}
229: \label{fig:szwindow}
230: \end{figure}
231: 
232: Wavelets have emerged as a powerful tool for signal processing 
233: \cite[see the Appendix for a brief discussion and e.g.][for a 
234: review of wavelet signal processing]{Mallat99}.  
235: They are simultaneously (but imperfectly) localized in both real space and 
236: Fourier space, and are therefore a natural choice for processing data 
237: which possesses both real space and Fourier space correlations, such 
238: as we expect from our fiducial surveys.  We employ the wavelet transform 
239: algorithm outlined in \cite{Press92}, and although we focus on the 
240: Daubechies wavelets, we note that we have also explored the Coiflet, 
241: Symlet, and Morlet wavelet families, and as these offer essentially 
242: the same results as the Daubechies wavelets, we do not discuss 
243: them further.  
244: 
245: To generate our filter, we follow \cite{Pen99} and estimate the 
246: expected signal given data $\langle S | D \rangle$ for each coefficient in 
247: the wavelet transform.  This amounts to Wiener filtering 
248: in the limit of Gaussian noise and Gaussian signal, and to 
249: thresholding for highly non-Gaussian signal distributions, such as unsmoothed 
250: point sources.  This approach is likely to be superior (in the sense of 
251: minimizing the least squared error in the reconstruction) if the 
252: signal or noise include substantial non-Gaussian behavior, but requires that 
253: the signal and noise probability density functions (PDF) be known.  
254: Although \cite{Pen99} suggests computing $\langle S | D \rangle$ directly 
255: from the non-Gaussian behavior of the observed maps, we find that the 
256: procedure outlined there is subject to numerical artifacts due to the 
257: finite size of sky in our simulations, so we compute this function directly 
258: from our 10 input signal maps.  This will not be possible for real surveys, 
259: and so should be considered an upper limit to the performance of the 
260: technique.  
261: 
262: We implement a third class of filter, the continuous wavelet filter, using 
263: the mexican hat wavelet transform \citep[e.g. ][]{Cayon00,Maisinger04}.  The 
264: continuous wavelet transform $W(a,b)$ of a one dimensional function $f(x)$ 
265: is a real space convolution of $f(x)$ with a ``mother wavelet'' $\psi(x)$
266: \begin{equation} \label{eq:continuous}
267: W(a,b) = \int dx f(x) {1 \over \sqrt{a}} \psi \left ( {x - b \over a} \right )
268: \end{equation}
269: where $a$ and $b$ are the scale and position parameters.  This convolution 
270: is normally performed as a multiplication in the Fourier domain, so that 
271: $\tilde{W} = \sqrt{a} \tilde{f}(\ell) \tilde{\psi}(a \ell)$, 
272: where the tilde denotes the Fourier transform.  The mexican hat wavelet is 
273: the second derivative of a Gaussian, so that its Fourier transform is
274: \begin{equation} \label{eq:mexhat}
275: \tilde{\psi}_{mex}(a \ell) \sim ( a \ell)^2 \rm{exp} 
276: \left [ - {(a \ell)^2 \over 2} \right ]
277: \end{equation}
278: Like the optimal filter, the continuous wavelet transform is essentially 
279: a filter in Fourier space, as can be seen in the side by side comparison 
280: in Figure \ref{fig:szwindow}.  
281: 
282: In the next section, we demonstrate the use of these filters on the simulated 
283: maps described in Section \ref{sec:simulations} for various survey 
284: strategies.
285: \begin{figure}
286: \begin{center}
287: {\includegraphics*[height=6.8cm,width=6.8cm]{fig3a.ps}
288: \includegraphics*[height=6.8cm,width=6.8cm]{fig3b.ps}}
289: {\includegraphics*[height=6.8cm,width=6.8cm]{fig3c.ps}
290: \includegraphics*[height=6.8cm,width=6.8cm]{fig3d.ps}}
291: \end{center}
292: \caption{The input SZE (top left) and the filtered maps for the discrete 
293: Daubechies wavelet filter (top right), the mexican hat continuous wavelet 
294: filter (bottom left), and for the optimal Fourier filter (bottom right).  
295: The color scale is linear, and structures 
296: less than $1 \sigma$ of the noise have been suppressed for visual clarity.}
297: \label{fig:filtermaps}
298: \end{figure}
299: 
300: \section{Results} \label{sec:results}
301: 
302: In this section, we examine the maps after application of the filters 
303: described in Section \ref{sec:wavelets}.  As can be seen in the examples 
304: shown in Figure \ref{fig:filtermaps}, 
305: the three filters clearly all succeed in improving signal 
306: to noise, with large structures in the filtered maps all corresponding to 
307: massive clusters.  We quantify the level of this success using the peak 
308: finding algorithm of \cite{SW03}, and conclude that for the surveys we 
309: consider here, the optimal filter performs at least as well as the 
310: wavelet based filters for creating complete, efficient surveys of clusters.  
311: We then examine cluster finding in the context of several survey strategies.
312: 
313: Of the three filtered maps shown here, the one created using the discrete 
314: wavelet (specifically, Daub6) ``$\langle S | D \rangle$'' technique 
315: described in Section \ref{sec:wavelets} best reconstructs the input SZE 
316: signal in the sense of minimizing the least squared error of the 
317: reconstruction.  However, this reconstruction does not do as well at 
318: creating complete, efficient surveys of clusters.  This is because a 
319: large fraction of the improved signal recovery is associated with a few 
320: giant clusters.  Since these produce enormous signal, they are easy to 
321: find, regardless of the filter, and improving their reconstruction 
322: does not aid in the completeness of the catalogue.  However, these 
323: clusters have signal on relatively large angular scales, so that the 
324: ``$\langle S | D \rangle$'' technique smooths the maps more than is 
325: optimal for finding smaller clusters at the threshold of detection.  
326: Although this can be accounted for by eliminating the large clusters from 
327: consideration when formulating the filter, the result is to simply 
328: scale the signal by a constant at each level, so that the smaller 
329: cluster signal is effectively in the Wiener filter limit.  Although we do 
330: not show them here, we have also considered maps with holes, rough edges, 
331: and spatially varying noise.  The wavelet filter performs better in this 
332: context than the other filters for extreme conditions, but there is no 
333: detectable advantage for realistic assumptions.  
334: \begin{figure}
335: \begin{center}
336: {\includegraphics*[height=5.8cm]{fig4a.ps}
337: \includegraphics*[height=5.8cm]{fig4b.ps}}
338: \end{center}
339: \caption{Clusters found (left) and missed (right) as a function of mass and 
340: redshift for our fiducial $10 \ \mu$K per arcminute beam survey.  The survey 
341: size is 90 square degrees, and we have required a 75\% detection efficiency 
342: for clusters of mass above $10^{14} \ \rm{M}_\odot$.}
343: \label{fig:foundmissed}
344: \end{figure}
345: 
346: \begin{figure}
347: \begin{center}
348: {\includegraphics*[height=8.5cm,width=5.5in]{fig5.ps}}
349: \end{center}
350: \caption{The number of massive clusters associated with the 5000 largest 
351: peaks in the filtered signal for our 10 maps, shown here in descending 
352: order of signal strength, for our fiducial $10 \ \mu$K per arcminute 
353: beam survey.  The largest signal peaks are always associated 
354: with a cluster.  However, noise and projection effects cause 
355: substantial scatter in the mass-observable relation.  In the small 
356: signal regime, this scatter begins to dominate, so that many 
357: smaller peaks are not associated with a cluster of mass 
358: $\rm{M} > 10^{14} \ \rm{M}_\odot$.  We note that there are a total of 
359: 1900 clusters in the (solid) mass range of 
360: $10^{14} < \rm{M} < 2 \times 10^{14} \ \rm{M}_\odot$ 
361: and 550 of (dashed) mass $ \rm{M} > 2 \times 10^{14} \ \rm{M}_\odot$ 
362: in our simulation.}
363: \label{fig:completeness1}
364: \end{figure}
365: 
366: \begin{figure}
367: \begin{center}
368: {\includegraphics*[height=8.5cm,width=5.5in]{fig6.ps}}
369: \end{center}
370: \caption{Completeness of clusters detected above a mass threshold for three 
371: survey strategies as a function of the number of signal peaks included in the 
372: analysis.  The upper family 
373: of curves is the completeness for clusters of mass greater than 
374: $2.0 \times 10^{14} \ \rm{M}_\odot$, while the lower family is for clusters of 
375: mass less than this but greater than 
376: $1.5 \times 10^{14} \ \rm{M}_\odot$.  The efficiency for a threshold mass of 
377: $10^{14} \ \rm{M}_\odot$ for one survey is also shown.  The totals are derived 
378: from ten $3^{\circ} \times 3^{\circ}$ simulated maps.}
379: \label{fig:completeness}
380: \end{figure}
381: The mexican hat filter and the optimal filter of \cite{Tegmark98} perform 
382: about equally well.  In fact, we find that the primary CMB anisotropy is 
383: easy to separate morphologically from the SZE for the small beam sizes 
384: we consider here, and any reasonable hi-pass filter can be used for 
385: this task.  If the two filters are then set to be roughly the same on small 
386: angular scales, then they perform about equally well at cluster finding.  In 
387: particular, we find that for clusters which are smaller than 
388: the beam of the survey, simply smoothing the noisy maps by the beam 
389: performs best for small scale filtering.
390: 
391: We use two methods to identify signal peaks in the filtered maps.  The 
392: first simply flags local maxima, while the second computes the total 
393: flux in all pixels surrounding (and including) the local maxima which are 
394: greater than one quarter the peak value.  Since the choice of method does 
395: not significantly alter our results, the results presented here are for the 
396: simpler local maxima technique unless stated otherwise.
397: 
398: Once we have identified signal peaks in the maps, we compare these to a 
399: list of clusters in our simulation.  As can be seen in 
400: Figure \ref{fig:foundmissed}, massive clusters are nearly always 
401: identified using this method, while no particular dependence on 
402: redshift is evident.  Also, large signal peaks are nearly always 
403: associated with a cluster (Figure \ref{fig:completeness1}), although 
404: noise and projection effects cause substantial scatter in the 
405: mass-observable relation.  The efficiency of the survey (that is, 
406: the chance that a signal peak corresponds to a cluster) is therefor 
407: nearly 100\% for large peaks.  
408: 
409: We now turn our attention to survey strategy, where we examine the 
410: use of multiple vs single frequency measurements.  Although the primary CMB 
411: anisotropy is not a serious contaminant, contributions from point sources 
412: are non-negligible.  Spectral information can alleviate this issue, but at 
413: the price of either reduced signal to instrument noise, or less sky coverage, 
414: per unit of telescope time.  If only a single frequency is used, then point 
415: source contributions must be considered in addition to signal strength, 
416: instrument noise, and beam size.  
417: 
418: To investigate this issue, we begin with a fiducial survey at 150 GHz for a 
419: 12 meter dish (roughly a $0.^{'} 8$ beam at 150 GHz) and $10 \mu$K-arcmin 
420: instrument noise over our 90 square degrees of simulated sky.  As expected, 
421: this survey does well at cluster identification in the absence of point 
422: sources.  However, including them substantially worsens the result 
423: (Figure \ref{fig:completeness}), even for a level at the low end of the 
424: expected confusion noise, and a deeper integration at the same frequency only 
425: marginally improves the outcome.  A better result is achieved by combining 
426: the 150 GHz survey with a 220 GHz observation and differencing the results to 
427: remove the point source contribution.  Although the 220 GHz channel contains 
428: no signal and is noisier than the 150 GHz channel 
429: (we assume $\sim 15 \mu$K-arcmin for the same integration time), the 
430: effective signal to instrument noise is only marginally worse.  This is 
431: because the point source contribution is roughly twice as large at 220 GHz as 
432: it is at 150 GHz, so that (roughly speaking) the point source elimination is 
433: accomplished by including only half the noise of the instrument noise at 220 
434: GHz.  Added in quadrature, this is effectively a $12 \mu$K-arcmin survey.
435: \begin{figure}
436: \begin{center}
437: {\includegraphics*[height=5.3cm]{fig7a.ps}
438: \includegraphics*[height=5.3cm]{fig7b.ps}}
439: \end{center}
440: \caption{The number of galaxies brighter than (left) R = 24 and 
441: (right) R = 25 within the virial radius of clusters in our sample 
442: with mass greater than $10^{14} \rm{M}_\odot$ and at redshift 
443: $Z<1$.  The number within the core region, where the contrast 
444: against the background is highest, is about 10\% of $N_{gal}(< r_{200})$.}
445: \label{fig:ngal24}
446: \end{figure}
447: 
448: An alternative strategy is explored in the form of a single frequency 
449: survey at 95 GHz.  This offers the advantage of a larger signal and lower 
450: point source contamination, but is noisier (we assume $17 \mu$K-arcmin for 
451: equal integration time) and has a larger ($\sim 1.^{`} 3$) beam.  This 
452: results in improved detections, and although the level of improvement 
453: depends upon the model of point source confusion assumed, this survey 
454: strategy is superior in our simulations for the models we have considered 
455: here.  We have not accounted for clustering of point sources, 
456: which may change this result.
457: 
458: Finally, we note that optical and near-IR emission is still the least 
459: expensive way of measuring cluster redshifts, and that redshift 
460: information is crucial to the physical interpretation of the cluster 
461: sample.  We use the method outlined in \cite{White03} to provide a 
462: rough estimate (Figure \ref{fig:ngal24}) of the number of galaxies 
463: brighter than $R=24$ and $R=25$ within the virial radius of the 
464: clusters detected (at 75\% efficiency above mass 
465: $M > 10^{14} \ M_{\odot}$ and at redshift $Z < 1$) in our fiducial 
466: survey.  The number in the core region, where the contrast against the 
467: background is the highest, will obviously be smaller.  If the galaxies 
468: follow the mass, approximately 20\% of the galaxies lie within the break 
469: radius ($0.2r_{200}$) and 8\% within the core radius ($0.1r_{200}$).  
470: Although the results we show here are for clusters at redshift $Z < 1$, 
471: the results for higher redshifts will clearly be somewhat worse.  
472: Optical follow up will no doubt be an integral but challenging complement 
473: to an SZE cluster survey similar to that discussed here.
474: 
475: \section{Conclusions} \label{sec:conclusions}
476: 
477: Measurement of the CMB using the unprecedented combination of power, 
478: resolution, and sky coverage expected in upcoming surveys will return a 
479: wealth of information, including high resolution detections in the SZE sky 
480: sufficient to provide enormous catalogues of galaxy clusters.  We have 
481: studied the use of several filtering techniques to aid the cluster 
482: identification process, and evaluated the likely detection of clusters 
483: using simulated maps of the CMB.  We note that while further signal analysis 
484: will be required to optimally measure cluster properties, we have not 
485: addressed that issue here.
486: 
487: We have tested three filter techniques, using Fourier methods, continuous 
488: wavelets, and discrete wavelets, and have found that all of these can 
489: be efficiently used to enhance the signal to noise in our maps.  Although 
490: the discrete wavelets perform better under some extreme conditions, we 
491: find that each of these techniques may be used effectively to aid cluster 
492: detection.  
493: 
494: We have also examined the success of survey strategies in creating complete 
495: catalogues of clusters for a given mass threshold.  We find that the primary 
496: CMB anisotropy is not an important source of noise for the high resolution 
497: surveys we have considered here, but that point sources may in some cases be 
498: more important than instrument noise.  Accordingly, multi-frequency 
499: measurements are likely required if the sky is probed in frequency bands 
500: where point sources are large relative to the signal strength.  A single 
501: frequency band may be used effectively if the band center is selected at a 
502: frequency where point source contamination is not expected to overwhelm the 
503: signal.  We note that the magnitude of the point source confusion and 
504: clustering are not as yet well measured at frequencies relevant to us here, 
505: so that we are dependent upon models for our results.  
506: 
507: A well understood, nearly complete catalogue of massive clusters over a 
508: large fraction of the sky would be a major achievement for cosmology, and 
509: will likely be available in the near future as powerful surveys begin 
510: operation.  The road to optimizing the results includes a determination 
511: of the best survey strategies and signal processing techniques, and in this 
512: endeavor, simulations can play an important role.  We have made the raw 
513: maps, along with some auxiliary data products, freely available to the 
514: community at http://mwhite.berkeley.edu/ in the hope that they will be 
515: useful in taking the next step.
516: 
517: CV would like to thank J.D. Cohn, Tom Crawford, Steve Myers, and Wayne Hu 
518: for useful discussions.  The simulations used here were performed on the 
519: IBM-SP at the National Energy Research Scientific Computing Center.
520: This research was supported by the NSF and NASA.
521: 
522: \appendix \section{Appendix:  Wavelets} \label{sec:appendix}
523: 
524: In this section, we provide a brief discussion of discrete wavelets to 
525: orient the reader \citep[see e.g.][for a more substantial introduction to 
526: wavelet signal processing]{Mallat99}.  
527: To get a feel for wavelets, let us consider the first order Daubechies 
528: wavelet, Daub1, also called the Haar wavelet.  In one dimension, the 
529: first level (not to be confused with order) Daub1 transform involves 
530: computing the two pixel average 
531: $\mathbf{a}$ and difference $\mathbf{d}$ of a signal 
532: $\mathbf{f}  = (f_1, f_2, \dots, f_N)$, so that the elements $a_m$ of 
533: $\mathbf{a}$ are defined by
534: \begin{equation} \label{eq:trend}
535: a_m = {f_{2m-1} + f_{2m} \over \sqrt{2}}
536: \end{equation}
537: and similarly for $\mathbf{d}$, but with a minus sign on the right hand side 
538: of Eq.(\ref{eq:trend}).  Like all discrete wavelet transforms, the Daub1 
539: transform decomposes a signal into two subsignals half the length of the 
540: original:  a running average $\mathbf{a}$ called the trend, and a running 
541: difference $\mathbf{d}$ called the fluctuation.  Note that this transform is 
542: linear, invertible, and preserves the total sum of squares of the pixels 
543: (the latter is often called ``conservation of energy'' in wavelet parlance).  
544: For higher order transforms, the trend and fluctuation subsignals are no 
545: longer simple averages and differences (for example, the Daub2 transform uses 
546: a four pixel linear fit rather than a two pixel average), but the basic idea 
547: is the same.
548: 
549: An essential component of wavelet based analysis is the simultaneous 
550: processing of data at multiple scales.  This ``Multi-Resolution Analysis'' 
551: (MRA) is implemented by a hierarchical application of the wavelet transform 
552: on the data, so that the first level transform, which probes the smallest 
553: physical (and highest frequency) scales, is applied to the original signal.  
554: The second level transform is then computed by taking the wavelet transform of 
555: the first level trend signal, and so on, so that for an $\rm{n}^{th}$ level 
556: transform, the result is a single trend $\mathbf{a^n}$ and n fluctuations 
557: $\mathbf{d}^1, \mathbf{d}^2, \dots, \mathbf{d}^n$.  
558: \begin{figure}
559: \begin{center}
560: {\includegraphics*[height=8.5cm,width=5.5in]{fig8.ps}}
561: \end{center}
562: \caption{The SZE signal is comparable to the level of confusion from the 
563: primary CMB and instrument noise on intermediate angular scales.  Here, 
564: we display this effect for the $6^{th}$ order Daubechies wavelet, and the 
565: resulting level dependent filter.  }
566: \label{fig:daub6}
567: \end{figure}
568: 
569: Most discrete wavelet filtering techniques are based on the ``thresholding'' 
570: modality, where wavelet coefficients with an absolute value less than a 
571: chosen threshold value are discarded.  This is effective when the signal is 
572: much larger than the noise, and when the goal is to recover an image which 
573: is visually appealing to the human eye.  An approach more suited to our 
574: purpose is to attempt to reconstruct the signal with a minimum least 
575: squared error.  For a Gaussian signal and uncorrelated Gaussian noise, 
576: this implies a Wiener filter in Fourier space.  The equivalent filter in 
577: wavelet space can be constructed by estimating the energy of the signal 
578: divided by the data for each level and scaling the transformed data at each 
579: level by this ratio.  The required estimate of the signal energy can be 
580: obtained either using simulations, or directly from the data if the 
581: noise is well understood.  We show an example of this filter for the Daub6 
582: wavelet in Figure \ref{fig:daub6}.
583: 
584: One simple use of a wavelet filter is to account for spatially varying 
585: noise.  If the statistical properties of the noise is known as a function 
586: of position, then a wavelet filter can adjust to accommodate this in a 
587: more natural way than filters with no localized spatial properties.  We 
588: show an example of this in Figure \ref{fig:varynoise} vs. the optimal 
589: filter, for an extreme case where the noise fluctuates by a factor of several 
590: hundred in the maps.  Although the reconstruction is notably improved in 
591: this highly artificial case, no improvement was evident in our maps 
592: for more realistic noise.
593: \begin{figure}
594: \begin{center}
595: {\includegraphics*[height=4.5cm,width=4.5cm]{fig8a.ps}
596: \includegraphics*[height=4.5cm,width=4.5cm]{fig8b.ps}}
597: {\includegraphics*[height=4.5cm,width=4.5cm]{fig8c.ps}}
598: \end{center}
599: \caption{The input SZE (left) and the filtered maps for the  optimal 
600: filter (center) and the discrete Daubechies wavelet filter (right), 
601: in an extreme case where the noise level in the maps is varied by a 
602: factor of several hundred.  The recovered maps shown here are in low 
603: noise regions, and in this case, the localized nature of the wavelet 
604: filter allows for a substantial improvement in the reconstruction.  
605: For realistic noise, this advantage is not detected in our maps.}
606: \label{fig:varynoise}
607: \end{figure}
608: 
609: \begin{thebibliography}{} 
610: 
611: \harvarditem{Bahcall et al.}{1999}{Bahcall99}
612:  Bahcall N., Ostriker J., Perlmutter S., Steinhardt P., 
613:  1999, Science, 284, 1481
614: 
615: \harvarditem{Bennett et al.}{2003}{Bennett03}
616:  Bennett C.L., et al., 2003, ApJS, 148, 97 [astro-ph/0302208]
617: 
618: \harvarditem{Birkinshaw}{1999}{Birk99}
619:  Birkinshaw M., 1999, Phys. Rep., 310, 98
620: 
621: \harvarditem{Borys et al.}{2003}{Borys03}
622:  Borys C., Chapman S.C., Halpern M., Scott D., 2003, MNRAS, 344, 385 
623:  [astro-ph/0305444]
624: 
625: \harvarditem{Carlstrom, Holder, \& Reese}{2002}{Carl02}
626:  Carlstrom J., Holder G., Reese E., 2002, ARAA, 40, 643
627: 
628: \harvarditem{Cayon et al.}{2000}{Cayon00}
629:  Cayon L., et al., 2000, MNRAS, 315, 757
630: 
631: \harvarditem{Daubechies}{1992}{Daub92}
632:  Daubechies I., 1992, Ten Lectures on Wavelets, SIAM, Philadelphia
633: 
634: \harvarditem{Davis et al.}{1985}{Davis85}
635:  Davis M., Efstathiou G., Frenk C.S., White S.D.M., 1985, ApJ, 292, 371
636: 
637: \harvarditem{Dawson et al.}{2001}{Dawson01}
638:  Dawson K.S., Holzapfel W.L., Carlstrom J.E., Joy M., LaRoque S.J., 
639:  Reese E.D., 2001, ApJ, 553, L1
640: 
641: \harvarditem{Holland et al.}{1999}{Holland99}
642:  Holland W.S, et al., 1999, MNRAS, 303, 659
643: 
644: \harvarditem{Maisinger, Hobson,  \& Lasenby}{2004}{Maisinger04}
645:  Maisinger K., Hobson M.P., Lasenby A.N., 2004, 
646:  MNRAS, 347, 339 [astro-ph/0303246]
647: 
648: \harvarditem{Mallat}{1999}{Mallat99}
649:  Mallat S., 1999, A Wavelet Tour of Signal Processing, Academic Press
650: 
651: \harvarditem{Mason et al.}{2003}{Mason03}
652:  Mason B.S., et al., 2003, ApJ, 591, 540 [astro-ph/0205384]
653: 
654: \harvarditem{Pen}{1999}{Pen99}
655:  Pen U., 1999, Phil.Trans.Roy.Soc.Lond. A357, 2561 [astro-ph/9904170]
656: 
657: \harvarditem{Pierpaoli et al.}{2004}{Pierpaoli04}
658:  Pierpaoli E., Anthoine S., Huffenberger K., Daubechies I., 2004, 
659:  submitted to MNRAS [astro-ph/0412197]
660: 
661: \harvarditem{Press et al.}{1992}{Press92}
662:  Press W.H., Teukolsky S.A., Vetterling W.T., Flannery B.P., 1992, 
663:  Numerical Recipes in C, 2nd edition, Cambridge University Press
664: 
665: \harvarditem{Rephaeli}{1995}{Reph95}
666:  Rephaeli, Y., 1995, ARA\&A, 33, 541
667: 
668: \harvarditem{Rosati, Borgani, \& Norman}{2002}{RBN02}
669:  Rosati P., Borgani S., Norman C., 2002, ARA\&A, 40, 539
670: 
671: \harvarditem{Schulz \& White}{2003}{SW03}
672:  Schulz A., White M., 2003, ApJ, 586, 723 [astro-ph/0210667]
673: 
674: \harvarditem{Seljak \& Zaldarriaga}{1996}{SeZa96}
675:  Seljak U., Zaldarriaga M., 1996, ApJ, 469, 437
676: 
677: \harvarditem{Sunyaev \& Zel'dovich}{1972}{SZ72}
678:  Sunyaev R.A., Zel'dovich Ya. B., 1972, Comm. Astrophys. Space Phys., 4, 173
679: 
680: \harvarditem{Sunyaev \& Zel'dovich}{1980}{SZ80}
681:  Sunyaev R.A., Zel'dovich Ya. B., 1980, ARA\&A, 18, 537
682: 
683: \harvarditem{Tegmark \& de Oliveira-Costa}{1998}{Tegmark98}
684:  Tegmark M., de Oliveira-Costa A., 1998, ApJ, 500, L83 [astro-ph/9802123]
685: 
686: \harvarditem{Voit}{2004}{Voit04}
687: Voit G.M., 2004, Rev. Mod. Phys. (in press) [astro-ph/0410173]
688: 
689: \harvarditem{White}{2003}{White03}
690:  White M., 2003, ApJ, 597, 650 [astro-ph/0302371]
691: 
692: \harvarditem{White, Hernquist \& Springel}{2002}{WHS02}
693:  White M., Hernquist L., Springel V., 2002, ApJ, 579, 16 [astro-ph/0205437]
694: 
695: \harvarditem{White \& Majumdar}{2004}{WM04}
696:  White M., Majumdar S., 2004, ApJ, 602, 565 [astro-ph/0308464]
697: 
698: \end{thebibliography}
699: \end{document}
700: